Abstract
In this paper, we introduce a novel approach to automatically regulate receptive fields in deep image parsing networks. Unlike previous work which placed much importance on obtaining better receptive fields using manually selected dilated convolutional kernels, our approach uses two affine transformation layers in the network’s backbone and operates on feature maps. Feature maps are inflated or shrunk by the new layer, thereby changing the receptive fields in the following layers. By use of end-to-end training, the whole framework is data-driven, without laborious manual intervention. The proposed method is generic across datasets and different tasks. We have conducted extensive experiments on both general image parsing tasks, and face parsing tasks as concrete examples, to demonstrate the method’s superior ability to regulate over manual designs.
| Original language | English |
|---|---|
| Pages (from-to) | 231-244 |
| Number of pages | 14 |
| Journal | Computational Visual Media |
| Volume | 4 |
| Issue number | 3 |
| DOIs | |
| State | Published - 1 Sep 2018 |
| Externally published | Yes |
Keywords
- data-driven
- face parsing
- receptive field
- semantic segmentation
Fingerprint
Dive into the research topics of 'Learning adaptive receptive fields for deep image parsing networks'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver