Hypercolumns for object segmentation and fine-grained localization
Top Cited Papers
- 1 June 2015
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- No. 10636919,p. 447-456
- https://doi.org/10.1109/cvpr.2015.7298642
Abstract
Recognition algorithms based on convolutional networks (CNNs) typically use the output of the last layer as a feature representation. However, the information in this layer may be too coarse spatially to allow precise localization. On the contrary, earlier layers may be precise in localization but will not capture semantics. To get the best of both worlds, we define the hypercolumn at a pixel as the vector of activations of all CNN units above that pixel. Using hypercolumns as pixel descriptors, we show results on three fine-grained localization tasks: simultaneous detection and segmentation [22], where we improve state-of-the-art from 49.7 mean AP r [22] to 60.0, keypoint localization, where we get a 3.3 point boost over [20], and part labeling, where we show a 6.6 point gain over a strong baseline.Keywords
All Related Versions
This publication has 24 references indexed in Scilit:
- Detect What You Can: Detecting and Representing Objects Using Holistic Models and Body PartsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2014
- DeepPose: Human Pose Estimation via Deep Neural NetworksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2014
- Pedestrian Parsing via Deep Decompositional NetworkPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2013
- Paper Doll Parsing: Retrieving Similar Styles to Parse Clothing ItemsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2013
- Pedestrian Detection with Unsupervised Multi-stage Feature LearningPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2013
- Learning Hierarchical Features for Scene LabelingIEEE Transactions on Pattern Analysis and Machine Intelligence, 2012
- The truth about cats and dogsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2011
- Object Detection with Discriminatively Trained Part-Based ModelsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2009
- Robust computation of optical flow in a multi-scale differential frameworkInternational Journal of Computer Vision, 1995
- Preattentive texture discrimination with early vision mechanismsJournal of the Optical Society of America A, 1990