Hypercolumns for object segmentation and fine-grained localization

Top Cited Papers

1 June 2015

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

No. 10636919,p. 447-456
https://doi.org/10.1109/cvpr.2015.7298642

Abstract

Recognition algorithms based on convolutional networks (CNNs) typically use the output of the last layer as a feature representation. However, the information in this layer may be too coarse spatially to allow precise localization. On the contrary, earlier layers may be precise in localization but will not capture semantics. To get the best of both worlds, we define the hypercolumn at a pixel as the vector of activations of all CNN units above that pixel. Using hypercolumns as pixel descriptors, we show results on three fine-grained localization tasks: simultaneous detection and segmentation [22], where we improve state-of-the-art from 49.7 mean AP ^r [22] to 60.0, keypoint localization, where we get a 3.3 point boost over [20], and part labeling, where we show a 6.6 point gain over a strong baseline.

Keywords

All Related Versions

This publication has 24 references indexed in Scilit:

Detect What You Can: Detecting and Representing Objects Using Holistic Models and Body Parts
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2014
DeepPose: Human Pose Estimation via Deep Neural Networks
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2014
Pedestrian Parsing via Deep Decompositional Network
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2013
Paper Doll Parsing: Retrieving Similar Styles to Parse Clothing Items
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2013
Pedestrian Detection with Unsupervised Multi-stage Feature Learning
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2013
Learning Hierarchical Features for Scene Labeling
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2012
The truth about cats and dogs
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2011
Object Detection with Discriminatively Trained Part-Based Models
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2009
Robust computation of optical flow in a multi-scale differential framework
International Journal of Computer Vision, 1995
Preattentive texture discrimination with early vision mechanisms
Journal of the Optical Society of America A, 1990

Cited by 886 articles