Beyond active noun tagging: Modeling contextual interactions for multi-class active learning
- 1 June 2010
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 2979-2986
- https://doi.org/10.1109/cvpr.2010.5540044
Abstract
We present an active learning framework to simultaneously learn appearance and contextual models for scene understanding tasks (multi-class classification). Existing multi-class active learning approaches have focused on utilizing classification uncertainty of regions to select the most ambiguous region for labeling. These approaches, however, ignore the contextual interactions between different regions of the image and the fact that knowing the label for one region provides information about the labels of other regions. For example, the knowledge of a region being sea is informative about regions satisfying the “on” relationship with respect to it, since they are highly likely to be boats. We explicitly model the contextual interactions between regions and select the question which leads to the maximum reduction in the combined entropy of all the regions in the image (image entropy). We also introduce a new methodology of posing labeling questions, mimicking the way humans actively learn about their environment. In these questions, we utilize the regions linked to a concept with high confidence as anchors, to pose questions about the uncertain regions. For example, if we can recognize water in an image then we can use the region associated with water as an anchor to pose questions such as “what is above water?”. Our active learning framework also introduces questions which help in actively learning contextual concepts. For example, our approach asks the annotator: “What is the relationship between boat and water?” and utilizes the answer to reduce the image entropies throughout the training dataset and obtain more relevant training examples for appearance models.Keywords
This publication has 12 references indexed in Scilit:
- Decomposing a scene into geometric and semantically consistent regionsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2009
- Understanding videos, constructing plots learning a visually grounded storyline model from annotated videosPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2009
- Active learning for large multi-class problemsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2009
- Entropy-based active learning for object recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2008
- LabelMe: A Database and Web-Based Tool for Image AnnotationInternational Journal of Computer Vision, 2007
- Constructing Free-Energy Approximations and Generalized Belief Propagation AlgorithmsIEEE Transactions on Information Theory, 2005
- Selective Sampling for Nearest Neighbor ClassifiersMachine Learning, 2004
- Texture segmentation by multiscale aggregation of filter responses and shape elementsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Automatically labeling video data using multi-class active learningPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Information-Based Objective Functions for Active Data SelectionNeural Computation, 1992