Active learning for large multi-class problems

1 June 2009

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

No. 10636919,p. 762-769
https://doi.org/10.1109/cvpr.2009.5206651

Abstract

Scarcity and infeasibility of human supervision for large scale multi-class classification problems necessitates active learning. Unfortunately, existing active learning methods for multi-class problems are inherently binary methods and do not scale up to a large number of classes. In this paper, we introduce a probabilistic variant of the K-nearest neighbor method for classification that can be seamlessly used for active learning in multi-class scenarios. Given some labeled training data, our method learns an accurate metric/kernel function over the input space that can be used for classification and similarity search. Unlike existing metric/kernel learning methods, our scheme is highly scalable for classification problems and provides a natural notion of uncertainty over class labels. Further, we use this measure of uncertainty to actively sample training examples that maximize discriminating capabilities of the model. Experiments on benchmark datasets show that the proposed method learns appropriate distance metrics that lead to state-of-the-art performance for object categorization problems. Furthermore, our active learning method effectively samples training examples, resulting in significant accuracy gains over random sampling for multi-class problems involving a large number of classes.

Keywords

This publication has 9 references indexed in Scilit:

Entropy-based active learning for object recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2008
Active kernel learning
Published by Association for Computing Machinery (ACM) ,2008
Representing shape with a spatial pyramid kernel
Published by Association for Computing Machinery (ACM) ,2007
Information-theoretic metric learning
Published by Association for Computing Machinery (ACM) ,2007
Geometric blur for template matching
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
The pyramid match kernel: discriminative classification with sets of image features
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Selective Sampling for Nearest Neighbor Classifiers
Machine Learning, 2004
Automatically labeling video data using multi-class active learning
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Information-Based Objective Functions for Active Data Selection
Neural Computation, 1992

Cited by 94 articles