Domain Adaptation From Multiple Sources: A Domain-Dependent Regularization Approach

Top Cited Papers

23 January 2012

journal article
research article
Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Neural Networks and Learning Systems

Vol. 23 (3), 504-518
https://doi.org/10.1109/tnnls.2011.2178556

Abstract

In this paper, we propose a new framework called domain adaptation machine (DAM) for the multiple source domain adaption problem. Under this framework, we learn a robust decision function (referred to as target classifier) for label prediction of instances from the target domain by leveraging a set of base classifiers which are prelearned by using labeled instances either from the source domains or from the source domains and the target domain. With the base classifiers, we propose a new domain-dependent regularizer based on smoothness assumption, which enforces that the target classifier shares similar decision values with the relevant base classifiers on the unlabeled instances from the target domain. This newly proposed regularizer can be readily incorporated into many kernel methods (e.g., support vector machines (SVM), support vector regression, and least-squares SVM (LS-SVM)). For domain adaptation, we also develop two new domain adaptation methods referred to as FastDAM and UniverDAM. In FastDAM, we introduce our proposed domain-dependent regularizer into LS-SVM as well as employ a sparsity regularizer to learn a sparse target classifier with the support vectors only from the target domain, which thus makes the label prediction on any test instance very fast. In UniverDAM, we additionally make use of the instances from the source domains as Universum to further enhance the generalization ability of the target classifier. We evaluate our two methods on the challenging TRECIVD 2005 dataset for the large-scale video concept detection task as well as on the 20 newsgroups and email spam datasets for document retrieval. Comprehensive experiments demonstrate that FastDAM and UniverDAM outperform the existing multiple source domain adaptation methods for the two applications.

Keywords

This publication has 23 references indexed in Scilit:

A theory of learning from different domains
Machine Learning, 2009
Cross-domain video concept detection using adaptive svms
Published by Association for Computing Machinery (ACM) ,2007
Integrating structured biological data by Kernel Maximum Mean Discrepancy
Bioinformatics, 2006
Inference with the Universum
Published by Association for Computing Machinery (ACM) ,2006
A tutorial on support vector regression
Statistics and Computing, 2004
Improving SVM accuracy by training on auxiliary data sources
Published by Association for Computing Machinery (ACM) ,2004
Benchmarking Least Squares Support Vector Machine Classifiers
Machine Learning, 2004
Detecting Change in Data Streams
Published by Elsevier ,2004
An introduction to kernel-based learning algorithms
IEEE Transactions on Neural Networks, 2001
Combining labeled and unlabeled data with co-training
Published by Association for Computing Machinery (ACM) ,1998

Cited by 285 articles