Binarized Support Vector Machines

Open Access

1 February 2010

journal article
Published by Institute for Operations Research and the Management Sciences (INFORMS) in INFORMS Journal on Computing

Vol. 22 (1), 154-167
https://doi.org/10.1287/ijoc.1090.0317

Abstract

The widely used support vector machine (SVM) method has shown to yield very good results in supervised classification problems. Other methods such as classification trees have become more popular among practitioners than SVM thanks to their interpretability, which is an important issue in data mining.In this work, we propose an SVM-based method that automatically detects the most important predictor variables and the role they play in the classifier. In particular, the proposed method is able to detect those values and intervals that are critical for the classification. The method involves the optimization of a linear programming problem in the spirit of the Lasso method with a large number of decision variables. The numerical experience reported shows that a rather direct use of the standard column generation strategy leads to a classification method that, in terms of classification ability, is competitive against the standard linear SVM and classification trees. Moreover, the proposed method is robust; i.e., it is stable in the presence of outliers and invariant to change of scale or measurement units of the predictor variables.When the complexity of the classifier is an important issue, a wrapper feature selection method is applied, yielding simpler but still competitive classifiers.

Keywords

All Related Versions

Version 1, 2007-01-01, RePEc (Unconfirmed version)

This publication has 24 references indexed in Scilit:

Multi-group support vector machines with measurement costs: A biobjective approach
Discrete Applied Mathematics, 2008
Comprehensible credit scoring models using rule extraction from support vector machines
European Journal of Operational Research, 2007
A Feature Selection Newton Method for Support Vector Machine Classification
Computational Optimization and Applications, 2004
Support vector machines with different norms: motivation, formulations and results
Pattern Recognition Letters, 2001
Massive data discrimination via linear support vector machines
Optimization Methods and Software, 2000
Structural risk minimization over data-dependent hierarchies
IEEE Transactions on Information Theory, 1998
Classification by pairwise coupling
The Annals of Statistics, 1998
Using neural networks for data mining
Future Generation Computer Systems, 1997
Survey and critique of techniques for extracting rules from trained artificial neural networks
Knowledge-Based Systems, 1995
Support-vector networks
Machine Learning, 1995

Cited by 28 articles