Imbalanced learning with a biased minimax probability machine
- 17 July 2006
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics)
- Vol. 36 (4), 913-923
- https://doi.org/10.1109/tsmcb.2006.870610
Abstract
Imbalanced learning is a challenged task in machine learning. In this context, the data associated with one class are far fewer than those associated with the other class. Traditional machine learning methods seeking classification accuracy over a full range of instances are not suitable to deal with this problem, since they tend to classify all the data into a majority class, usually the less important class. In this correspondence, the authors describe a new approach named the biased minimax probability machine (BMPM) to deal with the problem of imbalanced learning. This BMPM model is demonstrated to provide an elegant and systematic way for imbalanced learning. More specifically, by controlling the accuracy of the majority class under all possible choices of class-conditional densities with a given mean and covariance matrix, this model can quantitatively and systematically incorporate a bias for the minority class. By establishing an explicit connection between the classification accuracy and the bias, this approach distinguishes itself from the many current imbalanced-learning methods; these methods often impose a certain bias on the minority data by adapting intermediate factors via the trial-and-error procedure. The authors detail the theoretical foundation, prove its solvability, propose an efficient optimization algorithm, and perform a series of experiments to evaluate the novel model. The comparison with other competitive methods demonstrates the effectiveness of this new modelKeywords
This publication has 21 references indexed in Scilit:
- A Comparison of State‐of‐the‐Art Classification Techniques for Expert Automobile Insurance Claim Fraud DetectionJournal of Risk and Insurance, 2002
- A comparison of methods for multiclass support vector machinesIEEE Transactions on Neural Networks, 2002
- Bayesian classification for data from the same unknown classIEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 2002
- A Comparison of State-of-the-Art Classification Techniques with Application to CytogeneticsNeural Computing & Applications, 2001
- Sparse pixel vectorization: an algorithm and its performance evaluationIEEE Transactions on Pattern Analysis and Machine Intelligence, 1999
- Building Detection and Description from a Single Intensity ImageComputer Vision and Image Understanding, 1998
- The use of the area under the ROC curve in the evaluation of machine learning algorithmsPattern Recognition, 1997
- Combination of multiple classifiers using local accuracy estimatesIEEE Transactions on Pattern Analysis and Machine Intelligence, 1997
- Analyzing a Portion of the ROC CurveMedical Decision Making, 1989
- Measuring the Accuracy of Diagnostic SystemsScience, 1988