A COMPREHENSIVE STUDY OF THE EFFECT OF CLASS IMBALANCE ON THE PERFORMANCE OF CLASSIFIERS

Rodica Potolea, Camelia Lemnaru

2011

Abstract

Class imbalance is one of the significant issues which affect the performance of classifiers. In this paper we systematically analyze the effect of class imbalance on some standard classification algorithms. The study is performed on benchmark datasets, in relationship with concept complexity, size of the training set, and ratio between number of instances and number of attributes of the training set data. In the evaluation we considered six different metrics. The results indicate that the multilayer perceptron is the most robust to the imbalance in training data, while the support vector machine’s performance is the most affected. Also, we found that unpruned C4.5 models work better than the pruned versions.

Download


Paper Citation


in Harvard Style

Potolea R. and Lemnaru C. (2011). A COMPREHENSIVE STUDY OF THE EFFECT OF CLASS IMBALANCE ON THE PERFORMANCE OF CLASSIFIERS . In Proceedings of the 13th International Conference on Enterprise Information Systems - Volume 1: ICEIS, ISBN 978-989-8425-53-9, pages 14-21. DOI: 10.5220/0003415800140021

in Bibtex Style

@conference{iceis11,
author={Rodica Potolea and Camelia Lemnaru},
title={A COMPREHENSIVE STUDY OF THE EFFECT OF CLASS IMBALANCE ON THE PERFORMANCE OF CLASSIFIERS},
booktitle={Proceedings of the 13th International Conference on Enterprise Information Systems - Volume 1: ICEIS,},
year={2011},
pages={14-21},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003415800140021},
isbn={978-989-8425-53-9},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 13th International Conference on Enterprise Information Systems - Volume 1: ICEIS,
TI - A COMPREHENSIVE STUDY OF THE EFFECT OF CLASS IMBALANCE ON THE PERFORMANCE OF CLASSIFIERS
SN - 978-989-8425-53-9
AU - Potolea R.
AU - Lemnaru C.
PY - 2011
SP - 14
EP - 21
DO - 10.5220/0003415800140021