arXiv Analytics

Sign in

arXiv:1907.12363 [cs.LG]AbstractReferencesReviewsResources

A comparison of Deep Learning performances with others machine learning algorithms on credit scoring unbalanced data

Louis Marceau, Lingling Qiu, Nick Vandewiele, Eric Charton

Published 2019-07-25Version 1

Training models on highly unbalanced data is admitted to be a challenging task for machine learning algorithms. Current studies on deep learning mainly focus on data sets with balanced class labels, or unbalanced data but with massive amount of samples available, like in speech recognition. However, the capacities of deep learning on imbalanced data with little samples is not deeply investigated in literature, while it is a very common application context, in numerous industries. To contribute to fill this gap, this paper compares the performances of several popular machine learning algorithms previously applied with success to unbalanced data set with deep learning algorithms. We conduct those experiments on an highly unbalanced data set, used for credit scoring. We evaluate various configuration including neural network optimisation techniques and try to determine their capacities when they operate with imbalanced corpora.

Related articles: Most relevant | Search more
arXiv:2007.12475 [cs.LG] (Published 2020-07-12)
Predicting and Mapping of Soil Organic Carbon Using Machine Learning Algorithms in Northern Iran
arXiv:1506.00852 [cs.LG] (Published 2015-06-02)
Peer Grading in a Course on Algorithms and Data Structures: Machine Learning Algorithms do not Improve over Simple Baselines
arXiv:2008.13690 [cs.LG] (Published 2020-08-31)
Evaluation of machine learning algorithms for Health and Wellness applications: a tutorial