Skip to main navigation Skip to search Skip to main content

Empirical evaluation of ensemble learning for credit scoring

  • Gang Wang*
  • , Jin Xing Hao
  • , Jian Ma
  • , Li Hua Huang
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

Abstract

Credit scoring is an important finance activity. Both statistical techniques and Artificial Intelligence (AI) techniques have been explored for this topic. But different techniques have different advantages and disadvantages on different datasets. Recent studies draw no consistent conclusions to show that one technique is superior to the other, while they suggest combining multiple classifiers, i.e., ensemble learning, may have a better performance. In this study, we conduct an empirical evaluation of the performance of three popular ensemble methods, i.e., bagging, boosting, and stacking, based on four base learners, i.e., Logistic Regression Analysis (LRA), Decision Tree (DT), Artificial Neural Network (ANN) and Support Vector Machine (SVM). The experiment uses the credit dataset including 239 companies' financial records from China, collected by the Industrial and Commercial Bank of China. Results reveal that ensemble learning can substantially improve individual base learners. Stacking, in our experiments, gets the best performance in terms of all six performance indicators, i.e., type I error, type II error, average accuracy, precision, recall, and F-value.

Original languageEnglish
Title of host publicationMachine Learning
Subtitle of host publicationConcepts, Methodologies, Tools and Applications
PublisherIGI Global
Pages1108-1127
Number of pages20
Volume2-3
ISBN (Electronic)9781609608194
ISBN (Print)9781609608187
DOIs
StatePublished - 31 Jul 2011
Externally publishedYes

Fingerprint

Dive into the research topics of 'Empirical evaluation of ensemble learning for credit scoring'. Together they form a unique fingerprint.

Cite this