Evolving Transparent Credit Risk Models: A Symbolic Regression Approach Using Genetic Programming
Created by W.Langdon from
gp-bibliography.bib Revision:1.8512
- @Article{sotiropoulos:2024:Electronics,
-
author = "Dionisios N. Sotiropoulos and Gregory Koronakos and
Spyridon V. Solanakis",
-
title = "Evolving Transparent Credit Risk Models: A Symbolic
Regression Approach Using Genetic Programming",
-
journal = "Electronics",
-
year = "2024",
-
volume = "13",
-
number = "21",
-
pages = "Article No. 4324",
-
keywords = "genetic algorithms, genetic programming",
-
ISSN = "2079-9292",
-
URL = "
https://www.mdpi.com/2079-9292/13/21/4324",
-
DOI = "
doi:10.3390/electronics13214324",
-
abstract = "Credit scoring is a cornerstone of financial risk
management, enabling financial institutions to assess
the likelihood of loan default. However, widely
recognised contemporary credit risk metrics, like FICO
(Fair Isaac Corporation) or Vantage scores, remain
proprietary and inaccessible to the public. This study
aims to devise an alternative credit scoring metric
that mirrors the FICO score, using an extensive dataset
from Lending Club. The challenge lies in the limited
available insights into both the precise analytical
formula and the comprehensive suite of credit-specific
attributes integral to the FICO score's calculation.
Our proposed metric leverages basic information
provided by potential borrowers, eliminating the need
for extensive historical credit data. We aim to
articulate this credit risk metric in a closed
analytical form with variable complexity. To achieve
this, we employ a symbolic regression method anchored
in genetic programming (GP). Here, the Occam's razor
principle guides evolutionary bias toward simpler, more
interpretable models. To ascertain our method's
efficacy, we juxtapose the approximation capabilities
of GP-based symbolic regression with established
machine learning regression models, such as Gaussian
Support Vector Machines (GSVMs), Multilayer Perceptrons
(MLPs), Regression Trees, and Radial Basis Function
Networks (RBFNs). Our experiments indicate that
GP-based symbolic regression offers accuracy comparable
to these benchmark methodologies. Moreover, the
resultant analytical model offers invaluable insights
into credit risk evaluation mechanisms, enabling
stakeholders to make informed credit risk assessments.
This study contributes to the growing demand for
transparent machine learning models by demonstrating
the value of interpretable, data-driven credit scoring
models.",
-
notes = "also known as \cite{electronics13214324}",
- }
Genetic Programming entries for
Dionisios N Sotiropoulos
Gregory Koronakos
Spyridon V Solanakis
Citations