Feature Engineering for Credit Risk Evaluation in Online P2P Lending

Feature Engineering for Credit Risk Evaluation in Online P2P Lending

Shuxia Wang, Bin Fu, Hongzhi Liu, Zhengshen Jiang, Zhonghai Wu, D. Frank Hsu
DOI: 10.4018/IJSSCI.2017040101
OnDemand:
(Individual Articles)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

The rise of online P2P lending, as a novel economic lending model, brings new opportunities and challenges for the research of credit risk evaluation. This paper aims to mine information from different data sources to improve the performance of credit risk evaluation models. Be-sides the personal financial and demographic data used in traditional models, the authors collect in-formation from (1) text description, (2) social network and (3) macro-economic data. They de-sign methods to extract features from unstructured data. To avoid the curse of dimensionality caused by too many features and identify the key factors in credit risk, the authors remove the irrelevant and redundant features by feature selection. Using the data provided by Prosper.com, one of the biggest P2P lending platforms in the world, they show that: (1) it can achieve better performance, measured by both AUC (area under the receiver operating characteristic curve) and classification accuracy, by fusion of information from different data sources; (2) it requires only ten features from different data sources to get better performance.
Article Preview
Top

Personal financial data is the main information source of traditional credit risk evaluation models. Puro et al. (2010) studied the relationship between loan amount, interest rate and the funding success. Their experimental results showed that lower interest rates decrease the chances of getting the loan funded, while lower loan amounts increase the chance of funded. Emekter et al. (2015) studied the relation between various financial factors and the default rate. Their results showed that credit grade, debt-to-income ratio, FICO score and revolving line utilization play an important role in loan defaults. Loans with lower credit grade and longer duration as associated with high default rate.

Complete Article List

Search this Journal:
Reset
Volume 16: 1 Issue (2024)
Volume 15: 1 Issue (2023)
Volume 14: 4 Issues (2022): 1 Released, 3 Forthcoming
Volume 13: 4 Issues (2021)
Volume 12: 4 Issues (2020)
Volume 11: 4 Issues (2019)
Volume 10: 4 Issues (2018)
Volume 9: 4 Issues (2017)
Volume 8: 4 Issues (2016)
Volume 7: 4 Issues (2015)
Volume 6: 4 Issues (2014)
Volume 5: 4 Issues (2013)
Volume 4: 4 Issues (2012)
Volume 3: 4 Issues (2011)
Volume 2: 4 Issues (2010)
Volume 1: 4 Issues (2009)
View Complete Journal Contents Listing