Skip to main navigation Skip to search Skip to main content

P2P lending analysis using the most relevant graph-based features

  • Lixin Cui
  • , Lu Bai*
  • , Yue Wang
  • , Xiao Bai
  • , Zhihong Zhang
  • , Edwin R. Hancock
  • *Corresponding author for this work
  • Central University of Finance and Economics
  • Xiamen University
  • University of York

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Peer-to-Peer (P2P) lending is an online platform to facilitate borrowing and investment transactions. A central problem for these P2P platforms is how to identify the most influential factors that are closely related to the credit risks. This problem is inherently complex due to the various forms of risks and the numerous influencing factors involved. Moreover, raw data of P2P lending are often high-dimension, highly correlated and unstable, making the problem more untractable by traditional statistical and machine learning approaches. To address these problems, we develop a novel filter-based feature selection method for P2P lending analysis. Unlike most traditional feature selection methods that use vectorial features, the proposed method is based on graphbased features and thus incorporates the relationships between pairwise feature samples into the feature selection process. Since the graph-based features are by nature completed weighted graphs, we use the steady state random walk to encapsulate the main characteristics of the graphbased features. Specifically, we compute a probability distribution of the walk visiting the vertices. Furthermore, we measure the discriminant power of each graph-based feature with respect to the target feature, through the Jensen-Shannon divergence measure between the probability distributions from the random walks. We select an optimal subset of features based on the most relevant graph-based features, through the Jensen-Shannon divergence measure. Unlike most existing state-of-theart feature selection methods, the proposed method can accommodate both continuous and discrete target features. Experiments demonstrate the effectiveness and usefulness of the proposed feature selection algorithm on the problem of P2P lending platforms in China.

Original languageEnglish
Title of host publicationStructural, Syntactic, and Statistical Pattern Recognition - Joint IAPR International Workshop S+SSPR 2016, Proceedings
EditorsBattista Biggio, Richard Wilson, Marco Loog, Francisco Escolano, Antonio Robles-Kelly
PublisherSpringer Verlag
Pages3-14
Number of pages12
ISBN (Print)9783319490540
DOIs
StatePublished - 2016
EventJoint IAPR International Workshops on Structural and Syntactic Pattern Recognition, SSPR 2016 - Merida, Mexico
Duration: 29 Nov 20162 Dec 2016

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10029 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceJoint IAPR International Workshops on Structural and Syntactic Pattern Recognition, SSPR 2016
Country/TerritoryMexico
CityMerida
Period29/11/162/12/16

Fingerprint

Dive into the research topics of 'P2P lending analysis using the most relevant graph-based features'. Together they form a unique fingerprint.

Cite this