Protein function prediction using kernal logistic regresssion with ROC curves

  • Jingwei Liu*
  • , Minping Qian
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

To avoid the "over-fitting" problem in protein function prediction based on protein-protein interactions (PPI), we propose a pattern recognition strategy that all the features of PPI observation data are divided into three sets, training set, learning set and testing set. The employed classifiers are trained on training sets, the receiver operating characteristic (ROC) curve and optimal operating point (OOP) is calculated on learning set, and the accuracy rate is reported on the testing set with OOP. Under this framework, we compare the performances of logistic regression (LR) model with kernel logistic regression (KLR) model on two different feature selection sets, 1-order feature and 2-order feature according to PPI data. The experiment results on a standard PPI data show that KLR model performs better than LR model on training sets of both 1-order feature set and 2-order feature set, and the 2-order feature outperforms 1-order feature set with KLR model on training set . The predictive rates on testing set of both 1-order feature and 2-order feature with LR and KLR can achieve 95%.

Original languageEnglish
Title of host publicationComputing and Intelligent Systems - International Conference, ICCIC 2011, Proceedings
Pages491-502
Number of pages12
EditionPART 4
DOIs
StatePublished - 2011
Event2011 International Conference on Computing, Information and Control, ICCIC 2011 - Wuhan, China
Duration: 17 Sep 201118 Sep 2011

Publication series

NameCommunications in Computer and Information Science
NumberPART 4
Volume234 CCIS
ISSN (Print)1865-0929

Conference

Conference2011 International Conference on Computing, Information and Control, ICCIC 2011
Country/TerritoryChina
CityWuhan
Period17/09/1118/09/11

Keywords

  • kernel logistic regression
  • logistic regression
  • optimal operating point
  • protein-protein interaction
  • receiver operating characteristic

Fingerprint

Dive into the research topics of 'Protein function prediction using kernal logistic regresssion with ROC curves'. Together they form a unique fingerprint.

Cite this