TY - GEN
T1 - Predicting gene function with positive and unlabeled examples
AU - Chen, Yiming
AU - Li, Zhoujun
AU - Hu, Xiaohua
AU - Diao, Hongxiang
AU - Liu, Junwan
PY - 2009
Y1 - 2009
N2 - Predicting gene function is usually formulated as binary classification problem. However, we only know which gene has some function while we are not sure that it doesn't belong to a function class, which means that only positive examples are given. Therefore, selecting a good training example set becomes a key step. In this paper, we cluster the genes on integrated weighted graph by generalizing the cluster coefficient of unweighted graph to weighted one, and identify the reliable negative samples based on distance between a gene and centroid of positive clusters. Then, the tri-training algorithm is used to learn three classifiers from labeled and unlabeled examples to predict the gene function by combining three prediction result. The experiment results show that our approach outperforms several classic prediction methods.
AB - Predicting gene function is usually formulated as binary classification problem. However, we only know which gene has some function while we are not sure that it doesn't belong to a function class, which means that only positive examples are given. Therefore, selecting a good training example set becomes a key step. In this paper, we cluster the genes on integrated weighted graph by generalizing the cluster coefficient of unweighted graph to weighted one, and identify the reliable negative samples based on distance between a gene and centroid of positive clusters. Then, the tri-training algorithm is used to learn three classifiers from labeled and unlabeled examples to predict the gene function by combining three prediction result. The experiment results show that our approach outperforms several classic prediction methods.
UR - https://www.scopus.com/pages/publications/70450043258
U2 - 10.1109/GRC.2009.5255161
DO - 10.1109/GRC.2009.5255161
M3 - 会议稿件
AN - SCOPUS:70450043258
SN - 9781424448319
T3 - 2009 IEEE International Conference on Granular Computing, GRC 2009
SP - 89
EP - 94
BT - 2009 IEEE International Conference on Granular Computing, GRC 2009
T2 - 2009 IEEE International Conference on Granular Computing, GRC 2009
Y2 - 17 August 2009 through 19 August 2009
ER -