摘要
Ratio-based algorithms are proven to be effective methods for removing batch effects that exist among micro array expression data from different data sources. They are outperforming than other methods in the enhancement of cross-batch prediction, especially for cancer data sets. However, their overall power is limited by: (1) Not every batch has control samples. The original method uses all negative samples to calculate the subtrahend. (2) Micro array experimental data may not have clear labels, especially in the prediction application, the labels of test data set are unknown. In this paper, we propose an Improved Ratio-Based (IRB) method to relieve these two constraints for cross-batch prediction applications. For each batch in a single study, we select one reference sample based on the idea of aligning probability density functions (pdfs) of each gene in different batches. Moreover, for data sets without label information, we transfer the problem of finding reference sample to the dense sub graph problem in graph theory. Our newly-proposed IRB method is straightforward and efficient, and can be extended for integrating large volume micro array data sets. The experiments show that our method is stable and has high performance in tumor/non-tumor prediction.
| 源语言 | 英语 |
|---|---|
| 主期刊名 | Proceedings - IEEE 14th International Conference on Bioinformatics and Bioengineering, BIBE 2014 |
| 编辑 | Reda Alhajj, Taghi M. Khoshgoftaar, Nikolaos G. Bourbakis, Xingquan Zhu |
| 出版商 | Institute of Electrical and Electronics Engineers Inc. |
| 页 | 212-219 |
| 页数 | 8 |
| ISBN(电子版) | 9781479975013 |
| DOI | |
| 出版状态 | 已出版 - 5 2月 2014 |
| 已对外发布 | 是 |
| 活动 | 14th IEEE International Conference on BioInformatics and BioEngineering, BIBE 2014 - Boca Raton, 美国 期限: 10 11月 2014 → 12 11月 2014 |
出版系列
| 姓名 | Proceedings - IEEE 14th International Conference on Bioinformatics and Bioengineering, BIBE 2014 |
|---|
会议
| 会议 | 14th IEEE International Conference on BioInformatics and BioEngineering, BIBE 2014 |
|---|---|
| 国家/地区 | 美国 |
| 市 | Boca Raton |
| 时期 | 10/11/14 → 12/11/14 |
联合国可持续发展目标
此成果有助于实现下列可持续发展目标:
-
可持续发展目标 3 良好健康与福祉
指纹
探究 'An improved ratio-based (IRB) batch effects removal algorithm for cancer data in a co-analysis framework' 的科研主题。它们共同构成独一无二的指纹。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver