Skip to main navigation Skip to search Skip to main content

Biclustering of linear patterns in gene expression data

  • Qinghui Gao
  • , Christine Ho
  • , Yingmin Jia
  • , Jingyi Jessica Li
  • , Haiyan Huang*
  • *Corresponding author for this work
  • Beihang University
  • University of California at Berkeley

Research output: Contribution to journalArticlepeer-review

Abstract

Identifying a bicluster, or submatrix of a gene expression dataset wherein the genes express similar behavior over the columns, is useful for discovering novel functional gene interactions. In this article, we introduce a new algorithm for finding biClusters with Linear Patterns (CLiP). Instead of solely maximizing Pearson correlation, we introduce a fitness function that also considers the correlation of complementary genes and conditions. This eliminates the need for a priori determination of the bicluster size. We employ both greedy search and the genetic algorithm in optimization, incorporating resampling for more robust discovery. When applied to both real and simulation datasets, our results show that CLiP is superior to existing methods. In analyzing RNA-seq fly and worm time-course data from modENCODE, we uncover a set of similarly expressed genes suggesting maternal dependence. Supplementary Material is available online (at www.liebertonline.com/cmb).

Original languageEnglish
Pages (from-to)619-631
Number of pages13
JournalJournal of Computational Biology
Volume19
Issue number6
DOIs
StatePublished - 1 Jun 2012

Keywords

  • algorithms
  • gene clusters
  • probability

Fingerprint

Dive into the research topics of 'Biclustering of linear patterns in gene expression data'. Together they form a unique fingerprint.

Cite this