Skip to main navigation Skip to search Skip to main content

Convex clustering method for compositional data via sparse group lasso

  • Beihang University
  • Beijing Advanced Innovation Center for Big Data and Brain Computing
  • Beijing Jiaotong University

Research output: Contribution to journalArticlepeer-review

Abstract

High-dimensional sparse clustering with compositional data is of great practical importance, as exemplified by applications in high-throughput gene expression profiles analysis. In this paper, we develop a compositional clustering framework based on convex clustering, which is a convex relaxation of hierarchical clustering that incorporates a fused penalty term on the cluster prototypes. To explicitly deal with the issue of high dimensionality and sparsity, we propose the Compositional Convex Clustering with Sparse Group Lasso (CCC-SGL). The isometric logratio (ilr) transformation is first applied to transform the composition in the simplex space to the standard Euclidean geometry. Then, a group lasso penalty and a lasso penalty are imposed on the cluster centers, which effectively selects informative features and promotes within-feature sparsity. The proposed convex clustering formulation is numerically and efficiently solved with the proximal gradient descent algorithm within the Alternating Direction Method of Multipliers (ADMM) framework. Simulation studies are carried out to evaluate the performance of the proposed methodology and also a real data set in microbiome sequencing is analyzed.

Original languageEnglish
Pages (from-to)23-36
Number of pages14
JournalNeurocomputing
Volume425
DOIs
StatePublished - 15 Feb 2021

Keywords

  • ADMM
  • Compositional data
  • Convex clustering
  • Sparse-group-lasso

Fingerprint

Dive into the research topics of 'Convex clustering method for compositional data via sparse group lasso'. Together they form a unique fingerprint.

Cite this