Incremental modelling for compositional data streams

Research output: Contribution to journalReview articlepeer-review

Abstract

Incremental modelling of data streams is of great practical importance, as shown by its applications in advertising and financial data analysis. We propose two incremental covariance matrix decomposition methods for a compositional data type. The first method, exact incremental covariance decomposition of compositional data (C-EICD), gives an exact decomposition result. The second method, covariance-free incremental covariance decomposition of compositional data (C-CICD), is an approximate algorithm that can efficiently compute high-dimensional cases. Based on these two methods, many frequently used compositional statistical models can be incrementally calculated. We take multiple linear regression and principle component analysis as examples to illustrate the utility of the proposed methods via extensive simulation studies.

Original languageEnglish
Pages (from-to)2229-2243
Number of pages15
JournalCommunications in Statistics Part B: Simulation and Computation
Volume48
Issue number8
DOIs
StatePublished - 14 Sep 2019

Keywords

  • Compositional data
  • Covariance matrix
  • Data stream
  • Eigen decomposition

Fingerprint

Dive into the research topics of 'Incremental modelling for compositional data streams'. Together they form a unique fingerprint.

Cite this