A Selective Overview of Quantile Regression for Large-Scale Data

  • Shanshan Wang
  • , Wei Cao
  • , Xiaoxue Hu*
  • , Hanyu Zhong
  • , Weixi Sun
  • *Corresponding author for this work

Research output: Contribution to journalReview articlepeer-review

Abstract

Large-scale data, characterized by heterogeneity due to heteroskedastic variance or inhomogeneous covariate effects, arises in diverse fields of scientific research and technological development. Quantile regression (QR) is a valuable tool for detecting heteroskedasticity, and numerous QR statistical methods for large-scale data have been rapidly developed. This paper provides a selective review of recent advances in QR theory, methods, and implementations, particularly in the context of massive and streaming data. We focus on three key strategies for large-scale QR analysis: (1) distributed computing, (2) subsampling methods, and (3) online updating. The main contribution of this paper is a comprehensive review of existing work and advancements in these areas, addressing challenges such as managing the non-smooth QR loss function, developing distributed and online updating formulations, and conducting statistical inference. Finally, we highlight several issues that require further study.

Original languageEnglish
Article number837
JournalMathematics
Volume13
Issue number5
DOIs
StatePublished - Mar 2025

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy

Keywords

  • distributed computing
  • large-scale data
  • quantile regression
  • renewable estimation
  • subsampling methods

Fingerprint

Dive into the research topics of 'A Selective Overview of Quantile Regression for Large-Scale Data'. Together they form a unique fingerprint.

Cite this