跳到主要导航 跳到搜索 跳到主要内容

HCIndex: a Hilbert-Curve-based clustering index for efficient multi-dimensional queries for cloud storage systems

  • Xinyang Wang
  • , Yu Sun
  • , Qiao Sun*
  • , Weiwei Lin*
  • , James Z. Wang
  • , Wei Li
  • *此作品的通讯作者
  • Beijing Forestry University
  • National Forestry and Grassland Administration Engineering Research Center for Forestry-Oriented Intelligent Information Processing
  • South China University of Technology
  • Peng Cheng Laboratory
  • Clemson University

科研成果: 期刊稿件文章同行评审

摘要

With the rapid development of the Internet of Things and cloud computing, HBase has become a good choice for massive data storage, and is efficient in reading and writing data. However, HBase is not supportive for multi-dimensional query of non-rowkey data, unconducive to data analysis and processing. To address this issue, we first analyze the constitution principle and deficiency of secondary index and clustering index, and select clustering index as the basis of optimization. Then, we choose the Hilbert curve in the space filling curve as the linearization technology, design the pre-partition algorithm and subspace partition algorithm, and realize the Hilbert-curve-based clustering index (HCIndex) which supports multi-dimensional point query and range query. Finally, the performance of HCIndex is verified by comparison experiments with HBase Scan, HiBase and CCIndex. The experimental results show that the query efficiency of HCIndex has been greatly improved at the expense of very limited storage space, which is necessary for storing index data and only 1.7 times the size of the original data table of HBase. Compared with HBase scan, the query efficiency of HCIndex’s multi-dimensional point query and range query has been increased to more than 4 times and more than 2 times, respectively. Therefore, the proposed HCIndex is well suited for efficient multi-dimensional and complex queries of massive data in cloud storage systems.

源语言英语
页(从-至)2011-2025
页数15
期刊Cluster Computing
26
3
DOI
出版状态已出版 - 6月 2023
已对外发布

指纹

探究 'HCIndex: a Hilbert-Curve-based clustering index for efficient multi-dimensional queries for cloud storage systems' 的科研主题。它们共同构成独一无二的指纹。

引用此