跳到主要导航 跳到搜索 跳到主要内容

Network-based clustering and embedding for high-dimensional data visualization

  • Hengyuan Zhang
  • , Xiaowu Chen
  • Beihang University

科研成果: 会议稿件论文同行评审

摘要

We present a novel method to visualize high-dimensional dataset as a landscape. The goal is to provide clear and compact representation to reveal the structure of high-dimensional datasets in a way that the size and distinctiveness of clusters can be easily discerned, and the relationships among single points can be preserved. Our method is network-based, and consists of two main steps: clustering and embedding. First of all, the similarity graph of high-dimensional dataset is constructed based on the Euclidean distances between data points. For clustering, we propose a new network community detection algorithm to calculate the membership-degree of each vertex belonging to each community. For embedding, we bring forward a practical algorithm to obtain an evenly distributed and regularly shaped layout of data points, in a way that the original relationships among single points are preserved. Finally, the landscape-like visualization is produced by assigning altitudes to data points according to their membership-degrees and by inserting control points. In our high-dimensional data visualization, clusters form highlands, and border data points among clusters show up as valleys. The area and altitude of highland indicate the size and distinctiveness of data cluster respectively.

源语言英语
290-297
页数8
DOI
出版状态已出版 - 2013
活动13th International Conference on Computer-Aided Design and Computer Graphics, CAD/Graphics 2013 - Hong Kong, 中国
期限: 16 11月 201318 11月 2013

会议

会议13th International Conference on Computer-Aided Design and Computer Graphics, CAD/Graphics 2013
国家/地区中国
Hong Kong
时期16/11/1318/11/13

指纹

探究 'Network-based clustering and embedding for high-dimensional data visualization' 的科研主题。它们共同构成独一无二的指纹。

引用此