跳到主要导航 跳到搜索 跳到主要内容

Dirichlet aggregation: Unsupervised learning towards an optimal metric for proportional data

  • Hua Yan Wang*
  • , Hongbin Zha
  • , Hong Qin
  • *此作品的通讯作者
  • Peking University
  • Stony Brook University

科研成果: 会议稿件论文同行评审

摘要

Proportional data (normalized histograms) have been frequently occurring in various areas, and they could be mathematically abstracted as points residing in a geometric simplex. A proper distance metric on this simplex is of importance in many applications including classification and information retrieval. In this paper, we develop a novel framework to learn an optimal metric on the simplex. Major features of our approach include: 1) its flexibility to handle correlations among bins/dimensions; 2) widespread applicability without being limited to ad hoc backgrounds; and 3) a "real" global solution in contrast to existing traditional local approaches. The technical essence of our approach is to fit a parametric distribution to the observed empirical data in the simplex. The distribution is parameterized by affinities between simplex vertices, which is learned via maximizing likelihood of observed data. Then, these affinities induce a metric on the simplex, defined as the earth mover's distance equipped with ground distances derived from simplex vertex affinities.

源语言英语
959-966
页数8
DOI
出版状态已出版 - 2007
已对外发布
活动24th International Conference on Machine Learning, ICML 2007 - Corvalis, OR, 美国
期限: 20 6月 200724 6月 2007

会议

会议24th International Conference on Machine Learning, ICML 2007
国家/地区美国
Corvalis, OR
时期20/06/0724/06/07

指纹

探究 'Dirichlet aggregation: Unsupervised learning towards an optimal metric for proportional data' 的科研主题。它们共同构成独一无二的指纹。

引用此