跳到主要导航 跳到搜索 跳到主要内容

Graph Regularized Non-negative Matrix Factorization with Long-tail Constraint

  • Lu You
  • , Rui Liu
  • , He Zhang
  • , Z. M. Shan
  • Beihang University
  • Tencent

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

How to dig out long tail topics is a great challenge in text mining. In previous research, most of non-hierarchical topic models were based on a hypothesis that the topics in documents follow polynomial distribution, ignoring the topics at the tail of distribution curve. Hierarchical topic model have the ability to mine long tail topics by introducing the hierarchical relationship among topics, but leading to a higher computational complexity. In this article, we propose a new method to mine long tail topics, which is called graph regularized non-negative matrix factorization with long-tail constraint. It uses KL divergence to measure the difference between matrices, and use neighbor graph to preserve the intrinsic geometrical and discriminating structure between original samples in low-dimensional space. Experiment shows, the algorithm we proposed can mine more long tail topic information in document, and make improvement in the task of data mining, comparing to other method, such as classical dirichlet distribution, non-negative matrix, hierarchical matrix, hierarchical latent dirichlet distribution.

源语言英语
主期刊名2019 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing, PACRIM 2019 - Proceedings
出版商Institute of Electrical and Electronics Engineers Inc.
ISBN(电子版)9781728127941
DOI
出版状态已出版 - 8月 2019
活动2019 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing, PACRIM 2019 - Victoria, 加拿大
期限: 21 8月 201923 8月 2019

出版系列

姓名2019 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing, PACRIM 2019 - Proceedings

会议

会议2019 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing, PACRIM 2019
国家/地区加拿大
Victoria
时期21/08/1923/08/19

指纹

探究 'Graph Regularized Non-negative Matrix Factorization with Long-tail Constraint' 的科研主题。它们共同构成独一无二的指纹。

引用此