跳到主要导航 跳到搜索 跳到主要内容

Attend and select: A segment selective transformer for microblog hashtag generation

  • Qianren Mao
  • , Xi Li
  • , Bang Liu
  • , Shu Guo
  • , Peng Hao
  • , Jianxin Li*
  • , Lihong Wang
  • *此作品的通讯作者
  • Beihang University
  • University of Montreal
  • CENCERT/CC

科研成果: 期刊稿件文章同行评审

摘要

Hashtag generation aims to generate short and informal topical tags from a microblog post, in which tokens or phrases form the hashtags. These tokens or phrases may originate from primary fragmental textual pieces (e.g., segments) in the original text and are separated into different segments. However, conventional sequence-to-sequence generation methods are hard to filter out secondary information from different textual granularity and are not good at selecting crucial tokens. Thus, they are suboptimal in generating more condensed hashtags. In this work, we propose a modified Transformer-based generation model with adding a segments-selection procedure for the original encoding and decoding phases. The segments-selection phase is based on a novel Segments Selection Mechanism (SSM) to model different textual granularity on global text, local segments, and tokens, contributing to generate condensed hashtags. Specifically, it first attends to primary semantic segments and then transforms discontinuous segments from the source text into a sequence of hashtags by selecting crucial tokens. Extensive evaluations on the two datasets reveal our approach's superiority with significant improvements to the extraction and generation baselines. The code and datasets are available at https://github.com/OpenSUM/HashtagGen.

源语言英语
文章编号109581
期刊Knowledge-Based Systems
254
DOI
出版状态已出版 - 27 10月 2022

指纹

探究 'Attend and select: A segment selective transformer for microblog hashtag generation' 的科研主题。它们共同构成独一无二的指纹。

引用此