Unsupervised template mining for semantic category understanding

  • Lei Shi
  • , Shuming Shi
  • , Chin Yew Lin
  • , Yi Dong Shen
  • , Yong Rui

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We propose an unsupervised approach to constructing templates from a large collection of semantic category names, and use the templates as the semantic representation of categories. The main challenge is that many terms have multiple meanings, resulting in a lot of wrong templates. Statistical data and semantic knowledge are extracted from a web corpus to improve template generation. A nonlinear scoring function is proposed and demonstrated to be effective. Experiments show that our approach achieves significantly better results than baseline methods. As an immediate application, we apply the extracted templates to the cleaning of a category collection and see promising results (precision improved from 81% to 89%).

Original languageEnglish
Title of host publicationEMNLP 2014 - 2014 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference
PublisherAssociation for Computational Linguistics (ACL)
Pages799-809
Number of pages11
ISBN (Electronic)9781937284961
DOIs
StatePublished - 2014
Externally publishedYes
Event2014 Conference on Empirical Methods in Natural Language Processing, EMNLP 2014 - Doha, Qatar
Duration: 25 Oct 201429 Oct 2014

Publication series

NameEMNLP 2014 - 2014 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference

Conference

Conference2014 Conference on Empirical Methods in Natural Language Processing, EMNLP 2014
Country/TerritoryQatar
CityDoha
Period25/10/1429/10/14

Fingerprint

Dive into the research topics of 'Unsupervised template mining for semantic category understanding'. Together they form a unique fingerprint.

Cite this