Skip to main navigation Skip to search Skip to main content

How contents influence clustering features in the web

  • Cheng Xueqi*
  • , Ren Fuxin
  • , Cao Xianbin
  • , Ma Jing
  • *Corresponding author for this work
  • CAS - Institute of Computing Technology
  • University of Science and Technology of China

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In World Wide Web, contents of web documents play important roles in the evolution process because of their effects on linking preference. A majority of topological properties are content-related, and among them the clustering features are sensitive to contents of Web documents. In this paper, we first observe the impacts of content similarity on web links by introducing a metric called Linkage Probability. Then we investigate how contents influence the formation mechanism of the most basic cluster, triangle, with a metric named Triangularization Probability. Experimental results indicate that content similarity has a positive function in the process of cluster formation in the Web. Theoretical analysis predicts the contents influence on the clustering features in the Web very well.

Original languageEnglish
Title of host publicationProceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, WI 2007
Pages81-84
Number of pages4
DOIs
StatePublished - 2007
Externally publishedYes
EventIEEE/WIC/ACM International Conference on Web Intelligence, WI 2007 - Silicon Valley, CA, United States
Duration: 2 Nov 20075 Nov 2007

Publication series

NameProceedings of the IEEE/WIC/ACM International Conference on Web Intelligence, WI 2007

Conference

ConferenceIEEE/WIC/ACM International Conference on Web Intelligence, WI 2007
Country/TerritoryUnited States
CitySilicon Valley, CA
Period2/11/075/11/07

Fingerprint

Dive into the research topics of 'How contents influence clustering features in the web'. Together they form a unique fingerprint.

Cite this