Skip to main navigation Skip to search Skip to main content

An incremental Algorithm for clustering search results

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

When internet users are facing massive search results, document clustering techniques are very helpful. Generally, existing clustering methods start with a known set of data objects, measured against a known set of attributes. However, there are numerous applications where the attribute set can only obtained gradually as processing data objects incrementally. This paper presents an incremental clustering algorithm (ICA) for clustering search results, which relies on pair-wise search result similarity calculated using Jaccard method. We use a measure namely, Cluster Average Similarity Area to score cluster cohesiveness. Experimental results show that our algorithm leads to less computational time than traditional clustering method while achieving a comparable or better clustering quality.

Original languageEnglish
Title of host publicationSITIS 2008 - Proceedings of the 4th International Conference on Signal Image Technology and Internet Based Systems
Pages112-117
Number of pages6
DOIs
StatePublished - 2008
Event4th International Conference on Signal Image Technology and Internet Based Systems, SITIS 2008 - Bali, Indonesia
Duration: 30 Nov 20083 Dec 2008

Publication series

NameSITIS 2008 - Proceedings of the 4th International Conference on Signal Image Technology and Internet Based Systems

Conference

Conference4th International Conference on Signal Image Technology and Internet Based Systems, SITIS 2008
Country/TerritoryIndonesia
CityBali
Period30/11/083/12/08

Fingerprint

Dive into the research topics of 'An incremental Algorithm for clustering search results'. Together they form a unique fingerprint.

Cite this