Adaptive focused crawler based on tunneling and link analysis

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

At present, using focused crawler becomes a way to seek the needed information. The main characteristic of a focused web crawler is to select and retrieve only relevant web pages in each crawling process. In this paper, we propose a learnable algorithm that combines link analysis with web content in order to retrieve specific web documents, and it can predict the next URL through learning. The algorithm also uses an adaptive tunneling to overcome some of the limitations of normal focused crawlers. We apply three metrics to compare its efficiency with other weD-known web crawling techniques based.

Original languageEnglish
Title of host publication11th International Conference on Advanced Communication Technology, ICACT 2009 - Proceedings
Pages2225-2230
Number of pages6
StatePublished - 2009
Event11th International Conference on Advanced Communication Technology, ICACT 2009 - Phoenix Park, Korea, Republic of
Duration: 15 Feb 200918 Feb 2009

Publication series

NameInternational Conference on Advanced Communication Technology, ICACT
Volume3
ISSN (Print)1738-9445

Conference

Conference11th International Conference on Advanced Communication Technology, ICACT 2009
Country/TerritoryKorea, Republic of
CityPhoenix Park
Period15/02/0918/02/09

Fingerprint

Dive into the research topics of 'Adaptive focused crawler based on tunneling and link analysis'. Together they form a unique fingerprint.

Cite this