Improving Efficiency of the Focused Web Crawler by Link Score Calculation

N. Senthil Kumar

Improving Efficiency of the Focused Web Crawler by Link Score Calculation

Authors

N. Senthil Kumar

Abstract

Abstract
The overwhelming size of the Web enables continuous support and recurring update of Web based information retrieval systems. The Crawler can narrow down the entire process to allow the hyperlinks in Web pages to download a part view of the Web. Meanwhile few systems absolutely depend on crawlers that exclusively crawls the web, and extract the topic specific collections. A focused crawler has the tendency to accumulate particular topics and target to gather relevant segments but not to consume resources on irrelevant material. The sole objective of the focused crawler is to fetch the maximal set of relevant and quality pages. In the proposed approach, the classifier task is to categorize the unvisited URL based on visited URLs attribute score and eliminating the unvisited URLs which are not relevant to the specific domain.

Keywords: crawlers, information retrieval, web mining, link score, robot

Downloads

Requires Subscription PDF

Published

2019-07-09

Issue

Vol. 4 No. 1 (2013)

Section

Articles

License

Declaration and Copyright Transfer Form

(to be completed by authors)

I/ We, the undersigned author(s) of the submitted manuscript, hereby declare, that the above manuscript which is submitted for publication in the STM Journals(s), is not published already in part or whole (except in the form of abstract) in any journal or magazine for private or public circulation, and, is not under consideration of publication elsewhere.

I/We will not withdraw the manuscript after 1 week of submission as I have read the Author Guidelines and will adhere to the guidelines.
I/We Author(s ) have niether given nor will give this manuscript elsewhere for publishing after submitting in STM Journal(s).
I/ We have read the original version of the manuscript and am/ are responsible for the thought contents embodied in it. The work dealt in the manuscript is my/ our own, and my/ our individual contribution to this work is significant enough to qualify for authorship.
I/We also agree to the authorship of the article in the following order:

Author’s name

1. ________________

2. ________________

3. ________________

4. ________________

We Author(s) tick this box and would request you to consider it as our signature as we agree to the terms of this Copyright Notice, which will apply to this submission if and when it is published by this journal.

Improving Efficiency of the Focused Web Crawler by Link Score Calculation

Authors

Abstract

Downloads

Published

Issue

Section

License

Developed By

Subscription

Language

Information