Retrieval of Data from Hidden Web using Web Agent

Das N. N.; Kumar E.

Retrieval of Data from Hidden Web using Web Agent

Authors

Das N. N.
Kumar E.

Abstract

Abstract
There is an increase in the number of data sources that can be queried across the World Wide Web (WWW). Such sources typically support HTML forms-based interfaces and search engines query collections of suitably indexed data. The data are displayed via a browser. As the Web is growing in recent years, huge amount of data are available under dynamic forms of publication, which are accessed by an HTML Form, known as legacy databases (known as hidden Web). To handle such types of situation fast generation of agents is required that can automatically fetch pages for further processing. In this paper, a method is explained for automatically generating agents to collect hidden Web pages (deep web pages). In this method a preexisting data repository is used for identifying the contents of these web pages. Number of experiments has been carried out with sites from different domains to show the accuracy of method.

Keywords: Wrapper, hidden web, web agent, WWW, repository

Downloads

Requires Subscription PDF

Published

2019-07-09

Issue

Vol. 5 No. 2 (2014)

Section

Articles

License

Declaration and Copyright Transfer Form

(to be completed by authors)

I/ We, the undersigned author(s) of the submitted manuscript, hereby declare, that the above manuscript which is submitted for publication in the STM Journals(s), is not published already in part or whole (except in the form of abstract) in any journal or magazine for private or public circulation, and, is not under consideration of publication elsewhere.

I/We will not withdraw the manuscript after 1 week of submission as I have read the Author Guidelines and will adhere to the guidelines.
I/We Author(s ) have niether given nor will give this manuscript elsewhere for publishing after submitting in STM Journal(s).
I/ We have read the original version of the manuscript and am/ are responsible for the thought contents embodied in it. The work dealt in the manuscript is my/ our own, and my/ our individual contribution to this work is significant enough to qualify for authorship.
I/We also agree to the authorship of the article in the following order:

Author’s name

1. ________________

2. ________________

3. ________________

4. ________________

We Author(s) tick this box and would request you to consider it as our signature as we agree to the terms of this Copyright Notice, which will apply to this submission if and when it is published by this journal.

Retrieval of Data from Hidden Web using Web Agent

Authors

Abstract

Downloads

Published

Issue

Section

License

Developed By

Subscription

Language

Information