ABSTRACT
Form mapping is the key problem that needs to be solved in order to get access to the hidden web. Currently available solutions for fully automatic mapping are not ready for commercial meta-search engines, which still have to rely on hand crafted code and are hard to maintain.
We believe that a thorough formal description of the problem with semantic web technologies provides a promising perspective to develop a new class of vertical search engines that is more robust and easier to maintain than existing solutions.
In this paper, instead of trying to tackle the mapping problem, we model the interaction necessary to fill out a web form. First, during a user-assisted phase, the connection from the visible elements on the form to the domain concepts is established. Then, with help from background knowledge about the possible interaction steps, a plan for filling out the form is derived.
- Baumgartner, R., Ceresna, M. and Ledermueller, G. (2005). Deep Web Navigation in Web Data Extraction. In CIMCA--IAWTIC, p. 698--703. Google Scholar
Digital Library
- Chickenfoot for Firefox -- Rewrite the Web. http://groups.csail.mit.edu/uid/chickenfoot.Google Scholar
- He, B., Zhang Z. and Chen-Chuan, K. (2005). Towards building a metaquerier: Extracting and matching web query interfaces. In ICDE, p. 1098--1099. Google Scholar
Digital Library
- Meng, W., Peng, Q. (2004). Clustering e-commerce search engines based on their search interface pages using WISE-cluster. In WIDM, p. 231--246.Google Scholar
Index Terms
Exploiting semantic web technologies to model web form interactions
Recommendations
A flight meta-search engine with metamorph
WWW '09: Proceedings of the 18th international conference on World wide webWe demonstrate a flight meta-search engine that is based on the Metamorph framework. Metamorph provides mechanisms to model web forms together with the interactions which are needed to fulfil a request, and can generate interaction sequences that pose ...
A QIIIEP based domain specific hidden web crawler
ICWET '11: Proceedings of the International Conference & Workshop on Emerging Trends in TechnologyFor context based surfing of World Wide Web in a systematic and automatic manner, a web crawler is required. The World Wide Web consists interlinked documents and resources that are easily crawled by general web crawler, known as surface web crawler. ...
Semantic web technologies for the adaptive web
The adaptive webOntologies and reasoning are the key terms brought into focus by the semantic web community. Formal representation of ontologies in a common data model on the web can be taken as a foundation for adaptive web technologies as well. This chapter describes ...





Comments