Abstract
Keyword query is a user-friendly mechanism for retrieving useful information from XML data in Web and scientific applications. Inspired by the performance benefits of exploiting materialized views when processing structured queries, we investigate the feasibility and present a general framework for answering XML keyword queries using materialized views. Then we develop an XML keyword search engine that leverages materialized views for query evaluation and maintains materialized views incrementally upon XML data update. Experimental evaluation demonstrates the significance and efficiency of our approach.
- Arion, A., Benzaken, V., Manolescu, I., and Papakonstantinou, Y. 2007. Structured materialized views for xml queries. In Proceedings of the International Conference on Very Large Databases (VLDB’07). Google Scholar
Digital Library
- Balmin, A., Ozcan, F., Beyer, K. S., and Cochrane, R. J. 2004. A framework for using materialized xpath views in xml query processing. In Proceedings of the International Conference on Very Large Databases (VLDB’04). Google Scholar
Digital Library
- Bao, Z., Ling, T. W., Chen, B., and Lu, J. 2009. Effective xml keyword search with relevance oriented ranking. In Proceedings of the International Conference on Data Engineering (ICDE’09). Google Scholar
Digital Library
- Chen, L. J. and Papakonstantinou, Y. 2010. Supporting top-k keyword search in xml databases. In Proceedings of the International Conference on Data Engineering (ICDE’10).Google Scholar
- Chen, Y., Wang, W., Liu, Z., and Lin, X. 2009. Keyword search on structured and semi-structured data. In Proceedings of the ACM SIGMOD Conference on Management of Data. 1005--1010. Google Scholar
Digital Library
- Chen, Y., Wang, W., and Liu, Z. 2011. Keyword-Based search and exploration on databases. In Proceedings of the International Conference on Very Large Databases (ICDE’11). 1380--1383. Google Scholar
Digital Library
- Cohen, E., Kaplan, H., and Milo, T. 2002. Labeling dynamic xml trees. In Proceedings of the ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems (PODS’02). Google Scholar
Digital Library
- Cohen, S., Mamou, J., Kanza, Y., and Sagiv, Y. 2003. XSEarch: A semantic search engine for xml. http://www.vldb.org/conf/2003/papers/S03P02.pdf. Google Scholar
Digital Library
- Cormen, T. H., Leiserson, C. E., Rivest, R. L., and Stein, C. 2001. Introduction to Algorithms 2nd Ed. The MIT Press. Google Scholar
Digital Library
- Fan, W., Geerts, F., Jia, X., and Kementsietsidis, A. 2007. Rewriting regular xpath queries on xml views. In Proceedings of the International Conference on Data Engineering (ICDE’07).Google Scholar
- Feng, J., Ta, N., Zhang, Y., and Li, G. 2007. Exploit sequencing views in semantic cache to accelerate xpath query evaluation. In Proceedings of the International Conference on World Wide Web (WWW’07). Google Scholar
Digital Library
- Guo, L., Shao, F., Botev, C., and Shanmugasundaram, J. 2003. XRANK: Ranked keyword search over xml documents. In Proceedings of the ACM SIGMOD Conference on Management of Data. Google Scholar
Digital Library
- Hristidis, V., Koudas, N., Papakonstantinou, Y., and Srivastava, D. 2006. Keyword proximity search in xml trees. IEEE Trans. Knowl. Data Engin. 18, 4. Google Scholar
Digital Library
- Huang, Y., Liu, Z., and Chen, Y. 2008. Query biased snippet generation in xml search. In Proceedings of the ACM SIGMOD Conference on Management of Data. Google Scholar
Digital Library
- Lempel, R. and Moran, S. 2003. Predictive caching and prefetching of query results in search engines. In Proceedings of the International Conference on World Wide Web (WWW’03). Google Scholar
Digital Library
- Li, C., Ling, T. W., and Hu, M. 2006. Efficient processing of updates in dynamic xml data. In Proceedings of the International Conference on Data Engineering (ICDE’06). Google Scholar
Digital Library
- Li, G., Feng, J., Wang, J., and Zhou, L. 2007a. Effective keyword search for valuable lcas over xml documents. In Proceedings of the ACM Conference on Information and Knowledge Management (CIKM’07). Google Scholar
Digital Library
- Li, G., Ooi, B. C., Feng, J., Wang, J., and Zhou, L. 2008. EASE: Efficient and adaptive keyword search on unstructured, semi-structured and structured data. In Proceedings of the ACM SIGMOD Conference on Management of Data. Google Scholar
Digital Library
- Li, Y., Yang, H., and Jagadish, H. V. 2007b. NaLIX: A generic natural language research environment for xml data. ACM Trans. Datab. Syst. 32, 4. Google Scholar
Digital Library
- Li, Y., Yu, C., and Jagadish, H. V. 2004. Schema-Free xquery. In Proceedings of the International Conference on Very Large Databases (VLDB’04). Google Scholar
Digital Library
- Liu, Z. and Chen, Y. 2007. Identifying meaningful return information for xml keyword search. In Proceedings of the ACM Conference on Management of Data. Google Scholar
Digital Library
- Liu, Z. and Chen, Y. 2008a. Answering keyword queries on xml using materialized views. In Proceedings of the International Conference on Data Engineering (ICDE’08). Google Scholar
Digital Library
- Liu, Z. and Chen, Y. 2008b. Reasoning and identifying relevant matches for xml keyword search. In Proceedings of the International Conference on Very Large Databases (VLDB’08).Google Scholar
- Liu, Z. and Chen, Y. 2010. Return specification interference and result clustering for keyword search on xml. ACM Trans. Datab. Syst. 35, 2. Google Scholar
Digital Library
- Liu, Z. and Chen, Y. 2011. Processing keyword search on xml: A survey. World Wide Web 14, 5--6, 671--707. Google Scholar
Digital Library
- Liu, Z. and Chen, Y. 2012. Differentiating search results on structured data. ACM Trans. Datab. Syst. 37, 1, 4. Google Scholar
Digital Library
- Liu, Z., Huang, Y., and Chen, Y. 2010a. Improving xml search by generating and utilizing informative result snippets. ACM Trans. Datab. Syst. 35, 3. Google Scholar
Digital Library
- Liu, Z., Shao, Q., and Chen, Y. 2010b. Searching workflows with hierarchical views. Proc. VLDB 3, 1, 918--927. Google Scholar
Digital Library
- Liu, Z., Natarajan, S., and Chen, Y. 2011. Query expansion based on clustered results. Proc. VLDB 4, 6, 350--361. Google Scholar
Digital Library
- Luo, Y., Lin, X., Wang, W., and Zhou, X. 2007. SPARK: Top-k keyword query in relational databases. In Proceedings of the ACM SIGMOD Conference on Management of Data. Google Scholar
Digital Library
- Mandhani, B. and Suciu, D. 2005. Query caching and view selection for xml databases. In Proceedings of the International Conference on Very Large Databases (VLDB’05). Google Scholar
Digital Library
- O’Neil, P., ONeil, E., Pal, S., Cseri, I., and Schaller, G. 2004. ORDPATHs: Insert-Friendly xml node labels. In Proceedings of the ACM SIGMOD Conference on Management of Data. Google Scholar
Digital Library
- Onose, N., Deutsch, A., Papakonstantinou, Y., and Curtmola, E. 2006. Rewriting nested xml queries using nested views. In Proceedings of the ACM SIGMOD Conference on Management of Data. Google Scholar
Digital Library
- Saraiva, P.-C., de Moura, E. S., Ziviani, N., Meira, W., Fonseca, R., and RibeiroNeto, B. 2007. Rank-Preserving two-level caching for scalable search engines. In Proceedings of the International ACM SIGIR Conference on Research and Development in Information Retrieval. Google Scholar
Digital Library
- Sawires, A., Tatemura, J., Po, O., Agrawal, D., and Candan, K. S. 2005. Incremental maintenance of path-expression views. In Proceedings of the ACM SIGMOD Conference on Management of Data. Google Scholar
Digital Library
- Sawires, A., Tatemura, J., Po, O., Agrawal, D., Abbadi, A. E., and Candan, K. S. 2006. Maintaining xpath views in loosely coupled systems. In Proceedings of the International Conference on Very Large Databases (VLDB’06). Google Scholar
Digital Library
- Shao, F., Guo, L., and Botev, C. 2007. Efficient keyword search over virtual xml views. In Proceedings of the International Conference on Very Large Databases (VLDB’07). Google Scholar
Digital Library
- Sun, C., Chan, C.-Y., and Goenka, A. 2007. Multiway slca-based keyword search in xml data. In Proceedings of the International Conference on World Wide Web (WWW’07). Google Scholar
Digital Library
- Tang, N., Yu, J. X., Ozsu, M. T., Choi, B., and Wong, K.-F. 2008. Multiple materialized view selection for xpath query rewriting. In Proceedings of the International Conference on Data Engineering (ICDE’08). Google Scholar
Digital Library
- Tatarinov, I., Viglas, S., Beyer, K. S., Shanmugasundaram, J., Shekita, E. J., and Zhang, C. 2002. Storing and querying ordered xml using a relational database system. In Proceedings of the ACM SIGMOD Conference on Management of Data. Google Scholar
Digital Library
- Xu, L., Ling, T. W., Wu, H., and Bao, Z. 2009. DDE: From dewey to a fully dynamic xml labeling scheme. In Proceedings of the ACM SIGMOD Conference on Management of Data. Google Scholar
Digital Library
- Xu, W. and Ozsoyoglu, Z. M. 2005. Rewriting xpath queries using materialized views. In Proceedings of the International Conference on Very Large Databases (VLDB’05). Google Scholar
Digital Library
- Xu, Y. and Papakonstantinou, Y. 2005. Efficient keyword search for smallest lcas in xml databases. In Proceedings of the ACM SIGMOD Conference on Management of Data. Google Scholar
Digital Library
- Xu, Y. and Papakonstantinou, Y. 2008. Efficient lca based keyword search in xml data. In Proceedings of the International Conference on Extending Database Technology (EDBT’08). Google Scholar
Digital Library
Index Terms
Exploiting and Maintaining Materialized Views for XML Keyword Queries
Recommendations
Exploit keyword query semantics and structure of data for effective XML keyword search
ADC '10: Proceedings of the Twenty-First Australasian Conference on Database Technologies - Volume 104Keyword search is a natural and user-friendly mechanism for querying XML data in information systems and Web based applications. One of the key tasks is to identify and return meaningful fragments as results, due to the limited expressiveness and the ...
Automatically generating structured queries in XML keyword search
INEX'10: Proceedings of the 9th international conference on Initiative for the evaluation of XML retrieval: comparative evaluation of focused retrievalIn this paper, we present a novel method for automatically deriving structured XML queries from keyword-based queries and show how it was applied to the experimental tasks proposed for the INEX 2010 data-centric track. In our method, called StruX, users ...
Return specification inference and result clustering for keyword search on XML
Keyword search enables Web users to easily access XML data without the need to learn a structured query language and to study possibly complex data schemas. Existing work has addressed the problem of selecting qualified data nodes that match keywords ...








Comments