Abstract
During the life cycle of an XML application, both schemas and queries may change from one version to another. Schema evolutions may affect query results and potentially the validity of produced data. Nowadays, a challenge is to assess and accommodate the impact of these changes in evolving XML applications. Such questions arise naturally in XML static analyzers. These analyzers often rely on decision procedures such as inclusion between XML schemas, query containment and satisfiability. However, existing decision procedures cannot be used directly in this context. The reason is that they are unable to distinguish information related to the evolution from information corresponding to bugs. This paper proposes a predicate language within a logical framework that can be used to make this distinction.
We present a system for monitoring the effect of schema evolutions on the set of admissible documents and on the results of queries. The system is very powerful in analyzing various scenarios where the result of a query may not be anymore what was expected. Specifically, the system is based on a set of predicates which allow a fine-grained analysis for a wide range of forward and backward compatibility issues. Moreover, the system can produce counterexamples and witness documents which are useful for debugging purposes. The current implementation has been tested with realistic use cases, where it allows identifying queries that must be reformulated in order to produce the expected results across successive schema versions.
Supplemental Material
- Michael Benedikt and Christoph Koch. XPath leashed. submitted, 2006.Google Scholar
- Michael Benedikt, Wenfei Fan, and Floris Geerts. XPath satisfiability in the presence of DTDs. In PODS '05, pages 25--36. ACM Press, 2005. ISBN 1-59593-062-0. http://doi.acm.org/10.1145/1065167.1065172. Google Scholar
Digital Library
- Véronique Benzaken, Giuseppe Castagna, and Alain Frisch. CDuce: An XML-centric general-purpose language. In ICFP '03: Proceedings of the Eighth ACM SIGPLAN International Conference on Functional Programming, pages 51--63, New York, NY, USA, 2003. ACM Press. ISBN 1-58113-756-7. Google Scholar
Digital Library
- an, Saiprasad, and der Linden}beyer-sigmod05Kevin Beyer, Fatma Özcan, Sundar Saiprasad, and Bert Van der Linden. DB2/XML: designing for evolution. In SIGMOD '05, pages 948--952. ACM, 2005. ISBN 1-59593-060-4. http://doi.acm.org/10.1145/1066157.1066299. Google Scholar
Digital Library
- Giuseppe Castagna and Kim Nguyen. Typed iterators for XML. In ICFP, pages 15--26, 2008. Google Scholar
Digital Library
- James Clark and Steve DeRose. XML path language (XPath) version 1.0, W3C recommendation, November 1999. http://www.w3.org/TR/ 1999/REC-xpath-19991116.Google Scholar
- Dario Colazzo, Giorgio Ghelli, Paolo Manghi, and Carlo Sartiani. Types for path correctness of XML queries. In ICFP '04: Proceedings of the ninth ACM SIGPLAN international conference on Functional programming, pages 126--137, New York, NY, USA, 2004. ACM Press. ISBN 1-58113-905-5. Google Scholar
Digital Library
- Dario Colazzo, Giorgio Ghelli, Paolo Manghi, and Carlo Sartiani. Static analysis for path correctness of XML queries. J. Funct. Program., 16 (4-5): 621--661, 2006. ISSN 0956-7968. Google Scholar
Digital Library
- Vladimir Gapeyev, François Garillot, and Benjamin C. Pierce. Statically typed document transformation: An Xtatic experience. In PLAN-X 2006: Proceedings of the International Workshop on Programming Language Technologies for XML, volume NS-05-6 of BRICS Notes Series, pages 2--13, Aarhus, Denmark, January 2006. BRICS.Google Scholar
- Pierre Genevès. Logics for XML. PhD thesis, Institut National Polytechnique de Grenoble, December 2006. http://www.pierresoft.com/pierre.geneves/phd.htm.Google Scholar
- Pierre Genevès and Nabil Layaïda. The XML reasoning solver project, February 2009. http://wam.inrialpes.fr/xml.Google Scholar
- Pierre Genevès, Nabil Layaïda, and Alan Schmitt. Efficient static analysis of XML paths and types. In PLDI '07, pages 342--351. ACM Press, 2007. ISBN 978-1-59593-633-2. http://doi.acm.org/10.1145/1250734.1250773. Google Scholar
Digital Library
- Pierre Genevès, Nabil Layaïda, and Alan Schmitt. Efficient static analysis of XML paths and types. Long version of Geneves Et. Al. 07, Research Report 6590, INRIA, July 2008. URL http://hal.inria.fr/inria-00305302/en/.Google Scholar
- Haruo Hosoya and Benjamin C. Pierce. XDuce: A statically typed XML processing language. ACM Trans. Inter. Tech., 3 (2): 117--148, 2003. ISSN 1533-5399. Google Scholar
Digital Library
- Haruo Hosoya, Jérôme Vouillon, and Benjamin C. Pierce. Regular expression types for XML. phACM TOPLAS, 27 (1): 46--90, 2005. ISSN 0164-0925. http://doi.acm.org/10.1145/1053468.1053470. Google Scholar
Digital Library
- Anders Møller and Michael I. Schwartzbach. The design space of type checkers for XML transformation languages. In Proc. Tenth International Conference on Database Theory, ICDT '05, volume 3363 of LNCS, pages 17--36, London, UK, January 2005. Springer-Verlag. Google Scholar
Digital Library
- Hyun J. Moon, Carlo A. Curino, Alin Deutsch, and Chien-Yi Hou. Managing and querying transaction-time databases under schema evolution. In VLDB '08, pages 882--895. VLDB Endowment, 2008. Google Scholar
Digital Library
- Mirella M. Moro, Susan Malaika, and Lipyeow Lim. Preserving xml queries during schema evolution. In WWW '07, pages 1341--1342. ACM, 2007. ISBN 978-1-59593-654-7. http://doi.acm.org/10.1145/1242572.1242841. Google Scholar
Digital Library
- Makoto Murata, Dongwon Lee, Murali Mani, and Kohsuke Kawaguchi. Taxonomy of XML schema languages using formal language theory. ACM TOIT, 5 (4): 660--704, 2005. ISSN 1533-5399. http://doi.acm.org/10.1145/1111627.1111631. Google Scholar
Digital Library
- Emmanuel Pietriga. MathML content2presentation transformation, May 2005. http://www.lri.fr/pietriga/mathmlc2p/mathmlc2p.html.Google Scholar
- Kristoffer H. Rose. The XML world view. In DocEng '04: Proceedings of the 2004 ACM symposium on Document engineering, pages 34--34, New York, NY, USA, 2004. ACM. ISBN 1-58113-938-1. http://doi.acm.org/10.1145/1030397.1030403. URL http://www.research.ibm.com/XML/Rose--DocEng2004.pdf. Google Scholar
Digital Library
- Eric Sedlar. Managing structure in bits&pieces: the killer use case for XML. In SIGMOD '05, pages 818--821. ACM, 2005. ISBN 1-59593-060-4. http://doi.acm.org/10.1145/1066157.1066256. Google Scholar
Digital Library
- Philip Wadler. Two semantics for XPath. Internal Technical Note of the W3C XSL Working Group, http://homepages.inf.ed.ac.uk/wadler/papers/xpath-semantics/xpath-semantics.%pdf, January 2000.Google Scholar
- Cong Yu and Lucian Popa. Semantic adaptation of schema mappings when schemas evolve. In VLDB '05, pages 1006--1017. VLDB Endowment, 2005. ISBN 1-59593-154-6. Google Scholar
Digital Library
Index Terms
Identifying query incompatibilities with evolving XML schemas
Recommendations
Identifying query incompatibilities with evolving XML schemas
ICFP '09: Proceedings of the 14th ACM SIGPLAN international conference on Functional programmingDuring the life cycle of an XML application, both schemas and queries may change from one version to another. Schema evolutions may affect query results and potentially the validity of produced data. Nowadays, a challenge is to assess and accommodate ...
An XML Schema integration and query mechanism system
The availability of large amounts of heterogeneous distributed web data necessitates the integration of XML data from multiple XML sources for many reasons. For example, currently, there are many e-commerce companies, which offer similar products but ...
An Efficient Schema-Based Technique for Querying XML Data
As data integration over the Web has become an increasing demand, there is a growing desire to use XML as a standard format for data exchange. For sharing their grammars efficiently, most of the XML documents in use are associated with a document ...







Comments