Abstract
We consider the problem of XML Schema evolution. In the ever-changing context of the web, XML schemas continuously change in order to cope with the natural evolution of the entities they describe. Schema changes have important consequences. First, existing documents valid with respect to the original schema are no longer guaranteed to fulfill the constraints described by the evolved schema. Second, the evolution also impacts programs, manipulating documents whose structure is described by the original schema.
We propose a unifying framework for determining the effects of XML Schema evolution both on the validity of documents and on queries. The system is very powerful in analyzing various scenarios in which forward/backward compatibility of schemas is broken, and in which the result of a query may no longer be what was expected. Specifically, the system offers a predicate language that allows one to formulate properties related to schema evolution. The system then relies on exact reasoning techniques to perform a fine-grained analysis. This yields either a formal proof of the property or a counter-example that can be used for debugging purposes. The system has been fully implemented and tested with real-world use cases, in particular with the main standard document formats used on the web, as defined by W3C. The system precisely identifies compatibility relations between document formats. In case these relations do not hold, the system can identify queries that must be reformulated in order to produce the expected results across successive schema versions.
- Benedikt, M. and Koch, C. 2009. XPath leashed. ACM Comput. Surv. 41, 3:1--3:54. Google Scholar
Digital Library
- Benedikt, M., Fan, W., and Geerts, F. 2005. XPath satisfiability in the presence of DTDs. In Proceedings of the 24th ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems (PODS’05). ACM, New York, 25--36. Google Scholar
Digital Library
- Benzaken, V., Castagna, G., and Frisch, A. 2003. CDuce: An XML-centric general-purpose language. In Proceedings of the 8th ACM SIGPLAN International Conference on Functional Programming (ICFP’03). ACM, New York, 51--63. Google Scholar
Digital Library
- Beyer, K., Özcan, F., Saiprasad, S., and der Linden, B. V. 2005. DB2/XML: Designing for evolution. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD’05). ACM, New York. Google Scholar
Digital Library
- Castagna, G. and Nguyen, K. 2008. Typed iterators for XML. In Proceedings of the ACM SIGPLAN International Conference on Functional Programming (ICFP’08). ACM, New York, 15--26. Google Scholar
Digital Library
- Clark, J. and DeRose, S. 1999. XML path language (XPath) version 1.0, W3C recommendation. http://www.w3.org/TR/1999/REC-xpath-19991116.Google Scholar
- Colazzo, D., Ghelli, G., Manghi, P., and Sartiani, C. 2004. Types for path correctness of XML queries. In Proceedings of the 9th ACM SIGPLAN International Conference on Functional Programming (ICFP’04). ACM, New York, 126--137. Google Scholar
Digital Library
- Colazzo, D., Ghelli, G., Manghi, P., and Sartiani, C. 2006. Static analysis for path correctness of XML queries. J. Functional Prog. 16, 4-5, 621--661. Google Scholar
Digital Library
- Gapeyev, V., Garillot, F., and Pierce, B. C. 2006. Statically typed document transformation: An Xtatic experience. In Proceedings of the International Workshop on Programming Language Technologies for XML (PLAN-X 2006). BRICS Notes Series NS-05-6, BRICS, Aarhus, Denmark, 2--13.Google Scholar
- Genevès, P. 2006. Logics for XML. Ph.D. dissertation, Institut National Polytechnique de Grenoble. http://www.pierresoft.com/pierre.geneves/phd.htm.Google Scholar
- Genevès, P. and Layaïda, N. 2009. The XML reasoning solver project. http://wam.inrialpes.fr/xml.Google Scholar
- Genevès, P., Layaïda, N., and Schmitt, A. 2007. Efficient static analysis of XML paths and types. In Proceedings of the ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI’07). ACM, New York, 342--351. Google Scholar
Digital Library
- Genevès, P., Layaïda, N., and Schmitt, A. 2008. Efficient static analysis of XML paths and types. Res. rep. 6590, INRIA.Google Scholar
- Genevès, P., Layaïda, N., and Quint, V. 2009. Identifying query incompatibilities with evolving XML schemas. In Proceedings of the ACM SIGPLAN International Conference on Functional Programming (ICFP’09). ACM, New York, 221--230. Google Scholar
Digital Library
- Groppe, J. and Groppe, S. 2008. Filtering unsatisfiable XPath queries. Data Knowl. Eng. 64, 1, 134--169. Google Scholar
Digital Library
- Groppe, S., Bottcher, S., and Groppe, J. 2006. XPath query simplification with regard to the elimination of intersect and except operators. In Proceedings of the 22nd International Conference on Data Engineering Workshops (ICDEW’06). IEEE, Los Alamitos, CA, 86. Google Scholar
Digital Library
- Hosoya, H. and Pierce, B. C. 2003. XDuce: A statically typed XML processing language. ACM Trans. Internet Technol. 3, 2, 117--148. Google Scholar
Digital Library
- Hosoya, H., Vouillon, J., and Pierce, B. C. 2005. Regular expression types for XML. ACM Trans. Program. Lang. Syst. 27, 1, 46--90. Google Scholar
Digital Library
- Møller, A. and Schwartzbach, M. I. 2005. The design space of type checkers for XML transformation languages. In Proceedings of the 10th International Conference on Database Theory (ICDT’05). Lecture Notes in Computer Science, vol. 3363, Springer, Berlin, 17--36. Google Scholar
Digital Library
- Moon, H. J., Curino, C. A., Deutsch, A., and Hou, C.-Y. 2008. Managing and querying transaction-time databases under schema evolution. Proc. VLDB Endow. 1, 1. 882--895. Google Scholar
Digital Library
- Moro, M. M., Malaika, S., and Lim, L. 2007. Preserving XML queries during schema evolution. In Proceedings of the 16th International Conference on the World Wide Web (WWW’07). ACM, 1341--1342. Google Scholar
Digital Library
- Murata, M., Lee, D., Mani, M., and Kawaguchi, K. 2005. Taxonomy of XML schema languages using formal language theory. ACM Trans. Internet Technol. 5, 4, 660--704. Google Scholar
Digital Library
- Pietriga, E. 2005. MathML content 2 presentation transformation. http://www.lri.fr/~pietriga/mathmlc2p/mathmlc2p.html.Google Scholar
- Rose, K. H. 2004. The XML worldview. In Proceedings of the ACM Symposium on Document Engineering (DocEng’04). ACM, New York, 34--34. Google Scholar
Digital Library
- Sedlar, E. 2005. Managing structure in bits & pieces: The killer use case for XML. In Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD’05). ACM, New York. 818--821. Google Scholar
Digital Library
- Thomas, W. 1990. Automata on infinite objects. In Handbook of Theoretical Computer Science: Formal Models and Semantics (vol. B). MIT Press, Cambridge, MA, 133--191. Google Scholar
Digital Library
- Wadler, P. 2000. Two semantics for XPath. Internal tech. note of the W3C XSL Working Group, http://homepages.inf.ed.ac.uk/wadler/papers/xpath-semantics/xpath-semantics.pdf.Google Scholar
- Yu, C. and Popa, L. 2005. Semantic adaptation of schema mappings when schemas evolve. In Proceedings of the 31st International Conference on Very large Data Bases (VLDB’05). VLDB Endowment, 1006--1017. Google Scholar
Digital Library
Index Terms
Impact of XML Schema Evolution
Recommendations
XML materialized views and schema evolution in VIREX
Web-based databases are gaining increased popularity. This has positively influenced the availability of structured and semi-structured databases for access by a variety of users ranging from professionals to naive users. The number of users accessing ...
Impact of XML schema evolution on valid documents
WIDM '05: Proceedings of the 7th annual ACM international workshop on Web information and data managementIn this paper we investigate the problem of XML Schema evolution. We first discuss the different kinds of changes that may be needed on an XML Schema. Then, we investigate how to minimize document revalidation, that is, detecting the document parts ...
Regular expression transformations to extend regular languages (with application to a Datalog XML schema validator)
An XML schema is a set of rules for defining the allowed sub-elements of any element in an XML document. These rules use regular expressions to define the language of the element's children. Updates to an XML schema are updates to the regular ...






Comments