ABSTRACT
In the study of data exchange one usually assumes an open-world semantics, making it possible to extend instances of target schemas. An alternative closed-world semantics only moves 'as much data as needed' from the source to the target to satisfy constraints of a schema mapping. It avoids some of the problems exhibited by the open-world semantics, but limits the expressivity of schema mappings. Here we propose a mixed approach: one can designate different attributes of target schemas as open or closed, to combine the additional expressivity of the open-world semantics with the better behavior of query answering in closed worlds.
We define such schema mappings, and show that they cover a large space of data exchange solutions with two extremes being the known open and closed-world semantics. We investigate the problems of query answering and schema mapping composition, and prove two trichotomy theorems, classifying their complexity based on the number of open attributes. We find conditions under which schema mappings compose, extending known results to a wide range of closed-world mappings. We also provide results for restricted classes of queries and mappings guaranteeing lower complexity.
- S. Abiteboul, O. Duschka. Complexity of answering queries using materialized views. In PODS 1998, pages 254--263. Google Scholar
Digital Library
- S. Abiteboul, P. Kanellakis, G. Grahne. On the representation and querying of sets of possible worlds. TCS 78 (1991), 158--187. Google Scholar
Digital Library
- M. Arenas, P. Barceló, R. Fagin, L. Libkin. Locally consistent transformations and query answering in data exchange. In PODS 2004, pages 229--240. Google Scholar
Digital Library
- P. Atzeni, N. Morfuni. Functional dependencies and constraints on null values in database relations. Information and Control 70(1): 1--31 (1986). Google Scholar
Digital Library
- P. Bernstein, T.Green, S. Melnik, A. Nash. Implementing mapping composition. VLDB'06, pages 55--66. Google Scholar
Digital Library
- P. Bernstein, S. Melnik. Model management 2.0: manipulating richer mappings. SIGMOD'07, pages 1--12. Google Scholar
Digital Library
- L. Chiticariu, W.-C. Tan. Debugging schema mappings with routes. In VLDB'06, pages 79--90. Google Scholar
Digital Library
- G. De Giacomo, D. Lembo, M. Lenzerini, R. Rosati. On reconciling data exchange, data integration, and peer data management. In PODS'07, pages 133--142. Google Scholar
Digital Library
- A. Deutsch, V. Tannen. Reformulation of XML queries and constraints. In ICDT'03, pages 225--241. Google Scholar
Digital Library
- R. Fagin, Ph. Kolaitis, R. Miller, L. Popa. Data exchange: semantics and query answering. Theor. Comput. Sci. 336(1): 89--124 (2005). Google Scholar
Digital Library
- R. Fagin, Ph. Kolaitis, L. Popa. Data exchange: getting to the core. ACM TODS 30(1): 174--210 (2005). Google Scholar
Digital Library
- R. Fagin, Ph. Kolaitis, L. Popa, W.C. Tan. Composing schema mappings: second-order dependencies to the rescue. ACM TODS 30(4) 994--1055 (2005). Google Scholar
Digital Library
- G. Gottlob, R. Zicari. Closed world databases opened through null values. In VLDB'88, pages 50--61. Google Scholar
Digital Library
- G. Grahne. The Problem of Incomplete Information in Relational Databases. Springer, 1991. Google Scholar
Digital Library
- A. Hernich, N. Schweikardt. CWA-solutions for data exchange settings with target dependencies. In PODS'07, pages 113--122. Google Scholar
Digital Library
- T. Imielinski, W. Lipski. Incomplete information in relational databases. J. ACM 31 (1984), 761--791. Google Scholar
Digital Library
- Ph. Kolaitis. Schema mappings, data exchange, and metadata management. In PODS 2005. Google Scholar
Digital Library
- M. Lenzerini. Data integration: a theoretical perspective. In PODS'02, pages 233--246. Google Scholar
Digital Library
- M. Levene, G. Loizou. Axiomatisation of functional dependencies in incomplete relations. Theoretical Computer Science 206 (1998), 283--300. Google Scholar
Digital Library
- L. Libkin. Data exchange and incomplete information. In PODS'06, pages 60--69. Google Scholar
Digital Library
- W. Lipski. On semantic issues connected with incomplete information in databases. ACM Trans. Database Systems 4 (1979), 262--296. Google Scholar
Digital Library
- J. Madhavan, A. Halevy. Composing mappings among data sources. In VLDB'03, pages 572--583. Google Scholar
Digital Library
- A. Madry. Data exchange: on the complexity of answering queries with inequalities. IPL 94 (2005) 253--257. Google Scholar
Digital Library
- J. Makowsky and Y. Pnueli. Arity and alternation in second-order logic. APAL, 78 (1996), 189--202.Google Scholar
- R. Miller, M. Hernandez, L. Haas, L. Yan, C. Ho, R. Fagin, L. Popa. The Clio project: managing heterogeneity. SIGMOD Record 30 (2001), 78--83. Google Scholar
Digital Library
- A. Nash, P. Bernstein, S. Melnik. Composition of mappings given by embedded dependencies. ACM TODS 32(1): 4 (2007). Google Scholar
Digital Library
- L. Popa, Y. Velegrakis, R. Miller, M. Hern´andez, R. Fagin. Translating web data. In VLDB 2002, pages 598--609. Google Scholar
Digital Library
- R. Reiter. On closed world databases. In Logic and Databases, Plenum Press, 1978, pages 55--76.Google Scholar
Index Terms
Data exchange and schema mappings in open and closed worlds
Recommendations
Data exchange and schema mappings in open and closed worlds
In the study of data exchange one usually assumes an open-world semantics, making it possible to extend instances of target schemas. An alternative closed-world semantics only moves 'as much data as needed' from the source to the target to satisfy ...
XML Schema Mappings: Data Exchange and Metadata Management
Relational schema mappings have been extensively studied in connection with data integration and exchange problems, but mappings between XML schemas have not received the same amount of attention. Our goal is to develop a theory of expressive XML schema ...
Reflections on Schema Mappings, Data Exchange, and Metadata Management
PODS '18: Proceedings of the 37th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database SystemsA schema mapping is a high-level specification of the relationship between two database schemas. For the past fifteen years, schema mappings have played an essential role in the modeling and analysis of data exchange, data integration, and related data ...






Comments