skip to main content
10.1145/1265530.1265549acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
Article

On reconciling data exchange, data integration, and peer data management

Published:11 June 2007Publication History

ABSTRACT

Data exchange and virtual data integration have been the subject of several investigations in the recent literature. At the same time, the notion of peer data management has emerged as a powerful abstraction of many forms of flexible and dynamic data-centere ddistributed systems. Although research on the above issues has progressed considerably in the last years, a clear understanding on how to combine data exchange and data integration in peer data management is still missing. This is the subject of the present paper. We start our investigation by first proposing a novel framework for peer data exchange, showing that it is a generalization of the classical data exchange setting. We also present algorithms for all the relevant data exchange tasks, and show that they can all be done in polynomial time with respect to data complexity. Based on the motivation that typical mappings and integrity constraints found in data integration are not captured by peer data exchange, we extend the framework to incorporate these features. One of the main difficulties is that the constraints of this new class are not amenable to materialization. We address this issue by resorting to a suitable combination of virtual and materialized data exchange, showing that the resulting framework is a generalization of both classical data exchange and classical data integration, and that the new setting incorporates the most expressive types of mapping and constraints considered in the two contexts. Finally, we present algorithms for all the relevant data management tasks also in the new setting, and show that, again, their data complexity is polynomial.

References

  1. S. Abiteboul and O. Duschka. Complexity of answering queries using materialized views. In Proc. of PODS'98, pages 254--265, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. S. Abiteboul, R. Hull, and V. Vianu. Foundations of Databases. Addison Wesley Publ. Co., 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. Arenas, P. Barcelo, R. Fagin, and L. Libkin. Locally consistent transformations and query answering in data exchange. In Proc. of PODS 2004, pages 229--240, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. M. Arenas, V. Kantere, A. Kementsietsidis, I. Kiringa, R. J. Miller, and J. Mylopoulos. The Hyperion pro ject: from data integration to data coordination. SIGMOD Record, 32(3):53--58, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. M. Arenas and L. Libkin. XML data exchange: consistency and query answering. In Proc. of PODS 2005, pages 13--24, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. P. A. Bernstein, F. Giunchiglia, A. Kementsietsidis, J. Mylopoulos, L. Serafini, and I. Zaihrayeu. Data management for peer-to-peer computing: A vision. In Proc. of WebDB 2002, 2002.Google ScholarGoogle Scholar
  7. L. Bravo and L. Bertossi. Logic programming for consistently querying data integration systems. In Proc. of IJCAI 2003, pages 10--15, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. A. Calì, D. Calvanese, G. De Giacomo, and M. Lenzerini. Data integration under integrity constraints. Information Systems, 29:147--163, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. A. Calì, D. Lembo, and R. Rosati. On the decidability and complexity of query answering over inconsistent and incomplete databases. In Proc. of PODS 2003, pages 260--271, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. A. Calì, D. Lembo, and R. Rosati. Query rewriting and answering under constraints in data integration systems. In Proc. of IJCAI 2003, pages 16--21, 2003.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. D. Calvanese, G. De Giacomo, D. Lembo, M. Lenzerini, and R. Rosati. Inconsistency tolerance in P2P data integration: an epistemic logic approach. In Proc. of DBPL 2005, pages 90--105, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. D. Calvanese, G. De Giacomo, M. Lenzerini, and R. Rosati. Logical foundations of peer-to-peer data integration. In Proc. of PODS 2004, pages 241--251, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. P. Chatalic, G. -H. Nguyen, and M. -C. Rousset. Reasoning with inconsistencies in propositional Peer-to-Peer inference systems. In Proc. of ECAI 2006, pages 352--357, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. R. Fagin, P. G. Kolaitis, R. J. Miller, and L. Popa. Data exchange: Semantics and query answering. Theor. Comp. Sci., 336(1):89--124, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. R. Fagin, P. G. Kolaitis, and L. Popa. Data exchange: Getting to the core. ACM Trans. on Database Systems, 30(1):174--210, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. R. Fagin, P. G. Kolaitis, L. Popa, and W. -C. Tan. Composing schema mappings: Second-order dependencies to the rescue. ACM Trans. on Database Systems, 30(4):994--1055, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. E. Franconi, G. Kuper, A. Lopatenko, and L. Serafini. A robust logical and computational characterisation of peer-to-peer database systems. In Proc. of the VLDB International Workshop On Databases, Information Systems and Peer-to-Peer Computing (DBISP2P 2003), 2003.Google ScholarGoogle Scholar
  18. A. Fuxman, P. G. Kolaitis, R. Miller, and W. C. Tan. Peer data exchange. In Proc. of PODS 2005, pages 160--171, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. G. Gottlob. Computing cores for data exchange: New algorithms and practical solutions. In Proc. of PODS 2005, pages 148--159, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. G. Gottlob and A. Nash. Data exchange: computing cores in polynomial time. In Proc. of PODS 2006, pages 40--49, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. S. Gribble, A. Halevy, Z. Ives, M. Rodrig, and D. Suciu. What can databases do for peer-to-peer? In Proc. of WebDB 2001, 2001.Google ScholarGoogle Scholar
  22. A. Halevy, Z. Ives, D. Suciu, and I. Tatarinov. Schema mediation in peer data management systems. In Proc. of ICDE 2003, pages 505--516, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  23. A. Y. Halevy. Answering queries using views: A survey. VLDB Journal, 10(4):270--294, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. A. Y. Halevy, A. Ra jaraman, and J. Ordille. Data integration: The teenage years. In Proc. of VLDB 2006, pages 9--16, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. P. G. Kolaitis. Schema mappings, data exchange, and metadata management. In Proc. of PODS 2005, pages 61--75, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. P. G. Kolaitis, J. Pantta ja, and W. C. Tan. The complexity of data exchange. In Proc. of PODS 2006, pages 30--39, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. M. Lenzerini. Data integration: A theoretical perspective. In Proc. of PODS 2002, pages 233--246, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. L. Libkin. Data exchange and incomplete information. In Proc. of PODS 2006, pages 60--69, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. J. Madhavan and A. Y. Halevy. Composing mappings among data sources. In Proc. of VLDB 2003, pages 572--583, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. R. Rosati. On the decidability and finite controllability of query processing in databases with incomplete information. In Proc. of PODS 2006, pages 356--365, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. I. Tatarinov and A. Halevy. Efficient query reformulation in peer data management. In Proc. of ACM SIGMOD, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. On reconciling data exchange, data integration, and peer data management

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in
            • Published in

              cover image ACM Conferences
              PODS '07: Proceedings of the twenty-sixth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
              June 2007
              328 pages
              ISBN:9781595936851
              DOI:10.1145/1265530

              Copyright © 2007 ACM

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 11 June 2007

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • Article

              Acceptance Rates

              Overall Acceptance Rate476of1,835submissions,26%

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader
            About Cookies On This Site

            We use cookies to ensure that we give you the best experience on our website.

            Learn more

            Got it!