skip to main content
10.1145/2213556.2213582acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

Query-based data pricing

Published:21 May 2012Publication History

ABSTRACT

Data is increasingly being bought and sold online, and Web-based marketplace services have emerged to facilitate these activities. However, current mechanisms for pricing data are very simple: buyers can choose only from a set of explicit views, each with a specific price. In this paper, we propose a framework for pricing data on the Internet that, given the price of a few views, allows the price of any query to be derived automatically. We call this capability "query-based pricing." We first identify two important properties that the pricing function must satisfy, called arbitrage-free and discount-free. Then, we prove that there exists a unique function that satisfies these properties and extends the seller's explicit prices to all queries. When both the views and the query are Unions of Conjunctive Queries, the complexity of computing the price is high. To ensure tractability, we restrict the explicit prices to be defined only on selection views (which is the common practice today). We give an algorithm with polynomial time data complexity for computing the price of any chain query by reducing the problem to network flow. Furthermore, we completely characterize the class of Conjunctive Queries without self-joins that have PTIME data complexity (this class is slightly larger than chain queries), and prove that pricing all other queries is NP-complete, thus establishing a dichotomy on the complexity of the pricing problem when all views are selection queries.

References

  1. http://gnip.com.Google ScholarGoogle Scholar
  2. http://www.patientslikeme.com.Google ScholarGoogle Scholar
  3. S. Abiteboul and O. M. Duschka. Complexity of answering queries using materialized views. In PODS, pages 254--263. ACM Press, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. S. Abiteboul, R. Hull, and V. Vianu. Foundations of Databases. Addison-Wesley, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. F. N. Afrati. Rewriting conjunctive queries determined by views. In MFCS, pages 78--89, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. http://www.aggdata.com/.Google ScholarGoogle Scholar
  7. Using Amazon S3 Requester Pays with DevPay. http://docs.amazonwebservices.com/AmazonDevPay/latest/DevPayDeveloperGuide/index.html?S3RequesterPays.html.Google ScholarGoogle Scholar
  8. http://www.apollomapping.com/.Google ScholarGoogle Scholar
  9. https://datamarket.azure.com/.Google ScholarGoogle Scholar
  10. M. Balazinska, B. Howe, and D. Suciu. Data markets in the cloud: An opportunity for the database community. Proc. of the VLDB Endowment, 4(12), 2011.Google ScholarGoogle Scholar
  11. D. Calvanese, G. D. Giacomo, M. Lenzerini, and M. Y. Vardi. Lossless regular views. In L. Popa, editor, PODS, pages 247--258. ACM, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. T. H. Cormen, C. E. Leiserson, R. L. Rivest, and C. Stein. Introduction to Algorithms, Second Edition. The MIT Press and McGraw-Hill Book Company, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. http://www.customlists.net/.Google ScholarGoogle Scholar
  14. D. Dash, V. Kantere, and A. Ailamaki. An economic model for self-tuned cloud caching. In Proc. of the 25th ICDE Conf., pages 1687--1693, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. C. Dwork. A firm foundation for private data analysis. Commun. ACM, 54(1):86--95, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. S. Grumbach and L. Tininini. On the content of materialized aggregate views. J. Comput. Syst. Sci., 66(1):133--168, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. http://www.infochimps.com/.Google ScholarGoogle Scholar
  18. S. Jain and P. K. Kannan. Pricing of information products on online servers: Issues, models, and analysis. Management Science, 48(9):1123--1142, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. P. Koutris, P. Upadhyaya, M. Balazinska, B. Howe, and D. Suciu. Query-based data pricing. Research Report UW-CSE-12-03-02, University of Washington, 2012.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. L. Libkin. Elements of Finite Model Theory. Springer, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. M. Marx. Queries determined by views: pack your views. In L. Libkin, editor, PODS, pages 23--30. ACM, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. A. Meliou, W. Gatterbauer, K. F. Moore, and D. Suciu. The complexity of causality and responsibility for query answers and non-answers. PVLDB, 4(1):34--45, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. A. Nash, L. Segoufin, and V. Vianu. Determinacy and rewriting of conjunctive queries using views: A progress report. In ICDT, pages 59--73, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. A. Nash, L. Segoufin, and V. Vianu. Views and queries: Determinacy and rewriting. ACM Trans. Database Syst., 35(3), 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. B. Schneier. Secrets & Lies, Digital Security in a Networked World. John Wiley & Sons, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. L. Segoufin and V. Vianu. Views and queries: determinacy and rewriting. In C. Li, editor, PODS, pages 49--60. ACM, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. C. Shapiro and H. R. Varian. Versioning: The smart way to sell information. Harvard Business Review, 76:106--114, November-December 1998.Google ScholarGoogle Scholar
  28. Stonebraker et al. Mariposa: a wide-area distributed database system. VLDB Journal, 5(1):048--063, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. http://webscaled.com/.Google ScholarGoogle Scholar
  30. Web marketing. Google group forum post, http://groups.google.com/group/webmarketing/msg/c6643da409802f85.Google ScholarGoogle Scholar
  31. http://www.xignite.com/.Google ScholarGoogle Scholar
  32. Z. Zhang and A. O. Mendelzon. Authorization views and conditional query containment. In ICDT, pages 259--273, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Query-based data pricing

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        PODS '12: Proceedings of the 31st ACM SIGMOD-SIGACT-SIGAI symposium on Principles of Database Systems
        May 2012
        332 pages
        ISBN:9781450312486
        DOI:10.1145/2213556

        Copyright © 2012 ACM

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 21 May 2012

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        Overall Acceptance Rate476of1,835submissions,26%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader
      About Cookies On This Site

      We use cookies to ensure that we give you the best experience on our website.

      Learn more

      Got it!