skip to main content
10.1145/2213556.2213558acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
keynote

What next?: a half-dozen data management research goals for big data and the cloud

Published:21 May 2012Publication History

ABSTRACT

In this short paper, I describe six data management research challenges relevant for Big Data and the Cloud. Although some of these problems are not new, their importance is amplified by Big Data and Cloud Computing.

Skip Supplemental Material Section

Supplemental Material

p1-chaudhuri.mp4

References

  1. Acharya, S., Gibbons, P., Poosala, V., Ramaswamy, S.: Join Synopses for Approximate Query Answering. SIGMOD Conference 1999: 275--286. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Chaiken R. et. al.: SCOPE: easy and efficient parallel processing of massive data sets. PVLDB 1(2), 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Chaudhuri, S., Motwani, R., Narasayya, V..: On Random Sampling over Joins. SIGMOD Conference 1999: 263--274. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Chaudhuri, S.: Query optimizers: time to rethink the contract? SIGMOD Conference 2009: 961--968. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Chaudhuri, S., Dayal, U., Narasayya, V. An Overview of Business Intelligence Technology. Communications of the ACM Vol. 54 No. 8, Pages 88--98. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Cheng T., Lauw H.W., Paparizos S.: Entity Synonyms for Structured Web Search, IEEE Trans. Knowledge and Data Eng., 2011.Google ScholarGoogle Scholar
  7. Dageville, B., Zait, M. SQL Memory Management in Oracle 9i. In Proceedings of VLDB 2002, Hong Kong, China. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Dean, J., Ghemawat, S.: MapReduce: a flexible data processing tool. Communications of the ACM 53(1): 72--77 (2010). Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Dwork, C., Differential Privacy. 33rd International Colloquium on Automata, Languages and Programming, part II (ICALP 2006), Springer Verlag, Venice, Italy, July 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Dwork,C., McSherry, F., Nissim,K., Smith, A. Calibrating noise to sensitivity in private data analysis. In Proceedings of the 3rd Theory of Cryptography Conference, pages 265--284, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Gonzalez, H., Halevy, A.Y., Jensen, C.S., Langen,A., Madhavan, J., Shapley, R., Shen, R., Goldberg-Kidon, J.: Google fusion tables: web-centered data management and collaboration. SIGMOD Conference 2010: 1061--1066 Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Haas, P.J., Hellerstein, J.M.: Ripple Joins for Online Aggregation. SIGMOD Conference 1999: 287--298 Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Hellerstein, J.M., Haas, P.J., Wang, H.J.: Online Aggregation. SIGMOD Conference 1997: 171--182 Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Hoffart J. et.al.: Robust Disambiguation of Named Entities in Text, EMNLP 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Kulkarni K., Singh A., Ramakrishnan G., Chakrabarti, S.: Collective Annotation of Wikipedia Entities in Web Text. KDD 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Lampson, B.: Privacy and security - Usable security: how to get it. Communications of the ACM 52(11): 25--27 (2009). Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. McSherry, F.: Privacy integrated queries: an extensible platform for privacy-preserving data analysis. SIGMOD Conference 2009: 19--30. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Olston C. et.al.: Pig Latin: a not-so-Foreign Language for Data Processing. SIGMOD'08. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Oracle Virtual Private Database (VPD). http://www.oracle.com.Google ScholarGoogle Scholar
  20. Stonebraker, M., Abadi, D.A., DeWitt, D.J., Madden, S., Paulson, E., Pavlo, A., Rasin, A.: MapReduce and parallel DBMSs: friends or foes? Communications of the ACM 53(1): 64--71 (2010). Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Storm et al. Adaptive Self-Tuning Memory in IBM DB2. In Proceedings of VLDB 2006, Seoul, Korea. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Thusoo, A. et al. Hive: a Warehousing Solution over a Map-Reduce Framework. PVLDB 2(2), 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Wang C., Chakrabarti K, Cheng T., Chaudhuri S.: Targeted Disambiguation of Ad-hoc, Homogeneous Sets of Named Entities, WWW 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. What next?: a half-dozen data management research goals for big data and the cloud

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      PODS '12: Proceedings of the 31st ACM SIGMOD-SIGACT-SIGAI symposium on Principles of Database Systems
      May 2012
      332 pages
      ISBN:9781450312486
      DOI:10.1145/2213556

      Copyright © 2012 ACM

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 21 May 2012

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • keynote

      Acceptance Rates

      Overall Acceptance Rate476of1,835submissions,26%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader
    About Cookies On This Site

    We use cookies to ensure that we give you the best experience on our website.

    Learn more

    Got it!