10.1145/1516360.1516398acmotherconferencesArticle/Chapter ViewAbstractPublication PagesedbtConference Proceedingsconference-collections
research-article
Free Access

Exploiting the power of relational databases for efficient stream processing

Online:24 March 2009Publication History

ABSTRACT

Stream applications gained significant popularity over the last years that lead to the development of specialized stream engines. These systems are designed from scratch with a different philosophy than nowadays database engines in order to cope with the stream applications requirements. However, this means that they lack the power and sophisticated techniques of a full fledged database system that exploits techniques and algorithms accumulated over many years of database research.

In this paper, we take the opposite route and design a stream engine directly on top of a database kernel. Incoming tuples are directly stored upon arrival in a new kind of system tables, called baskets. A continuous query can then be evaluated over its relevant baskets as a typical one-time query exploiting the power of the relational engine. Once a tuple has been seen by all relevant queries/operators, it is dropped from its basket. A basket can be the input to a single or multiple similar query plans. Furthermore, a query plan can be split into multiple parts each one with its own input/output baskets allowing for flexible load sharing query scheduling. Contrary to traditional stream engines, that process one tuple at a time, this model allows batch processing of tuples, e.g., query a basket only after x tuples arrive or after a time threshold has passed. Furthermore, we are not restricted to process tuples in the order they arrive. Instead, we can selectively pick tuples from a basket based on the query requirements exploiting a novel query component, the basket expressions.

We investigate the opportunities and challenges that arise with such a direction and we show that it carries significant advantages. We propose a complete architecture, the DataCell, which we implemented on top of an open-source column-oriented DBMS. A detailed analysis and experimental evaluation of the core algorithms using both micro benchmarks and the standard Linear Road benchmark demonstrate the potential of this new approach.

References

  1. D. J. Abadi et al. The Design of the Borealis Stream Processing Engine. In CIDR, 2005.Google ScholarGoogle Scholar
  2. A. Arasu et al. CQL: A Language for Continuous Queries over Streams and Relations. In DBPL, 2003.Google ScholarGoogle Scholar
  3. A. Arasu et al. Linear Road: A Stream Data Management Benchmark. In VLDB, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. B. Babcock et al. Operator Scheduling in Data Stream Systems. The VLDB Journal, 13(4):333--353, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. S. Babu and J. Widom. Continuous Queries over Data Streams. SIGMOD Record, 30(3):109--120, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. H. Balakrishnan et al. Retrospective on Aurora. The VLDB Journal, 13(4):370--383, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. S. Chandrasekaran et al. TelegraphCQ: Continuous Data-flow Processing for an Uncertain World. In CIDR, 2003.Google ScholarGoogle Scholar
  8. J. Chen et al. NiagaraCQ: A Scalable Continuous Query System for Internet Databases. In SIGMOD, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. C. D. Cranor et al. Gigascope: A Stream Database for Network Applications. In SIGMOD, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. L. Girod et al. The Case for a Signal-Oriented Data Stream Management System. In CIDR, 2007.Google ScholarGoogle Scholar
  11. S. Harizopoulos et al. QPipe: a simultaneously pipelined relational query engine. In SIGMOD, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. M. Ivanova and T. Risch. Customizable Parallel Execution of Scientific Stream Queries. In VLDB, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. N. Jain et al. Design, Implementation, and Evaluation of the Linear Road Benchmark on the Stream Processing Core. In SIGMOD, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. M. Kersten, E. Liarou, and R. Goncalves. A Query Language for a Data Refinery Cell. In Int. Workshop on Event Driven Architecture and Event Processing Systems, 2007.Google ScholarGoogle Scholar
  15. H. Lim et al. Continuous query processing in data streams using duality of data and queries. In SIGMOD, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. S. Madden et al. Continuously Adaptive Continuous Queries over Streams. In SIGMOD, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. MonetDB. http://www.monetdb.com.Google ScholarGoogle Scholar
  18. J. L. Peterson. Petri nets. ACM Comput. Surv., 9(3), 1977. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. U. Schreier et al. Alert: An Architecture for Transforming a Passive DBMS into an Active DBMS. In VLDB, 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. StreamSQL. http://blogs.streamsql.org/.Google ScholarGoogle Scholar

Index Terms

(auto-classified)
  1. Exploiting the power of relational databases for efficient stream processing

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        ACM Other conferences cover image
        EDBT '09: Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
        March 2009
        1180 pages
        ISBN:9781605584225
        DOI:10.1145/1516360

        Copyright © 2009 ACM

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Online: 24 March 2009

        Permissions

        Request permissions about this article.

        Request Permissions

        Qualifiers

        • research-article

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader
      About Cookies On This Site

      We use cookies to ensure that we give you the best experience on our website.

      Learn more

      Got it!