10.1145/2795218.2795221acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
short-paper

Data Like This: Ranked Search of Genomic Data Vision Paper

Authors Info & Claims
Published:31 May 2015Publication History

ABSTRACT

High-throughput genetic sequencing produces the ultimate "big data": a human genome sequence contains more than 3B base pairs, and more and more characteristics, or annotations, are being recorded at the base-pair level. Locating areas of interest within the genome is a challenge for researchers, limiting their investigations. We describe our vision of adapting "big data" ranked search to the problem of searching the genome. Our goal is to make searching for data as easy for scientists as searching the Internet.

References

  1. Agrawal, R. and Srikant, R. 2003. Searching with numbers. IEEE TKDE. 15, 4 (Aug. 2003), 855--870. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Ahrens, J.P. et al. 2011. Data-intensive science in the US DOE. CISE. 13, 6 (Dec. 2011), 14--24. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Altschul, S.F. et al. 1997. Gapped BLAST and PSI-BLAST. Nucleic acids res. 25, 17 (1997), 3389--3402.Google ScholarGoogle Scholar
  4. Cafarella, M.J. et al. 2008. Webtables: exploring the power of tables on the web. VLDB. 1, 1 (2008), 538--549. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. CURSOR: http://cursor.businesscatalyst.com/index.html. Accessed: 2015-02-23.Google ScholarGoogle Scholar
  6. Krzywinski, M. et al. 2009. Circos: An information aesthetic for comparative genomics. Genome Research. 19, 9 (Sep. 2009), 1639--1645.Google ScholarGoogle ScholarCross RefCross Ref
  7. Maier, D. et al. 2012. Navigating oceans of data. Scientific and Statistical Database Management (2012), 1--19. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Martin Sanchez, F. et al. 2013. Exposome informatics. J. of Am. Medical Informatics Ass. 21, 3 (Nov. 2013), 386--390.Google ScholarGoogle Scholar
  9. Megler, V.M. 2014. Ranked Similarity Search of Scientific Datasets (PhD Dissertation). Portland State University.Google ScholarGoogle Scholar
  10. Megler, V.M. and Maier, D. 2015. Are Datasets Like Documents?. IEEE TKDE. 27, 1 (Jan. 2015), 32--45.Google ScholarGoogle Scholar
  11. Robinson, J.T. et al. 2011. Integrative Genomics Viewer. Nature Biotechnology. 29, (2011), 24--26.Google ScholarGoogle Scholar
  12. UCSC Genome Browser: http://genome.ucsc.edu/.Google ScholarGoogle Scholar
  13. Venetis, P. et al. 2011. Recovering semantics of tables on the web. Proceedings of VLDB. 4, 9 (2011), 528--538. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Weidman, S. and Arrison, T. 2009. Steps toward large-scale data integration in the sciences. NRC/NAGoogle ScholarGoogle Scholar

Index Terms

  1. Data Like This: Ranked Search of Genomic Data Vision Paper

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          ExploreDB '15: Proceedings of the Second International Workshop on Exploratory Search in Databases and the Web
          May 2015
          37 pages
          ISBN:9781450337403
          DOI:10.1145/2795218

          Copyright © 2015 ACM

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 31 May 2015

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • short-paper
          • Research
          • Refereed limited

          Acceptance Rates

          ExploreDB '15 Paper Acceptance Rate 6 of 10 submissions, 60%Overall Acceptance Rate 11 of 21 submissions, 52%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader
        About Cookies On This Site

        We use cookies to ensure that we give you the best experience on our website.

        Learn more

        Got it!