Abstract
With the availability of reliable and low-cost DNA sequencing, human genomics is relevant to a growing number of end-users, including biologists and clinicians. Typical interactions require applying comparative data analysis to huge repositories of genomic information for building new knowledge, taking advantage of the latest findings in applied genomics for healthcare. Powerful technology for data extraction and analysis is available, but broad use of the technology is hampered by the complexity of accessing such methods and tools.
This work presents GeCoAgent, a big-data service for clinicians and biologists. GeCoAgent uses a dialogic interface, animated by a chatbot, for supporting the end-users’ interaction with computational tools accompanied by multi-modal support. While the dialogue progresses, the user is accompanied in extracting the relevant data from repositories and then performing data analysis, which often requires the use of statistical methods or machine learning. Results are returned using simple representations (spreadsheets and graphics), while at the end of a session the dialogue is summarized in textual format. The innovation presented in this article is concerned with not only the delivery of a new tool but also our novel approach to conversational technologies, potentially extensible to other healthcare domains or to general data science.
- [1] . 2015. GenoMetric query language: A novel approach to large-scale genomic data management. Bioinformatics 31, 12 (2015), 1881–1888.Google Scholar
Cross Ref
- [2] . 2019. Processing of big heterogeneous genomic datasets for tertiary analysis of next generation sequencing data. Bioinformatics 35, 5 (2019), 729–736.Google Scholar
Cross Ref
- [3] . 2020. META-BASE: A novel architecture for large-scale genomic metadata integration. IEEE/ACM Transactions on Computational Biology and Bioinformatics (2020), 1–1.
DOI: https://doi.org/10.1109/TCBB.2020.2998954Google Scholar - [4] . 2019. GenoSurf: Metadata driven semantic search system for integrated genomic datasets. Database: The Journal of Biological Databases and Curation 2019 (2019).
DOI: https://doi.org/10.1093/database/baz132Google Scholar - [5] . 2020. Bioinformatics. John Wiley & Sons.Google Scholar
- [6] . 2010. A hitchhiker’s guide to Next Generation Sequencing - Part 2. Retrieved May 1, 2021, from https://blog.goldenhelix.com/a-hitchhikers-guide-to-next-generation-sequencing-part-2/.Google Scholar
- [7] . 2021. The road towards data integration in human genomics: Players, steps and interactions. Briefings in Bioinformatics 22, 1 (2021), 30–44. http://dx.doi.org/10.1093/bib/bbaa080Google Scholar
- [8] . 2017. Overview of GeCo: A project for exploring and integrating signals from the genome. In International Conference on Data Analytics and Management in Data Intensive Domains. Springer, 46–57.Google Scholar
- [9] . 2015. Who qualifies to be a bioinformatician?Frontiers in Genetics 6 (2015), 164.Google Scholar
- [10] . 2013. Orange: Data mining toolbox in python. Journal of Machine Learning Research 14 (2013), 2349–2353. http://jmlr.org/papers/v14/demsar13a.html.Google Scholar
Digital Library
- [11] . 2020. Visualizing and interpreting cancer genomics data via the Xena platform. Nature Biotechnology 38 (2020), 675–678.Google Scholar
- [12] . 2014. Experiences building globus genomics: A next-generation sequencing analysis service using galaxy, globus, and amazon web services. Concurrency and Computation: Practice and Experience 26, 13 (2014), 2266–2279.Google Scholar
Digital Library
- [13] . 2009. Better bioinformatics through usability analysis. Bioinformatics 25, 3 (2009), 406–412.Google Scholar
Digital Library
- [14] . 2018. Conversational agents in healthcare: A systematic review. Journal of the American Medical Informatics Association 25, 9 (2018), 1248–1258.Google Scholar
Cross Ref
- [15] . 1950. Mind. Mind 59, 236 (1950), 433–460.Google Scholar
Cross Ref
- [16] . 1966. ELIZA—A computer program for the study of natural language communication between man and machine. Communications of the ACM 9, 1 (1966), 36–45.Google Scholar
Digital Library
- [17] . 2009. The anatomy of ALICE. In Parsing the Turing Test. Springer, 181–210.Google Scholar
- [18] . 1975. Artificial Paranoia: A Computer Simulation of Paranoid Process. Pergamon Press.Google Scholar
- [19] . 2003. The elements of AIML style. Alice AI Foundation 139 (2003).Google Scholar
- [20] . 2017. Rasa: Open source language understanding and dialogue management. ArXiv
DOI: https://arxiv.org/abs/1712.05181.Google Scholar - [21] . 2019. Would you like a chart with that? Incorporating visualizations into conversational interfaces. In 2019 IEEE Visualization Conference (VIS’19). IEEE, 1–5.Google Scholar
- [22] . 2007. Plow: A collaborative task learning agent. In AAAI, Vol. 7. Association for the Advancement of Artificial Intelligence, 1514–1519. https://www.semanticscholar.org/paper/PLOW%3A-A-Collaborative-Task-Learning-Agent-Allen-Chambers/431e61648a59abcd05411503ead56de8aa97906b.Google Scholar
Digital Library
- [23] . 2017. Why people use chatbots. In International Conference on Internet Science. Springer, 377–392.Google Scholar
- [24] . 2010. Evaluating the usability of natural language query languages and interfaces to semantic web knowledge bases. Journal of Web Semantics 8, 4 (2010), 377–393.Google Scholar
Digital Library
- [25] . 2017. Chatbots and the new world of HCI. Interactions 24, 4 (2017), 38–42.Google Scholar
Digital Library
- [26] . 1999. Ten myths of multimodal interaction. Communications of the ACM 42, 11 (1999), 74–81.Google Scholar
Digital Library
- [27] . 2019. PUMICE: A multi-modal agent that learns concepts and conditionals from natural language and demonstrations. In Proceedings of the 32nd Annual ACM Symposium on User Interface Software and Technology. 577–589.Google Scholar
Digital Library
- [28] . 2015. Datatone: Managing ambiguity in natural language interfaces for data visualization. In Proceedings of the 28th Annual ACM Symposium on User Interface Software & Technology. 489–500.Google Scholar
Digital Library
- [29] . 2017. Analyza: Exploring data with conversation. In Proceedings of the 22nd International Conference on Intelligent User Interfaces. 493–504.Google Scholar
- [30] . 2017. Applying pragmatics principles for interaction with visual analytics. IEEE Transactions on Visualization and Computer Graphics 24, 1 (2017), 309–318.Google Scholar
- [31] . 2019. Do what I mean, not what I say! Design considerations for supporting intent and context in analytical conversation. In 2019 IEEE Conference on Visual Analytics Science and Technology (VAST’19). IEEE, 93–103.Google Scholar
- [32] . 1999. Microsoft English query 7.5: Automatic extraction of semantics from relational databases and OLAP cubes. In VLDB, Vol. 99. 247–248.Google Scholar
- [33] . 2003. Towards a theory of natural language interfaces to databases. In Proceedings of the 8th International Conference on Intelligent User Interfaces. 149–157.Google Scholar
Digital Library
- [34] . 2016. ATHENA: An ontology-driven system for natural language querying over relational data stores. Proceedings of the VLDB Endowment 9, 12 (2016), 1209–1220.Google Scholar
- [35] . 2017. BioGraphBot: A conversational assistant for bioinformatics graph databases. In International Conference on Innovative Mobile and Internet Services in Ubiquitous Computing. Springer, 135–146.Google Scholar
- [36] . 2016. BioGraphDB: A new GraphDB collecting heterogeneous data for bioinformatics analysis. In Proceedings of BIOTECHNO.Google Scholar
- [37] . 2019. Development and usability evaluation of a prototype conversational interface for biological information retrieval via bioinformatics. In International Conference on Human-Computer Interaction. Springer, 575–593.Google Scholar
- [38] . 2017. Ava: From data to insights through conversations.. In CIDR.Google Scholar
- [39] . 1995. Specifying logic programs in controlled natural language. arXiv preprint cmp-lg/9507009 (1995).Google Scholar
- [40] . 2018. Iris: A conversational agent for complex tasks. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems. 1–12.Google Scholar
- [41] . 1990. Meaning and Speech Acts: Volume 1, Principles of Language Use. Cambridge University Press.Google Scholar
Digital Library
- [42] . 2016. Modeling and interoperability of heterogeneous genomic big data for integrative processing and querying. Methods 111 (2016), 3–11.Google Scholar
Cross Ref
- [43] . 2017. Conceptual modeling for genomics: Building an integrated repository of open data. In Conceptual Modeling, , , , and (Eds.). Springer International Publishing, Cham, 325–339.Google Scholar
Cross Ref
- [44] . 2019. From a conceptual model to a knowledge graph for genomic datasets. In Conceptual Modeling, , , , and (Eds.). Springer International Publishing, Cham, 352–360.Google Scholar
Cross Ref
- [45] . 2018. Building chatbots with python. In Using Natural Language Processing and Machine Learning. Apress.Google Scholar
- [46] . 2018. Towards a french smart-home voice command corpus: Design and NLU experiments. In International Conference on Text, Speech, and Dialogue. Springer, 509–517.Google Scholar
- [47] . 2020. Context-aware conversational agent for a closed domain task. In Proceedings of the Global AI Congress 2019. Springer, 303–318.Google Scholar
- [48] . 2019. PyGMQL: Scalable data extraction and analysis for heterogeneous genomic datasets. BMC Bioinformatics 20, 1 (2019), 560.Google Scholar
- [49] . 2000. Thinking aloud: Reconciling theory and practice. IEEE Transactions on Professional Communication 43, 3 (2000), 261–278.Google Scholar
Cross Ref
- [50] . 2011. Applied Thematic Analysis. Sage Publications.Google Scholar
Index Terms
GeCoAgent: A Conversational Agent for Empowering Genomic Data Extraction and Analysis
Recommendations
Conversational Agents: Acting on the Wave of Research and Development
CHI EA '19: Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing SystemsIn the last five years, work on software that interacts with people via typed or spoken natural language, called chatbots, intelligent assistants, social bots, virtual companions, non-human players, and so on, increased dramatically. Chatbots burst into ...
Embodied conversational agents in Wizard-of-Oz and multimodal interaction applications
COST 2102'07: Proceedings of the 2007 COST action 2102 international conference on Verbal and nonverbal communication behavioursEmbodied conversational agents employed in multimodal interaction applications have the potential to achieve similar properties as humans in faceto-face conversation. They enable the inclusion of verbal and nonverbal communication. Thus, the degree of ...
Multiagent system for joke generation: Humor and emotions combined in human-agent conversation
In this paper we present an innovative work on a multiagent joking conversational system. In our research so far we have shown that implementing humor into a chatterbot can visibly improve its performance. The results presented in this paper are the ...






Comments