Abstract
Event extraction is one of the crucial tasks in biomedical text mining that aims to extract specific information concerning incidents embedded in the texts. In this article, we propose a deep learning framework that aims to identify the attributes (severity, course, temporal expression, and document creation time) associated with the medical concepts extracted from electronic medical records. The bi-directional long short-term memory network assisted by the attention mechanism is utilized to uncover the important aspects of the patient’s medical conditions. The attention mechanism specific to the medical disorder mention can focus on various parts of the sentence when different disorders are considered as input. The proposed methodology is evaluated on benchmark ShARe/CLEF eHealth Evaluation Lab 2014 shared task 2 datasets. In addition to the CLEF dataset, we also used the social media text, especially the medical blog posts. Experimental results of the proposed approach illustrate that our proposed approach achieves significant performance improvements over the state-of-the-art techniques and the highly competitive deep learning--based baseline methods.
Supplemental Material
Available for Download
Supplemental movie, appendix, image and software files for, Exploring Disorder-Aware Attention for Clinical Event Extraction
- Susannah Fox. 2011. The Social Life of Health Information, 2011. Pew Internet 8 American Life Project, Washington, DC.Google Scholar
- Mark Dredze and Michael J. Paul. 2014. Natural language processing for health and social media. IEEE Intelligent Systems 29, 2 (2014), 64--67.Google Scholar
- Yang Liu, Songhua Xu, Hong-Jun Yoon, and Georgia Tourassi. 2014. Extracting patient demographics and personal medical information from online health forums. In AMIA Annual Symposium Proceedings, Vol. 2014. American Medical Informatics Association, Bethesda, MD, 1825--1834.Google Scholar
- Per E. Kummervold, Deede Gammon, Svein Bergvik, Jan-Are K. Johnsen, Toralf Hasvold, and Jan H. Rosenvinge. 2002. Social support in a wired world: Use of online mental health forums in Norway. Nordic Journal of Psychiatry 56, 1 (2002), 59--65.Google Scholar
Cross Ref
- George Hripcsak, Nicholas D. Soulakis, Li Li, Frances P. Morrison, Albert M. Lai, Carol Friedman, Neil S. Calman, and Farzad Mostashari. 2009. Syndromic surveillance using ambulatory electronic health records. Journal of the American Medical Informatics Association 16, 3 (2009), 354--361.Google Scholar
Cross Ref
- Carlo Combi and Yuval Shahar. 1997. Temporal reasoning and temporal data maintenance in medicine: issues and challenges. Computers in Biology and Medicine 27, 5 (1997), 353--368.Google Scholar
Cross Ref
- Jennifer D’Souza and Vincent Ng. 2014. Knowledge-rich temporal relation identification and classification in clinical notes. Database 2014 (2014), bau109.Google Scholar
- Luis Fernandez-Luque, Randi Karlsen, and Jason Bonander. 2011. Review of extracting information from the Social Web for health personalization. Journal of Medical Internet Research 13, 1 (2011), e15.Google Scholar
Cross Ref
- Liadh Kelly, Lorraine Goeuriot, Hanna Suominen, Tobias Schreck, Gondy Leroy, Danielle L. Mowery, Sumithra Velupillai, et al. 2014. Overview of the ShARe/CLEF eHealth Evaluation Lab 2014. In Proceedings of the International Conference of the Cross-Language Evaluation Forum for European Languages. 172--191.Google Scholar
Cross Ref
- Li Zhou, Carol Friedman, Simon Parsons, and George Hripcsak. 2005. System architecture for temporal information extraction, representation and reasoning in clinical narrative reports. In AMIA Annual Symposium Proceedings, Vol. 2005. American Medical Informatics Association, Bethesda, MD, 869--873.Google Scholar
- Buzhou Tang, Yonghui Wu, Min Jiang, Yukun Chen, Joshua C. Denny, and Hua Xu. 2013. A hybrid system for temporal information extraction from clinical text. Journal of the American Medical Informatics Association 20, 5 (2013), 828--835.Google Scholar
- Chen Lin, Dmitriy Dligach, Timothy A. Miller, Steven Bethard, and Guergana K. Savova. 2015. Multilayered temporal modeling for the clinical domain. Journal of the American Medical Informatics Association 23, 2 (2015), 387--395.Google Scholar
Cross Ref
- Weiyi Sun, Anna Rumshisky, and Ozlem Uzuner. 2013. Annotating temporal information in clinical narratives. Journal of Biomedical Informatics 46 (2013), S5--S12.Google Scholar
Digital Library
- Nishikant Johri, Yoshiki Niwa, and Veera Raghavendra Chikka. 2014. Optimizing Apache cTAKES for disease/disorder template filling: Team HITACHI in 2014 ShARe/CLEF eHealth Evaluation Lab. In Proceedings of the Conference and Labs of the Evaluation Forum (CLEF’14).Google Scholar
- S. V. Ramanan and P. Senthil Nathan. 2014. Cocoa: Extending a rule-based system to tag disease attributes in clinical records. In Working Notes for the CLEF 2014 Conference.Google Scholar
- Azadeh Nikfarjam, Abeed Sarker, Karen O’Connor, Rachel Ginn, and Graciela Gonzalez. 2015. Pharmacovigilance from social media: Mining adverse drug reaction mentions using sequence labeling with word embedding cluster features. Journal of the American Medical Informatics Association 22, 3 (2015), 671--681.Google Scholar
Cross Ref
- Glen Coppersmith, Mark Dredze, Craig Harman, Kristy Hollingshead, and Margaret Mitchell. 2015. CLPsych 2015 shared task: Depression and PTSD on Twitter. In Proceedings of the 2nd Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality. 31--39.Google Scholar
Cross Ref
- Donghee Yvette Wohn, Casey Fiesler, Libby Hemphill, Munmun De Choudhury, and J. Nathan Matias. 2017. How to handle online risks? Discussing content curation and moderation in social media. In Proceedings of the 2017 CHI Conference Extended Abstracts on Human Factors in Computing Systems. ACM, New York, NY, 1271--1276.Google Scholar
- Alex Graves. 2013. Generating sequences with recurrent neural networks. arXiv:1308.0850.Google Scholar
- Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2014. Neural machine translation by jointly learning to align and translate. arXiv:1409.0473.Google Scholar
- Shweta Yadav, Asif Ekbal, Sriparna Saha, Ankit Kumar, and Pushpak Bhattacharyya. 2019. Feature assisted stacked attentive shortest dependency path based Bi-LSTM model for protein--protein interaction. Knowledge-Based Systems 166 (2019), 18--29.Google Scholar
Cross Ref
- Dhanachandra Ningthoujam, Shweta Yadav, Pushpak Bhattacharyya, and Asif Ekbal. 2019. Relation extraction between the clinical entities based on the shortest dependency path based LSTM. arXiv:1903.09941.Google Scholar
- Jason P. C. Chiu and Eric Nichols. 2016. Named entity recognition with bidirectional LSTM-CNNs. Transactions of the Association for Computational Linguistics 4 (2016), 357--370.Google Scholar
Cross Ref
- Peng Zhou, Zhenyu Qi, Suncong Zheng, Jiaming Xu, Hongyun Bao, and Bo Xu. 2016. Text classification improved by integrating bidirectional LSTM with two-dimensional max pooling. arXiv:1611.06639.Google Scholar
- Yequan Wang, Minlie Huang, Xiaoyan Zhu, and Li Zhao. 2016. Attention-based LSTM for aspect-level sentiment classification. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing. 606--615. DOI:http://dx.doi.org/10.18653/v1/D16-1058Google Scholar
Cross Ref
- Deepak Gupta, Rajkumar Pujari, Asif Ekbal, Pushpak Bhattacharyya, Anutosh Maitra, Tom Jain, and Shubhashis Sengupta. 2018. Can taxonomy help? Improving semantic question matching using question taxonomy. In Proceedings of the 27th International Conference on Computational Linguistics. 499--513. https://www.aclweb.org/anthology/C18-1042.Google Scholar
- Anutosh Maitra, Shubhashis Sengupta, Abhisek Mukhopadhyay, Deepak Gupta, Rajkumar Pujari, Pushpak Bhattacharya, Asif Ekbal, and Tom Geo Jain. 2018. Semantic question matching in data constrained environment. In Text, Speech, and Dialogue, P. Sojka, A. Horák, I. Kopeček, and K. Pala (Eds.). Springer International Publishing, Cham, Switzerland, 267--276.Google Scholar
- Danielle L. Mowery, Sumithra Velupillai, Brett R. South, Lee Christensen, David Martinez, Liadh Kelly, Lorraine Goeuriot, et al. 2014. Task 2: ShARe/CLEF eHealth Evaluation Lab 2014. In Proceedings of the Conference and Labs of the Evaluation Forum (CLEF’14).Google Scholar
- Thierry Hamon, Cyril Grouin, and Pierre Zweigenbaum. 2014. Disease and disorder template filling using rule-based and statistical approaches. In Working Notes for the CLEF 2014 Conference. 79--90.Google Scholar
- Tigran Mkrtchyan and Daniel Sonntag. 2014. Deep parsing at the CLEF2014 IE task (DFKI-Medical). In CEUR Workshop Proceedings, Vol. 1180. 138--146.Google Scholar
- Konrad Herbst, Cindy Fähnrich, Mariana L. Neves, and Matthieu-P. Schapranow. 2014. Applying in-memory technology for automatic template filling in the clinical domain. In Working Notes for the CLEF 2014 Conference. 91--102.Google Scholar
- Noémie Elhadad, Sameer Pradhan, Sharon Gorman, Suresh Manandhar, Wendy Chapman, and Guergana Savova. 2015. SemEval-2015 task 14: Analysis of clinical text. In Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval’15). 303--310.Google Scholar
Cross Ref
- Jun Xu, Yaoyun Zhang, Jingqi Wang, Yonghui Wu, Min Jiang, Ergin Soysal, and Hua Xu. 2015. UTH-CCB: The participation of the SemEval 2015 challenge—Task 14. In Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval’15). 311--314.Google Scholar
Cross Ref
- Parth Pathak, Pinal Patel, Vishal Panchal, Sagar Soni, Kinjal Dani, Amrish Patel, and Narayan Choudhary. 2015. ezDI: A supervised NLP system for clinical narrative analysis. In Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval’15). 412--416.Google Scholar
Cross Ref
- Weiyi Sun, Anna Rumshisky, and Ozlem Uzuner. 2013. Evaluating temporal relations in clinical text: 2012 i2b2 challenge. Journal of the American Medical Informatics Association 20, 5 (2013), 806--813.Google Scholar
- Steven Bethard, Guergana Savova, Wei-Te Chen, Leon Derczynski, James Pustejovsky, and Marc Verhagen. 2016. SemEval-2016 Task 12: Clinical TempEval. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval’16). 1052--1062.Google Scholar
Cross Ref
- Steven Bethard, Leon Derczynski, Guergana Savova, James Pustejovsky, and Marc Verhagen. 2015. SemEval-2015 Task 6: Clinical TempEval. In Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval’15). 806--814.Google Scholar
Cross Ref
- Sean MacAvaney, Arman Cohan, and Nazli Goharian. 2017. GUIR at SemEval-2017 Task 12: A framework for cross-domain clinical temporal information extraction. In Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval’17). 1024--1029.Google Scholar
Cross Ref
- James Pustejovsky and Amber Stubbs. 2011. Increasing informativeness in temporal annotation. In Proceedings of the 5th Linguistic Annotation Workshop. 152--160.Google Scholar
Digital Library
- Weiyi Sun, Anna Rumshisky, and Ozlem Uzuner. 2013. Temporal reasoning over clinical text: The state of the art. Journal of the American Medical Informatics Association 20, 5 (2013), 814--819.Google Scholar
- Yu Long, Zhijing Li, Xuan Wang, and Chen Li. 2017. XJNLP at SemEval-2017 Task 12: Clinical temporal information ex-traction with a hybrid model. In Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval’17). 1014--1018.Google Scholar
Cross Ref
- Minjoon Seo, Aniruddha Kembhavi, Ali Farhadi, and Hannaneh Hajishirzi. 2016. Bidirectional attention flow for machine comprehension. arXiv:1611.01603.Google Scholar
- Deepak Gupta, Sarah Kohail, and Pushpak Bhattacharyya. 2018. Combining graph-based dependency features with convolutional neural network for answer triggering. arXiv:1808.01650.Google Scholar
- Deepak Gupta, Pabitra Lenka, Asif Ekbal, and Pushpak Bhattacharyya. 2018. Uncovering code-mixed challenges: A framework for linguistically driven question generation and neural based question answering. In Proceedings of the 22nd Conference on Computational Natural Language Learning. 119--130. DOI:http://dx.doi.org/10.18653/v1/K18-1012Google Scholar
Cross Ref
- Deepak Gupta, Surabhi Kumari, Asif Ekbal, and Pushpak Bhattacharyya. 2018. MMQA: A multi-domain multi-lingual question-answering framework for English and Hindi. In Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC’18). https://www.aclweb.org/anthology/L18-1440.Google Scholar
- Mahmood Yousefi Azar, Kairit Sirts, Diego Mollá Aliod, and Len Hamey. 2015. Query-based single document summarization using an ensemble noisy auto-encoder. In Proceedings of the Australasian Language Technology Association Workshop 2015. 2--10. https://www.aclweb.org/anthology/U15-1001.Google Scholar
- Johan Hasselqvist, Niklas Helmertz, and Mikael Kågebäck. 2017. Query-based abstractive summarization using neural networks. arXiv:1712.06100.Google Scholar
- Ilya Sutskever, Oriol Vinyals, and Quoc V. Le. 2014. Sequence to sequence learning with neural networks. In Advances in Neural Information Processing Systems. 3104--3112.Google Scholar
- Preksha Nema, Mitesh M. Khapra, Anirban Laha, and Balaraman Ravindran. 2017. Diversity driven attention model for query-based abstractive summarization. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 1063--1072. DOI:http://dx.doi.org/10.18653/v1/P17-1098Google Scholar
Cross Ref
- Ming Tan, Cicero dos Santos, Bing Xiang, and Bowen Zhou. 2015. LSTM-based deep learning models for non-factoid answer selection. arXiv:1511.04108.Google Scholar
- Qin Chen, Qinmin Hu, Jimmy Xiangji Huang, Liang He, and Weijie An. 2017. Enhancing recurrent neural networks with positional attention for question answering. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, NY, 993--996.Google Scholar
Digital Library
- Chuanqi Tan, Furu Wei, Nan Yang, Bowen Du, Weifeng Lv, and Ming Zhou. 2017. S-Net: From answer extraction to answer generation for machine reading comprehension. arXiv:1706.04815.Google Scholar
- Tim Rocktäschel, Edward Grefenstette, Karl Moritz Hermann, Tomáš Kočiskỳ, and Phil Blunsom. 2015. Reasoning about entailment with neural attention. arXiv:1509.06664.Google Scholar
- Mohammed Saeed, Christine Lieu, Greg Raber, and Roger G. Mark. 2002. MIMIC II: A massive temporal ICU patient database to support research in intelligent patient monitoring. In Proceedings of Computers in Cardiology. 641--644.Google Scholar
- Diederik P. Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv:1412.6980.Google Scholar
- Jeffrey Pennington, Richard Socher, and Christopher D. Manning. 2014. GloVe: Global vectors for word representation. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP’14). 1532--1543. http://www.aclweb.org/anthology/D14-1162.Google Scholar
- Martín Abadi, Ashish Agarwal, Paul Barham, Eugene Brevdo, Zhifeng Chen, Craig Citro, Greg S. Corrado, et al. 2015. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. Retrieved February 13, 2020 from https://www.tensorflow.org/ (Software available from tensorflow.org.)Google Scholar
- Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long and Short Papers). 4171--4186. DOI:http://dx.doi.org/10.18653/v1/N19-1423Google Scholar
- Jinhyuk Lee, Wonjin Yoon, Sungdong Kim, Donghyeon Kim, Sunkyu Kim, Chan Ho So, and Jaewoo Kang. 2019. Biobert: Pre-trained biomedical language representation model for biomedical text mining. arXiv:1901.08746.Google Scholar
- Munmun De Choudhury, Sanket S. Sharma, Tomaz Logar, Wouter Eekhout, and René Clausen Nielsen. 2017. Gender and cross-cultural differences in social media disclosures of mental illness. In Proceedings of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing. ACM, New York, NY, 353--369.Google Scholar
Digital Library
- Sindhu Kiranmai Ernala, Tristan Labetoulle, Fred Bane, Michael L. Birnbaum, Asra F. Rizvi, John M. Kane, and Munmun De Choudhury. 2018. Characterizing audience engagement and assessing its impact on social media disclosures of mental illnesses. In Proceedings of the 12th International AAAI Conference on Web and Social Media.Google Scholar
- Michael L. Birnbaum, Sindhu Kiranmai Ernala, Asra F. Rizvi, Munmun De Choudhury, and John M. Kane. 2017. A collaborative approach to identifying social media markers of schizophrenia by employing machine learning and clinical appraisals. Journal of Medical Internet Research 19, 8 (2017), e289.Google Scholar
Cross Ref
- Eva Sharma and Munmun De Choudhury. 2018. Mental health support and its relationship to linguistic accommodation in online communities. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems. ACM, New York, NY, 641.Google Scholar
Digital Library
- Sarmistha Dutta, Jennifer Ma, and Munmun De Choudhury. 2018. Measuring the impact of anxiety on online social interactions. In Proceedings of the 12th International AAAI Conference on Web and Social Media.Google Scholar
- Shweta Yadav, Asif Ekbal, Sriparna Saha, and Pushpak Bhattacharyya. 2019. A unified multi-task adversarial learning framework for pharmacovigilance mining. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. 5234--5245.Google Scholar
Cross Ref
- Shweta Yadav, Joy Sain, Amit Sheth, Asif Ekbal, Sriparna Saha, and Pushpak Bhattacharyya. 2018. Leveraging medical sentiment to understand patients health on social media. arXiv:1807.11172.Google Scholar
- Shweta Yadav, Asif Ekbal, Sriparna Saha, and Pushpak Bhattacharyya. 2018. Medical sentiment analysis using social media: Towards building a patient assisted system. In Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC’18).Google Scholar
- Shweta Yadav, Asif Ekbal, Sriparna Saha, Pushpak Bhattacharyya, and Amit Sheth. 2018. Multi-task learning framework for mining crowd intelligence towards clinical treatment. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 2: Short Papers). 271--277.Google Scholar
Cross Ref
- Silvio Amir, Mark Dredze, and John W. Ayers. 2019. Mental health surveillance over social media with digital cohorts. In Proceedings of the 6th Workshop on Computational Linguistics and Clinical Psychology. 114--120.Google Scholar
- Munmun De Choudhury, Emre Kiciman, Mark Dredze, Glen Coppersmith, and Mrinal Kumar. 2016. Discovering shifts to suicidal ideation from mental health content in social media. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems. ACM, New York, NY, 2098--2110.Google Scholar
Digital Library
Index Terms
Exploring Disorder-Aware Attention for Clinical Event Extraction
Recommendations
Social Event Extraction: Task, Challenges and Techniques
ASONAM '15: Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2015Social media (e.g., Facebook and Twitter) serves as a popular platform for online communication and information dissemination, where users can effectively share information such as their recent activities and plans. This kind of information is extremely ...
Extraction and Compilation of Events and Sub-events from Twitter
WI-IAT '12: Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01Twitter has emerged as a great source to provide insights about upcoming planned and unplanned events of social, economic and political relevance. Big events are publicized and known in advance, but smaller, unplanned sub-events around them are not ...
Unsupervised event exploration from social text streams
Social media provides unprecedented opportunities for people to disseminate information and share their opinions and views online. Extracting events from social media platforms such as Twitter could help in understanding what is being discussed. ...






Comments