Abstract
In this article, we propose a new encoding scheme for named entity recognition (NER) called Joined Type-Length encoding (JoinedTL). Unlike most existing named entity encoding schemes, which focus on flat entities, JoinedTL can label nested named entities in a single sequence. JoinedTL uses a packed encoding to represent both type and span of a named entity, which not only results in less tagged tokens compared to existing encoding schemes, but also enables it to support nested NER. We evaluate the effectiveness of JoinedTL for nested NER on three nested NER datasets: GENIA in English, GermEval in German, and PerNest, our newly created nested NER dataset in Persian. We apply CharLSTM+WordLSTM+CRF, a three-layer sequence tagging model on three datasets encoded using JoinedTL and two existing nested NE encoding schemes, i.e., JoinedBIO and JoinedBILOU. Our experiment results show that CharLSTM+WordLSTM+CRF trained with JoinedTL encoded datasets can achieve competitive F1 scores as the ones trained with datasets encoded by two other encodings, but with 27%–48% less tagged tokens. To leverage the power of three different encodings, i.e., JoinedTL, JoinedBIO, and JoinedBILOU, we propose an encoding-based ensemble method for nested NER. Evaluation results show that the ensemble method achieves higher F1 scores on all datasets than the three models each trained using one of the three encodings. By using nested NE encodings including JoinedTL with CharLSTM+WordLSTM+CRF, we establish new state-of-the-art performance with an F1 score of 83.7 on PerNest, 74.9 on GENIA, and 70.5 on GermEval, surpassing two recent neural models specially designed for nested NER.
- [1] . 2020. Named entity recognition and classification for Punjabi shahmukhi. ACM Transactions on Asian and Low-Resource Language Information Processing 19, 4 (2020), 1–13. Google Scholar
Digital Library
- [2] . 2007. Recognising nested named entities in biomedical text. In Proceedings of the Biological, Translational, and Clinical Language Processing. Association for Computational Linguistics, 65–72. Retrieved from https://www.aclweb.org/anthology/W07-1009. Google Scholar
Digital Library
- [3] . 2014. GermEval 2014 named entity recognition shared task: Companion paper. Retrieved 25 Nov., 2021. from https://hildok.bsz-bw.de/frontdoor/index/index/docId/283.Google Scholar
- [4] . 2014. NoSta-D named entity annotation for German: Guidelines and dataset. In Proceedings of the 9th International Conference on Language Resources and Evaluation. European Language Resources Association, 2524–2531. Retrieved from http://www.lrec-conf.org/proceedings/lrec2014/pdf/276_Paper.pdf.Google Scholar
- [5] . 2011. Lessons from building a Persian written corpus: Peykare. Language Resources and Evaluation 45, 2 (2011), 143–164. Google Scholar
Digital Library
- [6] . 1999. A Maximum Entropy Approach to Named Entity Recognition. Ph.D. Dissertation. New York University, New York, NY. UMI
Order Number:AAI 9945252. Google ScholarDigital Library
- [7] . 2016. Named entity recognition with bidirectional LSTM-CNNs. Transactions of the Association for Computational Linguistics 4 (2016), 357–370.
DOI: https://doi.org/10.1162/tacl_a_00104Google ScholarCross Ref
- [8] . 2017. Named entity recognition with word embeddings and wikipedia categories for a low-resource language. ACM Transactions on Asian and Low-Resource Language Information Processing 16, 3 (2017), 1–19. Google Scholar
Digital Library
- [9] . 2009. Nested named entity recognition. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 141–150. Retrieved from https://www.aclweb.org/anthology/D09-1015. Google Scholar
Digital Library
- [10] . 2019. Merge and label: A novel neural network architecture for nested NER. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 5840–5850.
DOI: https://doi.org/10.18653/v1/P19-1585.Google ScholarCross Ref
- [11] . 2017. A’laam corpus: A standard corpus of named entity for persian language. Signal and Data Processing 14, 3 (
Dec 2017), 127–142.DOI: https://doi.org/10.29252/jsdp.14.3.127arXiv:http://jsdp.rcisp.ac.ir/article-1-477-en.pdf .Google ScholarCross Ref
- [12] . 2015. Bidirectional LSTM-CRF Models for Sequence Tagging. CoRR abs/1508.01991. http://arxiv.org/abs/1508.01991.Google Scholar
- [13] . 2018. A neural layered model for nested named entity recognition. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Vol. 1. Association for Computational Linguistics, 1446–1459.
DOI: https://doi.org/10.18653/v1/N18-1131Google ScholarCross Ref
- [14] . 2019. Urdu named entity recognition: Corpus generation and deep learning applications. ACM Transactions on Asian and Low-Resource Language Information Processing 19, 1 (
June 2019), Article 8, 13 pages.DOI: https://doi.org/10.1145/3329710 Google ScholarDigital Library
- [15] . 2018. Nested named entity recognition revisited. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Vol. 1. Association for Computational Linguistics, 861–871.
DOI: https://doi.org/10.18653/v1/N18-1079Google ScholarCross Ref
- [16] . 2003. GENIA corpus’a semantically annotated corpus for bio-textmining. Bioinformatics 19, suppl_1 (
July 2003), i180–i182.DOI: https://doi.org/10.1093/bioinformatics/btg1023arXiv:https://academic.oup.com/bioinformatics/article-pdf/19/suppl_1/i180/614820/btg1023.pdf .Google Scholar - [17] . 2001. Chunking with support vector machines. In Proceedings of the 2nd Meeting of the North American Chapter of the Association for Computational Linguistics. Retrieved from https://www.aclweb.org/anthology/N01-1025. Google Scholar
Digital Library
- [18] . 2016. Neural architectures for named entity recognition. In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Association for Computational Linguistics, 260–270.
DOI: https://doi.org/10.18653/v1/N16-1030Google ScholarCross Ref
- [19] . 2020. A survey on deep learning for named entity recognition. IEEE Transactions on Knowledge and Data Engineering PP, 99 (2020), 1–1.Google Scholar
Cross Ref
- [20] . 2019. A unified MRC framework for named entity recognition. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 5849–5859.Google Scholar
- [21] . 2018. Empower sequence labeling with task-aware neural language model. In Proceedings of the 32nd AAAI Conference on Artificial Intelligence. Google Scholar
Digital Library
- [22] . 2015. Joint mention extraction and classification with mention hypergraphs. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. 857–867.Google Scholar
Cross Ref
- [23] . 2020. Bipartite flat-graph network for nested named entity recognition. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. , , , and (Eds.). Association for Computational Linguistics, 6408–6418.
DOI: https://doi.org/10.18653/v1/2020.acl-main.571Google ScholarCross Ref
- [24] . 2017. Urdu named entity recognition and classification system using artificial neural network. ACM Transactions on Asian and Low-Resource Language Information Processing 17, 1 (2017), 1–13. Google Scholar
Digital Library
- [25] . 2007. Ancora: Multilingual and multilevel annotated corpora. MS, Universitat de Barcelona (2007).Google Scholar
- [26] . 2012. Interrater reliability: The kappa statistic. Biochemia Medica: Biochemia Medica 22, 3 (2012), 276–282.Google Scholar
Cross Ref
- [27] . 2006. Named entity recognition for question answering. In Proceedings of the Australasian Language Technology Workshop 2006, 51–58. Retrieved from https://www.aclweb.org/anthology/U06-1009.Google Scholar
- [28] . 2017. Labeling gaps between words: Recognizing overlapping mentions with mention separators. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 2608–2618.
DOI: https://doi.org/10.18653/v1/D17-1276Google ScholarCross Ref
- [29] . 2019. Error Analysis for Vietnamese Named Entity Recognition on Deep Neural Network Models. CoRR abs/1911.07228. http://arxiv.org/abs/1911.07228.Google Scholar
- [30] . 2018. BiLSTM-CRF for persian named-entity recognition ArmanPersoNERCorpus: The first entity-annotated persian dataset. In Proceedings of the 11th International Conference on Language Resources and Evaluation. European Language Resources Association. Retrieved from https://www.aclweb.org/anthology/L18-1701.Google Scholar
- [31] . 2015. SANAPHOR: Ontology-based coreference resolution. In Proceedings of the Semantic Web - ISWC 2015. , , , , , , , , , , , and (Eds.). Springer International Publishing, Cham, 458–473. Google Scholar
Digital Library
- [32] . 1995. Text chunking using transformation-based learning. In Proceedings of the 3rd Workshop on Very Large Corpora. Retrieved from https://www.aclweb.org/anthology/W95-0107.Google Scholar
- [33] . 2009. Design challenges and misconceptions in named entity recognition. In Proceedings of the 13th Conference on Computational Natural Language Learning. Association for Computational Linguistics, 147–155. Retrieved from https://www.aclweb.org/anthology/W09-1119. Google Scholar
Digital Library
- [34] . 1998. Maximum Entropy Models for Natural Language Ambiguity Resolution. Ph.D. Dissertation.
UMI Order Number: AAI 9840230. Google ScholarDigital Library
- [35] . 2017. Semi-supervised multitask learning for sequence labeling. CoRR abs/1704.07156.Google Scholar
- [36] . 1999. Representing text chunks. In Proceedings of the 9th Conference on European Chapter of the Association for Computational Linguistics . Association for Computational Linguistics, 173–179.
DOI: https://doi.org/10.3115/977035.977059 Google ScholarDigital Library
- [37] . 2018. PEYMA: A tagged corpus for Persian named entities. CoRR.Google Scholar
- [38] . 2005. Voting between multiple data representations for text chunking. In Proceedings of the 18th Canadian Society Conference on Advances in Artificial Intelligence. Springer-Verlag, Berlin, 389–400.
DOI: https://doi.org/10.1007/11424918_40 Google ScholarDigital Library
- [39] . 2020. Nested named entity recognition via second-best sequence learning and decoding. Transactions of the Association for Computational Linguistics 8 (2020), 605–620.Google Scholar
Cross Ref
- [40] . 2018. Deep exhaustive model for nested named entity recognition. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 2843–2849.
DOI: https://doi.org/10.18653/v1/D18-1309Google ScholarCross Ref
- [41] . 2012. BRAT: A web-based tool for NLP-assisted text annotation. In Proceedings of the Demonstrations at the 13th Conference of the European Chapter of the Association for Computational Linguistics. Association for Computational Linguistics, 102–107. Google Scholar
Digital Library
- [42] . 2019. Neural architectures for nested NER through linearization. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 5326–5331.
DOI: https://doi.org/10.18653/v1/P19-1527Google ScholarCross Ref
- [43] . 2013. Recognizing and encoding disorder concepts in clinical text using machine learning and vector space model. CEUR Workshop Proceedings 1179 (
Jan. 2013).Google Scholar - [44] . 2003. Introduction to the CoNLL-2003 shared task: Language-independent named entity recognition. In Proceedings of the 7th Conference on Natural Language Learning at HLT-NAACL 2003. 142–147. Retrieved from https://www.aclweb.org/anthology/W03-0419. Google Scholar
Digital Library
- [45] . 2018. Neural segmental hypergraphs for overlapping mention recognition. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 204–214.
DOI: https://doi.org/10.18653/v1/D18-1019Google ScholarCross Ref
- [46] . 2018. A neural transition-based model for nested mention recognition. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 1011–1017.
DOI: https://doi.org/10.18653/v1/D18-1124Google ScholarCross Ref
- [47] . 2020. Pyramid: A layered model for nested named entity recognition. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 5918–5928.
DOI: https://doi.org/10.18653/v1/2020.acl-main.525Google ScholarCross Ref
- [48] . 2020. HIT: Nested named entity recognition via head-tail pair and token interaction. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 6027–6036.
DOI: https://doi.org/10.18653/v1/2020.emnlp-main.486Google ScholarCross Ref
- [49] . 2019. Named entity recognition and normalization applied to large-scale information extraction from the materials science literature. Journal of Chemical Information and Modeling 59, 9 (2019), 3692–3702.
DOI: https://doi.org/10.1021/acs.jcim.9b00470arXiv:https://doi.org/10.1021/acs.jcim.9b00470 Google ScholarCross Ref
- [50] . 2018. Deep affix features improve neural named entity recognizers. In Proceedings of the 7th Joint Conference on Lexical and Computational Semantics. Association for Computational Linguistics, 167–172.
DOI: https://doi.org/10.18653/v1/S18-2021Google ScholarCross Ref
- [51] . 2018. Design challenges and misconceptions in neural sequence labeling. In Proceedings of the 27th International Conference on Computational Linguistics. Association for Computational Linguistics, 3879–3889. Retrieved from https://www.aclweb.org/anthology/C18-1327.Google Scholar
- [52] . 2018. NCRF++: An open-source neural sequence labeling toolkit. In Proceedings of the ACL 2018, System Demonstrations. Association for Computational Linguistics, 74–79.
DOI: https://doi.org/10.18653/v1/P18-4013Google ScholarCross Ref
- [53] . 2020. Named entity recognition as dependency parsing. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 6470–6476.
DOI: https://doi.org/10.18653/v1/2020.acl-main.577Google ScholarCross Ref
- [54] . 2019. A boundary-aware neural model for nested named entity recognition. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing. Association for Computational Linguistics, 357–366.
DOI: https://doi.org/10.18653/v1/D19-1034Google ScholarCross Ref
Index Terms
Joined Type Length Encoding for Nested Named Entity Recognition
Recommendations
Nested Named Entity Recognition: A Survey
With the rapid development of text mining, many studies observe that text generally contains a variety of implicit information, and it is important to develop techniques for extracting such information. Named Entity Recognition (NER), the first step of ...
Nested named entity recognition
EMNLP '09: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1Many named entities contain other named entities inside them. Despite this fact, the field of named entity recognition has almost entirely ignored nested named entity recognition, but due to technological, rather than ideological reasons. In this paper, ...
Handling negative samples problems in span-based nested named entity recognition
AbstractNamed Entity Recognition (NER) is generally regarded as a sequence labeling task, and faces a serious problem when the named entities are nested. Span-based model, which enumerates all possible spans as potential entity mentions in a ...






Comments