Abstract
Cross-lingual dependency parsing approaches have been employed to develop dependency parsers for the languages for which little or no treebanks are available using the treebanks of other languages. A language for which the cross-lingual parser is developed is usually referred to as the target language and the language whose treebank is used to train the cross-lingual parser model is referred to as the source language. The cross-lingual parsing approaches for dependency parsing may be broadly classified into three categories: model transfer, annotation projection, and treebank translation. This survey provides an overview of the various aspects of the model transfer approach of cross-lingual dependency parsing. In this survey, we present a classification of the model transfer approaches based on the different aspects of the method. We discuss some of the challenges associated with cross-lingual parsing and the techniques used to address these challenges. In order to address the difference in vocabulary between two languages, some approaches use only non-lexical features of the words to train the models while others use shared representations of the words. Some approaches address the morphological differences by chunk-level transfer rather than word-level transfer. The syntactic differences between the source and target languages are sometimes addressed by transforming the source language treebanks or by combining the resources of multiple source languages. Besides cross-lingual transfer parser models may be developed for a specific target language or it may be trained to parse sentences of multiple languages. With respect to the above-mentioned aspects, we look at the different ways in which the methods can be classified. We further classify and discuss the different approaches from the perspective of the corresponding aspects. We also demonstrate the performance of the transferred models under different settings corresponding to the classification aspects on a common dataset.
- Željko Agić. 2017. Cross-lingual parser selection for low-resource languages. In Proceedings of the NoDaLiDa 2017 Workshop on Universal Dependencies (UDW’17). Association for Computational Linguistics, Gothenburg, Sweden, 1--10. https://www.aclweb.org/anthology/W17-0401.Google Scholar
- Željko Agić, Dirk Hovy, and Anders Søgaard. 2015. If all you have is a bit of the Bible: Learning POS taggers for truly low-resource languages. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers). Association for Computational Linguistics, Beijing, China, 268--272. DOI:https://doi.org/10.3115/v1/P15-2044Google Scholar
- Željko Agić, Anders Johannsen, Barbara Plank, Héctor Martínez Alonso, Natalie Schluter, and Anders Søgaard. 2016. Multilingual projection for parsing truly low-resource languages. Transactions of the Association for Computational Linguistics 4 (2016), 301--312. DOI:https://doi.org/10.1162/tacl_a_00100Google Scholar
Cross Ref
- Wasi Uddin Ahmad, Zhisong Zhang, Zuezhe Ma, Eduard Hovy, Kai-Wei Chang, and Nanyun Peng. 2019. On difficulties of cross-lingual transfer with order differences: A case study on dependency parsing. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies.Google Scholar
Cross Ref
- Waleed Ammar, George Mulcaire, Miguel Ballesteros, Chris Dyer, and Noah A. Smith. 2016. Many languages, one parser. Transactions of the Association for Computational Linguistics 4 (2016), 431--444.Google Scholar
Cross Ref
- Lauriane Aufrant and Guillaume Wisniewski. 2016. PanParser: A modular implementation for efficient transition-based dependency parsing. Prague Bull. Math. Linguistics 111 (2016), 57--86.Google Scholar
Cross Ref
- Lauriane Aufrant, Guillaume Wisniewski, and François Yvon. 2016. Zero-resource dependency parsing: Boosting delexicalized cross-lingual transfer with linguistic knowledge. In COLING 2016, Proceedings of the 26th International Conference on Computational Linguistics: Technical Papers, December 11-16, 2016, Osaka, Japan. 119--130. http://aclweb.org/anthology/C/C16/C16-1012.pdf.Google Scholar
- Lauriane Aufrant, Guillaume Wisniewski, and François Yvon. 2017. [email protected]’17: UD shared task. In Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies. Association for Computational Linguistics, Vancouver, Canada, 163--173. DOI:https://doi.org/10.18653/v1/K17-3017Google Scholar
Cross Ref
- Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2014. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473 (2014).Google Scholar
- James K. Baker. 1979. Trainable grammars for speech recognition. The Journal of the Acoustical Society of America 65, S1 (1979), S132–S132.Google Scholar
Cross Ref
- Anders Björkelund, Agnieszka Falenska, Xiang Yu, and Jonas Kuhn. 2017. IMS at the CoNLL 2017 UD shared task: CRFs and perceptrons meet neural networks. In Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies. 40--51.Google Scholar
Cross Ref
- Anders Björkelund and Joakim Nivre. 2015. Non-deterministic oracles for unrestricted non-projective transition-based dependency parsing. In Proceedings of the 14th International Conference on Parsing Technologies. 76--86.Google Scholar
Cross Ref
- Bernd Bohnet. 2010. Very high accuracy and fast dependency parsing is not a contradiction. In Proceedings of the 23rd International Conference on Computational Linguistics (COLING’10). Association for Computational Linguistics, Stroudsburg, PA, 89--97. http://dl.acm.org/citation.cfm?id=1873781.1873792.Google Scholar
Digital Library
- Bernd Bohnet, Ryan McDonald, Emily Pitler, and Ji Ma. 2016. Generalized transition-based dependency parsing via control parameters. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, Berlin, Germany, 150--160. DOI:https://doi.org/10.18653/v1/P16-1015Google Scholar
Cross Ref
- Piotr Bojanowski, Edouard Grave, Armand Joulin, and Tomas Mikolov. 2017. Enriching word vectors with subword information. Transactions of the Association for Computational Linguistics 5 (2017), 135--146.Google Scholar
Cross Ref
- Eugene Charniak. 2000. A maximum-entropy-inspired parser. In Proceedings of the 1st North American Chapter of the Association for Computational Linguistics Conference (NAACL’00). Association for Computational Linguistics, Stroudsburg, PA, USA, 132--139. http://dl.acm.org/citation.cfm?id=974305.974323.Google Scholar
Digital Library
- Eugene Charniak and Mark Johnson. 2005. Coarse-to-fine N-best parsing and MaxEnt discriminative reranking. In Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics (ACL’05). Association for Computational Linguistics, Stroudsburg, PA, 173--180. DOI:https://doi.org/10.3115/1219840.1219862Google Scholar
Digital Library
- Wanxiang Che, Jiang Guo, Yuxuan Wang, Bo Zheng, Huaipeng Zhao, Yang Liu, Dechuan Teng, and Ting Liu. 2017. The HIT-SCIR system for end-to-end parsing of universal dependencies. In Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies. Association for Computational Linguistics, Vancouver, Canada, 52--62. DOI:https://doi.org/10.18653/v1/K17-3005Google Scholar
Cross Ref
- Wanxiang Che, Yijia Liu, Yuxuan Wang, Bo Zheng, and Ting Liu. 2018. Towards better UD parsing: Deep contextualized word embeddings, ensemble, and treebank concatenation. In Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies. Association for Computational Linguistics, Brussels, Belgium, 55--64. DOI:https://doi.org/10.18653/v1/K18-2005Google Scholar
- Danqi Chen and Christopher Manning. 2014. A fast and accurate dependency parser using neural networks. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). 740--750.Google Scholar
Cross Ref
- Kyunghyun Cho, Bart van Merriënboer, Caglar Gulcehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, and Yoshua Bengio. 2014. Learning phrase representations using RNN encoder–decoder for statistical machine translation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics, Doha, Qatar, 1724--1734. DOI:https://doi.org/10.3115/v1/D14-1179Google Scholar
Cross Ref
- Y. J. Chu and T. H. Liu. 1965. On the shortest arborescence of a directed graph. Science Sinica 14 (1965).Google Scholar
- Shay B. Cohen, Dipanjan Das, and Noah A. Smith. 2011. Unsupervised structure prediction with non-parallel multilingual guidance. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 50--61.Google Scholar
- Michael A. Covington. 2001. A fundamental algorithm for dependency parsing. In Proceedings of the 39th Annual ACM Southeast Conference. 95--102.Google Scholar
- Hang Cui, Renxu Sun, Keya Li, Min-Yen Kan, and Tat-Seng Chua. 2005. Question answering passage retrieval using dependency relations. In Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 400--407.Google Scholar
Digital Library
- Ayan Das, Agnivo Saha, and Sudeshna Sarkar. 2016. Cross-lingual transfer parser from Hindi to Bengali using delexicalization and chunking. In Proceedings of the 13th International Conference on Natural Language Processing. 99--108.Google Scholar
- Ayan Das and Sudeshna Sarkar. 2019. A little perturbation makes a difference: Treebank augmentation by perturbation improves transfer parsing. In Proceedings of the 16th International Conference on Natural Language Processing. Hyderabad, India.Google Scholar
- Ayan Das and Sudeshna Sarkar. 2019. Transform, combine, and transfer: Delexicalized transfer parser for low-resource languages. ACM Trans. Asian Low-Resour. Lang. Inf. Process. 19, 1, Article 4 (June 2019), 30 pages. DOI:https://doi.org/10.1145/3325886Google Scholar
- Ayan Das and Sudeshna Sarkar. 2020. Improving cross-lingual model transfer by chunking. arxiv:cs.CL/2002.12097.Google Scholar
- Ayan Das, Affan Zaffar, and Sudeshna Sarkar. 2017. Delexicalized transfer parsing for low-resource languages using transformed and combined treebanks. In Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies. Association for Computational Linguistics, Vancouver, Canada, 182--190. DOI:https://doi.org/10.18653/v1/K17-3019Google Scholar
Cross Ref
- Dipanjan Das, Nathan Schneider, Desai Chen, and Noah A. Smith. 2010. Probabilistic frame-semantic parsing. In Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics (HLT’10). Association for Computational Linguistics, Stroudsburg, PA, 948--956. http://dl.acm.org/citation.cfm?id=1857999.1858136.Google Scholar
- Éric de La Clergerie, Benoît Sagot, and Djamé Seddah. 2017. The ParisNLP entry at the ConLL UD shared task 2017: A tale of a #ParsingTragedy. In Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies. Association for Computational Linguistics, Vancouver, Canada, 243--252. DOI:https://doi.org/10.18653/v1/K17-3026Google Scholar
Cross Ref
- Marie-Catherine de Marneffe, Timothy Dozat, Natalia Silveira, Katri Haverinen, Filip Ginter, Joakim Nivre, and Christopher D. Manning. 2014. Universal Stanford dependencies: A cross-linguistic typology. In Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC-2014). European Languages Resources Association (ELRA), Reykjavik, Iceland, 4585--4592. http://www.lrec-conf.org/proceedings/lrec2014/pdf/1062_Paper.pdf.Google Scholar
- Marie-Catherine de Marneffe, Bill MacCartney, and Christopher D. Manning. 2006. Generating typed dependency parses from phrase structure parses. In Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC’06). European Language Resources Association (ELRA), Genoa, Italy. http://www.lrec-conf.org/proceedings/lrec2006/pdf/440_pdf.pdf.Google Scholar
- Marie-Catherine de Marneffe and Christopher D. Manning. 2008. The Stanford typed dependencies representation. In COLING 2008: Proceedings of the Workshop on Cross-Framework and Cross-Domain Parser Evaluation (CrossParser’08). Association for Computational Linguistics, Stroudsburg, PA, 1--8. http://dl.acm.org/citation.cfm?id=1608858.1608859.Google Scholar
- Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Association for Computational Linguistics, Minneapolis, Minnesota, 4171--4186. DOI:https://doi.org/10.18653/v1/N19-1423.Google Scholar
- Ludmila Dimitrova, Tomaz Erjavec, Nancy Ide, Heiki Jaan Kaalep, Vladimir Petkevic, and Dan Tufis. 1998. Multext-east: Parallel and comparable corpora and lexicons for six central and eastern European languages. In COLING 1998 Volume 1: The 17th International Conference on Computational Linguistics. https://www.aclweb.org/anthology/C98-1049.Google Scholar
- Timothy Dozat and Christopher D. Manning. 2016. Deep biaffine attention for neural dependency parsing. CoRR abs/1611.01734 (2016). arxiv:1611.01734 http://arxiv.org/abs/1611.01734.Google Scholar
- Timothy Dozat, Peng Qi, and Christopher D. Manning. 2017. Stanford’s graph-based neural dependency parser at the CoNLL 2017 shared task. In Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies. Association for Computational Linguistics, Vancouver, Canada, 20--30. DOI:https://doi.org/10.18653/v1/K17-3002Google Scholar
- Long Duong, Trevor Cohn, Steven Bird, and Paul Cook. 2015. Cross-lingual transfer for unsupervised dependency parsing without parallel data. In Proceedings of the 19th Conference on Computational Natural Language Learning. Association for Computational Linguistics, Beijing, China, 113--122. DOI:https://doi.org/10.18653/v1/K15-1012Google Scholar
Cross Ref
- Long Duong, Trevor Cohn, Steven Bird, and Paul Cook. 2015. A neural network model for low-resource universal dependency parsing. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, Lisbon, Portugal, 339--348. DOI:https://doi.org/10.18653/v1/D15-1040Google Scholar
Cross Ref
- Greg Durrett, Adam Pauls, and Dan Klein. 2012. Syntactic transfer using a bilingual lexicon. In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. Association for Computational Linguistics, Jeju Island, Korea, 1--11. https://www.aclweb.org/anthology/D12-1001.Google Scholar
Digital Library
- Elie Duthoo and Olivier Mesnard. 2018. CEA LIST: Processing low-resource languages for CoNLL 2018. In Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies. Association for Computational Linguistics, Brussels, Belgium, 34--44. DOI:https://doi.org/10.18653/v1/K18-2003Google Scholar
- Jack Edmonds. 1968. Optimum branchings. Mathematics and the Decision Sciences, Part 1 (1968), 335--345.Google Scholar
- Jason M. Eisner. 1996. Three new probabilistic models for dependency parsing: An exploration. In COLING 1996 Volume 1: The 16th International Conference on Computational Linguistics, Vol. 1.Google Scholar
Digital Library
- Tomaž Erjavec. 2012. MULTEXT-East: Morphosyntactic resources for central and eastern European languages. Lang. Resour. Eval. 46, 1 (March 2012), 131--142. DOI:https://doi.org/10.1007/s10579-011-9174-8Google Scholar
Digital Library
- Manaal Faruqui and Chris Dyer. 2014. Improving vector space word representations using multilingual correlation. In Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics. Association for Computational Linguistics, Gothenburg, Sweden, 462--471. DOI:https://doi.org/10.3115/v1/E14-1049Google Scholar
Cross Ref
- Kuzman Ganchev, Jennifer Gillenwater, and Ben Taskar. 2009. Dependency grammar induction via bitext projection constraints. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1-Volume 1. Association for Computational Linguistics, 369--377.Google Scholar
Digital Library
- Yoav Goldberg and Michael Elhadad. 2010. An efficient algorithm for easy-first non-directional dependency parsing. In Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics (HLT’10). Association for Computational Linguistics, Stroudsburg, PA, 742--750. http://dl.acm.org/citation.cfm?id=1857999.1858114.Google Scholar
Digital Library
- Jiang Guo, Wanxiang Che, David Yarowsky, Haifeng Wang, and Ting Liu. 2015. Cross-lingual dependency parsing based on distributed representations. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), Vol. 1. 1234--1244.Google Scholar
Cross Ref
- Jiang Guo, Wanxiang Che, David Yarowsky, Haifeng Wang, and Ting Liu. 2016. A distributed representation-based framework for cross-lingual transfer parsing. Journal of Artificial Intelligence Research 55 (2016), 995--1023.Google Scholar
Digital Library
- Jiang Guo, Wanxiang Che, David Yarowsky, Haifeng Wang, and Ting Liu. 2016. A representation learning framework for multi-source transfer parsing. In Proceedings of the 30th AAAI Conference on Artificial Intelligence.Google Scholar
- Martin Haspelmath. 2005. The World Atlas of Language Structures / Edited by Martin Haspelmath ... [et al.]. Oxford University Press Oxford. xv, 695 p. : pages.Google Scholar
- Junxian He, Graham Neubig, and Taylor Berg-Kirkpatrick. 2018. Unsupervised learning of syntactic structure with invertible neural projections. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. 1292--1302.Google Scholar
Cross Ref
- Junxian He, Zhisong Zhang, Taylor Berg-Kirkpatrick, and Graham Neubig. 2019. Cross-lingual syntactic transfer through unsupervised adaptation of invertible projections. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, Florence, Italy, 3211--3223. https://www.aclweb.org/anthology/P19-1311.Google Scholar
Cross Ref
- Johannes Heinecke and Munshi Asadullah. 2017. Multi-model and crosslingual dependency analysis. In Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies. Association for Computational Linguistics, Vancouver, Canada, 111--118. DOI:https://doi.org/10.18653/v1/K17-3011Google Scholar
Cross Ref
- Rebecca Hwa, Philip Resnik, Amy Weinberg, Clara Cabezas, and Okan Kolak. 2005. Bootstrapping parsers via syntactic projection across parallel texts. Natural Language Engineering 11 (2005), 11--311.Google Scholar
Digital Library
- Richard Johansson and Pierre Nugues. 2008. Dependency-based semantic role labeling of PropBank. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 69--78.Google Scholar
Digital Library
- Eliyahu Kiperwasser and Yoav Goldberg. 2016. Simple and accurate dependency parsing using bidirectional LSTM feature representations. Transactions of the Association for Computational Linguistics 4 (2016), 313--327.Google Scholar
Cross Ref
- Ömer Kırnap, Erenay Dayanık, and Deniz Yuret. 2018. Tree-stack LSTM in transition based dependency parsing. In Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies. Association for Computational Linguistics, Brussels, Belgium, 124--132. DOI:https://doi.org/10.18653/v1/K18-2012Google Scholar
- Ömer Kırnap, Berkay Furkan Önder, and Deniz Yuret. 2017. Parsing with context embeddings. In Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies. Association for Computational Linguistics, Vancouver, Canada, 80--87. DOI:https://doi.org/10.18653/v1/K17-3008Google Scholar
Cross Ref
- Dan Klein and Christopher Manning. 2004. Corpus-based induction of syntactic structure: Models of dependency and constituency. In Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics (ACL-04). Barcelona, Spain, 478--485. DOI:https://doi.org/10.3115/1218955.1219016Google Scholar
Digital Library
- Daniel Kondratyuk and Milan Straka. 2019. 75 languages, 1 model: Parsing universal dependencies universally. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP). Hong Kong, China.Google Scholar
Cross Ref
- Marco Kuhlmann, Carlos Gómez-Rodríguez, and Giorgio Satta. 2011. Dynamic programming algorithms for transition-based dependency parsers. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1 (HLT’11). Association for Computational Linguistics, Stroudsburg, PA, 673--682. http://dl.acm.org/citation.cfm?id=2002472.2002558.Google Scholar
- Ophélie Lacroix, Lauriane Aufrant, Guillaume Wisniewski, and François Yvon. 2016. Frustratingly easy cross-lingual transfer for transition-based dependency parsing. In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Association for Computational Linguistics, San Diego, California, 1058--1063. http://www.aclweb.org/anthology/N16-1121.Google Scholar
Cross Ref
- Guillaume Lample, Alexis Conneau, Marc’Aurelio Ranzato, Ludovic Denoyer, and Hervé Jégou. 2018. Word translation without parallel data. In Proceedings of the International Conference on Learning Representations. https://openreview.net/forum?id=H196sainb.Google Scholar
- Tao Lei, Yuan Zhang, Regina Barzilay, and Tommi Jaakkola. 2014. Low-rank tensors for scoring dependency structures. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, 1381–1391. DOI:10.3115/v1/P14-1130Google Scholar
Cross Ref
- KyungTae Lim, Cheoneum Park, Changki Lee, and Thierry Poibeau. 2018. SEx BiST: A multi-source trainable parser with deep contextualized lexical representations. In Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies. Association for Computational Linguistics, Brussels, Belgium, 143--152. DOI:https://doi.org/10.18653/v1/K18-2014Google Scholar
- Yu-Hsiang Lin, Chian-Yu Chen, Jean Lee, Zirui Li, Yuyan Zhang, Mengzhou Xia, Shruti Rijhwani, Junxian He, Zhisong Zhang, Xuezhe Ma, Antonios Anastasopoulos, Patrick Littell, and Graham Neubig. 2019. Choosing transfer languages for cross-lingual learning. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, Florence, Italy, 3125--3135. https://www.aclweb.org/anthology/P19-1301.Google Scholar
Cross Ref
- Xuezhe Ma, Zecong Hu, Jingzhou Liu, Nanyun Peng, Graham Neubig, and Eduard Hovy. 2018. Stack-pointer networks for dependency parsing. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 1403--1414.Google Scholar
Cross Ref
- Xuezhe Ma and Fei Xia. 2014. Unsupervised dependency parsing with transferring distribution via parallel guidance and entropy regularization. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, Baltimore, Maryland, 1337--1348. http://www.aclweb.org/anthology/P/P14/P14-1126.Google Scholar
Cross Ref
- Ryan McDonald, Koby Crammer, and Fernando Pereira. 2005. Online large-margin training of dependency parsers. In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL’05). 91--98.Google Scholar
Digital Library
- Ryan McDonald, Joakim Nivre, Yvonne Quirmbach-Brundage, Yoav Goldberg, Dipanjan Das, Kuzman Ganchev, Keith Hall, Slav Petrov, Hao Zhang, Oscar Täckström, Claudia Bedini, Núria Bertomeu Castelló, and Jungmee Lee. 2013. Universal dependency annotation for multilingual parsing. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers). Association for Computational Linguistics, Sofia, Bulgaria, 92--97. https://www.aclweb.org/anthology/P13-2017.Google Scholar
- Ryan Mcdonald, Fernando Pereira, Kiril Ribarov, and Jan Hajič. 2005. Non-projective dependency parsing using spanning tree algorithms. In Proceedings of HLT Conference and Conference on EMNLP. 523--530.Google Scholar
Digital Library
- Ryan McDonald, Slav Petrov, and Keith Hall. 2011. Multi-source transfer of delexicalized dependency parsers. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 62--72.Google Scholar
Digital Library
- Tao Meng, Nanyun Peng, and Kai-Wei Chang. 2019. Target language-aware constrained inference for cross-lingual dependency parsing. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP). Hong Kong, China.Google Scholar
Cross Ref
- Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In Advances in Neural Information Processing Systems. 3111--3119.Google Scholar
- Tahira Naseem, Regina Barzilay, and Amir Globerson. 2012. Selective sharing for multilingual dependency parsing. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1 (ACL’12). Association for Computational Linguistics, Stroudsburg, PA, 629--637. http://dl.acm.org/citation.cfm?id=2390524.2390613.Google Scholar
- Joakim Nivre. 2003. An efficient algorithm for projective dependency parsing. In Proceedings of the 8th International Conference on Parsing Technologies. Nancy, France, 149--160. https://www.aclweb.org/anthology/W03-3017.Google Scholar
- Joakim Nivre. 2005. Dependency Grammar and Dependency Parsing. Technical Report. Växjö University.Google Scholar
- Joakim Nivre. 2008. Algorithms for deterministic incremental dependency parsing. Computational Linguistics 34, 4 (2008), 513--553. DOI:https://doi.org/10.1162/coli.07-056-R1-07-027Google Scholar
Digital Library
- Joakim Nivre. 2009. Non-projective dependency parsing in expected linear time. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1-Volume 1. Association for Computational Linguistics, 351--359.Google Scholar
Cross Ref
- Joakim Nivre. 2016. Universal dependencies: A cross-linguistic perspective on grammar and lexicon. In Proceedings of the Workshop on Grammar and Lexicon: Interactions and Interfaces (GramLex). The COLING 2016 Organizing Committee, Osaka, Japan, 38--40. https://www.aclweb.org/anthology/W16-3806.Google Scholar
- Joakim Nivre, Marie-Catherine de Marneffe, Filip Ginter, Yoav Goldberg, Jan Hajič, Christopher Manning, Ryan McDonald, Slav Petrov, Sampo Pyysalo, Natalia Silveira, Reut Tsarfaty, and Daniel Zeman. 2016. Universal dependencies v1: A multilingual treebank collection. In Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC’16). European Language Resources Association, Portorož, Slovenia, 1659--1666.Google Scholar
- Matthew Peters, Mark Neumann, Luke Zettlemoyer, and Wen-tau Yih. 2018. Dissecting contextual word embeddings: Architecture and representation. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. 1499--1509.Google Scholar
Cross Ref
- Slav Petrov, Dipanjan Das, and Ryan McDonald. 2012. A universal part-of-speech tagset. In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC’12) (23-25), Nicoletta Calzolari (Conference Chair), Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, and Stelios Piperidis (Eds.). European Language Resources Association (ELRA), Istanbul, Turkey.Google Scholar
- Edoardo Maria Ponti, Roi Reichart, Anna Korhonen, and Ivan Vulić. 2018. Isomorphic transfer of syntactic structures in cross-lingual NLP. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, Melbourne, Australia, 1531--1542. DOI:https://doi.org/10.18653/v1/P18-1142Google Scholar
Cross Ref
- Peng Qi, Timothy Dozat, Yuhao Zhang, and Christopher D. Manning. 2018. Universal dependency parsing from scratch. In Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies. Association for Computational Linguistics, Brussels, Belgium, 160--170. DOI:https://doi.org/10.18653/v1/K18-2016Google Scholar
- Chris Quirk and Simon Corston-Oliver. 2006. The impact of parse quality on syntactically-informed statistical machine translation. In Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing. 62--69.Google Scholar
Digital Library
- Mohammad Sadegh Rasooli and Michael Collins. 2015. Density-driven cross-lingual transfer of dependency parsers. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, Lisbon, Portugal, 328--338. http://aclweb.org/anthology/D15-1039.Google Scholar
Cross Ref
- Mohammad Sadegh Rasooli and Michael Collins. 2017. Cross-lingual syntactic transfer with limited resources. Transactions of the Association for Computational Linguistics 5 (2017), 279--293.Google Scholar
Cross Ref
- Mohammad Sadegh Rasooli and Michael Collins. 2019. Low-resource syntactic transfer with unsupervised source reordering. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Association for Computational Linguistics, Minneapolis, Minnesota, 3845--3856. DOI:https://doi.org/10.18653/v1/N19-1385Google Scholar
Cross Ref
- Mohammad Sadegh Rasooli and Joel Tetreault. 2015. Yara Parser: A Fast and Accurate Dependency Parser. arxiv:cs.CL/1503.06733Google Scholar
- Rudolf Rosa, Ondřej Dušek, David Mareček, and Martin Popel. 2012. Using parallel features in parsing of machine-translated sentences for correction of grammatical errors. In Proceedings of 6th Workshop on Syntax, Semantics and Structure in Statistical Translation (SSST-6), ACL. Association for Computational Linguistics, Jeju, Korea, 39--48. http://hdl.handle.net/11858/00-097C-0000-0023-7AEB-4.Google Scholar
- Rudolf Rosa and David Mareček. 2018. CUNI x-ling: Parsing under-resourced languages in CoNLL 2018 UD shared task. In Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies. Association for Computational Linguistics, Brussels, Belgium, 187--196. DOI:https://doi.org/10.18653/v1/K18-2019Google Scholar
- Rudolf Rosa and Zdenek Zabokrtsky. 2015. Klcpos3-a language similarity measure for delexicalized parser transfer. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), Vol. 2. 243--249.Google Scholar
Cross Ref
- Rudolf Rosa and Zdeněk Žabokrtský. 2015. MSTParser model interpolation for multi-source delexicalized transfer. In Proceedings of the 14th International Conference on Parsing Technologies. Association for Computational Linguistics, Bilbao, Spain, 71--75. DOI:https://doi.org/10.18653/v1/W15-2209Google Scholar
Cross Ref
- Piotr Rybak and Alina Wróblewska. 2018. Semi-supervised neural system for tagging, parsing and lematization. In Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies. Association for Computational Linguistics, Brussels, Belgium, 45--54. DOI:https://doi.org/10.18653/v1/K18-2004Google Scholar
Cross Ref
- Kenji Sagae and Alon Lavie. 2006. Parser combination by reparsing. In Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers. Association for Computational Linguistics, New York City, USA, 129--132. https://www.aclweb.org/anthology/N06-2033.Google Scholar
Digital Library
- Michael Schlichtkrull and Anders Søgaard. 2017. Cross-lingual dependency parsing with late decoding for truly low-resource languages. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers. 220--229.Google Scholar
Cross Ref
- Manon Scholivet, Franck Dary, Alexis Nasr, Benoit Favre, and Carlos Ramisch. 2019. Typological features for multilingual delexicalised dependency parsing. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Association for Computational Linguistics, Minneapolis, Minnesota, 3919--3930. DOI:https://doi.org/10.18653/v1/N19-1393Google Scholar
Cross Ref
- Tal Schuster, Ori Ram, Regina Barzilay, and Amir Globerson. 2019. Cross-lingual alignment of contextual word embeddings, with applications to zero-shot dependency parsing. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Association for Computational Linguistics, Minneapolis, Minnesota, 1599--1613. DOI:https://doi.org/10.18653/v1/N19-1162Google Scholar
Cross Ref
- Peter Shaw, Jakob Uszkoreit, and Ashish Vaswani. 2018. Self-attention with relative position representations. CoRR abs/1803.02155 (2018). arxiv:1803.02155 http://arxiv.org/abs/1803.02155.Google Scholar
- Tianze Shi, Felix G. Wu, Xilun Chen, and Yao Cheng. 2017. Combining global models for parsing universal dependencies. In Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies. 31--39.Google Scholar
Cross Ref
- Aaron Smith, Bernd Bohnet, Miryam de Lhoneux, Joakim Nivre, Yan Shao, and Sara Stymne. 2018. 82 treebanks, 34 models: Universal dependency parsing with multi-treebank models. In Proceedings of the {C}o{NLL} 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies. Association for Computational Linguistics, 113–123. DOI:https://doi.org/10.18653/v1/k18-2011Google Scholar
Cross Ref
- Samuel L. Smith, David H. P. Turban, Steven Hamblin, and Nils Y. Hammerla. 2017. Offline bilingual word vectors, orthogonal transformations and the inverted softmax. In Proceedings of the 5th International Conference on Learning Representations (ICLR 2017), (Toulon, France, April 24-26, 2017). Conference Track Proceedings. https://openreview.net/forum?id=r1Aab85gg.Google Scholar
- Anders Søgaard. 2011. Data point selection for cross-language adaptation of dependency parsers. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: Short Papers-Volume 2. Association for Computational Linguistics, 682--686.Google Scholar
Digital Library
- Anders Søgaard and Julie Wulff. 2012. An empirical study of non-lexical extensions to delexicalized transfer. In Proceedings of COLING 2012: Posters. The COLING 2012 Organizing Committee, Mumbai, India, 1181--1190. https://www.aclweb.org/anthology/C12-2115.Google Scholar
- Milan Straka. 2018. UDPipe 2.0 prototype at CoNLL 2018 UD shared task. In Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies. Association for Computational Linguistics, Brussels, Belgium, 197--207. DOI:https://doi.org/10.18653/v1/K18-2020Google Scholar
- Milan Straka, Jan Hajic, Jana Straková, and Jan Hajic, Jr. 2015. Parsing universal dependency treebanks using neural networks and search-based oracle. In Proceedings of the International Workshop on Treebanks and Linguistic Theories (TLT14). 208--220.Google Scholar
- Emma Strubell, Patrick Verga, Daniel Andor, David Weiss, and Andrew McCallum. 2018. Linguistically-informed self-attention for semantic role labeling. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. 5027--5038.Google Scholar
Cross Ref
- Mihai Surdeanu, Massimiliano Ciaramita, and Hugo Zaragoza. 2011. Learning to rank answers to non-factoid questions from web collections. Computational Linguistics 37, 2 (2011), 351--383.Google Scholar
Digital Library
- Oscar Täckström, Ryan McDonald, and Jakob Uszkoreit. 2012. Cross-lingual word clusters for direct transfer of linguistic structure. In Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Association for Computational Linguistics, 477--487.Google Scholar
- Oscar Täckström, Ryan T. McDonald, and Joakim Nivre. 2013. Target language adaptation of discriminative transfer parsers. In Human Language Technologies: Proceedings of the Conference of the North American Chapter of the Association of Computational Linguistics, (Westin Peachtree Plaza Hotel, Atlanta, Georgia, June 9-14, 2013). 1061--1071. http://aclweb.org/anthology/N/N13/N13-1126.pdf.Google Scholar
- Jörg Tiedemann, Željko Agić, and Joakim Nivre. 2014. Treebank translation for cross-lingual parser induction. In Proceedings of the 18th Conference on Computational Natural Language Learning. Association for Computational Linguistics, Ann Arbor, Michigan, 130--140. http://www.aclweb.org/anthology/W/W14/W14-1614.Google Scholar
Cross Ref
- Jörg Tiedemann and Zeljko Agic. 2016. Synthetic treebanking for cross-lingual dependency parsing. J. Artif. Intell. Res. (JAIR) 55 (2016), 209--248. DOI:https://doi.org/10.1613/jair.4785Google Scholar
Digital Library
- Jakob Uszkoreit and Thorsten Brants. 2008. Distributed word clustering for large scale class-based language modeling in machine translation. In Proceedings of ACL-08: HLT. 755--762.Google Scholar
- Clara Vania, Yova Kementchedjhieva, Anders Søgaard, and Adam Lopez. 2019. A systematic comparison of methods for low-resource dependency parsing on genuinely low-resource languages. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing (EMNLP). Hong Kong, China.Google Scholar
Cross Ref
- Clara Vania, Xingxing Zhang, and Adam Lopez. 2017. UParse: The Edinburgh system for the CoNLL 2017 UD shared task. In Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies. Association for Computational Linguistics, Vancouver, Canada, 100--110. DOI:https://doi.org/10.18653/v1/K17-3010Google Scholar
Cross Ref
- Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Advances in Neural Information Processing Systems 30, I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (Eds.). Curran Associates, Inc., 5998--6008. http://papers.nips.cc/paper/7181-attention-is-all-you-need.pdf.Google Scholar
Digital Library
- Hui Wan, Tahira Naseem, Young-Suk Lee, Vittorio Castelli, and Miguel Ballesteros. 2018. IBM research at the CoNLL 2018 shared task on multilingual parsing. In Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies. Association for Computational Linguistics, Brussels, Belgium, 92--102. DOI:https://doi.org/10.18653/v1/K18-2009Google Scholar
- Dingquan Wang and Jason Eisner. 2016. The galactic dependencies treebanks: Getting more data by synthesizing new languages. Transactions of the Association for Computational Linguistics 4 (2016), 491--505.Google Scholar
Cross Ref
- Dingquan Wang and Jason Eisner. 2018. Surface statistics of an unknown language indicate how to parse it. Transactions of the Association for Computational Linguistics 6 (Dec. 2018), 667--685. DOI:https://doi.org/10.1162/tacl_a_00248Google Scholar
Cross Ref
- Dingquan Wang and Jason Eisner. 2018. Synthetic data made to order: The case of parsing. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. 1325--1337.Google Scholar
Cross Ref
- Mengqiu Wang and Christopher D. Manning. 2010. Probabilistic tree-edit models with structured latent variables for textual entailment and question answering. In Proceedings of the 23rd International Conference on Computational Linguistics (COLING’10). Association for Computational Linguistics, Stroudsburg, PA, USA, 1164--1172. http://dl.acm.org/citation.cfm?id=1873781.1873912.Google Scholar
- Tsung-Hsien Wen, Milica Gasic, Nikola Mrksic, Pei-Hao Su, David Vandyke, and Steve Young. 2015. Semantically conditioned LSTM-based natural language generation for spoken dialogue systems. arXiv preprint arXiv:1508.01745 (2015).Google Scholar
- Min Xiao and Yuhong Guo. 2014. Distributed word representation learning for cross-lingual dependency parsing. In Proceedings of the 18th Conference on Computational Natural Language Learning. 119--129.Google Scholar
Cross Ref
- David Yarowsky, Grace Ngai, and Richard Wicentowski. 2001. Inducing multilingual text analysis tools via robust projection across aligned corpora. In Proceedings of the 1st International Conference on Human Language Technology Research. Association for Computational Linguistics, 1--8.Google Scholar
Digital Library
- Daniel Zeman, David Mareček, Martin Popel, Loganathan Ramasamy, Jan Štěpánek, Zdeněk Žabokrtský, and Jan Hajič. 2012. HamleDT: To parse or not to parse? In Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC-2012). European Languages Resources Association (ELRA), Istanbul, Turkey, 2735--2741. http://www.lrec-conf.org/proceedings/lrec2012/pdf/429_Paper.pdf.Google Scholar
- Daniel Zeman and Philip Resnik. 2008. Cross-language parser adaptation between related languages. In Proceedings of the IJCNLP-08 Workshop on NLP for Less Privileged Languages. Asian Federation of Natural Language Processing.Google Scholar
- Yuan Zhang and Regina Barzilay. 2015. Hierarchical low-rank tensors for multilingual transfer parsing. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 1857–1867.Google Scholar
Cross Ref
- Hai Zhao, Yan Song, Chunyu Kit, and Guodong Zhou. 2009. Cross language dependency parsing using a bilingual lexicon. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP. Association for Computational Linguistics, Suntec, Singapore, 55--63. https://www.aclweb.org/anthology/P09-1007.Google Scholar
Index Terms
A Survey of the Model Transfer Approaches to Cross-Lingual Dependency Parsing
Recommendations
Transform, Combine, and Transfer: Delexicalized Transfer Parser for Low-resource Languages
Transfer parsing has been used for developing dependency parsers for languages with no treebank by using transfer from treebanks of other languages (source languages). In delexicalized transfer, parsed words are replaced by their part-of-speech tags. ...
Improving Telugu Dependency Parsing using Combinatory Categorial Grammar Supertags
We show that Combinatory Categorial Grammar (CCG) supertags can improve Telugu dependency parsing. In this process, we first extract a CCG lexicon from the dependency treebank. Using both the CCG lexicon and the dependency treebank, we create a CCG ...
Dependency Parsing on Source Language with Reordering Information in SMT
IALP '12: Proceedings of the 2012 International Conference on Asian Language ProcessingIn statistical machine translation, many translation errors may easily occur especially when the word orders are very different between source language and target language, especially with asymmetric morphological structures. The paper investigates ...






Comments