Abstract
An important area of research involving Artificial Intelligence (AI) is Natural Language Processing (NLP). The objective of training a machine is to imitate and manipulate text and speech of humans. Progressive research is undertaken to find connections between humans and their usage of language commonly used being referred as Natural Language. Various tools for different languages have been developed for operating the natural languages widely used by public. NLP integrates various disciplines and works cohesively for processing text, Information Retrieval, AI and so on. One such tool used for checking the accuracy of a given sentence in any language is referred to as a Grammar Checker. So a Grammar checker of a particular language explores grammatical errors (if any) and provides remedial suggestions for correction of the same. Such feature is imbibed by virtue of Natural Language Processing using Computational Linguistics. We have justified the need of an emerging Machine Learning technique by critically evaluating the existing Punjabi Grammar checker that was developed earlier in light of certain real-time cases. This process is accomplished by critically evaluating the output of each phase and identifying the component accountable for generating maximum errors and false alarms. Based on this analysis, we have proposed a hybrid framework as an efficient way of analyzing correction in sentences. This is attainable through the said booming technique of Machine Learning explicitly using Deep Neural Networks in combination with the existing rule-based approach. It's a novel approach as no work using machine learning has been done earlier in Punjabi Grammar Checker.
- [1] . 2002. Two-pass parsing implementation for Urdu grammar checker. In Proceedings of the IEEE International Multi Topic Conference. 1–8.Google Scholar
Cross Ref
- [2] . 2006. N-gram based bangla and english statistical grammar checker. In Proceedings of the 9th International Conference on Computer and Information Technology.Google Scholar
- [3] . 2008. A punjabi grammar checker. In Proceedings of the 3rd International Joint Conference on Natural Language Processing.Google Scholar
- [4] . 2012. Rule based morphological process grammar checking system for an indian language. In Proceedings of the Global Trends in Information Systems and Software Applications. Springer, Berlin, 524–531.Google Scholar
Cross Ref
- [5] . 2008. A grammar checking system for Punjabi. In Proceedings of the 22nd International Conference on Computational Linguistics: Demonstration Papers. Association for Computational Linguistics, 149–152.Google Scholar
Digital Library
- [6] . 2016. Improving existing punjabi grammar checker. In Proceedings of the International Conference on Computational Techniques in Information and Communication Technologies. IEEE, 445–449.Google Scholar
Cross Ref
- [7] . 2016. Deep learning. s.l. MIT Press.Google Scholar
- [8] . 2000. Learning to forget: Continual prediction with LSTM. Neural Computation 12, 10 (2000), 2451–2471.Google Scholar
Digital Library
- [9] Y. Kim. 2014. Convolutional neural networks for sentence classification. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, 1746--1751. Google Scholar
Cross Ref
- [10] . 2017. Attention is all you need. In Proceedings of the Advances in Neural Information Processing Systems. 5998–6008.Google Scholar
- [11] . 2017. Exploiting unlabeled data for neural grammatical error detection. 2017. Journal of Computer Science and Technology 32, 4 (2017), 758–767.Google Scholar
Cross Ref
- [12] . 2016. Grammatical error correction using NMT neural machine translation. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 380–386.Google Scholar
- [13] . 2018. Correcting writing errors in turkish with a character-level neural language model. In Proceedings of the 2018 26th Signal Processing and Communications Applications Conference. IEEE.Google Scholar
Cross Ref
- [14] . 2017. Attention-based encoder-decoder networks for spelling and grammatical error correction. Master's Thesis. Department of Mathematics and Computer Science. Paris Descartes University.Google Scholar
- [15] . 2020. A study of Grammar Analysis in English Teaching with Deep Learning Algorithm. In Proceedings of the International Journal of Emerging Technologies in Learning 15, 18 (2020), 20–29.Google Scholar
Cross Ref
- [16] . 2020. Chinese grammatical error detection based on BERT model. In Proceedings of the 6th Workshop on Natural Language Processing Techniques for Educational Applications. Suzhou, China, 108–113.Google Scholar
- [17] . 2018. Detecting simultaneously chinese grammar errors based on a BiLSTM-CRF Model. In Proceedings of the 5th Workshop on Natural Language Processing Techniques for Educational Applications. Melbourne, Australia, 188–193.Google Scholar
Cross Ref
- [18] . 2018. A survey on deep learning: Algorithms, techniques, and applications. ACM Computing Surveys 51, 5 (2018), 1–36.Google Scholar
Digital Library
- [19] . 1996. Learning task-dependent distributed representations by backpropagation through structure. In Proceedings of the IEEE International Conference on Neural Networks. 347–352.Google Scholar
Cross Ref
- [20] . 2011. Natural scenes Parsing and natural language with neural networks. In Proceedings of the International Conference on Machine Learning. 129–136.Google Scholar
- [21] . 2014. Phrase representations Learning using RNN encoder-decoder for statistical machine translation. In Proceedings of the Conference on Empirical Methods Natural Language Processing. 1724–1734.Google Scholar
Cross Ref
- [22] . 2015. Learning deconvolution network for semantic segmentation. In Proceedings of the IEEE International Conference on Computer Vision. IEEE, 1520–1528.Google Scholar
Digital Library
- [23] . 2015. Constructing LSTM long short-term memory based deep recurrent neural networks for large vocabulary speech recognition. In Proceedings of the IEEE International Conference on ASSP Acoustics, Speech and Signal Processing. IEEE, 4520–4524.Google Scholar
- [24] . 2015. Neural machine translation by jointly learning to align and translate. In Proceedings of the 3rd International Conference on Learning Representations.Google Scholar
- [25] A. Schmaltz, Y. Kim, A. M. Rush, and S. M. Shieber. 2016. Sentence level grammatical error identification as sequence-to-sequence correction. In Proceedings of the 11th Workshop on Innovative Use of NLP for Building Educational Applications. San Diego, CA, 242--251.Google Scholar
- [26] . 2016. Neural language correction with character-based attention. Google Scholar
Cross Ref
- [27] . 2017. A nested atention neural hybrid model for grammatical error correction. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics. 753–762.Google Scholar
- [28] . 2016. Compositional sequence labeling models for error detection in learner writing. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics. 1181–1191.Google Scholar
Cross Ref
- [29] . 2016. Attending to characters in neural sequence labeling models. In Proceedings of the 26th International Conference on Computational Linguistics: Technical Papers. 309–318.Google Scholar
- [30] 2017. Neural sequence-labelling models for grammatical error correction GEC. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. 2795–2806.Google Scholar
- [31] Marcin Junczys-Dowmunt, Roman Grundkiewicz, Shubha Guha, and Kenneth Heafield. 2018. Approaching neural grammatical error correction as a low-resource machine translation task. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers). Association for Computational Linguistics. New Orleans, Louisiana. 595--606. Google Scholar
Cross Ref
- [32] . 2016. Google's neural machine NM translation system: Bridging gap between human machine translation. Google Scholar
Cross Ref
- [33] . 2018. A multilayer convolutional encoder-decoder NN for grammatical error correction. In Proceedings of the 32nd AAAI Conference on AI Artificial Intelligence. 5755–5762.Google Scholar
- [34] . 2019. Improving grammatical error correction via pre-training copy-augmented architecture unlabeled data. In Proceedings of the NAACL-HLT 2019. Minneapolis, Minnesota, 156–165.Google Scholar
Cross Ref
- [35] . 2020. Machine learning and deep neural network-based lemmatization and morphosyntactic tagging for serbian. In Proceedings of the 12th Conference on Language Resources and Evaluation. 3954–3962.Google Scholar
- [36] . 2017. A neural network architecture for detecting grammatical errors in statistical machine translation. Prague Bulletin of Mathematical Linguistics 108 (2017), 133–145.Google Scholar
Cross Ref
- [37] . 2020. Chinese grammatical error diagnosis with graph convolution network and multi-task learning. In Proceedings of the 6th Workshop on Natural Language Processing Techniques for Educational Applications. Suzhou, China, 44–48.Google Scholar
- [38] . 2013. The CoNLL-2013 shared task on grammatical error correction. In Proceedings of the 7th Conference on Computational Natural Language Learning: Shared Task. Sofia, Bulgaria, 1–12.Google Scholar
- [39] . 2014. The CoNLL-2014 shared task on grammatical error correction. In Proceedings of the 8th Conference on Computational Natural Language Learning: Shared Task. Baltimore, Maryland, 1–14.Google Scholar
Cross Ref
- [40] . 2017. Auxiliary objectives for neural error detection models. In Proceedings of the 12th Workshop on Innovative Use of Natural Language Processing for Building Educational Applications. Copenhagen, Denmark: ACL Association for Computational Linguistics, 33–43.Google Scholar
- [41] . 2018. Colorless green recurrent networks dream hierarchically. In Procedings of the 2018 Conference of North American CACL Chapter of the Association for Computational Linguistics: Human Language Technologies. New Orleans, Louisiana: ACL, 1195–1205.Google Scholar
Cross Ref
- [42] . 2018. LSTMs can learn syntax-sensitive dependencies well, but modeling structure makes them better. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. Melbourne, Australia: ACL, 1426–1436.Google Scholar
Cross Ref
- [43] . 2016. Assessing the ability of LSTMs to learn syntax-sensitive dependencies. Transactions of the Association for Computational Linguistics 4, 521–535.Google Scholar
Cross Ref
- [44] . 2021. A framework for indonesian grammar error correction. ACM Transactions on Asian and Low-Resource Language Information Processing 20, 4 (2021), 1–12.Google Scholar
Digital Library
- [45] . 2020. GECToR –grammatical error correction: Tag, not rewrite. Procedings of the 15th Workshop on Innovative Use of Natural Language Processing for Building Educational Applications. 163–170.Google Scholar
Cross Ref
- [46] . 2019. Neural grammatical error correction systems with unsupervised pre-training on synthetic data. In Procedings of the 14th Workshop on Innovative Use of Natural Language Processing for Building Educational Applications. 252–263.Google Scholar
Cross Ref
- [47] . 2019. The BEA-2019 shared task on grammatical error correction. In Procedings of the 14th Workshop on Innovative Use of Natural Language Processing for Building Educational Applications. Florence, Italy, 52–75.Google Scholar
Cross Ref
- [48] . 2019. Context is key: grammatical error detection with contextual word representations. arXiv:1906.06593. Retrieved from https://arxiv.org/abs/1906.06593.Google Scholar
- [49] . 2020. A research on online grammar checker system based on neural network model. In Proceedings of the Journal of Physics: Conference Series. IOP Publishing, 1651, 1 012135.Google Scholar
- [50] . 2019. Neural and FST-based approaches to GEC grammatical error correction. In Procedings of the 14th Workshop on Innovative Use of Natural Language Processing for Building Educational Applications. 228–239.Google Scholar
- [51] . 2020. Maskgec: Improving neural grammatical error correction via dynamic masking. In Proceedings of the AAAI Conference on Artificial Intelligence 34, 01 (2020), 1226–1233.Google Scholar
Cross Ref
- [52] . 2017. GEC Grammatical error correction with NRL neural reinforcement learning. arXiv:1707.00299. Retrieved from https://arxiv.org/abs/1707.00299.Google Scholar
- [53] . 2019. A neural grammatical error correction system built on better pre-training and sequential transfer learning. arXiv:1907.01256. Retrieved from https://arxiv.org/abs/1907.01256.Google Scholar
- [54] . 2019. Nepali POS Tagging Using Deep Learning Approaches. No. 2073. EasyChair.Google Scholar
- [55] . 2019. Deep learning based part-of-speech tagging for Malayalam Twitter data (Special issue: deep learning techniques for natural language processing). Journal of Intelligent Systems 28, 3 (2019), 423–435.Google Scholar
Cross Ref
- [56] . 2020. Deep learning techniques for part of speech tagging by natural language processing. In Proceedings of the 2020 2nd International Conference on Innovative Mechanisms for Industry Applications. IEEE, 76–81.Google Scholar
Cross Ref
- [57] . 2018. Building a kannada pos tagger using machine learning and neural network models. arXiv:1808.03175. Retrieved from https://arxiv.org/abs/1808.03175.Google Scholar
- [58] . 2021. Deep learning based tamil parts of speech (POS) Tagger. Bulletin of the Polish Academy of Sciences: Technical Sciences: e138820–e138820.Google Scholar
Index Terms
Critical Analysis of Existing Punjabi Grammar Checker and a Proposed Hybrid Framework Involving Machine Learning and Rule-Base Criteria
Recommendations
A computational grammar of sinhala
CICLing'12: Proceedings of the 13th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part IA Computational Grammar for a language is a very useful resource for carrying out various language processing tasks for that language such as Grammar checking, Machine Translation and Question Answering. As is the case in most South Indian Languages, ...
Multimodal Machine Learning for Natural Language Processing: Disambiguating Prepositional Phrase Attachments with Images
AbstractAlthough documents are increasingly multimodal, their automatic processing is often monomodal. In particular, natural language processing tasks are typically performed based on the textual modality only. This work extends the syntactic parsing ...
Grammar-based classifier system: a universal tool for grammatical inference
Grammatical Inference deals with the problem of learning structural models, such as grammars, from different sort of data patterns, such as artificial languages, natural languages, biosequences, speech and so on. This article describes a new grammatical ...






Comments