skip to main content
research-article

Critical Analysis of Existing Punjabi Grammar Checker and a Proposed Hybrid Framework Involving Machine Learning and Rule-Base Criteria

Published:29 April 2022Publication History
Skip Abstract Section

Abstract

An important area of research involving Artificial Intelligence (AI) is Natural Language Processing (NLP). The objective of training a machine is to imitate and manipulate text and speech of humans. Progressive research is undertaken to find connections between humans and their usage of language commonly used being referred as Natural Language. Various tools for different languages have been developed for operating the natural languages widely used by public. NLP integrates various disciplines and works cohesively for processing text, Information Retrieval, AI and so on. One such tool used for checking the accuracy of a given sentence in any language is referred to as a Grammar Checker. So a Grammar checker of a particular language explores grammatical errors (if any) and provides remedial suggestions for correction of the same. Such feature is imbibed by virtue of Natural Language Processing using Computational Linguistics. We have justified the need of an emerging Machine Learning technique by critically evaluating the existing Punjabi Grammar checker that was developed earlier in light of certain real-time cases. This process is accomplished by critically evaluating the output of each phase and identifying the component accountable for generating maximum errors and false alarms. Based on this analysis, we have proposed a hybrid framework as an efficient way of analyzing correction in sentences. This is attainable through the said booming technique of Machine Learning explicitly using Deep Neural Networks in combination with the existing rule-based approach. It's a novel approach as no work using machine learning has been done earlier in Punjabi Grammar Checker.

REFERENCES

  1. [1] Kabir H., Zaman J., Nayyer S., and Hussain S.. 2002. Two-pass parsing implementation for Urdu grammar checker. In Proceedings of the IEEE International Multi Topic Conference. 18.Google ScholarGoogle ScholarCross RefCross Ref
  2. [2] Alam M., UzZaman Naushad, and Khan Mumit. 2006. N-gram based bangla and english statistical grammar checker. In Proceedings of the 9th International Conference on Computer and Information Technology.Google ScholarGoogle Scholar
  3. [3] Gill M. S. and Lehal G. S., and Joshi S. S.. 2008. A punjabi grammar checker. In Proceedings of the 3rd International Joint Conference on Natural Language Processing.Google ScholarGoogle Scholar
  4. [4] Bopche L., Dhopavkar G., and Kshirsagar M.. 2012. Rule based morphological process grammar checking system for an indian language. In Proceedings of the Global Trends in Information Systems and Software Applications. Springer, Berlin, 524531.Google ScholarGoogle ScholarCross RefCross Ref
  5. [5] Gill M. S. and Lehal G. S.. 2008. A grammar checking system for Punjabi. In Proceedings of the 22nd International Conference on Computational Linguistics: Demonstration Papers. Association for Computational Linguistics, 149152.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. [6] Sharma S. K. and Lehal G. S.. 2016. Improving existing punjabi grammar checker. In Proceedings of the International Conference on Computational Techniques in Information and Communication Technologies. IEEE, 445449.Google ScholarGoogle ScholarCross RefCross Ref
  7. [7] Goodfellow I., Bengio Y., and Courville A.. 2016. Deep learning. s.l. MIT Press.Google ScholarGoogle Scholar
  8. [8] Gers F., Schimidhuber J., and Cummins F.. 2000. Learning to forget: Continual prediction with LSTM. Neural Computation 12, 10 (2000), 24512471.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. [9] Y. Kim. 2014. Convolutional neural networks for sentence classification. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, 1746--1751. Google ScholarGoogle ScholarCross RefCross Ref
  10. [10] Vaswani Ashish, Parmar Niki, Shazeer Noam, Uszkoreit Jakob, Gomez Aidan N., Jones Llion, Polosukhin Illia, and Kaiser Lukasz. 2017. Attention is all you need. In Proceedings of the Advances in Neural Information Processing Systems. 59986008.Google ScholarGoogle Scholar
  11. [11] Liu Z. R. and Liu Y.. 2017. Exploiting unlabeled data for neural grammatical error detection. 2017. Journal of Computer Science and Technology 32, 4 (2017), 758767.Google ScholarGoogle ScholarCross RefCross Ref
  12. [12] Yuan Z. and Briscoe T.. 2016. Grammatical error correction using NMT neural machine translation. In Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 380386.Google ScholarGoogle Scholar
  13. [13] BenlIgIray Burak.. 2018. Correcting writing errors in turkish with a character-level neural language model. In Proceedings of the 2018 26th Signal Processing and Communications Applications Conference. IEEE.Google ScholarGoogle ScholarCross RefCross Ref
  14. [14] Ahmadi S., Roux J. Le, and Tomeh N.. 2017. Attention-based encoder-decoder networks for spelling and grammatical error correction. Master's Thesis. Department of Mathematics and Computer Science. Paris Descartes University.Google ScholarGoogle Scholar
  15. [15] Zhang Gang. 2020. A study of Grammar Analysis in English Teaching with Deep Learning Algorithm. In Proceedings of the International Journal of Emerging Technologies in Learning 15, 18 (2020), 2029.Google ScholarGoogle ScholarCross RefCross Ref
  16. [16] Cheng Yong and Duan Mofan. 2020. Chinese grammatical error detection based on BERT model. In Proceedings of the 6th Workshop on Natural Language Processing Techniques for Educational Applications. Suzhou, China, 108113.Google ScholarGoogle Scholar
  17. [17] Liu Yajun, Zan Hongying, Zhong Mengjie, and Ma Hongchao. 2018. Detecting simultaneously chinese grammar errors based on a BiLSTM-CRF Model. In Proceedings of the 5th Workshop on Natural Language Processing Techniques for Educational Applications. Melbourne, Australia, 188193.Google ScholarGoogle ScholarCross RefCross Ref
  18. [18] Pouyanfar Samira, Sadiq Saad, Yan Yilin, Tian Haiman, Tao Yudong, Reyes Maria Presa, Shyu Mei-Ling, Chen Shu-Ching, and Iyengar S. S.. 2018. A survey on deep learning: Algorithms, techniques, and applications. ACM Computing Surveys 51, 5 (2018), 136.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. [19] Goller Christoph and Kuchler Andreas. 1996. Learning task-dependent distributed representations by backpropagation through structure. In Proceedings of the IEEE International Conference on Neural Networks. 347352.Google ScholarGoogle ScholarCross RefCross Ref
  20. [20] Socher Richard, Lin Cliff C., Manning Chris, and Ng Andrew Y.. 2011. Natural scenes Parsing and natural language with neural networks. In Proceedings of the International Conference on Machine Learning. 129136.Google ScholarGoogle Scholar
  21. [21] Cho Kyunghyun, van Merrienboer Bart, Gulcehre Caglar, Bahdanau Dzmitry, Bougares Fethi, Schwenk Holger, and Bengio Yoshua. 2014. Phrase representations Learning using RNN encoder-decoder for statistical machine translation. In Proceedings of the Conference on Empirical Methods Natural Language Processing. 17241734.Google ScholarGoogle ScholarCross RefCross Ref
  22. [22] Noh Hyeonwoo, Hong Seunghoon, and Han Bohyung. 2015. Learning deconvolution network for semantic segmentation. In Proceedings of the IEEE International Conference on Computer Vision. IEEE, 15201528.Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. [23] Li Xiangang and Wu Xihong. 2015. Constructing LSTM long short-term memory based deep recurrent neural networks for large vocabulary speech recognition. In Proceedings of the IEEE International Conference on ASSP Acoustics, Speech and Signal Processing. IEEE, 45204524.Google ScholarGoogle Scholar
  24. [24] Cho K., Bahdanau D., and Bengio Y.. 2015. Neural machine translation by jointly learning to align and translate. In Proceedings of the 3rd International Conference on Learning Representations.Google ScholarGoogle Scholar
  25. [25] A. Schmaltz, Y. Kim, A. M. Rush, and S. M. Shieber. 2016. Sentence level grammatical error identification as sequence-to-sequence correction. In Proceedings of the 11th Workshop on Innovative Use of NLP for Building Educational Applications. San Diego, CA, 242--251.Google ScholarGoogle Scholar
  26. [26] Xie Z., Avati A., Arivazhagan N., Jurafsky D., and Ng A. Y.. 2016. Neural language correction with character-based attention. Google ScholarGoogle ScholarCross RefCross Ref
  27. [27] Ji J., Wang Q., Truong S., Toutanova K., Gong Y., and Gao J.. 2017. A nested atention neural hybrid model for grammatical error correction. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics. 753762.Google ScholarGoogle Scholar
  28. [28] Rei M. and Yannakoudakis H.. 2016. Compositional sequence labeling models for error detection in learner writing. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics. 11811191.Google ScholarGoogle ScholarCross RefCross Ref
  29. [29] Rei M., Crichton G. K. O., and Pyysalo S.. 2016. Attending to characters in neural sequence labeling models. In Proceedings of the 26th International Conference on Computational Linguistics: Technical Papers. 309318.Google ScholarGoogle Scholar
  30. [30] Yannakoudakis H., Rei M., Andersen O. E., and Yuan Z. 2017. Neural sequence-labelling models for grammatical error correction GEC. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. 27952806.Google ScholarGoogle Scholar
  31. [31] Marcin Junczys-Dowmunt, Roman Grundkiewicz, Shubha Guha, and Kenneth Heafield. 2018. Approaching neural grammatical error correction as a low-resource machine translation task. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers). Association for Computational Linguistics. New Orleans, Louisiana. 595--606. Google ScholarGoogle ScholarCross RefCross Ref
  32. [32] Wu Y., Schuster M., Chen Z., Le QV, Norouzi M., Macherey W., Krikun M., Cao Y., Gao Q., Macherey K., and Klingner J.. 2016. Google's neural machine NM translation system: Bridging gap between human machine translation. Google ScholarGoogle ScholarCross RefCross Ref
  33. [33] Chollampatt Shamil and Ng Hwee Tou. 2018. A multilayer convolutional encoder-decoder NN for grammatical error correction. In Proceedings of the 32nd AAAI Conference on AI Artificial Intelligence. 57555762.Google ScholarGoogle Scholar
  34. [34] Zhao Wei, Shen Kewei, Wang Liang, Jia Ruoyu, and Liu Jingming. 2019. Improving grammatical error correction via pre-training copy-augmented architecture unlabeled data. In Proceedings of the NAACL-HLT 2019. Minneapolis, Minnesota, 156165.Google ScholarGoogle ScholarCross RefCross Ref
  35. [35] Stankovic Ranka, Sandrih Branislava, Krstev Cvetana, MilošUtvić , and Mihailo Skoric. 2020. Machine learning and deep neural network-based lemmatization and morphosyntactic tagging for serbian. In Proceedings of the 12th Conference on Language Resources and Evaluation. 39543962.Google ScholarGoogle Scholar
  36. [36] Tezcan A., Hoste V., and Macken L.. 2017. A neural network architecture for detecting grammatical errors in statistical machine translation. Prague Bulletin of Mathematical Linguistics 108 (2017), 133145.Google ScholarGoogle ScholarCross RefCross Ref
  37. [37] Luo Yikang, Bao Zuyi, Li Chen, and Wang Rui. 2020. Chinese grammatical error diagnosis with graph convolution network and multi-task learning. In Proceedings of the 6th Workshop on Natural Language Processing Techniques for Educational Applications. Suzhou, China, 4448.Google ScholarGoogle Scholar
  38. [38] Ng Hwee Tou, Hadiwinoto Christian, Wu Siew Mei, Wu Yuanbin, and Tetreault Joel. 2013. The CoNLL-2013 shared task on grammatical error correction. In Proceedings of the 7th Conference on Computational Natural Language Learning: Shared Task. Sofia, Bulgaria, 112.Google ScholarGoogle Scholar
  39. [39] Ng Hwee Tou, Bryant Christopher, Wu Siew Mei, Hadiwinoto Christian, Briscoe Ted, and Susanto Raymond Hendy. 2014. The CoNLL-2014 shared task on grammatical error correction. In Proceedings of the 8th Conference on Computational Natural Language Learning: Shared Task. Baltimore, Maryland, 114.Google ScholarGoogle ScholarCross RefCross Ref
  40. [40] Yannakoudakis Rei. 2017. Auxiliary objectives for neural error detection models. In Proceedings of the 12th Workshop on Innovative Use of Natural Language Processing for Building Educational Applications. Copenhagen, Denmark: ACL Association for Computational Linguistics, 3343.Google ScholarGoogle Scholar
  41. [41] Gulordava Kristina, Bojanowski Piotr, Grave Edouard, Linzen Tal, and Baroni Marco. 2018. Colorless green recurrent networks dream hierarchically. In Procedings of the 2018 Conference of North American CACL Chapter of the Association for Computational Linguistics: Human Language Technologies. New Orleans, Louisiana: ACL, 11951205.Google ScholarGoogle ScholarCross RefCross Ref
  42. [42] Kuncoro Adhiguna, Dyer Chris, Hale John, Yogatama Dani, Clark Stephen, and Blunsom Phil. 2018. LSTMs can learn syntax-sensitive dependencies well, but modeling structure makes them better. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. Melbourne, Australia: ACL, 14261436.Google ScholarGoogle ScholarCross RefCross Ref
  43. [43] Linzen Tal, Dupoux Emmanuel, and Goldberg Yoav. 2016. Assessing the ability of LSTMs to learn syntax-sensitive dependencies. Transactions of the Association for Computational Linguistics 4, 521535.Google ScholarGoogle ScholarCross RefCross Ref
  44. [44] Lin Nankai, Chen Boyu, Lin Xiaotian, Wattanachote Kanoksak, and Jiang Shengyi. 2021. A framework for indonesian grammar error correction. ACM Transactions on Asian and Low-Resource Language Information Processing 20, 4 (2021), 112.Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. [45] Omelianchuk Kostiantyn, Chernodub Artem, Atrasevych Vitaliy, and Skurzhanskyi Oleksandr. 2020. GECToR –grammatical error correction: Tag, not rewrite. Procedings of the 15th Workshop on Innovative Use of Natural Language Processing for Building Educational Applications. 163170.Google ScholarGoogle ScholarCross RefCross Ref
  46. [46] Grundkiewicz Roman, Junczys-Dowmunt Marcin, and Heafield Kenneth. 2019. Neural grammatical error correction systems with unsupervised pre-training on synthetic data. In Procedings of the 14th Workshop on Innovative Use of Natural Language Processing for Building Educational Applications. 252263.Google ScholarGoogle ScholarCross RefCross Ref
  47. [47] Bryant Christopher, Felice Mariano, Andersen Oistein E., and Briscoe Ted. 2019. The BEA-2019 shared task on grammatical error correction. In Procedings of the 14th Workshop on Innovative Use of Natural Language Processing for Building Educational Applications. Florence, Italy, 5275.Google ScholarGoogle ScholarCross RefCross Ref
  48. [48] Bell Samuel, Rei Marek, and Yannakoudakis Helen. 2019. Context is key: grammatical error detection with contextual word representations. arXiv:1906.06593. Retrieved from https://arxiv.org/abs/1906.06593.Google ScholarGoogle Scholar
  49. [49] Hao1a Senyue and Hao Gang. 2020. A research on online grammar checker system based on neural network model. In Proceedings of the Journal of Physics: Conference Series. IOP Publishing, 1651, 1 012135.Google ScholarGoogle Scholar
  50. [50] Zheng Yuan, Stahlberg Felix, Rei Marek, Byrne Bill, and Yannakoudakis Helen. 2019. Neural and FST-based approaches to GEC grammatical error correction. In Procedings of the 14th Workshop on Innovative Use of Natural Language Processing for Building Educational Applications. 228239.Google ScholarGoogle Scholar
  51. [51] Zewei Zhao and Wang Houfeng. 2020. Maskgec: Improving neural grammatical error correction via dynamic masking. In Proceedings of the AAAI Conference on Artificial Intelligence 34, 01 (2020), 12261233.Google ScholarGoogle ScholarCross RefCross Ref
  52. [52] Sakaguchi Matt Post, Keisuke , and Durme Benjamin Van. 2017. GEC Grammatical error correction with NRL neural reinforcement learning. arXiv:1707.00299. Retrieved from https://arxiv.org/abs/1707.00299.Google ScholarGoogle Scholar
  53. [53] Choe Jiyeon Ham, Park Kyubyong, Joong Yo, and Yoon Yeoil. 2019. A neural grammatical error correction system built on better pre-training and sequential transfer learning. arXiv:1907.01256. Retrieved from https://arxiv.org/abs/1907.01256.Google ScholarGoogle Scholar
  54. [54] Sayami Sarbin, Bahadur Shahi Tej, and Shakya Subarna. 2019. Nepali POS Tagging Using Deep Learning Approaches. No. 2073. EasyChair.Google ScholarGoogle Scholar
  55. [55] Kumar S., Kumar M. Anand, and Soman K. P.. 2019. Deep learning based part-of-speech tagging for Malayalam Twitter data (Special issue: deep learning techniques for natural language processing). Journal of Intelligent Systems 28, 3 (2019), 423435.Google ScholarGoogle ScholarCross RefCross Ref
  56. [56] Deshmukh Rushali Dhumal and Kiwelekar Arvind. 2020. Deep learning techniques for part of speech tagging by natural language processing. In Proceedings of the 2020 2nd International Conference on Innovative Mechanisms for Industry Applications. IEEE, 7681.Google ScholarGoogle ScholarCross RefCross Ref
  57. [57] Todi Ketan Kumar, Mishra Pruthwik, and Sharma Dipti Misra. 2018. Building a kannada pos tagger using machine learning and neural network models. arXiv:1808.03175. Retrieved from https://arxiv.org/abs/1808.03175.Google ScholarGoogle Scholar
  58. [58] Anbukkarasi S. and Varadhaganapathy S.. 2021. Deep learning based tamil parts of speech (POS) Tagger. Bulletin of the Polish Academy of Sciences: Technical Sciences: e138820e138820.Google ScholarGoogle Scholar

Index Terms

  1. Critical Analysis of Existing Punjabi Grammar Checker and a Proposed Hybrid Framework Involving Machine Learning and Rule-Base Criteria

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Asian and Low-Resource Language Information Processing
      ACM Transactions on Asian and Low-Resource Language Information Processing  Volume 21, Issue 5
      September 2022
      486 pages
      ISSN:2375-4699
      EISSN:2375-4702
      DOI:10.1145/3533669
      Issue’s Table of Contents

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 29 April 2022
      • Online AM: 23 March 2022
      • Revised: 1 January 2022
      • Accepted: 1 January 2022
      • Received: 1 September 2021
      Published in tallip Volume 21, Issue 5

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Refereed
    • Article Metrics

      • Downloads (Last 12 months)125
      • Downloads (Last 6 weeks)5

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    View Full Text

    HTML Format

    View this article in HTML Format .

    View HTML Format
    About Cookies On This Site

    We use cookies to ensure that we give you the best experience on our website.

    Learn more

    Got it!