Abstract
Literary compositions are very often analyzed using various constituent units like words, phrases, sentences, and paragraphs. Unlike the conventional research that focuses on the aforementioned constituent units, our task is a statistical effort carried out on the most fundamental unit of any literary composition called varna, or character, followed by automated classification using learning algorithms. This article is a case study on the Hindi adaptations of two significant literary pieces, namely, Jana-Gaṇa-Mana and Vande-Mātaram, and acknowledging that the two songs being studied belong to different classes based on their bhava, i.e., the inherent emotion of the poem. The present task is the first of its kind that uses the concept of komala and kaṭhora varna to establish diversity between the two. The two-proportion Z-test is successfully applied to statistical data pertaining to the candidate songs, thereby reestablishing the theoretical assertions by investigating real pieces of literature. Taking the statistical verification as ground, a learning-based classification system is designed to yield the best accuracy of 85%, which further compliments the theory reestablished statistically.
- . 2007. Hindi: An Essential Grammar. Routledge Publication, New York.Google Scholar
- . 2014. Mathematics, poetry and beauty. Journal of Mathematics and the Arts 8, 1-2 (2014), 5–12.Google Scholar
Cross Ref
- . 2020. On exhaustive evaluation of eager machine learning algorithms for classification of Hindi verses. International Journal of Advanced Computer Science and Applications 11, 2 (2020), 181–185.Google Scholar
Cross Ref
- . 2013. Vande Mataram, the Biography of a Song. Primus Books, Delhi.Google Scholar
- . 2014. Poetic machine: Computational creativity for automatic poetry generation in Bengali. In 5th International Conference on Computational Creativity.Google Scholar
- R. A. Deshmukh, S. Kore, N. Chavan, S. Gole, and K. Adarsh. 2019. Marathi poem classification using machine learning. International Journal of Recent Technology and Engineering 8, 2 (2019), 2723-2727.Google Scholar
- . 2011. Rashmi Lok. Lokbharti Prakashan, Allahabad.Google Scholar
- . 2013. India: A Sacred Geography. Harmony Books, Delhi.Google Scholar
- . 2014. Guest editor's introduction. Journal of Mathematics and the Arts 8, 1-2 (2014), 1–4.Google Scholar
Cross Ref
- . 2002. Fundamentals of Mathematical Statistics. Sultan Chand & Sons, New Delhi.Google Scholar
- . 1995. Introduction to Hindi Grammar. Center for South and Southeast, University of California at Berkeley.Google Scholar
- . 2016. Sandhi: the rule-based word formation in Hindi. International Journal of Computer Science and Information Security 14, 12 (2016), 781–785.Google Scholar
- Kavita Kosh. 2023. Retrieved January, 2023 from http://kavitakosh.org/kk.Google Scholar
- . 2020. Designing Punjabi poetry classifiers using machine learning and different textual features. International Arab Journal of Information Technology 17, 1 (2020), 38–44.Google Scholar
Cross Ref
- V. Kumar. 2018. Vrihat Vyakarana Bhaskar (2nd ed.). Bharati Bhawan, Patna.Google Scholar
- . 2019. Schwa deletion in Hindi language speech synthesis. International Journal of Engineering and Advanced Technology 8, 6S (2019), 211–214.Google Scholar
Cross Ref
- National Portal of India. 2020. National anthem. Retrieved December 12, 2020 from Https://www.india.gov.in/india-glance/national-symbols.Google Scholar
- . 2018. Modern poetic trends of Odisha versus the neoteric strives of Guruprasad. International Journal of Trends in Scientific Research and Development 11, 2 (2018), 1140–1160.Google Scholar
Cross Ref
- . 2020. Model for classification of poems in Hindi language based on ras. In Smart Systems and IoT: Innovations in Computing, (Eds). Springer, Singapore, 655–661.Google Scholar
- . 2020. Author identification of Hindi poetry. International Journal of Scientific and Technology Research 9, 3 (2020), 3791–3795.Google Scholar
- . 2021. “The Road not taken” statistically taken: A probabilistic analysis of Frost's poem. Journal of Statistics and Management Systems 24, 8 (2021), 1733–1752.Google Scholar
Cross Ref
- . 2022. Automatic extraction of similar poetry for study of literary texts: An experiment on Hindi poetry. ETRI Journal (2022), 413–425.Google Scholar
Cross Ref
- . 2015. Automated analysis of Bangla poetry for classification and poet identification. In 12th International Conference on Natural Language Processing, D. S. Sharma (Ed.). NLP Association of India, Trivandrum, India, 247–253.Google Scholar
- . 2021. Kavya Ke Tatva (1st ed.). Lokbharti Prakashan, Allahabad.Google Scholar
- . 2016. Dhwani Siddhant: Ek naya moolyankan. Shubhadra Publishers and Distributors, Delhi.Google Scholar
- . 2005. Constructing Post-Colonial India: National Character and the Doon School. Routledge, London.Google Scholar
Cross Ref
- . 2022. Alankaar Parijaat. 18th ed. Lakshmi Narayan Agrawal, Agra.Google Scholar
- The Daily Star. 2011. Bankim Chandra: The first prominent Bengali novelist.Google Scholar
Index Terms
Komala and Kaṭhora: A Novel Approach Towards Classification of Hindi Poetry
Recommendations
Punjabi Poetry Classification: The Test of 10 Machine Learning Algorithms
ICMLC 2017: Proceedings of the 9th International Conference on Machine Learning and ComputingLiterature of country represents the prosperity of that country. India, being a multilingual country, is having a rich heritage and literature. In order to retrieve literature pieces easily, it must be classified. In this research work, vocabulary-...
An unsupervised Hindi stemmer with heuristic improvements
AND '08: Proceedings of the second workshop on Analytics for noisy unstructured text dataStemmers are used to convert inflected words into their root or stem. Stem does not necessarily correspond to linguistic root of a word. Stemming improve performance by reducing morphologically variants into same words. This paper presents an approach ...
Intelligent Part of Speech tagger for Hindi
AbstractEnglish Part of Speech like noun, verb, adverb, adjective, pronoun, preposition, interjection, conjunction is somewhat similar in Hindi but not exactly the same. Hindi grammar has different Part of Speech (POS) based on its morphological features ...






Comments