skip to main content
research-article

Development of a Benchmark Odia Handwritten Character Database for an Efficient Offline Handwritten Character Recognition with a Chronological Survey

Published:17 June 2023Publication History
Skip Abstract Section

Abstract

A good benchmark dataset is a primary requirement in the offline handwritten character recognition (HCR) process. Only three handwritten numerals and alphabet datasets from Odia are publicly accessible for study, although many writers have used several datasets in their experiments. In this article, two tasks are done to address this issue. Those are the following: First, an extensive survey focused on various datasets is provided with the methodologies used in chronological order. The second factor is a solution to the lack of publicly available handwritten characters and numeral datasets. A new dataset of handwritten Odia characters with numerals has been developed. Anyone can access this dataset by sending an email to the authors of the article. This dataset was created with the help of 150 volunteers of various age groups, races, and qualifications. Some homogeneous experiments are conducted using deep learning models to evaluate the consistency of the dataset. One heterogeneous trial has also been performed to estimate the complexities of the characters present in the dataset by comparing them with the existing benchmark datasets.

REFERENCES

  1. [1] Acharya Shailesh, Pant Ashok Kumar, and Gyawali Prashnna Kumar. 2015. Deep learning based large scale handwritten Devanagari character recognition. In 9th International Conference on Software, Knowledge, Information Management and Applications (SKIMA). IEEE, 16.Google ScholarGoogle ScholarCross RefCross Ref
  2. [2] Bhattacharya Ujjwal and Chaudhuri B. B.. 2005. Databases for research on recognition of handwritten characters of Indian scripts. In 8th International Conference on Document Analysis and Recognition (ICDAR’05). IEEE, 789793.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. [3] Bhattacharya Ujjwal, Shridhar Malayappan, Parui Swapan K., Sen P. K., and Chaudhuri B. B.. 2012. Offline recognition of handwritten Bangla characters: An efficient two-stage approach. Pattern Anal. Applic. 15, 4 (2012), 445458.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. [4] Bhowmik Tapan K., Parui Swapan K., Bhattacharya Ujjwal, and Shaw Bikash. 2006. An HMM based recognition scheme for handwritten Oriya numerals. In 9th International Conference on Information Technology (ICIT’06). IEEE, 105110.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. [5] Cao Zhong, Lu Jiang, Cui Sen, and Zhang Changshui. 2020. Zero-shot handwritten Chinese character recognition with hierarchical decomposition embedding. Pattern Recog. 107 (2020), 107488.Google ScholarGoogle ScholarCross RefCross Ref
  6. [6] Chanda Sukalpa, Franke Katrin, and Pal Umapada. 2012. Text independent writer identification for Oriya script. In 10th IAPR International Workshop on Document Analysis Systems. IEEE, 369373.Google ScholarGoogle Scholar
  7. [7] Das Dibyasundar, Dash Ratnakar, and Majhi Banshidhar. 2018. Optimization based feature generation for handwritten Odia-numeral recognition. In 14th International Conference on Information Processing (ICINPRO). IEEE, 15.Google ScholarGoogle ScholarCross RefCross Ref
  8. [8] Das Dibyasundar, Nayak Deepak Ranjan, Dash Ratnakar, and Majhi Banshidahar. 2019. An empirical evaluation of extreme learning machine: Application to handwritten character recognition. Multim. Tools Applic. 78, 14 (2019), 1949519523.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. [9] Das Dibyasundar, Nayak Deepak Ranjan, Dash Ratnakar, and Majhi Banshidhar. 2020. MJCN: Multi-objective Jaya convolutional network for handwritten optical character recognition. Multim. Tools Applic. 79, 43 (2020), 3302333042.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. [10] Dash Bhabani, Pradhan Shibashis, and Rana Debaraj. 2016. Odia offline character recognition using DWT features. IOSR J. Electron. Commun. Eng. (2016), 3137.Google ScholarGoogle Scholar
  11. [11] Dash Kalyan S., Puhan N. B., and Panda Ganapati. 2014. A hybrid feature and discriminant classifier for high accuracy handwritten Odia numeral recognition. In IEEE Region 10 Symposium. IEEE, 531535.Google ScholarGoogle Scholar
  12. [12] Dash Kalyan Sourav, Puhan Niladri B., and Panda Ganapati. 2015. Handwritten numeral recognition using non-redundant Stockwell transform and bio-inspired optimal zoning. IET Image Process. 9, 10 (2015), 874882.Google ScholarGoogle ScholarCross RefCross Ref
  13. [13] Dash Kalyan S., Puhan Niladri B., and Panda Ganapati. 2015. On extraction of features for handwritten Odia numeral recognition in transformed domain. In 8th International Conference on Advances in Pattern Recognition (ICAPR). IEEE, 16.Google ScholarGoogle ScholarCross RefCross Ref
  14. [14] Dash Kalyan S., Puhan Niladri B., and Panda Ganapati. 2017. Odia character recognition: A directional review. Artif. Intell. Rev. 48, 4 (2017), 473497.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. [15] Dash Kalyan S., Puhan Niladri B., and Panda Ganapati. 2020. Sparse concept coded tetrolet transform for unconstrained Odia character recognition. arXiv preprint arXiv:2004.01551.Google ScholarGoogle Scholar
  16. [16] Campos Teófilo Emídio De, Babu Bodla Rakesh, Varma Manik et al. 2009. Character recognition in natural images. VISAPP 7, 2 (2009).Google ScholarGoogle Scholar
  17. [17] Deng Jia, Dong Wei, Socher Richard, Li Li-Jia, Li Kai, and Fei-Fei Li. 2009. ImageNet: A large-scale hierarchical image database. In IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 248255.Google ScholarGoogle ScholarCross RefCross Ref
  18. [18] Dey Raghunath and Balabantaray Rakesh Chandra. 2019. A novel sliding window approach for offline handwritten character recognition. In International Conference on Information Technology (ICIT). IEEE, 178183.Google ScholarGoogle ScholarCross RefCross Ref
  19. [19] Dey Raghunath, Balabantaray Rakesh Chandra, and Mohanty Sanghamitra. 2021. Sliding window based off-line handwritten text recognition using edit distance. Multim. Tools Applic. 81, 16 (2021), 128.Google ScholarGoogle Scholar
  20. [20] Dey Raghunath, Balabantaray Rakesh Chandra, and Piri Jayashree. 2020. A robust handwritten digit recognition system based on sliding window with edit distance. In IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT). IEEE, 16.Google ScholarGoogle ScholarCross RefCross Ref
  21. [21] Dey Raghunath, Balabantaray Rakesh Chandra, Piri Jayashree, and Singh Debabrata. 2021. Offline natural scene character recognition using VGG16 neural networks. In 3rd International Conference on Inventive Research in Computing Applications (ICIRCA). IEEE, 946951.Google ScholarGoogle ScholarCross RefCross Ref
  22. [22] Guha Riya, Das Nibaran, Kundu Mahantapas, Nasipuri Mita, and Santosh K. C.. 2020. DevNet: An efficient CNN architecture for handwritten Devanagari character recognition. Int. J. Pattern Recog. Artif. Intell. 34, 12 (2020), 2052009.Google ScholarGoogle ScholarCross RefCross Ref
  23. [23] Jindal Tarun and Bhattacharya Ujjwal. 2013. Recognition of offline handwritten numerals using an ensemble of MLPs combined by AdaBoost. In 4th International Workshop on Multilingual OCR. ACM, 15.Google ScholarGoogle Scholar
  24. [24] Justusson B. I.. 1981. Median filtering: Statistical properties. In Two-dimensional Digital Signal Processing II. Springer, 161196.Google ScholarGoogle Scholar
  25. [25] Krizhevsky Alex, Sutskever Ilya, and Hinton Geoffrey E.. 2017. ImageNet classification with deep convolutional neural networks. Commun. ACM 60, 6 (2017), 8490.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. [26] Li Zhiyuan, Wu Qi, Xiao Yi, Jin Min, and Lu Huaxiang. 2020. Deep matching network for handwritten Chinese character recognition. Pattern Recog. 107 (2020), 107471.Google ScholarGoogle ScholarCross RefCross Ref
  27. [27] Mahto Manoj Kumar, Kumari Archana, and Panigrahi S.. 2011. Asystem for Oriya handwritten numeral recognization for Indian postal automation. Int. J. Appl. Sci. Technol. Res. Excell. 1, 1 (2011), 1723.Google ScholarGoogle Scholar
  28. [28] Majhi Babita and Pujari Puspalata. 2018. On development and performance evaluation of novel Odia handwritten digit recognition methods. Arab. J. Sci. Eng. 43, 8 (2018), 38873901.Google ScholarGoogle ScholarCross RefCross Ref
  29. [29] Meher Sukadev and Basa Debasish. 2011. An intelligent scanner with handwritten Odia character recognition capability. In 5th International Conference on Sensing Technology. IEEE, 5359.Google ScholarGoogle ScholarCross RefCross Ref
  30. [30] Mishra Tusar Kanti, Majhi Banshidhar, and Panda Sandeep. 2013. A comparative analysis of image transformations for handwritten Odia numeral recognition. In International Conference on Advances in Computing, Communications and Informatics (ICACCI). IEEE, 790793.Google ScholarGoogle ScholarCross RefCross Ref
  31. [31] Mishra Tusar Kanti, Majhi Banshidhar, Sa Pankaj K., and Panda Sandeep. 2014. Model based Odia numeral recognition using fuzzy aggregated features. Front. Comput. Sci. 8, 6 (2014), 916922.Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. [32] Mohapatra Ramesh Kumar, Majhi Banshidhar, and Jena Sanjay Kumar. 2015. Classification of handwritten Odia basic character using Stockwell transform. Int. J. Appl. Pattern Recog. 2, 3 (2015), 235254.Google ScholarGoogle ScholarCross RefCross Ref
  33. [33] Mohapatra Ramesh Kumar, Mishra Tusar Kanti, Panda Sandeep, and Majhi Banshidhar. 2015. OHCS: A database for handwritten atomic Odia Character Recognition. In 5th National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics (NCVPRIPG). IEEE, 14.Google ScholarGoogle ScholarCross RefCross Ref
  34. [34] Nayak Mamata and Nayak Ajit Kumar. 2017. Odia character recognition using backpropagation network with binary features. Int. J. Computat. Vis. Robot. 7, 5 (2017), 588604.Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. [35] Nongmeikapam Kishorjit, Wahengbam Kanan, Meetei Oinam Nickson, and Tuithung Themrichon. 2019. Handwritten Manipuri Meetei-Mayek classification using convolutional neural network. ACM Trans. Asian Low-resour. Lang. Inf. Process. 18, 4 (2019), 123.Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. [36] Otsu Nobuyuki. 1979. A threshold selection method from gray-level histograms. IEEE Trans. Syst., Man, Cybern. 9, 1 (1979), 6266.Google ScholarGoogle ScholarCross RefCross Ref
  37. [37] Padhi Debananda and Senapati Debabrata. 2013. Zone centroid distance and standard deviation based feature matrix for Odia handwritten character recognition. In International Conference on Frontiers of Intelligent Computing: Theory and Applications (FICTA). Springer, 649658.Google ScholarGoogle ScholarCross RefCross Ref
  38. [38] Pal Umapada, Jayadevan Ramachandran, and Sharma Nabin. 2012. Handwriting recognition in Indian regional scripts: A survey of offline techniques. ACM Trans. Asian Lang. Inf. Process. 11, 1 (2012), 135.Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. [39] Pal Umapada, Sharma Nabin, and Kimura Fumitaka. 2007. Oriya off-line handwritten character recognition. In Advances in Pattern Recognition. World Scientific, 123128.Google ScholarGoogle Scholar
  40. [40] Pal Umapada, Sharma Nabin, Wakabayashi Tetsushi, and Kimura Fumitaka. 2007. Handwritten numeral recognition of six popular Indian scripts. In 9th International Conference on Document Analysis and Recognition (ICDAR’07), Vol. 2. IEEE, 749753.Google ScholarGoogle ScholarCross RefCross Ref
  41. [41] Pal Umapada, Wakabayashi Tetsushi, and Kimura Fumitaka. 2007. A system for off-line Oriya handwritten character recognition using curvature feature. In 10th International Conference on Information Technology (ICIT’07). IEEE, 227229.Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. [42] Panda Smruti Rekha and Tripathy Jogeswar. 2015. Odia offline typewritten character recognition using template matching with unicode mapping. In International Symposium on Advanced Computing and Communication (ISACC). IEEE, 109115.Google ScholarGoogle Scholar
  43. [43] Roy Kaushik, Pal Tandra, Pal Umapada, and Kimura Fumitaka. 2005. Oriya handwritten numeral recognition system. In 8th International Conference on Document Analysis and Recognition (ICDAR’05). IEEE, 770774.Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. [44] Rushiraj Indugu, Kundu Souvik, and Ray Baidyanath. 2016. Handwritten character recognition of Odia script. In International Conference on Signal Processing, Communication, Power and Embedded System (SCOPES). IEEE, 764767.Google ScholarGoogle ScholarCross RefCross Ref
  45. [45] Sandler Mark, Howard Andrew, Zhu Menglong, Zhmoginov Andrey, and Chen Liang-Chieh. 2018. MobileNetV2: Inverted residuals and linear bottlenecks. In IEEE Conference on Computer Vision and Pattern Recognition. 45104520.Google ScholarGoogle ScholarCross RefCross Ref
  46. [46] Sarangi Pradeepta K. and Ahmed Parvez. 2013. Recognition of handwritten Odia numerals using artificial intelligence techniques. Int. J. Comput. Sci. 2, 02 (2013), 3538.Google ScholarGoogle Scholar
  47. [47] Sarangi Pradeepta K., Ahmed P., and Ravulakollu Kiran K.. 2014. Naïve Bayes classifier with Lu factorization for recognition of handwritten Odia numerals. Ind. J. Sci. Technol. 7, 1 (2014), 3538.Google ScholarGoogle ScholarCross RefCross Ref
  48. [48] Sarangi Pradeepta K., Sahoo Ashok K., and Ahmed P.. 2012. Recognition of isolated handwritten oriya numerals using Hopfield neural network. Int. J. Comput. Applic. 40, 8 (2012), 3642.Google ScholarGoogle ScholarCross RefCross Ref
  49. [49] Satpathy Subhashree, Nayak Ajit Kumar, Nayak Mamata, and Anusandhan Siksha O.. 2018. Use of stacked autoencoder in Odia numeral image recognition. Int. J. Pure Appl. Math. 118, 24 (2018).Google ScholarGoogle Scholar
  50. [50] Sethy Abhisek and Patra Prashanta Kumar. 2016. Off-line Odia handwritten numeral recognition using neural network: A comparative analysis. In International Conference on Computing, Communication and Automation (ICCCA). IEEE, 10991103.Google ScholarGoogle ScholarCross RefCross Ref
  51. [51] Sethy Abhisek and Patra Prashanta Kumar. 2019. Off-line Odia handwritten character recognition: An axis constellation model based research. Int. J. Innov. Technol. Explor. Eng. 8, 9S2 (2019), 788793.Google ScholarGoogle ScholarCross RefCross Ref
  52. [52] Sethy Abhisek and Patra Prashanta Kumar. 2020. R-HOG feature-based off-line Odia handwritten character recognition. In Examining Fractal Image Processing and Analysis. IGI Global, 196210.Google ScholarGoogle ScholarCross RefCross Ref
  53. [53] Sethy Abhisek, Patra Prashanta Kumar, and Nayak Deepak Ranjan. 2018. Off-line handwritten Odia character recognition using DWT and PCA. In Progress in Advanced Computing and Intelligent Engineering. Springer, 187195.Google ScholarGoogle ScholarCross RefCross Ref
  54. [54] Sethy Abhisek, Patra Prashanta K., and Nayak Deepak Ranjan. 2019. Gray-level co-occurrence matrix and random forest based off-line Odia handwritten character recognition. Recent Patents Eng. 13, 2 (2019), 136141.Google ScholarGoogle ScholarCross RefCross Ref
  55. [55] Sethy Abhisek, Patra Prashanta Kumar, Nayak SoumyaRanjan, and Jena Pyari Mohan. 2017. Symmetric axis based off-line Odia handwritten character and numeral recognition. In 3rd International Conference on Computational Intelligence and Networks (CINE). IEEE, 8387.Google ScholarGoogle ScholarCross RefCross Ref
  56. [56] Sethy Abhisek, Patra Prashanta Kumar, and Nayak Soumya Ranjan. 2020. Offline handwritten numeral recognition using convolution neural network. Mach. Vis. Inspect. Syst.: Image Process., Concepts, Methodol. Applic. 1 (2020), 197212.Google ScholarGoogle Scholar
  57. [57] Singh Pawan Kumar, Chatterjee Iman, Sarkar Ram, Smith Elisa Barney, and Nasipuri Mita. 2021. A new feature extraction approach for script invariant handwritten numeral recognition. Expert Syst. 38, 6 (2021), e12699.Google ScholarGoogle Scholar
  58. [58] Singh Sunny, Sarangi Pradeepta Kumar, Singla Chaitanya, and Sahoo Ashok Kumar. 2021. Odia character recognition system: A study on feature extraction and classification techniques. Mater. Today: Proc. 34 (2021), 742747.Google ScholarGoogle ScholarCross RefCross Ref
  59. [59] Subburaman Dhivya and Gandhi Usha Devi. 2020. Study on automated approach to generate character recognition for handwritten and historical documents. ACM Trans. Asian Low-resour. Lang. Inf (2020).Google ScholarGoogle Scholar
  60. [60] Tripathy Nilamadhaba and Pal Umapada. 2003. System for Oriya handwritten numeral recognition. In Document Recognition and Retrieval XI, Vol. 5296. International Society for Optics and Photonics, IEEE, 174181.Google ScholarGoogle Scholar
  61. [61] Tripathy Nilamadhaba and Pal Umapada. 2006. Handwriting segmentation of unconstrained Oriya text. Sadhana 31, 6 (2006), 755769.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Development of a Benchmark Odia Handwritten Character Database for an Efficient Offline Handwritten Character Recognition with a Chronological Survey

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Asian and Low-Resource Language Information Processing
      ACM Transactions on Asian and Low-Resource Language Information Processing  Volume 22, Issue 6
      June 2023
      635 pages
      ISSN:2375-4699
      EISSN:2375-4702
      DOI:10.1145/3604597
      Issue’s Table of Contents

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 17 June 2023
      • Online AM: 21 February 2023
      • Accepted: 3 February 2023
      • Revised: 31 October 2022
      • Received: 14 April 2021
      Published in tallip Volume 22, Issue 6

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
    • Article Metrics

      • Downloads (Last 12 months)120
      • Downloads (Last 6 weeks)47

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    View Full Text
    About Cookies On This Site

    We use cookies to ensure that we give you the best experience on our website.

    Learn more

    Got it!