TY - JOUR
T1 - Handwritten Odia Digit Recognition using Learning Systems: A Comparison of Neural Networks and Support Vector Machine Models
AU - Sharma, Urva
AU - Bansal, Rajat
AU - Sarangi, Pradeepta Kumar
AU - Gupta, Deepali
AU - Rani, Shalli
AU - Khan, Fazl Ullah
AU - Srivastava, Gautam
PY - 2023/10/9
Y1 - 2023/10/9
N2 - The Odia language is one of the many regional languages spoken in India. It is the official language of Odisha, a State in eastern India. The Odia language carries a 1500-year-old history and worldwide is spoken by more than 50 million people. The Odia digits are complex due to the presence of many curves in each character. Handwritten scripts are even more complex due to free-style writing. However, the development of an innovative machine learning model is essential because Odia scripts consist of a huge number of historical documents of more than 1000 years old. A robust automation method will help in converting historical documents into digital form and will help to preserve the documents. This will solve a big problem in society. This work experiments with handwritten Odia numerals by implementing two different classifiers. The first one is the implementation of a Convolutional Neural Network (CNN) and the second experiment implements a Support Vector Machine (SVM). Finally, results from both experiments have been compared. The dataset has been generated through software by writing the digits on MS Paint. Both CNN and SVM models have been implemented through Python programming to recognize the inputs into a particular class. Both training and testing of the models have been done using this dataset. The accuracy from the CNN Model is obtained to be 94.999% which is ≈95% and for SVM, the model accuracy is 86%. Comparing both results, it is concluded that the CNN model is comparatively better than the SVM classifier in the case of the proposed work.
AB - The Odia language is one of the many regional languages spoken in India. It is the official language of Odisha, a State in eastern India. The Odia language carries a 1500-year-old history and worldwide is spoken by more than 50 million people. The Odia digits are complex due to the presence of many curves in each character. Handwritten scripts are even more complex due to free-style writing. However, the development of an innovative machine learning model is essential because Odia scripts consist of a huge number of historical documents of more than 1000 years old. A robust automation method will help in converting historical documents into digital form and will help to preserve the documents. This will solve a big problem in society. This work experiments with handwritten Odia numerals by implementing two different classifiers. The first one is the implementation of a Convolutional Neural Network (CNN) and the second experiment implements a Support Vector Machine (SVM). Finally, results from both experiments have been compared. The dataset has been generated through software by writing the digits on MS Paint. Both CNN and SVM models have been implemented through Python programming to recognize the inputs into a particular class. Both training and testing of the models have been done using this dataset. The accuracy from the CNN Model is obtained to be 94.999% which is ≈95% and for SVM, the model accuracy is 86%. Comparing both results, it is concluded that the CNN model is comparatively better than the SVM classifier in the case of the proposed work.
U2 - 10.1145/3626524
DO - 10.1145/3626524
M3 - Article
JO - ACM Transactions on Asian and Low-Resource Language Information Processing
JF - ACM Transactions on Asian and Low-Resource Language Information Processing
ER -