论文标题

使用卷积神经网络和中风标识的儿童手写的阿拉伯角色识别

Handwritten Arabic Character Recognition for Children Writ-ing Using Convolutional Neural Network and Stroke Identification

论文作者

Alheraki, Mais, Al-Matham, Rawan, Al-Khalifa, Hend

论文摘要

自动阿拉伯语手写识别是机器学习领域最近研究的问题之一。与拉丁语不同,阿拉伯语是一种闪族语言,构成了更严重的挑战,尤其是由于作者年龄等因素引起的模式的变化。大多数研究都集中在成年人上,只有一项关于儿童的研究。此外,许多最近的机器学习方法都集中在使用卷积神经网络,这是一种强大的神经网络,可以从图像中提取复杂的功能。在本文中,我们提出了一个卷积神经网络(CNN)模型,该模型在Hijja数据集上以91%的精度识别儿童手写,这是一个最近通过收集儿童编写的阿拉伯字符的图像和阿拉伯语手写字符数据构建的数据集。结果表明,Hijja数据集作者对所提出的模型有很好的改进,但它揭示了为儿童阿拉伯语手写角色识别而解决的更大挑战。此外,我们根据字符中的笔触数量提出了一种使用多模型而不是单个模型的新方法,并将Hijja与AHCD合并,该方法达到了平均的预测准确性为96%。

Automatic Arabic handwritten recognition is one of the recently studied problems in the field of Machine Learning. Unlike Latin languages, Arabic is a Semitic language that forms a harder challenge, especially with variability of patterns caused by factors such as writer age. Most of the studies focused on adults, with only one recent study on children. Moreover, much of the recent Machine Learning methods focused on using Convolutional Neural Networks, a powerful class of neural networks that can extract complex features from images. In this paper we propose a convolutional neural network (CNN) model that recognizes children handwriting with an accuracy of 91% on the Hijja dataset, a recent dataset built by collecting images of the Arabic characters written by children, and 97% on Arabic Handwritten Character Dataset. The results showed a good improvement over the proposed model from the Hijja dataset authors, yet it reveals a bigger challenge to solve for children Arabic handwritten character recognition. Moreover, we proposed a new approach using multi models instead of single model based on the number of strokes in a character, and merged Hijja with AHCD which reached an averaged prediction accuracy of 96%.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源