SOTAVerified

Ekush: A Multipurpose and Multitype Comprehensive Database for Online Off-Line Bangla Handwritten Characters

2019-07-17Recent Trends in Image Processing and Pattern Recognition. RTIP2R 2018. 2019Code Available0· sign in to hype

Akm Shahariar Azad Rabby, Sadeka Haque, Md. Sanzidul Islam, Sheikh Abujar, Sayed Akhter Hossain

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Ekush is the largest dataset of handwritten Bangla characters for research on handwritten Bangla character recognition. In recent years Machine learning and deep learning application-based researchers have achieved interest and one of the most significant applications is handwritten recognition. Because it has tremendous applications such as in Bangla OCR. Also, Bangla's writing script is one of the most popular in the world. For that reason, we are introducing a multipurpose comprehensive dataset for Bangla Handwritten Characters. The proposed dataset contains Bangla modifiers, vowels, consonants, compound letters, and numerical digits that consist of 367,018 isolated handwritten characters written by 3086 unique writers which were collected within Bangladesh. This dataset can be used for other problems i.e.: gender, age, district base handwritten related research, because the samples were collected include a variety of the district, age group, and the equal number of males and females. It is intended to fabricate acknowledgment techniques for handwritten Bangla characters. This dataset is unreservedly accessible for any sort of scholarly research work. The Ekush dataset is trained and validated with EkushNet and indicated attractive acknowledgment precision 97.73% for the Ekush dataset, which is up until this point, the best exactness for Bangla character acknowledgment. The Ekush dataset and relevant code can be found at this link: https://github.com/ShahariarRabby/ekush.

Tasks

Reproductions