SOTAVerified

An efficient way for segmentation of Bangla characters in printed document using curved scanning

2016-05-13Code Available0· sign in to hype

Ahnaf Farhan Rownak; Md. Fazle Rabby; Sabir Ismail; Md. Saiful Islam

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

The preeminent reason for poor output in Optical Character Recognition (OCR) for Bangla text is introduced by segmentation related error. Different shape of characters, connected characters, modifiers in top and bottom, overlapped region between consecutive characters are the main obstacle for effective segmentation for Bangla printed text. In this paper an efficient strategy is introduced to segment characters consisting overlapped region with other characters. The proposed strategy of our research have achieved 99.8% accuracy rate in line segmentation, 99.5% accuracy in word segmentation and 99% accuracy for character segmentation. The error introduced when two consecutive characters have multiple touching points.

Tasks

Reproductions