Character Spotting and Autonomous Tagging: Offline Handwriting Recognition for Bangla, Korean and Other Alphabetic Scripts

Nishatul Majid, Elisa H. Barney Smith

Research output: Contribution to journalArticlepeer-review

2 Downloads (Pure)

Abstract

This paper demonstrates a framework for offline handwriting recognition using character spotting and autonomous tagging which works for any alphabetic script. Character spotting builds on the idea of object detection to find character elements in unsegmented word images. An autonomous tagging approach is introduced which automates the production of a character image training set by estimating character locations in a word based on typical character size. Although scripts can vary vividly from each other, our proposed approach provides a simple and powerful workflow for unconstrained offline recognition that should work for any alphabetic script with few adjustments. Here we demonstrate this approach with handwritten Bangla, obtaining a character recognition accuracy (CRA) of 94.8% and 91.12% with precision and autonomous tagging, respectively. Furthermore, we explained how character spotting and autonomous tagging can be implemented for other alphabetic scripts. We demonstrated that with handwritten Hangul/Korean obtaining a Jamo recognition accuracy (JRA) of 93.16% using a tiny fraction of the PE92 training set. The combination of character spotting and autonomous tagging takes away one of the biggest frustrations—data annotation by hand, and thus, we believe this has the potential to revolutionize the growth of offline recognition development.

Original languageAmerican English
JournalElectrical and Computer Engineering Faculty Publications and Presentations
StatePublished - 1 Dec 2022

Keywords

  • Bangla handwriting recognition
  • Korean handwriting recognition
  • autonomous tagging
  • character spotting
  • offline handwriting recognition

EGS Disciplines

  • Electrical and Computer Engineering

Fingerprint

Dive into the research topics of 'Character Spotting and Autonomous Tagging: Offline Handwriting Recognition for Bangla, Korean and Other Alphabetic Scripts'. Together they form a unique fingerprint.

Cite this