Kurdish Optical Character Recognition.
By: Rasty Yaseen, Hossein Hassani.
2018.
Currently, no offline tool is available for Optical Character Recognition (OCR) in Kurdish. Kurdish is spoken in different dialects and uses several scripts for writing. The Persian/Arabic script is widely used among these dialects. The Persian/Arabic script is written from Right to Left (RTL), it is cursive, and it uses unique diacritics. These features, particularly the last two, affect the segmentation stage in developing a Kurdish OCR. In this article, we introduce an enhanced character segmentation based method which addresses the mentioned characteristics. We applied the method to text-only images and tested the Kurdish OCR using documents of different fonts, font sizes, and image resolutions. The results of the experiments showed that the accuracy rate of character recognition of the proposed method was 90.82% on average. [1]
=KTML_Link_External_Begin=https://www.kurdipedia.org/docviewer.aspx?id=445069&document=0001.PDF=KTML_Link_External_Between= Click to read the article: Kurdish Optical Character Recognition=KTML_Link_External_End=