Practical vision based degraded text recognition system

Khader Mohammad, Sos Agaian, Hani Saleh

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Rapid growth and progress in the medical, industrial, security and technology fields means more and more consideration for the use of camera based optical character recognition (OCR) Applying OCR to scanned documents is quite mature, and there are many commercial and research products available on this topic. These products achieve acceptable recognition accuracy and reasonable processing times especially with trained software, and constrained text characteristics. Even though the application space for OCR is huge, it is quite challenging to design a single system that is capable of performing automatic OCR for text embedded in an image irrespective of the application. Challenges for OCR systems include; images are taken under natural real world conditions, Surface curvature, text orientation, font, size, lighting conditions, and noise. These and many other conditions make it extremely difficult to achieve reasonable character recognition. Performance for conventional OCR systems drops dramatically as the degradation level of the text image quality increases. In this paper, a new recognition method is proposed to recognize solid or dotted line degraded characters. The degraded text string is localized and segmented using a new algorithm. The new method was implemented and tested using a development framework system that is capable of performing OCR on camera captured images. The framework allows parameter tuning of the image-processing algorithm based on a training set of camera-captured text images. Novel methods were used for enhancement, text localization and the segmentation algorithm which enables building a custom system that is capable of performing automatic OCR which can be used for different applications. The developed framework system includes: new image enhancement, filtering, and segmentation techniques which enabled higher recognition accuracies, faster processing time, and lower energy consumption, compared with the best state of the art published techniques. The system successfully produced impressive OCR accuracies (90% -to- 93%) using customized systems generated by our development framework in two industrial OCR applications: water bottle label text recognition and concrete slab plate text recognition. The system was also trained for the Arabic language alphabet, and demonstrated extremely high recognition accuracy (99%) for Arabic license name plate text recognition with processing times of 10 seconds. The accuracy and run times of the system were compared to conventional and many states of art methods, the proposed system shows excellent results.

Original languageBritish English
Title of host publicationProceedings of SPIE-IS and T Electronic Imaging - Multimedia on Mobile Devices 2011; and Multimedia Content Access
Subtitle of host publicationAlgorithms and Systems V
DOIs
StatePublished - 2011
EventMultimedia on Mobile Devices 2011; and Multimedia Content Access: Algorithms and Systems V - San Francisco, CA, United States
Duration: 25 Jan 201126 Jan 2011

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Volume7881
ISSN (Print)0277-786X

Conference

ConferenceMultimedia on Mobile Devices 2011; and Multimedia Content Access: Algorithms and Systems V
Country/TerritoryUnited States
CitySan Francisco, CA
Period25/01/1126/01/11

Keywords

  • Angled text
  • Curved text
  • Degraded characters
  • Dotted text
  • OCR in camera captured images
  • Optical character recognition
  • Recognition
  • Segmentation

Fingerprint

Dive into the research topics of 'Practical vision based degraded text recognition system'. Together they form a unique fingerprint.

Cite this