Journal of Computer and Communications

Volume 5, Issue 10 (August 2017)

ISSN Print: 2327-5219   ISSN Online: 2327-5227

Google-based Impact Factor: 1.12  Citations  

An Automatic Text Region Positioning Method for the Low-Contrast Image

HTML  XML Download Download as PDF (Size: 6702KB)  PP. 36-49  
DOI: 10.4236/jcc.2017.510005    1,089 Downloads   1,889 Views  

ABSTRACT

Text extraction is the key step in the character recognition; its accuracy highly relies on the location of the text region. In this paper, we propose a new method which can find the text location automatically to solve some regional problems such as incomplete, false position or orientation deviation occurred in the low-contrast image text extraction. Firstly, we make some pre-processing for the original image, including color space transform, contrast-limited adaptive histogram equalization, Sobel edge detector, morphological method and eight neighborhood processing method (ENPM) etc., to provide some results to compare the different methods. Secondly, we use the connected component analysis (CCA) method to get several connected parts and non-connected parts, then use the morphology method and CCA again for the non-connected part to erode some noises, obtain another connected and non-connected parts. Thirdly, we compute the edge feature for all connected areas, combine Support Vector Machine (SVM) to classify the real text region, obtain the text location coordinates. Finally, we use the text region coordinate to extract the block including the text, then binarize, cluster and recognize all text information. At last, we calculate the precision rate and recall rate to evaluate the method for more than 200 images. The experiments show that the method we proposed is robust for low-contrast text images with the variations in font size and font color, different language, gloomy environment, etc.

Share and Cite:

Liu, G. , Jiang, M. , Cun, H. , Shi, Z. and Hao, J. (2017) An Automatic Text Region Positioning Method for the Low-Contrast Image. Journal of Computer and Communications, 5, 36-49. doi: 10.4236/jcc.2017.510005.

Cited by

No relevant information.

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.