TY - GEN
T1 - A robust algorithm for text detection in color images
AU - Liu, Yangxing
AU - Goto, Satoshi
AU - Ikenaga, Takeshi
PY - 2005
Y1 - 2005
N2 - Text detection in color images has become an active research area since recent decades. In this paper, we present a novel approach to accurately detect text in color images possibly with a complex background. First, we use an elaborate edge detection algorithm to extract all possible text edge pixels. Second connected component analysis is employed to construct text candidate region and classify part non-text regions. Third each text candidate region is verified with texture features derived from wavelet domain. Finally, the Expectation maximization algorithm is introduced to binarize text regions to prepare data for recognition. In contrast to previous approach, our algorithm combines both the efficiency of connected component based method and robustness of texture based analysis. Experimental results show that our algorithm is robust in text detection with respect to different character size, orientation, color and language and can provide reliable text binarization result.
AB - Text detection in color images has become an active research area since recent decades. In this paper, we present a novel approach to accurately detect text in color images possibly with a complex background. First, we use an elaborate edge detection algorithm to extract all possible text edge pixels. Second connected component analysis is employed to construct text candidate region and classify part non-text regions. Third each text candidate region is verified with texture features derived from wavelet domain. Finally, the Expectation maximization algorithm is introduced to binarize text regions to prepare data for recognition. In contrast to previous approach, our algorithm combines both the efficiency of connected component based method and robustness of texture based analysis. Experimental results show that our algorithm is robust in text detection with respect to different character size, orientation, color and language and can provide reliable text binarization result.
UR - http://www.scopus.com/inward/record.url?scp=33846431096&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33846431096&partnerID=8YFLogxK
U2 - 10.1109/ICDAR.2005.29
DO - 10.1109/ICDAR.2005.29
M3 - Conference contribution
AN - SCOPUS:33846431096
SN - 0769524206
SN - 9780769524207
T3 - Proceedings of the International Conference on Document Analysis and Recognition, ICDAR
SP - 399
EP - 403
BT - Proceedings of the Eighth International Conference on Document Analysis and Recognition
T2 - 8th International Conference on Document Analysis and Recognition
Y2 - 31 August 2005 through 1 September 2005
ER -