题名

自適性閥值運算與應用條件性連通物件於影像文字辨識

并列篇名

Apply Adaptive Threshold Operation and Conditional Connected-Component to Image Text Recognition

DOI

10.6285/MIC.2(1).16

作者

黃純敏;林郁凱;張日威

关键词

文字辨識 ; 影像前處理 ; 閥值運算 ; 光學字元辨識 ; text recognition ; OCR ; image preprocessing ; grayscale threshold

期刊名称

管理資訊計算

卷期/出版年月

2卷1期(2013 / 08 / 01)

页次

221 - 232

内容语文

繁體中文

中文摘要

在文字辨識的領域裡,清楚且正確的將文字從圖像中擷取出來辨識,是個非常關鍵的議題。經過影像前處理後,擷取出的字詞完整與否將影響著文字辨識的準確率。圖像中的文字是人們感興趣且具有意義的一部分,但一張圖像中往往存在許多干擾元素,如不同的光線強弱或複雜的背景,而這些元素常常會增加字元辨識的困難度。在本研究中,運用自適性閥值運算以及機率性連通物件來解決不同光線的強弱和複雜的背景所造成的影像問題。實驗結果顯示,經由本研究影像前處理方法,在字元辨識的結果中,正確性有著顯著提升,文字辨識率與識別率分別高達81.17%與91.30%。

英文摘要

How to effectively extract texts from an image is a critical issue in text recognition domain. After image preprocessing, the wholeness of the extracted text region will profoundly affect the accuracy of further OCR processing. A well-planned image preprocessing is believed to produce better OCR results. Due to the variety of background components, for example, different kind of colors, texture, or brightness in an image will deteriorate the problem of text recognition. In this research, we applied ”conditional connected-component” and ”adaptive threshold operation” to deal with complicated background and non-uniform lightness images. With this approach, we successfully identified and recognized texts from an image. The result shows that the rate of object identification and recognition achieves 81.17% and 91.30%, respectively.

主题分类 基礎與應用科學 > 資訊科學
社會科學 > 管理學
参考文献
  1. Abdulkader, A.,Casey, M. R.(2009).Low cost correction of OCR errors using learning in a multi-engine Eenvironment.Proceedings of the 10th International Conference on Document Analysis and Recognition,Barcelona, Spain:
  2. Besag, J.(1989).Digital image processing.Journal of Applied Statistics,16,395-407.
  3. Chang, L. Z.,ZhiYing, S. Z.(2009).Robust pre-processing techniques for OCR applications on mobile devices.Proceedings of the 6th International Conference on Mobile Technology, Application & Systems,New York, NY, USA:
  4. Chowdhury, S. P.,Dhar, S.,Das, A. K.,Chanda, B.,McMenemy, K.(2009).Robust extraction of text from camera images.Proceedings of the 10th International Conference on Document Analysis and Recognition,Barcelona, Spain:
  5. Egyul, K.,Seonghun, L.,Jinhyung, K.(2009).Scene Text Extraction Using Focus of Mobile Camera.Proceedings of the 10th International Conference on Document Analysis and Recognition,Barcelona, Spain:
  6. Garcia, C.,Apostolidis, X.(2000).Text detection and segmentation in complex color images.Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing,İstanbul, Türkiye:
  7. Kim, K. C.,Byun, H. R.,Song, Y. J.,Choi, Y. W.,Chi, S. Y.,Kim, K. K.,Chung, Y. K.(2004).Scene text extraction in natural scene images using hierarchical feature combining and verification.Proceedings of the 17th International Conference on Pattern Recognition,Cambridge, England, UK:
  8. King-Sun, F.,Rosenfeld, A.(1976).Pattern recognition and image processing.IEEE Transactions on Computers,25(12),1336-1346.
  9. Laine, M.,Nevalainen, O. S.(2006).A standalone OCR system for mobile cameraphones.Proceedings of the IEEE 17th International Symposium on Personal, Indoor and Mobile Radio Communications,Helsinki, Finland:
  10. Linlin, L.,Tan, C. L.(2006).Improving OCR text categorization accuracy with electronic abstracts.Proceedings of the Second International Conference on Document Image Analysis for Libraries,Lyon, France:
  11. Liu, X.,Samarabandu, J.(2006).Multiscale edge-based text extraction from complex images.Proceedings of the 2006 IEEE International Conference on Multimedia and Expo,Toronto, Ontario, Canada:
  12. Marqués, F.,Vilaplana, V.(2002).Face segmentation and tracking based on connected operators and partition projection.Pattern Recognition,35,601-614.
  13. McAndrew, A.(2010).Introduction to Digital Image Processing with MATLAB.Cengage.
  14. Minetto, R.,Thome, N.,Cord, M.,Stolfi, J.,Precioso, F.,Guyomard, J.,Leite, N. J.(2011).Text detection and recognition in urban scenes.Proceedings of the IEEE International Conference on Computer Vision Workshops,Barcelona, Spain:
  15. Mori, S.,Suen, C. Y.,Yamamoto, K.(1992).Historical review of OCR research and development.Proceedings of the IEEE,80(7),1029-1058.
  16. Nakajima, H.,Matsuo, Y.,Nagata, M.,Saito, K.(2005).Portable translator capable of recognizing characters on signboard and menu captured by built-in camera.Proceedings of the ACL 2005 on Interactive poster and demonstration sessions,Stroudsburg, PA, USA:
  17. Otsu, N.(1979).A threshold selection method from gray-level histograms.IEEE Transactions on Systems, Man and Cybernetics,9(1),62-66.
  18. Rafael, C. G.,Richard, E. W.(2008).Digital Image Processing.Prentice Hall.
  19. Raza, M. U.,Ullah, A.,Ghori, K. M.,Haider, S.(2001).Text extraction using artificial neural networks.Proceedings of the International Conference on Networked Computing and Advanced Information Management,Gyeongju, Gyeongsangbuk-do, South Korea:
  20. Shuqing, Z.,Qiaoning, Y.(2006).Microarray images processing based on mathematical morphology.Proceedings of the 8th International Conference on Signal Processing,Beijing, China:
  21. Shutao, L.,Kwok, J. T.(2004).Text extraction using edge detection and morphological dilation.Proceedings of the International Symposium on Intelligent Multimedia, Video and Speech,Hong Kong:
  22. Yakobov, V.,Mash, L.,Thirer, N.(2010).On chip implementation of an OCR algorithm for musical notation.Proceedings of the IEEE 26th Convention of Electrical and Electronics Engineers,Eliat, Israel:
  23. Ye, Q.,Huang, Q.,Gao, W.,Zhao, D.(2005).Fast and robust text detection in images and video frames.Image Vision Comput,23(6),565-576.
  24. 葉榮木、李宗岳、蔡俊明(2006)。自適性閥值的人臉偵測系統。2006 年現代電機科技研討會,台灣,嘉義: