GBIB Beta

Transform invariant text extraction
Xin Zhang, Zhouchen Lin, Fuchun Sun, Yi Ma
In The Visual Computer, 30(4), April 2014.

Abstract: Automatically extracting texts from natural images is very useful for many applications such as augmented reality. Most of the existing text detection systems require that the texts to be detected (and recognized) in an image are taken from a nearly frontal viewpoint. However, texts in most images taken naturally by a camera or a mobile phone can have a significant affine or perspective deformation, making the existing text detection and the subsequent OCR engines prone to failures. In this paper, based on stroke width transform and texture invariant low-rank transform, we propose a framework that can detect and rectify texts in arbitrary orientations in the image against complex backgrounds, so that the texts can be correctly recognized by common OCR engines. Extensive experiments show the advantage of our method when compared to the state of art text detection systems.

Article URL: http://dx.doi.org/10.1007/s00371-013-0864-7

BibTeX format:

@article{Zhang:2014:TIT,
  author = {Xin Zhang and Zhouchen Lin and Fuchun Sun and Yi Ma},
  title = {Transform invariant text extraction},
  journal = {The Visual Computer},
  volume = {30},
  number = {4},
  pages = {401--415},
  month = apr,
  year = {2014},
}

Search for more articles by Xin Zhang.
Search for more articles by Zhouchen Lin.
Search for more articles by Fuchun Sun.
Search for more articles by Yi Ma.

Return to the search page.