TUTCRIS - Tampereen teknillinen yliopisto

TUTCRIS

Towards visual words to words

Tutkimustuotosvertaisarvioitu

Yksityiskohdat

AlkuperäiskieliEnglanti
Otsikko2015 13th International Conference on Document Analysis and Recognition (ICDAR)
KustantajaIEEE
Sivut641-645
Sivumäärä5
ISBN (painettu)978-1-4799-1805-8
DOI - pysyväislinkit
TilaJulkaistu - 1 elokuuta 2015
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
TapahtumaINTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION -
Kesto: 1 tammikuuta 1900 → …

Conference

ConferenceINTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION
Ajanjakso1/01/00 → …

Tiivistelmä

We address the problem of text localization and retrieval in real world images. We are first to study the retrieval of text images, i.e. the selection of images containing text in large collections at high speed. We propose a novel representation, textual visual words, which describe text by generic visual words that geometrically consistently predict bottom and top lines of text. The visual words are discretized SIFT descriptors of Hessian features. The features may correspond to various structures present in the text - character fragments, individual characters or their arrangements. The textual words representation is invariant to affine transformation of the image and local linear change of intensity. Experiments demonstrate that the proposed method outperforms the state-of-the-art on the MS dataset. The proposed method detects blurry, small font, low contrast, noisy text from real world images.

Tutkimusalat

Julkaisufoorumi-taso