Word Spotting and Recognition with Embedded Attributes

Welcome to the Word Representation with Attributes library, a software for the retrieval and recognition of word images.

This code is written in Matlab and is the basis of the following project:

Jon Almazán, Albert Gordo, Alicia Fornés, Ernest Valveny. Word Spotting and Recognition with Embedded Attributes. Project Page

Abstract

We deal with the problems of word spotting and word recognition on images. In word spotting, the goal is to find all instances of a query word in a dataset of images. In recognition, the goal is to recognize the content of the word image, usually aided by a dictionary or lexicon. We propose a formulation for word representation and matching based on embedded attributes that jointly addresses these two problems. Contrary to most other existing methods, our representation has a fixed length, is low dimensional, and is very fast to compute and, especially, to compare.

We propose to use character attributes to learn a semantic representation of the word images and then perform a calibration of the scores with CCA that puts images and text strings in a common subspace. After that, spotting and recognition become simple nearest neighbor problems in a very low dimensional space. We test our approach on four public datasets of both document and natural images showing results comparable or better than the state-of-the-art on spotting and recognition tasks.

This word spotting library uses some great open-source software:

MATLAB Quick Start Guide

To get started, you need to install MATLAB and download the code from GitHub. This code has been tested on Mac and Linux and some pre-compiled Mex files are included.

Download source code

$ cd ~/your_projects/
$ git clone git://github.com/almazan/words-att.git

Download and uncompress the IIIT5K datasets

$ cd words-att/datasets
$ wget http://cvit.iiit.ac.in/projects/SceneTextUnderstanding/IIIT5K-Word_V3.0.tar.gz
$ tar -xvzf IIIT5K-Word_V3.0.tar.gz

Run the program with the default parameters

>> main

Note: The parameters set by default can be modified in the prepare_opts.m script

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
data		data
datasets		datasets
util		util
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
compute_GMM_PCA_models.m		compute_GMM_PCA_models.m
embed_labels_PHOC.m		embed_labels_PHOC.m
evaluate.m		evaluate.m
extract_FV_features.m		extract_FV_features.m
extract_features.m		extract_features.m
extract_lexicon.m		extract_lexicon.m
learn_attributes.m		learn_attributes.m
learn_attributes_bagging.m		learn_attributes_bagging.m
learn_common_subspace.m		learn_common_subspace.m
load_dataset.m		load_dataset.m
main.m		main.m
prepare_data_learning.m		prepare_data_learning.m
prepare_opts.m		prepare_opts.m

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Word Spotting and Recognition with Embedded Attributes

Jon Almazán, Albert Gordo, Alicia Fornés, Ernest Valveny. Word Spotting and Recognition with Embedded Attributes. Project Page

Abstract

MATLAB Quick Start Guide

Download source code

Download and uncompress the IIIT5K datasets

Run the program with the default parameters

About

Uh oh!

Releases 7

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

almazan/watts

Folders and files

Latest commit

History

Repository files navigation

Word Spotting and Recognition with Embedded Attributes

Jon Almazán, Albert Gordo, Alicia Fornés, Ernest Valveny. Word Spotting and Recognition with Embedded Attributes. Project Page

Abstract

MATLAB Quick Start Guide

Download source code

Download and uncompress the IIIT5K datasets

Run the program with the default parameters

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 7

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages