PyLaia

PyLaia is a device agnostic, PyTorch based, deep learning toolkit specialized for handwritten document analysis. It is also a successor to Laia.

Disclaimer: The easiest way to learn to use PyLaia is to follow the IAM example for HTR. Apologies for not having a better documentation at this moment, I will keep improving it and adding other examples.

Installation

In order to install PyLaia, follow this recipe:

git clone https://github.com/jpuigcerver/PyLaia
cd PyLaia
pip install -r requirements.txt
python setup.py install

The following Python scripts will be installed in your system:

pylaia-htr-create-model: Create a VGG-like model with BLSTMs on top for handwriting text recognition. The script has different options to costumize the model. The architecture is based on the paper "Are Multidimensional Recurrent Layers Really Necessary for Handwritten Text Recognition?" (2017) by J. Puigcerver.

pylaia-htr-decode-ctc: Decode text line images using a trained model and the CTC algorithm. It also outputs segmentatation boundaries of recognized chars/words. For example, to obtain recognized hypotheses along with the corresponding word segmentation boundaries:

pylaia-htr-decode-ctc \
  --print_args True \
  --train_path ./model \
  --model_filename model \
  --batch_size 24 \
  --print_img_ids \
  --use_letters \
  --separator=" " \
  --space "<space>" \
  --print_word_segm \
  --join_str " " \
  symbols.txt Lines-Processed test.lst | less

pylaia-htr-train-ctc: Train a model using the CTC algorithm and a set of text-line images and their transcripts.
pylaia-htr-netout: Dump the output of the model for a set of text-line images in order to decode using an external language model.

Some examples need additional tools and packages, which are not installed with pip install -r requirements.txt. For instance, typically ImageMagick is used to process images, or Kaldi is employed to perform Viterbi decoding (and lattice generation) combining the output of the neural network with a n-gram language model.

Name		Name	Last commit message	Last commit date
Latest commit History 603 Commits
docs		docs
egs		egs
laia		laia
travis		travis
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
pylaia-htr-create-model		pylaia-htr-create-model
pylaia-htr-decode-ctc		pylaia-htr-decode-ctc
pylaia-htr-netout		pylaia-htr-netout
pylaia-htr-train-ctc		pylaia-htr-train-ctc
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PyLaia

Installation

About

Releases

Packages

Languages

License

Transkribus/PyLaia

Folders and files

Latest commit

History

Repository files navigation

PyLaia

Installation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages