vietnamese-htr

Vietnamese handwritten text recognition system

Install Library

conda env create -f environment.yml

Run

Train

Note: Default --max_epochs of Pytorch Lightning is 100 and not use GPU, then 2 following option might always be used

To train Transformer (see config params at model/model_tf.py):

python train.py tf config/base.yaml --gpus -1 --max_epochs 50 --deterministic True

To train RNN (see config params at model/model_rnn.py):

python train.py rnn config/base.yaml --gpus -1 --max_epochs 50 --deterministic True

Example train TF

python train.py tf config/base.yaml --gpus -1 --max_epochs 50 --deterministic True --attn_size 512 --dim_feedforward 4096 --encoder_nlayers 2 --decoder_nlayers 2 --seed 9498 --decoder_nlayers 2 --stn --pe_text --pe_image

See Pytorch Lightning Trainer config at: Pytorch Lightning Doc. Some useful flags:

--fast_dev_run True                             # Run 1 batch on train, 1 batch on val to debug
--profiler True                                 # Log time (might slow)
--max_epochs 50                                 # Train for 50 epochs
--resume_from_checkpoint [PATH_TO_CKPT_FILE]    # Resume training from checkpoint
--deterministic True                            # Reproducible

Test

python test.py {tf, rnn} CKPT_FILE

Visualization

Run jupyter:

jupyter lab

Open respective notebooks for further visualization

Code references

If you found the code useful, please cite our paper.

@INPROCEEDINGS{9335877,
author={Ly, Vinh-Loi and Doan, Tuan and Ly, Ngoc Quoc},
booktitle={2020 7th NAFOSTED Conference on Information and Computer Science (NICS)},
title={Transformer-based model for Vietnamese Handwritten Word Image Recognition},
year={2020},
volume={},
number={},
pages={163-168},
doi={10.1109/NICS51282.2020.9335877}}

Name		Name	Last commit message	Last commit date
Latest commit History 276 Commits
app		app
config		config
data		data
dataset		dataset
loss		loss
metrics		metrics
model		model
stat		stat
tests		tests
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
env.yml		env.yml
mnist.py		mnist.py
server.py		server.py
test.py		test.py
train.py		train.py
visualize_attn_rnn.ipynb		visualize_attn_rnn.ipynb
visualize_attn_tf.ipynb		visualize_attn_tf.ipynb
visualize_cnn.ipynb		visualize_cnn.ipynb
visualize_ctc.ipynb		visualize_ctc.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

vietnamese-htr

Install Library

Run

Train

Test

Visualization

Code references

About

Releases

Packages

Contributors 2

Languages

License

VinhLoiIT/vietnamese-htr

Folders and files

Latest commit

History

Repository files navigation

vietnamese-htr

Install Library

Run

Train

Test

Visualization

Code references

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages