Integrate synthetic data for text recognition training #10

robertknight · 2024-02-07T07:33:55Z

The main improvement needed for Ocrs to be more useful is higher text recognition accuracy / lower error rate, especially with longer lines. Also for multilingual support, examples in more languages will be needed. The main plan to improve this is to expand the training data with synthetic images. There are a number of existing text generation projects that might be useful:

https://github.com/ankush-me/SynthText
https://github.com/Belval/TextRecognitionDataGenerator (forked here to add Pillow v10 support)
https://github.com/clovaai/synthtiger

robertknight · 2024-03-03T07:14:10Z

The main improvement needed for Ocrs to be more useful is higher text recognition accuracy / lower error rate, especially with longer lines.

This was partly addressed in robertknight/ocrs#32. Long line images are still squashed to 800px during training though, which needs to be fixed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrate synthetic data for text recognition training #10

Integrate synthetic data for text recognition training #10

robertknight commented Feb 7, 2024 •

edited

Loading

robertknight commented Mar 3, 2024

Integrate synthetic data for text recognition training #10

Integrate synthetic data for text recognition training #10

Comments

robertknight commented Feb 7, 2024 • edited Loading

robertknight commented Mar 3, 2024

robertknight commented Feb 7, 2024 •

edited

Loading