Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Integrate synthetic data for text recognition training #10

Open
robertknight opened this issue Feb 7, 2024 · 1 comment
Open

Integrate synthetic data for text recognition training #10

robertknight opened this issue Feb 7, 2024 · 1 comment

Comments

@robertknight
Copy link
Owner

robertknight commented Feb 7, 2024

The main improvement needed for Ocrs to be more useful is higher text recognition accuracy / lower error rate, especially with longer lines. Also for multilingual support, examples in more languages will be needed. The main plan to improve this is to expand the training data with synthetic images. There are a number of existing text generation projects that might be useful:

  1. https://github.com/ankush-me/SynthText
  2. https://github.com/Belval/TextRecognitionDataGenerator (forked here to add Pillow v10 support)
  3. https://github.com/clovaai/synthtiger
@robertknight
Copy link
Owner Author

The main improvement needed for Ocrs to be more useful is higher text recognition accuracy / lower error rate, especially with longer lines.

This was partly addressed in robertknight/ocrs#32. Long line images are still squashed to 800px during training though, which needs to be fixed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant