Skip to content

Integrate synthetic data for text recognition training #10

Description

@robertknight

The main improvement needed for Ocrs to be more useful is higher text recognition accuracy / lower error rate, especially with longer lines. Also for multilingual support, examples in more languages will be needed. The main plan to improve this is to expand the training data with synthetic images. There are a number of existing text generation projects that might be useful:

  1. https://github.com/ankush-me/SynthText
  2. https://github.com/Belval/TextRecognitionDataGenerator (forked here to add Pillow v10 support)
  3. https://github.com/clovaai/synthtiger

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions