OCR Web Application

A web application for Optical Character Recognition (OCR) that supports both printed and handwritten text recognition. The application uses PaddleOCR for handwritten text and Tesseract.js for printed text.

Features

Support for both printed and handwritten text recognition
Multiple language support
PDF and image file processing
Text summarization
Export functionality (PDF and DOCX)
Real-time processing status
Detailed OCR configuration options

Tech Stack

Frontend

React
TypeScript
Tesseract.js
PDF.js
React Dropzone
React Toastify

Backend

FastAPI
PaddleOCR
Python
OpenCV
NumPy

Installation

Prerequisites

Python 3.8+
Node.js 14+
npm or yarn

Backend Setup

Create a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install Python dependencies:

pip install -r requirements.txt

Start the backend server:

uvicorn app.main:app --reload

Frontend Setup

Install dependencies:

cd frontend
npm install

Start the development server:

npm run dev

Usage

Open the application in your browser (default: http://localhost:5173)
Upload an image or PDF file
Select the document language
Choose between handwritten or printed text
Click "Extract Text" to process the document
View and edit the extracted text
Export the result as PDF or DOCX

Configuration

OCR Settings

Page Segmentation Mode (PSM)
OCR Engine Mode (OEM)
Character whitelist/blacklist
Language selection
Handwritten text detection

Performance Settings

Image preprocessing
Batch processing
Confidence thresholds
Processing retries

License

MIT

Author

Christopher Loklindt (christopher@loklindt.dk)

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.vscode		.vscode
app		app
frontend		frontend
uploads		uploads
.gitignore		.gitignore
README.md		README.md
ocr_app.log		ocr_app.log
ocr_app_20250406_144120.log		ocr_app_20250406_144120.log
ocr_app_20250406_144230.log		ocr_app_20250406_144230.log
ocr_app_20250406_144416.log		ocr_app_20250406_144416.log
ocr_app_20250406_145459.log		ocr_app_20250406_145459.log
ocr_app_20250406_145619.log		ocr_app_20250406_145619.log
requirements.txt		requirements.txt
test_image.png		test_image.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OCR Web Application

Features

Tech Stack

Frontend

Backend

Installation

Prerequisites

Backend Setup

Frontend Setup

Usage

Configuration

OCR Settings

Performance Settings

License

Author

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

OCR Web Application

Features

Tech Stack

Frontend

Backend

Installation

Prerequisites

Backend Setup

Frontend Setup

Usage

Configuration

OCR Settings

Performance Settings

License

Author

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages