Skip to content

auscode/QtStreamOCR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

QtStreamOCR

A multimedia application to perform OCR on video streams using Qt 6, OpenCV, and Tesseract.

Features

  • Video playback from file or webcam.
  • Monochrome filter with 4 selectable color palettes.
  • Real-time video rendering using VideoItem (QQuickPaintedItem).
  • OCR on captured frames (F4 key) using Tesseract in a worker thread.
  • UI built with QML, logic in C++.

Prerequisites

  • Qt 6.5+ (Core, Quick, Multimedia, Widgets)
  • OpenCV 4.x
  • Tesseract OCR (libtesseract-dev, libleptonica-dev)
  • CMake 3.16+
  • C++17 compiler

Installation (Linux/Ubuntu/Debian)

You need to install the development libraries for OpenCV and Tesseract. Run the following commands:

sudo apt update
sudo apt install libopencv-dev tesseract-ocr libtesseract-dev libleptonica-dev pkg-config

Build Instructions

mkdir build
cd build
cmake ..
make

Usage

  1. Run the application: ./appQtStreamOCR
  2. Click "Upload Video" to select a video file, or "Webcam" to start the camera.
  3. Use the Color Palette on the left to change the monochrome filter.
  4. Press F4 to capture the current frame.
    • The frame will appear in the bottom list.
    • OCR will be performed, and text will appear in the right panel.

About

watch its demo

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published