Screen Reader Application

A powerful screen reader application that captures screen content and processes it using Optical Character Recognition (OCR) technology and Text-to-Speech capabilities. The application features an advanced image analysis mode, CustomTkinter-based modern UI, and enhanced text processing capabilities. It can detect code snippets, programming questions, and regular text, making it particularly useful for developers and technical users.

Features

Screen text capture and reading
Advanced image analysis mode
Text-to-Speech functionality with toggle controls
Automatic code detection
Modern CustomTkinter-based UI with theme support
Customizable hotkeys
Enhanced button interactions
Support for programming-related content

Prerequisites

Python 3.x
Tesseract OCR (Download from UB-Mannheim)

Installation

Clone or download this repository
Run the setup script to install dependencies:
```
python setup.py
```
This will automatically:
- Create requirements.txt if not present
- Install required Python packages:
  - Pillow (10.2.0)
  - pytesseract (0.3.10)
  - keyboard (0.13.5)
  - requests (2.31.0)
  - pyttsx3 (latest)
  - customtkinter (latest)
- Check for Tesseract OCR installation
Make sure Tesseract OCR is properly installed and added to your system PATH

Usage

Start the application using the provided batch file:
```
run_screen_reader.bat
```
Or run directly with Python:
```
python run.py
```
The application will initialize with a modern CustomTkinter interface featuring:
- Dark/Light theme support
- Intuitive button layouts
- Real-time OCR status indicators
- Text-to-Speech controls with toggle functionality
- Image analysis mode selection

Features in Detail

OCR Processing

Captures screen content and converts it to text
Specialized detection for code snippets and programming content
Handles various programming languages and syntax
Advanced image analysis mode for enhanced text recognition

Text-to-Speech

Natural-sounding voice output
Toggle functionality for easy control
Adjustable speech rate and volume
Support for multiple languages
Pause/Resume functionality

Image Analysis

Advanced mode for complex image processing
Enhanced text recognition accuracy
Support for various image formats
Optimized for technical content

Modern CustomTkinter UI

Responsive design that adapts to window size
Dark and light theme support
Smooth animations and transitions
Enhanced button feedback and interactions
Accessibility-focused design elements
Selection window for different modes

Configuration

Customizable settings through config.ini
Adjustable hotkeys for various functions
Theme preferences
Speech settings customization

Project Structure

├── app/
│   ├── core/              # Core functionality
│   │   ├── ocr.py         # OCR processing
│   │   ├── speech.py      # Text-to-speech handling
│   │   ├── api.py         # API integrations
│   │   ├── screenshot.py  # Screen capture
│   │   └── image_analysis.py # Image analysis
│   ├── ui/                # User interface components
│   │   ├── ctk_main_window.py    # Main application window
│   │   ├── ctk_selection_window.py # Mode selection window
│   │   ├── ctk_theme.py          # Theme management
│   │   └── dialogs.py            # Dialog windows
│   ├── utils/             # Utility functions
│   │   ├── config.py      # Configuration handling
│   │   └── hotkey.py      # Hotkey management
│   └── main.py           # Application entry point
├── setup.py              # Dependency installation
├── run.py               # Runner script
└── run_screen_reader.bat # Windows batch launcher

Contributing

Feel free to submit issues, fork the repository, and create pull requests for any improvements.

License

This project is open source and available under the MIT License.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Screen Reader Application

Features

Prerequisites

Installation

Usage

Features in Detail

OCR Processing

Text-to-Speech

Image Analysis

Modern CustomTkinter UI

Configuration

Project Structure

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
app		app
.gitignore		.gitignore
README.md		README.md
config.ini		config.ini
requirements.txt		requirements.txt
run.py		run.py
run_screen_reader.bat		run_screen_reader.bat
setup.py		setup.py

Folders and files

Latest commit

History

Repository files navigation

Screen Reader Application

Features

Prerequisites

Installation

Usage

Features in Detail

OCR Processing

Text-to-Speech

Image Analysis

Modern CustomTkinter UI

Configuration

Project Structure

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages