Echo-voice

Minimal setup for speech-to-text transcription and text-to-speech generation.

Requirements

Python 3.9 (pynput requires ≤3.9, other libraries require ≥3.8)
ffmpeg (see installation instructions below)
A Python package manager like uv.

Instructions

Install command-line tool ffmpeg, required for speech-to-text transcription, on your system, which is available from most package managers:

# on Ubuntu or Debian
sudo apt update && sudo apt install ffmpeg

# on Arch Linux
sudo pacman -S ffmpeg

# on MacOS using Homebrew (https://brew.sh/)
brew install ffmpeg

# on Windows using Chocolatey (https://chocolatey.org/)
choco install ffmpeg

# on Windows using Scoop (https://scoop.sh/)
scoop install ffmpeg

Install Python dependencies:
```
uv sync
```
Run the application:
```
uv run python main.py
```
Controls:
- Press Space to start/stop recording
- Press ESC to exit the application

Tech Stack

Python - Core language
OpenAI Whisper API - Speech-to-text transcription
PyAudio - Audio recording from microphone
pynput - Keyboard event handling
bark - Text-to-speech generation
uv - Python package management

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
keyboard_listener.py		keyboard_listener.py
main.py		main.py
pyproject.toml		pyproject.toml
recorder.py		recorder.py
speech_to_text.py		speech_to_text.py
text_to_speech.py		text_to_speech.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Echo-voice

Requirements

Instructions

Tech Stack

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Echo-voice

Requirements

Instructions

Tech Stack

About

Resources

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages