ZeldaChat 🧠

A local, browser-based chat UI wired to a small FastAPI backend that talks to OpenAI.

It gives you:

A friendly persona using gpt-5-mini for chat.
Natural-sounding TTS using gpt-4o-mini-tts (voice: nova) with prosody shaping so it feels more alive.
Whisper-based speech-to-text so you can talk to Zelda via microphone instead of just typing.
A simple avatar front-end (PNG + SadTalker MP4 clips) that reacts in sync with the audio (as best as pre-rendered video allows).

🌐 Project Structure

The repo is intentionally minimal:

├── backend/
│ ├── main.py
│ ├── voice.py
│ ├── prosody.py
│ ├── transcribe.py
│ ├── zelda_key.env # your local API key (ignored)
│ ├── audio/ # generated audio (ignored)
│ └── video/ # pre-generated reaction videos
├── frontend/
│ ├── index.html
│ └── zelda.PNG

main.py – FastAPI app exposing:
- POST /chat – chat endpoint returning text + audio_url + tone
- POST /transcribe – transcription endpoint for microphone audio
- Static /audio mount for generated MP3 TTS files
voice.py – TTS helper using OpenAI gpt-4o-mini-tts with the nova voice and prosody shaping via prosody.py.
prosody.py – detects emotional tone of replies and reshapes text for more natural speech delivery, plus exposes detect_tone() used by the backend.
transcribe.py – audio transcription via Whisper (whisper-1).
index.html – the front-end chat UI with:
- Typing box + history
- Mic button (records audio, calls /transcribe)
- “Play Zelda’s voice” toggle
- Zelda avatar (PNG + pre-generated SadTalker MP4 clips)
audio/ – auto-created folder for generated MP3 files (served at /audio/...).
video/ – (optional) SadTalker MP4 clips used by the avatar (e.g. zelda_happy.mp4, zelda_neutral.mp4, etc.).
zelda_key.env – not committed: a single-line file containing your OpenAI API key, read by both main.py and voice.py.

✨ Features

🗣️ Conversational AI using gpt-5-mini
🔊 Natural Text-to-Speech via gpt-4o-mini-tts
🎤 Speech-to-Text using gpt-4o-transcribe
🎭 Emotion-based avatar reactions
🌐 Pure local HTML/JS frontend with optional web setup
⚡ Lightweight FastAPI backend (localhost:8000)
🧩 Three Personality Modes (Friendly, Therapist, Balanced)

🔧 Requirements

Python 3.11+ (3.10 will likely work, but 3.11 is recommended).
An OpenAI API key with access to:
- gpt-4.1-mini (chat)
- gpt-4o-mini-tts (TTS)
- whisper-1 (STT)
A modern browser (Chrome, Brave, etc.) with microphone access.

🚀 Quick Start

1. Install Dependencies

pip install -r requirements.txt

2. Add your API key

Create a file in /backend named: zelda_key.env and copy/paste your OpenAI key inside

3. Load the backend

cd backend
uvicorn main:app --reload

4. Start the frontend

Open frontend/index.html with your browser (Chrome recommended)

🤝 Contributing

Pull requests welcome. This is an evolving project; improvements and ideas are appreciated.

⭐ Acknowledgements

Built with:

Python / FastAPI
OpenAI APIs
HTML / JavaScript
Videos generated with SadTalker: https://github.com/OpenTalker/SadTalker

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
backend		backend
frontend		frontend
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
README.md		README.md
license.txt		license.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ZeldaChat 🧠

🌐 Project Structure

✨ Features

🔧 Requirements

🚀 Quick Start

1. Install Dependencies

2. Add your API key

3. Load the backend

4. Start the frontend

🤝 Contributing

⭐ Acknowledgements

About

Uh oh!

Releases 6

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ZeldaChat 🧠

🌐 Project Structure

✨ Features

🔧 Requirements

🚀 Quick Start

1. Install Dependencies

2. Add your API key

3. Load the backend

4. Start the frontend

🤝 Contributing

⭐ Acknowledgements

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 6

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages