🎙️ Arabic Dubbing Demo

An end-to-end AI-powered pipeline that automatically transcribes English audio, translates it to Arabic, and generates dubbed Arabic audio — complete with two-speaker voice synthesis.

🚀 What It Does

This project takes an English audio file and produces a fully dubbed Arabic version through a multi-stage pipeline:

Transcription — Converts English speech to text using Whisper
Translation — Translates the transcript to Arabic
Tashkeel (Diacritization) — Adds Arabic diacritics for accurate pronunciation
Text-to-Speech — Synthesizes natural Arabic audio using Edge TTS with two-speaker support

🗂️ Project Structure

dubbing_demo/
│
├── transcribe.py                   # Transcribe audio to English text
├── transcribe_all_demo.py          # Batch transcription
├── transcribe_chunks.py            # Chunk-based transcription
├── transcribe_en.py                # English-specific transcription
│
├── translate.py                    # Core translation script
├── translate_to_ar.py              # Translate English text to Arabic
├── translate_dialogue_to_ar.py     # Dialogue-aware translation
├── fix_translation.py              # Post-process and fix translations
│
├── tashkeel_ar.py                  # Arabic diacritization
│
├── tts_ar.py                       # Arabic TTS (base)
├── tts_ar_edge.py                  # Arabic TTS using Edge TTS
├── tts_ar_edge_male.py             # Male voice TTS
├── tts_ar_two_speakers_edge.py     # Two-speaker Arabic dubbing
│
├── make_dialogue_demo.py           # Full pipeline demo runner
│
├── demo.wav                        # Sample input audio
├── transcript.txt                  # Generated transcript
├── translated.txt                  # Generated Arabic translation
├── final_demo_arabic.wav           # Final dubbed Arabic output
│
├── input/                          # Input audio files
├── tts_out/                        # TTS output audio files
├── work/                           # Intermediate working files
└── piper_voices/                   # Local TTS voice models

🛠️ Tech Stack

Tool	Purpose
OpenAI Whisper	Speech-to-text transcription
Microsoft Edge TTS	Neural Arabic voice synthesis
Piper TTS	Local offline TTS engine
Python 3	Core language

⚙️ Setup & Installation

# Clone the repository
git clone https://github.com/bedouhammad/dubbing-demo.git
cd dubbing-demo

# Create and activate virtual environment
python3 -m venv venv
source venv/bin/activate

# Install dependencies
pip install -r requirements.txt

▶️ How to Run

Full pipeline demo:

python make_dialogue_demo.py

Step by step:

# Step 1: Transcribe
python transcribe.py

# Step 2: Translate to Arabic
python translate_to_ar.py

# Step 3: Generate dubbed audio
python tts_ar_two_speakers_edge.py

🎧 Example Output

Input: demo.wav — English audio
Output: final_demo_arabic.wav — Dubbed Arabic audio with two speakers

📌 Notes

This is a local demo pipeline built and tested on macOS
Arabic diacritization (tashkeel) is applied before TTS for more natural pronunciation
Two-speaker mode simulates dialogue dubbing with distinct male/female voices

👤 Author

Abdelrahman Hammad
GitHub @bedouhammad

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎙️ Arabic Dubbing Demo

🚀 What It Does

🗂️ Project Structure

🛠️ Tech Stack

⚙️ Setup & Installation

▶️ How to Run

🎧 Example Output

📌 Notes

👤 Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
input		input
piper_voices		piper_voices
work		work
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
concat.txt		concat.txt
fix_translation.py		fix_translation.py
make_dialogue_demo.py		make_dialogue_demo.py
tashkeel_ar.py		tashkeel_ar.py
transcribe.py		transcribe.py
transcribe_all_demo.py		transcribe_all_demo.py
transcribe_chunks.py		transcribe_chunks.py
transcribe_en.py		transcribe_en.py
transcript.txt		transcript.txt
translate.py		translate.py
translate_dialogue_to_ar.py		translate_dialogue_to_ar.py
translate_to_ar.py		translate_to_ar.py
translated.txt		translated.txt
tts_ar.py		tts_ar.py
tts_ar_edge.py		tts_ar_edge.py
tts_ar_edge_male.py		tts_ar_edge_male.py
tts_ar_two_speakers_edge.py		tts_ar_two_speakers_edge.py

Folders and files

Latest commit

History

Repository files navigation

🎙️ Arabic Dubbing Demo

🚀 What It Does

🗂️ Project Structure

🛠️ Tech Stack

⚙️ Setup & Installation

▶️ How to Run

🎧 Example Output

📌 Notes

👤 Author

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages