Skip to content

Njihia413/sonic_ai

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

110 Commits
 
 
 
 
 
 

Repository files navigation

Sonic AI 🎙️

A powerful web application that provides instant voice cloning and synthesis through a simple, accessible interface. Transform any voice into a digital clone and generate natural-sounding speech with just a few clicks.

Overview

Sonic AI leverages advanced machine learning to capture the essence of any voice and reproduce it with remarkable accuracy. Whether you're creating content, developing accessibility tools, or exploring voice technology, Sonic AI makes voice cloning accessible to everyone.

How It Works: The 4 C's

1. 📥 Capture - Preserve Vocal Identity

Record or upload any voice sample (30+ seconds recommended) to begin the cloning process. The longer and clearer the sample, the better the results.

2. 🧬 Clone - Extract Vocal DNA

Our advanced AI analyzes and extracts the unique characteristics of the voice:

  • Tone patterns and pitch
  • Emotional cadence and rhythm
  • Unique mannerisms and speech patterns
  • Accent and pronunciation nuances

3. ✍️ Create - Define Your Message

Simply type any text you want the cloned voice to speak. No limits on length or complexity.

4. 🎵 Craft - Generate Authentic Speech

Instantly produce natural-sounding speech in the target voice with realistic intonation and emotion.

Technology Stack

  • Voice Cloning Engine: Coqui TTS - An advanced text-to-speech and voice cloning toolkit
  • Frontend: Next.js - React framework for production-grade web applications
  • Backend: FastAPI - Modern, fast Python web framework for building APIs

Project Structure

sonic_ai/
├── backend/        # API server and voice processing logic
├── frontend/       # Web interface
└── README.md       # Project documentation

Features

  • 🎤 Voice Recording: Built-in audio recorder for capturing voice samples
  • 📤 File Upload: Support for various audio formats
  • 🔄 Real-time Processing: Fast voice cloning and synthesis
  • 🎨 Intuitive Interface: User-friendly design for all skill levels
  • 🔊 High-Quality Output: Natural-sounding, authentic voice reproduction
  • 💾 Export Options: Download generated audio in multiple formats

Getting Started

Prerequisites

  • Node.js (v16 or higher)
  • Python (v3.8 or higher)
  • FFmpeg for audio processing

Installation

# Clone the repository
git clone https://github.com/Athens-AI/sonic_ai.git
cd sonic_ai

# Install backend dependencies
cd backend
pip install -r requirements.txt

# Install frontend dependencies
cd ../frontend
npm install

Running the Application

# Start the backend server
cd backend
uvicorn main:app --reload

# Start the frontend (in a new terminal)
cd frontend
npm run dev

Visit http://localhost:3000 to access the application.

Usage

  1. Upload or Record: Provide a voice sample of at least 30 seconds
  2. Wait for Processing: The AI will analyze the voice characteristics
  3. Enter Text: Type the message you want to be spoken
  4. Generate: Click the generate button to create your audio
  5. Download: Save your generated audio file

Use Cases

  • 🎬 Content Creation: Generate voiceovers for videos and podcasts
  • 📚 Audiobooks: Create personalized narrations
  • 🎮 Gaming: Generate character voices
  • Accessibility: Help those who have lost their voice
  • 🌍 Language Learning: Practice with different accents
  • 🎭 Entertainment: Create fun voice messages

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

  • Coqui TTS for the powerful voice cloning engine
  • The open-source community for continuous support and inspiration

Disclaimer

This tool is designed for legitimate and ethical use cases. Users are responsible for ensuring they have proper consent and rights to clone and use any voice. Misuse of voice cloning technology may violate laws and regulations.


Built with ❤️ by Njihia

About

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors