Simple Gradio application integrated with Hugging Face Multimodals to support visual question answering chatbot and more features
-
Updated
Aug 16, 2024 - Python
Simple Gradio application integrated with Hugging Face Multimodals to support visual question answering chatbot and more features
Run XTTS with Docker/Podman for voice fine-tuning in Gradio's Web UI
Saya Voice Assistant for Discord AI voice bot: listens, detects keywords, chats via LM Studio, and replies with TTS or voice cloning.
Book to MP3 converter. Convert e-books (FB2, EPUB, TXT) to MP3 audiobooks using various Text-to-Speech technologies.
Automatic video translator and dubber using Whisper, XTTS v2 for voice cloning, and Ollama for local LLM translation. Supports 100+ languages.
Dubbing english videos into russian.
XTTS fine-tuning via CLI
A Streamlit web app for AI-powered voice cloning using Coqui XTTS v2. Record or upload reference voices, clone speech in multiple languages, and generate natural audio outputs.
This program is designed to provide a graphical user interface for the xtts_api_server project: https://github.com/daswer123/xtts-api-server
Personal voice cloning CLI tool using XTTS-v2
This project aims to find a solution to make the xtts v2 model accessible via an API.
🌍 Transform videos by automatically transcribing, translating, and dubbing in multiple languages using AI-powered tools for seamless global reach.
🎙️ Build high-quality, self-hosted Text-to-Speech applications with voice cloning and multi-language support using the XTTS-v2 API.
Add a description, image, and links to the xtts-v2 topic page so that developers can more easily learn about it.
To associate your repository with the xtts-v2 topic, visit your repo's landing page and select "manage topics."