Viddie is an AI-powered platform for automated video creation, utilizing advanced machine learning models. It streamlines video production by combining GPT-4 for script generation, Conquis TTS for speech synthesis, DALL-E for image generation, and SadTalker for avatar animation. This allows users to generate high-quality videos with minimal manual effort.
There are two ways to run the project: manually or using Docker.
- Recommended Python version: 3.9 - 3.11
- Install the following dependencies:
- FFmpeg (Required for video rendering) - Installation Guide
- ImageMagick (Required for subtitles) - Download
- eSpeak-NG (Only required for some voice models) - Download MSI
- If you don't want to install extra voice packages, use only:
tts_models/en/ljspeech/tacotron2-DDC
- If you don't want to install extra voice packages, use only:
-
Navigate to the viddie folder and install dependencies:
pip install -r requirements/requirements.txt
-
Create a
checkpointsfolder and download required files from: Google Drive - Checkpoints -
Inside the viddie folder, create a
.envfile and add the following API keys:OPEN_API_KEYSEARCH_ENGINE_IDAPI_KEY
To find your Google search engine ID and API key, refer to this YouTube Guide.
-
Run the following commands to set up the database and start the server:
py manage.py makemigrations py manage.py migrate py manage.py loaddata fixtures/fixtures.json py manage.py setup_media py manage.py runserver
-
(Optional) To enable ElevenLabs voices, add your
XI_API_KEYin the.envfile and run:py manage.py setup_elevenlabs
- Create the
.envfile and add your OPEN_API_KEY, SEARCH_ENGINE_ID, API_KEY as per.env_example. - Download and place the required checkpoint files in the
checkpointsfolder from: Google Drive - Checkpoints - Navigate to the src folder and run:
docker-compose up --build
- URL: http://localhost:8000/admin
- Login Credentials:
- Username:
admin - Password:
pass
- Username:
You can find all API endpoints in Swagger: http://localhost:8000/swagger/
- Convert all
moviepyfunctions toFFmpegfor better performance. - Add Celery support for asynchronous and scheduled tasks.
- Implement unit tests for models, functions, and views.
For any inquiries or support, feel free to reach out:
- University Email: kodamia@cs.ihu.gr
- Personal Email: kostas2372@gmail.com
✅ Fixed Conquis TTS Docker issue
✅ Added support for Gemini and Claude AI models
✅ Integrated ElevenLabs API voices
✅ Enabled compilation video creation from Twitch (by game or streamer)
✅ Added OpenAI voices
✅ Integrated MidJourney and Stable Diffusion as image providers (Change providers in ``)
✅ Dockerized the application for easier deployment