Welcome to Asymptotic Cuteness, an innovative project developed during the lablab.ai hackathon using the rhymes.ai platform. This project explores the synergy between advanced AI models—Aria and Allegro—to iteratively enhance the cuteness of cat videos, approaching the asymptote of ultimate adorableness.
- Introduction
- Project Goal
- How It Works
- Technologies Used
- Usage
- Project Structure
- Results
- Potential Applications
- Contributing
- License
- Acknowledgments
Asymptotic Cuteness leverages cutting-edge AI technologies to create a self-improving system that enhances the appeal of cat videos through iterative optimization. By combining Aria, a multimodal large language model, and Allegro, an advanced generative video model, we demonstrate how AI can approach ideal outcomes through continuous refinement.
The primary objective is to generate a video of a cute cat and make it progressively cuter with each iteration. By forming an optimization loop, we aim to approach the asymptote of ultimate cuteness—acknowledging that while perfection may be unattainable, continuous improvement can bring us infinitely close.
The project follows a systematic process:
-
Generate Initial Video: Use Allegro to create a video from a textual prompt (e.g., "a cute cat wearing glasses, sitting at a laptop with code on the screen").
-
Analyze Video with Aria: Aria evaluates the video's cuteness, providing a rating and suggestions for enhancement.
-
Enhance Key Frames: Extract key frames from the video and enhance them using Aria, focusing on features that increase appeal.
-
Generate New Video: Feed the enhanced images back into Allegro to create a new, improved video.
-
Iterative Optimization: Repeat the process, with each iteration aiming to produce a cuter video than the last.
This loop mirrors reinforcement learning principles:
- State: The current version of the video.
- Action: Enhancing images and regenerating the video.
- Reward: The cuteness rating provided by Aria.
- Aria: A multimodal large language model capable of understanding and processing both text and images.
- Allegro: An advanced generative model that creates videos from textual prompts.
- Python: For scripting and automation of the optimization loop.
- GitHub: Version control and collaboration.
- Markdown: Documentation and presentation.
Note: Access to Aria and Allegro models is required.
-
Clone the Repository
git clone https://github.com/bioAI0/rhymesai
-
Install Dependencies
Install the required Python packages:
pip install -r requirements.txt
-
Set Up API Access
Ensure you have API keys or access tokens for Aria and Allegro. Set them as environment variables.
-
Run the Optimization Loop
Edit the prompt.txt file to set your prompt.
Execute the main script to start the iterative process:
make
-
View Results
The generated videos and enhanced images will be saved in the
output/directory for each iteration.
asymptotic-cuteness/
├── README.md
├── Makefile
├── catception_generate_video1.py
├── catception_get_video2.py
├── catception_analyze_video.py
├── catception_new_prompt.py
├── prompt.txt
├── requirements.txt
├── output/
└── LICENSE
- catception_generate_video.py: Generate cat video with Allegro
- catception_get_video.py: Get the URL of the generated video with Allegro
- catception_analyze_video.py: Analyze cat video with Aria
- catception_new_prompt.py: Generate new prompt with Aria
- output/: Directory where outputs of each iteration are stored.
Through iterative optimization, the project achieves significant enhancements in the video's appeal:
- Visual Improvements: The cat's features become more expressive, colors are more vibrant, and the background is more engaging.
- Cuteness Rating: Incremental increases in the cuteness rating with each iteration, approaching the asymptote of ultimate cuteness.
Example Progression:
- Iteration 1: Cuteness Rating – 77/100
- Iteration 2: Cuteness Rating – 80/100
- Iteration 3: Cuteness Rating – 87/100
- Iteration 4: Cuteness Rating – 92/100
While this project focuses on enhancing cat videos, the methodology can be applied to various fields:
- Marketing: Iteratively enhancing advertisements to maximize consumer engagement.
- Entertainment: Refining animations or CGI elements for greater audience appeal.
- Design: Continuously improving product images for higher aesthetic value.
Contributions are welcome! Please follow these steps:
-
Fork the Repository
-
Create a Feature Branch
git checkout -b feature/your-feature-name
-
Commit Your Changes
git commit -m "Add your message here" -
Push to the Branch
git push origin feature/your-feature-name
-
Open a Pull Request
This project is licensed under the GPL v3. See the LICENSE file for details.
- lablab.ai: For hosting the hackathon and providing a platform for innovation.
- rhymes.ai: For developing the Aria and Allegro models.
- [My Wife]: For the creative "hacker cat" and "cute cat in a kitchen making its favorite meal" prompt idea.
- [My Cat]]: For inspiration and support
Feel free to explore, contribute, and reach out with any questions or suggestions. Together, let's push the boundaries of AI-driven content optimization!