Skip to content

bioAI0/rhymesai

Repository files navigation

Asymptotic Cuteness: The Infinite Cat Optimization Loop

Welcome to Asymptotic Cuteness, an innovative project developed during the lablab.ai hackathon using the rhymes.ai platform. This project explores the synergy between advanced AI models—Aria and Allegro—to iteratively enhance the cuteness of cat videos, approaching the asymptote of ultimate adorableness.

Table of Contents

Introduction

Asymptotic Cuteness leverages cutting-edge AI technologies to create a self-improving system that enhances the appeal of cat videos through iterative optimization. By combining Aria, a multimodal large language model, and Allegro, an advanced generative video model, we demonstrate how AI can approach ideal outcomes through continuous refinement.

Project Goal

The primary objective is to generate a video of a cute cat and make it progressively cuter with each iteration. By forming an optimization loop, we aim to approach the asymptote of ultimate cuteness—acknowledging that while perfection may be unattainable, continuous improvement can bring us infinitely close.

How It Works

The project follows a systematic process:

  1. Generate Initial Video: Use Allegro to create a video from a textual prompt (e.g., "a cute cat wearing glasses, sitting at a laptop with code on the screen").

  2. Analyze Video with Aria: Aria evaluates the video's cuteness, providing a rating and suggestions for enhancement.

  3. Enhance Key Frames: Extract key frames from the video and enhance them using Aria, focusing on features that increase appeal.

  4. Generate New Video: Feed the enhanced images back into Allegro to create a new, improved video.

  5. Iterative Optimization: Repeat the process, with each iteration aiming to produce a cuter video than the last.

This loop mirrors reinforcement learning principles:

  • State: The current version of the video.
  • Action: Enhancing images and regenerating the video.
  • Reward: The cuteness rating provided by Aria.

Technologies Used

  • Aria: A multimodal large language model capable of understanding and processing both text and images.
  • Allegro: An advanced generative model that creates videos from textual prompts.
  • Python: For scripting and automation of the optimization loop.
  • GitHub: Version control and collaboration.
  • Markdown: Documentation and presentation.

Usage

Note: Access to Aria and Allegro models is required.

  1. Clone the Repository

    git clone https://github.com/bioAI0/rhymesai
  2. Install Dependencies

    Install the required Python packages:

    pip install -r requirements.txt
  3. Set Up API Access

    Ensure you have API keys or access tokens for Aria and Allegro. Set them as environment variables.

  4. Run the Optimization Loop

    Edit the prompt.txt file to set your prompt.

    Execute the main script to start the iterative process:

    make
  5. View Results

    The generated videos and enhanced images will be saved in the output/ directory for each iteration.

Project Structure

asymptotic-cuteness/
├── README.md
├── Makefile
├── catception_generate_video1.py
├── catception_get_video2.py
├── catception_analyze_video.py
├── catception_new_prompt.py
├── prompt.txt
├── requirements.txt
├── output/
└── LICENSE
  • catception_generate_video.py: Generate cat video with Allegro
  • catception_get_video.py: Get the URL of the generated video with Allegro
  • catception_analyze_video.py: Analyze cat video with Aria
  • catception_new_prompt.py: Generate new prompt with Aria
  • output/: Directory where outputs of each iteration are stored.

Results

Through iterative optimization, the project achieves significant enhancements in the video's appeal:

  • Visual Improvements: The cat's features become more expressive, colors are more vibrant, and the background is more engaging.
  • Cuteness Rating: Incremental increases in the cuteness rating with each iteration, approaching the asymptote of ultimate cuteness.

Example Progression:

  • Iteration 1: Cuteness Rating – 77/100
  • Iteration 2: Cuteness Rating – 80/100
  • Iteration 3: Cuteness Rating – 87/100
  • Iteration 4: Cuteness Rating – 92/100

Potential Applications

While this project focuses on enhancing cat videos, the methodology can be applied to various fields:

  • Marketing: Iteratively enhancing advertisements to maximize consumer engagement.
  • Entertainment: Refining animations or CGI elements for greater audience appeal.
  • Design: Continuously improving product images for higher aesthetic value.

Contributing

Contributions are welcome! Please follow these steps:

  1. Fork the Repository

  2. Create a Feature Branch

    git checkout -b feature/your-feature-name
  3. Commit Your Changes

    git commit -m "Add your message here"
  4. Push to the Branch

    git push origin feature/your-feature-name
  5. Open a Pull Request

License

This project is licensed under the GPL v3. See the LICENSE file for details.

Acknowledgments

  • lablab.ai: For hosting the hackathon and providing a platform for innovation.
  • rhymes.ai: For developing the Aria and Allegro models.
  • [My Wife]: For the creative "hacker cat" and "cute cat in a kitchen making its favorite meal" prompt idea.
  • [My Cat]]: For inspiration and support

Feel free to explore, contribute, and reach out with any questions or suggestions. Together, let's push the boundaries of AI-driven content optimization!

About

rhymesai

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors