Asymptotic Cuteness

medal
Created by team bioAI on November 02, 2024

**Asymptotic Cuteness: The Infinite Cat Optimization Loop** is an innovative project that leverages advanced artificial intelligence models to iteratively enhance the cuteness of a cat video. Developed during the lablab.ai hackathon, the project utilizes rhymes.ai's state-of-the-art models, **Aria** and **Allegro**, to create a self-improving system based on reinforcement learning principles. The process begins with **Allegro**, an advanced generative model that creates videos from textual prompts. By inputting the simple prompt **"a cute cat,"** Allegro generates an initial video featuring an adorable feline. This video serves as the starting point for optimization. Next, **Aria**, a multimodal large language model capable of understanding and processing both text and images, analyzes the video. Aria evaluates various aspects such as the cat's facial features, expressiveness, fur softness, color vibrancy, and overall emotional impact. It provides a cuteness rating and suggests areas for enhancement. Key frames capturing the essence of cuteness are extracted from the video. Using Aria's capabilities, these frames are enhanced by focusing on elements that increase appeal—making the cat's eyes larger and more expressive, softening the fur texture, brightening colors, and adding playful poses or backgrounds. The enhanced images are then fed back into Allegro to generate a new, improved video. This forms an optimization loop where each iteration aims to produce a cuter video than the last, approaching the asymptote of ultimate cuteness. The process mirrors reinforcement learning: - **State**: The current version of the video. - **Action**: Enhancing images and regenerating the video. - **Reward**: The cuteness rating provided by Aria.

Category tags:

"Good application of technology. Leveraging Aria to create a loop, effectively maximizes Allegro's capabilities."

avatar

Yacine Si Tayeb

"-Asymptotic Cuteness, who doesn’t love cat videos! Although it’s not really Reinforcement Learning, but the self evolving idea is definitely very creative! Allegro will soon receive an I2V (Image+Text as input), by then you can actually iteratively upgrade based on previous video. And perhaps M2V (motion as guide) upgrade as well. One thing, Using LLMs as judge to give a score 0-100 is perhaps not the best idea, instead a more robust and reliable way is to given two pictures and give a list of judging criteria and ask Aria to only answer which picture is better given a specific criteria."

avatar

Maazel maazel

Engineer

"Incredible idea! It's like a diffusion model at prompt level :)"

avatar