Multimodal QA with Eval and Image Gen

Created by team Agent Artificial on December 27, 2023

TruLens Gemini AI GPT-4 Vision LlamaIndex

LiteLLM manages a hot-swap between, by default, Gemini and GPT-4-V (but any of the 100+ litellm models could be chosen) for comparison of behaviors. Litellm enables any open source model to call functions through it's openai proxy server. The included function is used to generate images via fal.ai's realtime lcm endpoint in response to user requests, resulting in a rapid iteration cycle between user and vision model -- generate an image, show it to vision model with requested changes, model re-generates on a realtime endpoint. Trulens evals allow for metrics comparisons between various models on vision tasks in this iteration cycle.

Category tags:

Developer Tools

Github Presentation Demo

Explore more applications

MEDVAULT

MedVault AI ensures secure, AI-powered health records with offline access, SMS support & blockchain security, bridging healthcare gaps in remote areas.

streamlit

Level-4 Autonomous Connectivity Network

A fully autonomous, AI-driven connectivity solution that leverages "Giga Nodes"—smart, low-power, solar-powered devices—to establish resilient internet networks in underserved and remote regions.

Creativity with AI

AI/ML API

Maa-connect

Our project is a simple network management chatbot. It will help teachers, health facility network managers and individuals do basic diagnostic analysis of their network.

Maa Connect

TinyLlama

Smarti

This project uses machine learning to optimize TVWS base stations, predicting interference, failures, and providing AI recommendations. It features a dashboard, a simulation map, and procurement tracking.

VIBOT

Gemini AIGenerative AI Studio

AI for Connectivity Global Green Guard

AI platform to optimize school connectivity using geospatial data and LLMS for underserved regions

Global Green Guard

GPT-4 Vision

Events @ lablab
For Innovators & Creators