OpenAI Whisper Applications

ImmersiveCulturalExplorer

Harmony

This project is a Python script that downloads a YouTube video (or uses a local video file), transcribes it, translates the transcript into a target language, and generates a video with dual subtitles (original and translated).

GPT-3.5ChatGPTWhisperCohereGemini AI

VOCALYTICS- Intelligence Speech Transformation

VOCALYTICS is the cutting-edge technologies with solutions of Intelligence Audio and Speech Transformation, Speech synthesis, voice conversion, audio processing, transcription and voice biometrics.

Sanjoy Kumar

CohereOpenAILlama 3QdrantWhisperLangChainTwelve LabsSeamlessM4TElevenLabs

Semana

Our app utilizes Sema API for seamless video and audio subtitles in 200+ languages. Sema API is based on our custom translator model cable of translation across 200 languages

sema

Lolacap

Lolacap is an AI-powered local language video captioner that breaks language barriers and enriches video content by adding subtitles in local languages (fon/yoruba) to them.

Genesis

OpenAIWhisperStable Dreamfusion

GBETCHE

- Sous-titres en Fon de vidéos qui sont à la base en français ou en anglais - Génération d'image à partir de texte écrit en fon - Traduction d'audios Fr - Fon / Fon - Fr / En - Fon / Fon -En

GBETCHE

Noulimon

Notre application est une calculatrice capable de réaliser des opérations en langues locales.

TAKA TEAM

WhisperOpenAI

PLANTID

PlantID Bénin est une application mobile innovante qui utilise la vision par ordinateur et l'IA pour identifier les plantes médicinales indigènes du Bénin. PlantID autonomise les communautés, préserve le patrimoine botanique et stimule la recherche.

AfriTech Innovs

WhisperMistral AIStable Dreamfusion

Transcribe Pro-Audio Transcription with AI

Transcribe Pro: AI-powered audio transcription tool for fast, accurate & cost-effective transcriptions. Supports 74 languages, scalable & user-friendly. Revolutionizing transcription for businesses, organizations & individuals

Neural Nomads

Custom GPTsWhisperGPT-3.5OpenGPTsOpenAI

Ben Interactive Video

Ben Interactive Video allows you interact with videos in your preferred language, with Subtitles, Dubs and ask questions about the Video. It also saves time so you don’t have to watch the entire video.

AI Powerhouse

VectaraOpenAIWhisper

Dôkun

Une plateforme mobile qui se veut être la vitrine du Bénin en termes de culture et de tourisme.

Imole

OpenAIElevenLabsWhisper

GBECHEMIN

Traduction et doublage du français/anglais en langue local

Zenith

alɔ Do mɛ tɔ Assistant Vocal Multifonctionnel

alɔ Do mɛ tɔ est une application mobile révolutionnaire intégrant la reconnaissance vocale pour contrôler le téléphone, traduire du texte écrit ou sur image français/anglais vers l'audio en langue locale et effectuer des recherches internet en audio.

AIDA

Assistants APIGemini AIWhisper

CamLive

Cam Live makes it easy to navigate between videos from different sources, while integrating AI to detect and translate languages simultaneously and stream them online. It's software for masses, interviews, long speeches, online meetings and much more...

issoft

WhisperElevenLabsEasyOCRReinforcement Learning

Multilingual video Dubbing

Our project provides multilingual video dubbing with emotional nuances, synchronized subtitles, lip-sync technology, and video supers. It also generates concise text and video summaries, ensuring accessible and engaging content for a global audience.

CodeMaestroes

Benin360

Explore Benin tourist sites with a multilingual virtual tour guide, and introductory tour videos, adapted to your foreign language.

neural nexus

OpenAIWhisperGPT-4 VisionPineconeLangChain

FongBot - BENIN FONGBE COMPANION

FongBot will help you preparing your next travel to Benin, showing you how to say things, translating from English, and generating images from Fon expressions.

KIBERNUM USA

GPT-4 VisionDALL·E Image Generation APIWhisperVercel

VoiceChatApplication

Voice Conversation using Gemini Pro Model, Microsoft Speech T5, and Faster Whisper

MuneebUllah

Assistants APIOpenAIGPT-3.5open-interpreterprivateGPTChatGPTGPT-3WhisperLangChain

Consierge AI

Hyper Realistic Consierge ai (fast api ,long-term memory , TTS)

CAI

ElevenLabsOpenAIWhisper

Sara Palliative Care for The New Age

A compassionate companion for the terminally ill, offering personalized experiences like reliving memories through images, their favorite music, and connecting with loved ones. It's accessible and comforting for a more memorable end-of-life journey.

Object Recognizers

Mistral AIChatGPTWhisper

EasyFastAI - Empower your business with AI

Empower your business with AI Customize AI to automate your business enquiries via WhatsApp in under 5 minutes!

Easy Fast AI

Medcare

Medcare outlines a revolutionary AI-powered solution aimed at transforming healthcare efficiency. By automating medical documentation and streamlining patient communication, it addresses key challenges like professional burnout and staff shortages.

Monolith

OpenAIGPT-3.5ChatGPTLlamaIndexWhisperLangChainGPT-4 VisionGPT-3

ShopAssist

An AI powered assistant for influencers that indexes all social content in a Q&A style chatbot. Your followers can now quickly search through your posts and get to the products that you reviewed. This will help you generate more sales.

Influencer Assist

GPT-3.5OpenAIPineconeGPT-4 VisionWhisperLangChainTruLens

Multilingual Video Translator and Transcriber

innovative Multilingual Video Translator and Transcriber, breaking language barriers with seamless translation and transcription capabilities for diverse audiences worldwide

Taleemabad

WhisperGPT-4 Vision

E-Mind Mirror App - Your Digital Thought Diary

E-Mind Mirror is a revolutionary CBT thought diary app that uses voice recording and AI to simplify journaling 10x, making therapy more accessible and effective. E-Mind Mirror App: Speak Don't Type!

EMindMirror

OpenAIWhisperDALL-E-2TruLens

LinguaSync AI

Languify revolutionizes language learning with personalized, immersive experiences powered by LLM technology, offering tailored paths for each user's proficiency level and learning style.

EchoMinds

OpenAIGPT-3.5GPT-4 VisionTruLensStable DiffusionReinforcement LearningWhisperGPT-4

NevrForget

A mobile app that aims to refresh elderlies' memories to help them cope with dementia.

Code Sync

OpenAIGPT-3.5WhisperTruLensLangChainDALL·E Image Generation API

ResearchWriterGPT

ResearchWriterGPT helps draft research papers using both text and visual data analysis. Harnessing GPT-4, GPT-4 Vision, RAG vector database, and with Trulens LLM evaluation, it provides advanced research assistance.

TaskLife

GPT-4 VisionGPT-4PineconeTruLensWhisper

Doweb AI

Doweb is a versatile AI platform offering a range of services including video and audio transcription, data analysis, multi-language translation and dubbing, speech synthesis, voice cloning, AI chatbots, code development, and image processing.

Doweb

Assistants APICustom GPTsOpenAIGPT-3.5LangChainSeamlessM4TElevenLabsGPT-4 VisionChromaWhisper

TalkTriUnity

We have created a virtual assistant that is able to communicate via Sign Language, Text and Audio. Users can enter their query in Sign Language or Text or Audio and the assistant replies in text and audio.

rhineSaur

Clarifaigpt4allWhisper

Vidiator AI

Vidiator.ai-Your personalized text-to-video chatbot. Effortlessly transform ideas into engaging videos with a user-friendly interface and rapid production capabilities. Simplify, create, captivate.

Techiee Hackers

ClarifaiGPT-4 VisionDALL·E Image Generation APIOpenAIWhisper

NOOBIES AI

Noobies.ai - Your Ultimate Content Creation Wingman.

Noobies

GPT-3.5DALL·E Image Generation APIClarifaiElevenLabsWhisper

Nivx Voice Multi Modal Digital Shopping Assistant

Nivx AI is voice first digital shopping assistant for small and mid size ecommerce businesses. On Frontend Fronterier Empowring Gen AU Multi Modals to create Immerse Shopping expeience.

Nivx

Gemini AIWeaviateWhisperOpenAI

Virtual Grandchild

an AI specialized in security, information and care for the elderly's relationship with technology

Virtual Grandchild

Gemini AIGPT-4 VisionWhisperTruLens

GENIS Voice and Visual Chat Messaging Application

Get visual explanation and text/number extraction by sending an image to our LINE Official Account

GENIS

OpenAIopen-interpreterQdrantGPT-3.5WhisperLlamaIndex

VectaServe

Vectara's RAG-Enhanced Customer Service Bot Platform

LJZ

GPT-3.5VectaraLangChainWhisper

Legal AI II

PatentableClaimExtraction, listens to the conversations that inventors & builders have then extracts these claims and finally writes these patentable claims in patent claim format - reducing time to market from weeks and 1000s of dollars to a few minutes.

Team Tonic

WhisperOpenAIBERT

FieldAssessment

This app lets you take a picture with your camera while in the field for a real time and on-the-fly assessment of your situation. Empower field teams with AI !

Team Tonic

OpenAIWhisper

SightCom 2

Smart glasses software prototype for the visually impaired, utilizing OpenAI and Clarifai technologies

Louis

OpenAIGPT-3.5ClarifaiDALL-E-2Whisper

Schrodinger ClarifaiLlama

Built "Schrödinger's ClarifaiLlama" at Clarifai+Llama2 hackathon by lablab.ai Our app ingests multimedia data, indexes it with vector search, and generates custom content from user queries.

Schrödinger's ClarifaiLlama Hackathon

ClarifaiLangChainOpenAILlama 2WhisperChroma

Echo Ai

Elevate meetings with Echo AI – beyond transcripts. Autonomous agents turn discussions into actionable insights. Effortlessly summarise, organise tasks, update absentees, suggest follow up emails and actions, and provide resources.

Hacktolive

LangChainOpenAIWhisperGPT-3.5

TrueCast

TrueCast: An AI-powered tool that monitors podcast conversations in real-time, instantly verifying claims against multiple sources and providing relevant media upon request, ensuring podcast accuracy and authenticity.

Just DO IT

OpenAIGPT-3.5GPT-4Stable DiffusionWhisper

InterviewerGPT

Put technical phone screen on autopilot. Can also be used by new grads for mock interviews.

Makeabot

OpenAIGPT-3.5LangChainWhisperElevenLabs

OptiAgents

OptiAgents-Autonomous Agents for Competitive Intelligence, we explore 3 use cases, YouTube Intelligence, Wikipedia Intelligence and PDF Intelligence.

OptiAgents

LangChainOpenAIGPT-3.5GPT-3ChatGPTWhisperGPT-4

EffiQuery Intelligent IT Support Automation

Our innovative RAG and LLM solution automate responses to well-documented helpdesk/IT requests. Enjoy faster query resolution, heightened productivity, and optimized costs with seamless integration.

THE MODELS

OpenAIWeaviateWhisperLLaMALlama 2

Transforming Text into Tangible 3D Objects

This project involves the full cycle from 3D design to printing the object. OpenAI's shap-e and gpt4all were used in a streamlit application to generate an avocado-shaped flower vase from the recycled plastic filament.

RSLT

OpenAIShap-EWhisperGPT-4Stable DreamfusionGET3DPoint-E

TalkSenseAI - AI Telephony Customer Support

Elevate telephony customer support with TalkSense.AI's cutting-edge platform, reducing waiting times and offering personalized interactions for a seamless caller experience.

TalkSense.AI

ElevenLabsWhisperOpenAI

Skeen

Skeen is an app that detects skin conditions from user-uploaded pictures, analyzes the user’s lifestyle and habits to identify potential causes. It also features an AI Assistant chatbot for skincare, which now also serves as a voice assistant.

KandM

GPT-3.5WhisperOpenAIElevenLabs

Kasuku AI

AI-powered customer service chatbot with text and audio capabilities.

afrineuron

GPT-3.5ChatGPTWhisperLangChainElevenLabs

VoiceStoryBoard

A platform for automatic character identification & voice assignment for any children's storybook.

Character Mania

ElevenLabsOpenAIWhisper

SUMMA

Soma: Revolutionizing Audio Conversion & Translation! Convert long audio recordings to text effortlessly, translate into various languages, and access summarized content. With a vast market size targeting 1.35 billion English speakers and 480 million Arab

SUMMA

GPT-3.5OpenAIWhisperStable DiffusionElevenLabs

BlaBlaLand - your personal AI companion

Accessible, platform-agnostic voice interface using advanced AI tech like ChatGPT and Whisper. Enabling personalized digital character creation, aiming to transform Human Computer Interaction and promoting inclusive communication.

BlaBlaLand

Dreaming AI Language Tutor

We offer cheaper, everywhere language learning experiences.

Dreaming AI

ElevenLabsWhisperOpenAIChatGPT

Voice verse Video Transcription

"Voila! Video Translator – the perfect companion to effortlessly translate any foreign language video into English. Enjoy an uninterrupted, comprehensive experience as this innovative tool breaks the language barrier for you!"

Voice verse

OpenAIGPT-3.5ElevenLabsWhisper

ShortGPT- Open-Source Automated Content Creation

ShortGPT is an open-source AI framework designed to automate video and short content creation from scratch, including a user-friendly web interface for customization.

ShortGPT

Peroni Language Tutor

Learn languages at the speed of polyglot XiaomaNyc by following his method of immersive learning, and be tutored by your own voice!

Peroni

OpenAIGPT-4WhisperElevenLabs

Auto-Vid

A different, automatic, way to generate educational, entertaining and creative content. This content would be short form content one minute or less, for things like Youtube shorts.

Auto-Vid

OpenAIGPT-3.5Stable DiffusionWhisper

Retriever AI

Retriever AI is an application that leverages artificial intelligence to allow users to surf Windows with just their voice. Powered with ElevenLabs speech synthesis, OpenAI's Whisper, and Google's PaLM2 LLM.

Spill

OpenAIPaLMElevenLabsWhisper

Noter

With Noter, you're not just taking notes, you're freeing your mind to focus on what really matters.

Noter

GPT-3.5ElevenLabsWhisper

Speak Stream AI - Languista

Languista is a near real-time audio translator using OpenAI's GPT model side by side with ElevenLabs voice models. It enables multi-user sessions and broadcasts AI responses to all participants.

AI Driven Designers

OpenAIGPT-3.5ElevenLabsWhisper

The most effective way to learn a new language

Imancity is an AI-powered language learning platform that makes it easy and effective for everyone to learn new languages. It is personalized, interactive, and cost-effective.

Imancity

OpenAIChatGPTVercelElevenLabsWhisperGPT-4

Habble

Habble is the one stop solution for anyone who wants to improve their language as well conversational skills. Habble is an AI powered tool that simulates human conversations and provides realtime feedback, to accelerate your learning experience.

Habble

OpenAIGPT-3.5WhisperElevenLabs

Patient Simulator

Practice communication for medical professionals by talking to AI patients.

We put AI in Medical EducAtIon

OpenAIGPT-3.5GPT-4WhisperElevenLabsStable Diffusion

LanGo

LanGo is an innovative conversational app designed to help English and French speakers practice their language skills with the assistance of a patient and interactive native speaker.

LanGo

OpenAIElevenLabsWhisperGPT-3.5

MentalSync II

Unlimited Voice Assistant which uses GPT and TTS (Whisper)

MentalSync

OpenAIGPT-3.5Whisper

Debate 101

Debate.lol is an app where you can debate the best speakers in the world on any topic of your choosing, powered by AI.

Plato

OpenAIGPT-3.5WhisperElevenLabs

Glocaster Breaking Language Barriers

Our Glocaster App revolutionizes video content by automating dubbing, providing high-quality synthesized voices, and breaking language barriers. With a user-friendly interface, it's perfect for content creators seeking a global audience.

Febus

OpenAIChatGPTWhisperElevenLabs

Whispy - AI for Accessibility

Whispy is an accessibility tool built to enable users of any ability to join and use a voice chat. Leveraging ElevenLab’s API, we deliver TTS to the voice call. Users in the voice call are transcribed per user by Whisper model.

shrimple

ElevenLabsWhisper

Laura AI-minds

a language teacher who can help you learn almost any language

AI Minds

GPT-3.5OpenAIElevenLabsWhisperChatGPT

Assistant Store and Assistant Factory

Similar to an App Store, the Assistant Store allows you to buy Assistants crafted with realistic voices and descriptions done by other users in the Assistant Factory

Assistant Store

ElevenLabsWhisperOpenAILangChainGAN

Verbify II

A speech to speech conversation. This tool converts speeches (video or audio) from any foreign language to your local language or any language of your choice.

Verbify

OpenAIWhisperElevenLabsGPT-3.5ChatGPT

Enchant AI

We have created an AI Agent for Call agencies and for businesses.

Pak Falcons

GPT-3.5WhisperElevenLabsGenerative Agents

Smart decision with AI and cognitive science

Strategic Thinking Systems (STS) lies at the convergence of AI, cognitive science, spatial and web3, and voice! It facilitates the organization and communication of thoughts in the context of important decisions, putting users in charge of their content.

Strategic Thinking Systems

ElevenLabsGPT-3.5Whisper

CloneDub

CloneDub let's you translate audio for podcasts or youtube videos in different languages while keeping the same voices or using AI generated voices

CloneDub

WhisperGPT-4ElevenLabsVercel

Real Time Language Translation for video calls

If there are problems with a language barrier there communicate with another person. If I want to share with another country, then I can’t understand their local language.

Code sapphire

OpenAIWhisperAWS SageMakerText Generation Web UI

MentalSync

UNLIMITED AI Voice Assistant in your pocket 24/7 :)

MentalSync

GPT-3.5OpenAIWhisper

Research assistant

This project provides a powerful tool for researchers, enabling them to easily search for and download academic papers from Google Scholar. Allowing users to create a knowledge base from the downloaded papers and answer questions regarding the content.

RONG

PaLMLangChainWhisper

moviai - movie production erp

moviai - movie production erp with generative ai . Generate Oscars standard screenplay from audio recording. Speech to ScreenPlay https://www.youtube.com/watch?v=sNL5mVOdsAk

aimoovi

WhisperPaLM

Cohesive AI

The CRM of the future is natural and transparent data gathering while surfacing critical information as needed.

Cohesive AI

Monday AI AssistantChatGPTWhisperGPT-3.5Stable Diffusion

CALL ALL

Automated meeting artifacts to streamline documenting, analyzing, and acting on meeting discussions

Etrog

OpenAIWhisperGPT-3.5

MondayVox

A voice assistant for making changes in any board on Monday.com. User can add, update or delete any item from the Monday board.

LENS Corporation

Monday.comWhisperGPT-3.5

AutoRecruit

AutoRecruit is transforming the recruitment landscape. Our solution leverages AI to analyze interviews, predict candidate suitability, and generate unbiased candidate reports while automating manual tasks to expedite hiring.

AI Rebel

Auto-GPTBabyAGIOpenAIGenerative AgentsWhisperLangChainMonday AI Assistant

Multiagent

We wills how you 5 different agents that we build "1. **AssemblyAI Agent**\n" "2. **PandasAI Agent**\n" "3. **Presentation Agent**\n" "4. **README Agent "5. **Webscraping generator Agent

RONG

OpenAIWhisperLangChain

Vocava

Your personal language tutor, powered by cutting-edge AI. Rather than relying on traditional, static methods, Vocava presents dynamic, context-based content tailored to the learner's interests and proficiency.

The Irrelevant Elephant

CohereChromaOpenAIDALL-E-2WhisperAnthropic Claude

SwiftSearch

Swift Search Your ultimate YouTube video summarizer Save time and effort by instantly generating concise summaries of any YouTube video. Extract key points, main ideas. Simplify your viewing experience and stay informed with Swift Search.

Codexai

WhisperOpenAIAnthropic Claude

Plutus

Start & Grow your Business Effortlessly via Speech Agent

Noisebridge AI

LangChainWhisperAnthropic Claude

WIM Whatd I Miss

Ask pointed questions about a given playlist and get back a summary, key points, and related timestamps generated via AI! 🤖

Vector

Anthropic ClaudeBERTGenerative AgentsWhisper

Interview Assistant

AI Interview Assistant is an advanced AI system that conducts personalized interviews, provides targeted feedback, and optimizes performance over time based on user experience.

KIKIYO

VercelRedisWhisperAnthropic Claude

Language Tutor

A language tutor based on the latest second language acquisition research that provides conversational practice and comprehensible input to language students.

Ollie

OpenAIWhisperAnthropic Claude

FRAN AIVoice Aid for Psycology

"FRAN: A 24/7 AI chatbot offering anonymous, personalized mental health support to students. Immediate, multilingual aid for stress, anxiety, sleep issues, and more.

FRAN

RedisWhisperGPT-4

Flow Genius

Don't miss the opportunity to elevate your business with Flow Genius, the ultimate conversational bot creation platform.

The Astro Cats-trophes

OpenAIAI21 LabsWhisperRedisLangChain

RubberDuck MVP

Have you ever felt like you couldn't retain all the information from a talk, class, or event? Introducing "Rubber Duck," your intelligent virtual assistant based on the concept of "rubber ducking"!

RubberDuck

Stable DiffusionChatGPTWhisper

KlassNaut

KlassNaut is an AI-powered note generator for teachers. Using GPT3.5 and Stability AI, it generates accurate notes and corresponding images for a convenient and high-quality learning experience.

iT Central

Stable DiffusionWhisperRedisVercelGPT-3

DreamSteam

We developed a prototype that uses NLP and Stability.ai to regenerate your dreams and visualizes them in a small booklet.

SteamDreamTeam

ChatGPTStable DiffusionWhisperGPT-3

joan-holloway

Sometimes the video format of educational content is not the best solution. It is not reachable by search, it is time consuming to watch instead of reading. We fixed it by having our knowledge management system turn it into wiki pages on its own.

Joan Holloway

WhisperChatGPTRedisCohere Embed

CareConnect

Creating a smart assistant chatbot educating teens with diabetes type 1 about their disease and how to deal with it

Health Hub

WhisperLangChainChatGPTQdrantGPT-3

Global Voice

Global Voice is a software tool that allows for the easy and accurate translation of videos and audio into multiple languages and dialects.

Night Owls

WhisperChatGPTGPT-3

MULADIO

MULADIO enable its users watch YouTube videos in their preferred language. It removes the language barrier on the YouTube platform. MULADIO aims to increase the accessibility of YouTube content and expand the reach of content creators globally.

SEEKER

CHATTYRENTAL AI Room Assistant

FusionAI

Unify your media experience with ease! FusionAI is a platform for all your content need.

FusionAI

WhisperChatGPTDALL-E-2

Chattyrental

ChattyRental is an AI-powered platform revolutionizing room rentals with conversational booking, personalized recommendations, and intelligent search, streamlining agency operations and enhancing customer experiences.

RedisChatGPTLangChainWhisperVercel

PrepQuest

PrepQuest is an interview preparation app that help job seekers boost their interview skills and land their dream job.

Team Vision

AI Alliance 4 Voice Analytics

Our seamless solution uses AI & LLM to revolutionise call centre quality assurance auditing process by providing smart and automated call summarisations, sentiment analysis, customer satisfaction evaluation, and insights to improve operator's performance.

AI Alliance

ChatGPTWhisperGPT-3

VOICE OUT AI Translation

Voice Out leverages the power of AI to revolutionize communication, providing accurate translations without the need for precise vocabulary or grammar. Our platform empowers seamless dialogue, bringing people together with ease.

Voice Out

Verbify

Imagine being able to translate any speech - your own or someone else's - into any language you desire within just a few minutes.

Verbify

Multilingual Voice Assistant

In short, it’s a multilingual voice assistant, that can help to reduce the language barrier in everyday life and also increases accessibility of the technology that can be helpful for people who have some kind of disability.

Zion

Smart Notes Learn Better Fast

NoteMaster

Our platform uses artificial intelligence, GPT-4 and Whisper, to improve the academic performance of early college students by providing automatic transcription of lectures, summaries, personalized learning paths, and research and writing assistance.

WhisperGPT-4

Clear Speak

Speech analysis SaaS product. Uses speech-to-text to identify areas for improvement in recorded audio. Gives real-time feedback and suggestions to reduce stuttering and enhance communication skills. Aimed at boosting confidence in communication.

Coders Legion

WhisperChatGPTOpenAI gymGPT-3

No code customer care bot

We have created a no code platform that will perform the responsibilities of a customer care executive, figure out the problem with the product and take action accordingly.

Clippy

Speechify

Its a AI-based audio to video converter which add text in terms of captions to your video based upon your speech

Night Owls

Room Booking AI Assistant

CHATTYRENTAL AI Room Rentals

Providing a personalized and efficient booking experience for room rental customers, addressing industry's lack of seamless booking processes. With features like voice message and text support, we aim to stand out in a competitive market.

RedisWhisperGPT-3

MyQuiz AI

Introducing MyQuiz.AI - a trivia game that uses AI to generate custom questions based on your interests and skills. Simply use your voice to start a fun and challenging quiz journey.

Space Cats

ChatGPTOpenAI gymRedisWhisperGPT-3

Smart Lecture

Our app is designed to address some common problems that students and learners face when trying to engage with lectures

The bad batch

Health BOt

Develop a health app using ChatGPT API, Streamlit, speech-to-text, and Reddit to provide accurate health information to Ugandan users

BadaRama

ChatGPTWhisperRedis

QuizTube

QuizTube generates a multiple-choice quiz based on the content of a YouTube video. Users can enter the link to the video, and QuizTube will generate a quiz based on the content. With QuizTube, users can turn any YouTube video into an interactive quiz!

chatgpt5

ChatGPTWhisperDALL-E-2Cohere GenerateCohere ClassifyRedis

WeCare Caretaker Assistant

“WeCare - The caretaker assistant” is the AI based solution for agencies which provide the caretaker services for parents who are in search of babysitters for their child.

TechnoCouple

Interview assistant

Project helpes interviewers save time and work effectivly

Nathnenne

Curated Club

A subscription-based service that uses a personalized algorithm, ChatGPT API, and customer feedback to curate monthly deliveries of products tailored to the customer's individual interests and preferences.

TalkyAI

ChatGPTWhisperCodex

NotAlone

NotAlone is an app that provides a seamless writing environment to help individuals with dyslexia read and write better. Features include whisper-based STT, ChatGPT-based assistant, and TTS service to make life easier for anyone struggling with dyslexia.

UncomplicateIT

Intelligent Health Assistant

Revolutionizing healthcare with clinical AI solution that records and transcribes symptoms using Whisper and ChatGPT API to guide patients on urgency, reducing delays in seeking medical attention. Identifying potential illnesses in early stages.

MASTERS

WhisperChatGPTGPT-3

Translating Voices to Signs

This project aims to break down language barriers and empower the deaf and hard-of-hearing community. Leveraging the power of OpenAI and Whisper, we are developing an innovative solution that can translate speech into sign language in real-time.

Pac-Man

OpenAI gymWhisper

AI Study Buddy

AI Study Buddy - an interactive AI study assistant helping you prepare for your exams or freshen up your knowledge on a topic of your choice. Enjoy generated summaries of video lectures, tailored exercises, and (hopefully soon) feedback.

Solorider881909

AI Rap Song Creation

Generate a rap song for any YouTube video by inputting the link, instrumentals, and rap style.

ProjectK AI

WhisperDALL-E-2GPT-3

AI video translator

Make available any open source video for every person, including those with disabilities. Help promote the conservation of endangered or endangered languages

Tengri AI

PollyGlotica

Polly T. Glotica is a chatbot implemented on Telegram Messenger as an interactive language-learning tool. We communicate and discuss many aspects of the language by chatting over text and voice messages with Polly. Try now at t.me/PollyGloticaBot

MagnaLingua

ChatGPTWhisperRedisGPT-3

Aura the GPT assistant

Using the chatgpt API, we are creating a speech assistant. The idea is to record our speech and feed it into chatgpt, which will find the answer and convert it back to audio for the user.

GptHelpLine

CohereQdrantWhisperChatGPT

CryptoCrypt

Our application encrypts speech input messages using OpenAI Whisper and multi-layer encryption codes generated by GPT3. Customizable encryption algorithms and keys, user-friendly interface, and easy code retrieval make it ideal for secure messaging.

team phoeniks

OpenAI gymWhisperGPT-3

Joan Holloway

The first enterprise assistant of the Holloway family Joan is responsible for knowing everything in your company. https://joan.holloway.ai/

wiki search

Dripper News

Dripper news is Personalized news fed ai powered

Dripper News

Shinyonaika

Have you seen Self Therapy apps that provide therapy in boring text with no Interaction? Well, no more. Shinyonaika, is a self therapy app that gamifies the Therapy using Game Development and Neural Networks.

MAVERICKS

CohereWhisper

Vi chat

Vi-chat is an innovative AI assistant aimed at helping mothers connect with their autistic children by converting their voice into images easily understood by autistic children.

Clawcode

WhisperDALL-E-2

AudioQuest

Creating Infinite Possibility Voice-Commanded Text-Based Adventures using Whisper and ChatGPT

Hackstreet Boys

RedisCodexWhisperDALL-E-2ChatGPTStable DiffusionGPT-3

Miraa

Our app provides a fully digitalized package for our clients. We offer a range of services, including the creation of a logo, ads that can be used on social media platforms such as Facebook and Instagram, a website, and marketing videos

DeepDream

AIYu

Supercharge your business operation by using AI technology.

AI.Yu: Supercharge your business operation

Reinforcement LearningRedisWhisperGPT-3

Liquid LMS

Revolutionize your AI learning experience with Liquid Learning's dynamic LMS. Enjoy interactive courses and stay ahead of the game.

PlayFine

WhisperDALL-E-2GPT-3

TaskMate

A conversational chatbot designed to assist users in navigating and performing predefined tasks on various platforms.

TunisFeldberg

WhisperCohere ClassifyCohere EmbedRedis

LearnIt

Break this hectic process of going through lengthy videos and papers with LearnIt as it not only saves time but also helps users to discard any papers/videos without having to waste time on them, that are less important based on their field of interest.

Tetranator

ChatGPTCodexDALL-E-2CohereAI21 LabsWhisperYOLOv7GPT-3

I AM AI personalized ChatGPT

I-AM-AI is the intelligent personal assistant that's always there for you. Train it with your knowledge base, including Confluence, wiki, or Notion pages, and integrate it with Discord, Telegram, or your website.

Data Dreamers

MediFix

Our Medifix is an AI powered assistant powered on gpt-3.5 turbo (chatGPT). Medifix is designed to help people by providing preventive measures based on the symptoms mentioned.

SeaSky

WhisperChatGPTGPT-3

Spectra Mirror

Baby Boomers, the fastest growing demographic in the West, are experiencing the highest levels of social isolation among us. Spectra Mirror provides them with the much-needed benefits of AI voice assistance through something we all own, a mirror.

HackStreet Boys

Loqui

Loqui is an app where students can learn by having an interactive conversation with the historical figures they used to study in books.

delGrappa

ChatGPTDALL-E-2Whisper

FocusX

Hi, I'm FocusX, an intelligent bot helping people with attention deficit disorder to study in a more efficient way!

Inclusive Solutions

WhisperDALL-E-2GPT-3

WeatherVane

Use AI to build powerful presentations. A good presentation is critically important, because it will form the impression of your product for your audience Making an impact is done by creating a sensory experiences Presentations with videos and accompanying music will leave your audience mesmorised. Using AI to assist in efficiently produced maximum impact presentations Making good presentations takes time Regardless of the context time is always precious Presenting ideas is always necessary from starting small projects to guiding important decisions Weather Vane will save vital time for professionals, students and collaborators everywhere Time saved is value added

Weather Vane

Project Infinite Gallery

Our idea was about using GPT-3 and DALLE together by combining results from one neural network into prompt for the other one. After some brainstorming we came up with what we call project Infinite Gallery. It allows anyone to stroll through infinite amount of art pieces on the topic that they like. GPT-3 generates paintings descriptions and DALLE generate corresponding picture. Initially we were planning to make the gallery literally infinite by adding new pieces as the user moves but because time limitations we stopped at one "corridor of a museum".

Three dimensional beings

Resume AI

While applying for jobs one needs to create a perfect on-page resume with relevant skills to that specific job. While often individuals (like the ones participating here) have multiple experience. Filtering and creating a new resume for each job application can tedious and time consuming. Resume AI solves this in 3 simple steps 1. Upload your master resume. 2. Paste the link to Job application. 3. Generate the perfect resume in less then 10 seconds.

Resume AI

Brainstorming

So i had to learn how to use it first so i followed a tutorial, but i couldnt fully learn how to do what i was trying to just yet. I used a youtube tutorial and the template that yall have. I'll make the brainstorming assistant over time

Beginners Luck

Voice to Entertainment Music

Voice to Entertainment - Music Objective: To provide music based on voice command. Functionalities: User goes to my website, clicks on a mic button and insructs what kind of music they want. Output is provided in mp3 form which can be listened to for enjoyment and and downloaded for use. Thanks: To the several Python APIs that I've leveraged for this, and equally important lablabai's much friendly staff and the developer tutorials. Concept, Programming and Integration: Muthukumaran Azhagesan, kumar.algate@gmail.com (http://www.autoshields.website)

Voice to Entertainment

EchoScape Ai

Why only English Speaking Guys have been all the fun ? The revolution of AI is something which every human should have access to, and hence we have built something like that. It's a Voice to image generator that allows the user to give an audio input in their native language and generate an amazing image. Tech Stack - React on Frontend and Flask on Backend. We have used APIs from Whisper, Dalle and GPT-3 making the best case use of every of every technology at its best.

renesis

ScreenAIr

As a HR agency or recruitment professional, you know how time-consuming the recruitment process can be. ScreenAIr is here to help! This innovative tool can save up to 60% of the time typically spent on the recruitment process. With its advanced GPT-3 powered algorithms, ScreenAIr can quickly filter through resumes to find the most qualified applicants. By automating the initial screening process, ScreenAIr can significantly reduce the amount of time spent searching for new hires.

VulcAInts

NiFTy news

Newspapers are outdated business model.NiFTy News (NN) gameficies the newspaper, adds freshness and interest . NN reads news through API,getting difference from last run, generates imgs and then mints NFTs

LocaLgHosT

Distill AI Meeting Assistant

Distiller condenses information shared during meetings into bit-sized summaries and provides inspirations and actionable plans to drive projects forward productively. It transcribes long discussions into searchable transcript, summarizes content into easily consumable forms, provide action items and follow-up questions to push the project forward, and generate metaphors and images to promote more brainstorming.

Headjackers

InterviewMe

Built with GPT-3, React, and Flask. As the job search becomes increasingly competitive and top companies are increasingly strict in their employee sourcing criteria, acing the behavioural interview is an often overlooked component of successfully rounding out your application. To provide a readily-available, continuously-improving, and convenient solution to preparing for behavioural interviews, we developed Interview.Me. With Interview.Me, users can generate behavioural interview questions pertaining to their companies and positions of interest with the click of a button. Users can simulate a real interview experience using the audio input feature, in addition to receiving feedback based on the questions and answers provided. Interview.Me makes the behavioural interview preparation more convenient than ever, so applicants can feel confident they're making the best impression.

BobaTalks

Youtube Video Summarizer

Some videos are too long but contain key information. There is no points in absorbing the entirety of them when you are looking for information. Our project takes in a Youtube URL, extracts the audio from the video, and generates a transcription. With the transcription, we display a results page that all summary, key points with clickable timestamps that take you to their place in an embedded youtube player of the video the user submitted, and shows the total transcript at the bottom as well.

The Neural Networkers

Decked Out

Our solution is a web app that generates PowerPoint presentations giving either a prompt about what the slides should cover or a summary of a specific topic. The purpose of this app is to make presentations using the power of AI quickly. Technology-wise, for the frontend client, we utilized Next.js, and for the backend server, we utilized Python’s Flask. For the AI handling, we utilized ChatGPT3 to generate slide text and titles. Additionally, we used Dall E 2 to create each image on the individual slides. Lastly, we used Vercel to host the front end and Heroku to host the backend.

buildrs

kiwi video

Learn from videos with AI! Check out the live demo: https://kiwi.video

kiwi.video

IntelliDecor

There is a market niche for interior design services for empty rooms, as many people struggle to visualize multiple design options without physically placing furniture and decor in the space. Using AI and stable diffusion techniques, it is possible to create multiple design alternatives that can help clients better understand the potential of their empty room. This can save time and resources, as clients will not have to physically set up and take down multiple design configurations. By offering this service, interior design companies can meet a unique and unmet need in the market.

AI Mavericks

Lesson Plan Creator

To ignite creativity and learning with engaging experiences in the classroom, we designed an AI solution to creating content, activities, and ideas for dynamic lesson planning. A well-designed lesson plan helps students and teachers understand the goals of an instructional module. This allows the teacher to translate the curriculum into learning activities. Though lesson planning has its benefits, it is time-consuming. Our AI answer is to generate the content needed for a lesson plan without removing the flexibility for different teaching styles. Thank you for your time!

LA Hackers

TaleTeller

We built a Interactive Children Story Book Generator using Whisper, Dall-E 2 and GPT3.

TaleTellers

Creative Construct

Let your creative ideas grow with the smart AR app that helps you construct spatial mind maps. A system that inspires and fosters brainstorming. Through simple and intelligent text-based interactions, users’ input and choice will SPROUT, ROOT and BLOOM the seeds of their imagination. The 3 main STEMS serve as smart tools to expand, extend and envision your ideas. SPROUT will extract keywords based on the user’s text input, and list the relevant terms and concepts to expand on user’s ideation. ROOT will extend on the selected keyword and provide additional information. BLOOM will emerge imageries generated based on your idea. Creative Construct is an engine driven by AI (OpenAI’s GPT-3 and DALL·E 2), empowering the flow and growth of creative thinking and brainstorming, allowing the user to puzzle with new inspirations or unfamiliar ideas in any physical setting through the lenses of AR, for the user to cumulatively construct a spatial mind map natural to each user’s creative mind.

Textual Architecture

AI Enters the Kitchen

An AI powered food buddy which takes in your food preferences and creates a bespoke recipe for you in real time and not just that, it creates some delectable looking images of how your final dish could look. The preferences which it currently handles - list of ingredients, cuisine, flavor profile, allergies, time of meal, dietary restrictions, calories, preparation time

Pytorchbearers

SafeCall mobile app

The application is built on React Native + Python. It takes raw audio as an input and performs speaker diarization using pyannote.audio. Then by using Whisper it creates a transcription of the call. The transcribed text is summarized by GPT3 and analyzed by a blacklist algorithm that uses a list of words associated with popular scams. To improve algorithm performance we experimented with GPT, but only 3.5 version(chatGPT) was improving analysis quality. Since only GPT3 is available through API, we decided to wait with adding GPT to the algorithm. Summarized text with a calculated probability of scams is being sent to user's relative.

Young Bulls

Ultimate Design

An OpenAI powered tool that helps users generate interior designs effortlessly, with zero knowledge of prompt engineering. It eliminates the friction between discovery, consideration and purchase of inferior design products, specifically furniture as users can directly buy the products generated with Stable Diffusion from our partner manufacturers.

AIException

Prompt Profile Picture

This Application is developed for social media, prominently for images. Idea behind this project is to have a "Generate" option beside the usual "Upload". This project focuses on generating profile pictures and banners depending on social media platform a user want to generate for. "Prompt Profile Picture" have utilized Dall.E2, Dalle-mini and Streamlit.

OraOraOraOra....

Dalle and Whisper

Our service/ application is an amalgamation of streamlit with DALL-E's API and gradio with Whisper. In the case of DALL-E the user needs to give the number of images he/ she wants and add the corresponding image prompt or the imager description. The API will then generate the closest possible images to the given image description when the use clicks on the generate image button. Talking about the Whisper part, we used gradio to implement it. Over here it is capable to translate the speech or the audio input by the user to the text. This text can further be used for several applications.

Geeks

Bibliotopia

Bibliotopia is a service that enables you to search books from your descriptions.

Impact

ezTutor App

With the Technological Advancement and growing pool of Knowledge base. It is getting difficult for Learners to Understand Specific Topics, Videos, and Audios in Fast pace. specially when we have Sea of Information on Google. So we came up with the Idea of ezTutor, that can help learners to understand specific topics and Evaluate themselves with the help of an AI Generated Content and Quizzes. In The ezTutor App: If you want to learn any topic, you can enter the text, and get the results with examples, images and keywords. If you want to learn by Video then by pasting YouTube's video url and get the reading content as well the summary of the topic, if you are running out of time. Similarly if you want to learn from Audio, recorded in a class then simply upload the audio. ezTutor will transcribe and Summarize the content. You can also test your learning by attempting the quiz.

Deepfai

Book And I

An AI learning companion that answers questions and provides explanations and clarifications related to supplied literature for people with learning difficulties.

Book And I

Urecipy

Urecipy is a personal recipe notebook that allows its users to cleanse the content of food recipe YouTube videos and provide them with just the recipe content i.e. the ingredients used and the steps performed. It displays this result in both textual format as a recipe card and audio format (in case someone finds it hard reading the details). The application provides users to add as many recipes from YouTube and search them efficiently if the need arises.

Lone Warrior

Personal Elf

Personal Elf is an AI-driven application for recommending gifts. Have you ever struggled with choosing the most suitable gift for one of your close ones? Don't worry, from now on your Personal Elf has your back! This application is driven by OpenAI's models. We used GPT-like transformer-based DaVinci model for generating gifts proposals and then visualized proposed gifts with diffusion model called DALL-E 2. We created a demo using streamlit that was also deployed using their service. We highly value inclusivity, therefore we designed our solution to be convenient also for other occasions, such as birthdays, anniversaries or hanukkah. Such flexibility allows us to make our product more accessible to users and it can be advertised accordingly during the year. While building our userbase and being able to drift afloat thanks to adverts, we want to get into partnerships with various online shops. We believe we can provide them with a great advertisement by integrating their shops into our application and recommending the users to buy on our partners' sites.

OpenGolem

SmartNotes

The Goal of this project is to make organizing information simpler and to minimize the amount of clicks/taps a user needs to save dates/todos/goals. Making it easier for a user to manage their life events and objectives using AI to categorize tasks and building UI based on tags.

GeekCafe

Forensic Sketch AIrtist

*Generate hyper-realistic forensic sketches* SaaS that allows forensic sketch artists to improve the quality and speed of their work, creating a baseline hyper-realistic of the criminal, based on the witness description.

EagleAI

DocuSumm

DocuSumm: less words, more meaning Our mission is to help people quickly and easily understand complex information by providing accurate and concise summaries of any type of documents. We believe that by simplifying and organizing information, we can empower people and companies to make better decisions and achieve their goals. During the hackathon we created an website, and Chrome extension and an API to summarize Youtube videos, with the goal to add more documents formats in the future. Our tech stack includes OpenAI's Whisper and GPT-3, as well as Python, PHP, MySQL, Lighttpd, Javascript, HTML and CSS.

DocuSumm

RavenAI GameDev Toolkit

Game developers often face the challenge of coming up with unique and engaging game ideas, as well as organizing their thoughts and design elements in a cohesive manner. We developed RavenAI GameDev Toolkit in order to tackle these challeges by harvesting the catabilities of OpenAI's GPT-3 and DALL-E 2. Brainstorming ideas for game design is now easy! Simply select what you want to get ideas for from the sidebar, and fill in the form with any ideas that you may have. Press the button and.. Done! Behind the scenes, your inputs get processed and then passed to the AI models to get the final results. The tools are designed to complement each other in order to create a cohesive vision in the end, but they can also be used individually

Raven AI

OpenCode

Our goal at OpenCode is to provide users with solutions to their programming questions according to their needs. Opencode was developed to assist students in learning. OpenCode works with the OpenAI CodeX API. Our explainable AI collaborates with CodeX AI to provide systematic explanations of every line of code. To generate this code, we used the model code-cushman-001.

Data Smashers

Avocado

Avocado - a mobile application to guide beginners in gym, do it safe and get results. Avocado can help you do gym in several ways: - Based on your exercises, it suggests what to add or remove to have balanced training for all muscle groups - Plans the best training plan for you providing enough time to rest and recover - Tracks your progress and provides instants feedback on how to improve and prevent harm To process video and audio information I use Whisper to get transcript. And GPT to extract information and make recomendations. There are two tasks: - The first one is convert video to exercises. We extract all exercises from the video along with all required information including name, summary, steps, timecodes, involved muscles and movement types (pulling or pushing). - Then we use AI in similar way to process audio recordings, which user makes during the training session. We extract number of repeats, weight, feel and harm. Then we use it to adjust current session, prevent harm and make recomendations with help of GPT

TechDock

WebWizard

Our application allows users without coding experience to create webpages by using simple voice commands. We utilized the Whisper AI to convert voice to text, and Chat-GPT3 to generate HTML and render a webpage. Users can then iterate on their design by giving the AI follow-up commands until the desired webpage is developed. Users can then copy the HTML source code and use in their personal webpages. Our mission is to make webpage design more accessible for people without a technical background who are interested in creating their own webpages for personal or business needs.

AI on the Fly

Opinion Miner

The Opinion Miner crawls data from different videos on Youtube. I dedicate this project to informing enterprises of what their customers and reviewers say about their products on the Internet, with filters: popularity of video, positive or negative opinion. so they can have better marketing strategies, improve products and improve customer satisfaction… Customers now have more and more choices as they can choose their favorite sellers regardless of their location, thanks to digitalization. Marketing strategies have switched their focus from whether goods can be sold or not, to customers' satisfaction and opinion, otherwise, they will find other sellers.

tpn

Web Wizard

By offering a user-friendly and adaptable framework for website building, this project leverages text-DaVinci-003 and Whisper to create websites for people. This may be especially useful for individuals and small businesses that lack the technical skills or resources to create their own websites from scratch. To use the platform, a user would likely interact with their keyboards or whisper through a natural language interface. The user would provide information about their websites, such as the type of content they want to include and the design elements they prefer, and our application would use this information to generate a website based on pre-designed templates and customizations.

Code Jugadus

TrenchesAI

TrenchesAI is an AI powered educational blog/forum. Users of the blog get access to quality AI powered tools, articles and content related to Tech and AI. It is a one stop shop for newbies or anyone in particular trying to break in the field of AI or harness the power of AI for themselves The blog gives all users access to written content, premium users get access to the AI powered tools

TRENCHES AI

HealerAI

An virtual AI Psychotherapist which Helps and consoles people who were depressed and became mentally unstable.Speaks with them revive them from their Suicidal Thoughts and gives tasks to keep them engaged and Joyful.Its an Voice to voice AI.

OpenThunder

Logo Guessing Game

Our project is a guessing game where our user will attempt to guess the brand names from the logos shown in a webapp. The project will be built using the Python programming language via the python framework Streamlit. OpenAI will be used to provide speech-to-text capabilities for our project through the replicate API. We hope to target young kids with our project to teach them proper pronunciation and speaking through the playing of a guessing game using their voices. This project is therefore a small step towards what we hope to be able do in the future, to make learning fun and convenient, as games like this can be played by kids wherever and whenever.

Massive L

Customer Support GPT

Try it out: https://customer-support-gpt.vercel.app/ Founders and customer support representatives often get an influx of repetitive emails that could be easily automated using GPT-3. The challenge is that GPT-3 knowledge is general and doesn't have company-specific information. Enter Customer Support GPT, it uses the company's internal knowledge such as frequently asked questions, articles, etc. to give company-specific answers. If it doesn't know the answer, it'll flag to be answered by a human. How it works: To get company-specific responses, Customer Support GPT does the following: Parse and create an index of company data. When an incoming customer support query is presented, the index is searched for the most relevant result(s). If none are present, flag it to a human reviewer. This result(s) are passed into GPT-3 along with the incoming query, to return a personalized, relevant response. Future work is to integrate with Zendesk, Crisp, and other providers to pre-generate answers and use in real life. Improve the GPT-3 prompt to return results that are closer to how customer support agents communicate Built by: Abdellatif - Ex-twitter engineer and founder of Tarteel.ai Ahmed - 17-year-old hacker, creator of remail.ai We're happy that you've reached this far, if you'd like to use the tool or have any questions please don't hesitate to reach out abdellatif@remail.ai

Customer Support AI

AtYou

AtYou, it is mostly created to extract transcription from youtube and summarize it. The summarization will help people to know more about the video in short span or time or say they dont have to spend their time watching videos. It saves time, helps to know main points , also help in SEO to optimize recommendation by modelling main points. The app will also help people with hearing loss and non-native english speakers.

Sentient

Bookworm

The bookworm helps you to generate the image of a person or place from the description of that person/place .then it helps you to find the concept of the book or key points from it without reading the whole book then it help you get key notes or concept from audio books and can you can listen to the result as audio or read the result and find if this book is the right one to start /learn about the book without reading everything. this mainly helps children understand the story with visuals and reduce the time reading books.

The AI

AIShout

AIShout leverages Whisper and GPT-3 capabilities to complete your meeting experience. Indeed, during meetings and encounters, a reporter has the tedious job to write and summarize everything into an appropriate template. For meetings, we usually use a Minute Template provided by the company. Let AIShout be that reporter for you.

YodAI

The TimeSaving app

We built an app that listens to you speaking, transcribes it to text using whisper, then generates a formal email based on what you said with the help of the gpt-3 model.

AIWiz

GistGen

Introducing Gist - the ultimate study tool for students of all levels! Whether you're in middle school or college, GistGen has you covered with thousands of expertly-crafted questions tailored to your specific grade and subject. simply choose your grade and subject, and GistGen will provide you with a wealth of practice questions to help you succeed. With GistGen your understanding of the material, boost your test scores, and achieve your academic goals. Not only that but Gistgen is an application that also helps you understand the main points of an article quickly and easily. It uses advanced natural language processing techniques to automatically summarize the content of an article, giving you a concise and comprehensive overview of the material. Whether you're trying to keep up with the latest news or want to dive deeper into a particular subject, Gistgen is the perfect tool for anyone looking to efficiently learn about new topics. With Gistgen, you can stay informed and stay ahead of the curve, no matter how much information you have to process.

Code Crusaders

Image Generator

Image Generation from Text Input using Dalle2. Can Be used to generate arts or pictures. which can be used for printing or marketing purposes or even inspiration for artists.

Non Zero

Summarize web service

This is a small web service project that allows you to upload a mp3 audio or provide a youtube link - the source audio then gets transcribed and summarized by openAi models. The project was realized as part of OpenAI Whisper, GPT3, Codex & DALL-E 2 Hackathon together with colonelWalterKurtz and PioSikorski. The app was realized using python 3.10 with libraries such as Flask, openai, moviepy and pytube. The audio transcript is fed into the GPT-3 model in several pieces to ensure that it does not shorten and erase too much information. The prototype allows to convert short videos efficiently however it takes significantly more time to process longer audio files due to slow working of the requests to each model. The project provides a proof of concept that could potentailly be useful to many people who often do not wish to spend much time listening to audio files such as podcasts and if improved could allow to deliver such service online.

Summar'z3 web service

PrintAi

Our idea for a Shopify Print on Demand app that uses DALL-E inpainting would be to create an app that allows users to add custom images to their products. The app would use DALL-E to personalize the images based on a description provided by the user. For example, a user might want to add a picture of their grandparent’s dog to a t-shirt. They could then add props like a “Santa hat” to the dog, and DALL-E would generate the image and add it to the t-shirt or any other POD product. This would allow users to easily customize their products with unique images without needing any design skills. Another use case for this could be using the DALL-E outpainting tool, where a user might upload a picture of their own dog and ask the app to put it in a certain environment, i.e. “space”. The app would significantly reduce the turnover time and processes of personalized POD (currently 1-3 days) and design costs (currently $1-$5/design). We would charge merchants just $1 for each actual purchase on their store that was made via PrintAi. Overall, the app would make it easy for personalized POD merchants to allow their customers to add custom, high-quality images to their products, making their online stores more unique and appealing to customers.

PrintAi

children of heaven

Children of heaven🌸🧒🏡is a non profit educational oriented solution that uses AI to generate beautiful multilingual poems and relevant images. With its powerful language GPT3 model, it can create unique and inspiring multilingual poems on a wide range of childrens’ topics, and its Dall E model creates images that perfectly complement the poem ,We have built Audio input using gradio and whisper for multilingual input for kids who have difficulty in typing. Give children of heaven a try and discover the magic of multilingual poetry and art. Whether you’re a professional or kid , this app is sure to spark your creativity and inspire you to create something beautiful.

Millennials

GPTBlogs

This open-source project was built to give bloggers flexible tooling for their content creation. It only takes 5-10 minutes to set up and is cheaper than using services that mark up the price of OpenAI. The tool is streamlined to create higher-quality content by guiding the user thru a series of prompts.

Peaches

Talk to GPT Three

A Drag-and-Drop configurable solution for implementing conversational use of GPT-3 in Unity.

Dark Polarity AI

novela ink

Novela ink is your own personal AI assistant platform to create/modify/enchance your stories. With power of OpenAI it was possible to create a storywritter that can really fit into your needs. With minimal effort you can quickly create stories, books, and creations. You don't need expensive graphic designers, and copywriters. It also helps to get an inspiration! And everything is tamperproof, and immutable thanks to immudb, so you can't really lose your creations. Everything in easy markdown format - could be exported to pdf. Features: - Full books management - AI story completion with different setups - AI story completion in-place - Image generation for selected text - Image generation for summary - Inspirations - Time Travel - Immutability - All AI actions saved

ApproxTeam

MindMate

During the hackathon, we fine-tuned GPT-3 and built a self-analysis tool that helps one objectively assess their problem and develop new ideas for solving it. It can be used by people who can't access mental health care because of high prices and stigma. It is based on CBT and should be highly effective in the following cases: 1. A person has a problem and doesn't know how to solve it. For example, "I can't keep up with deadlines," or "My parents are overprotective." 2. A person can't make a decision. "Should I move?", "Should I accept an offer from a new company?" etc. 3. A person can't sort out their thoughts. "I can't understand why I'm so uncomfortable being a dad," "Why have I become so irritable?" etc. 4. A person wants to improve their relationship. "I'm so jealous," "We fight all the time," "I'm not happy with my wife. I cheated, and I feel guilty". In therapy, people who are objective about their situation and able to set specific goals tend to achieve better results. This tool does exactly that. A typical session consists of three parts: 1. Analysis. This part includes questions that make the person analyze various aspects of the situation and draw an objective picture. The essence of this part is the transition from an emotional to a rational perception of reality. 2. Empathy. It consists of a comprehensive generalizing statement aimed at supporting the client emotionally. 3. Decision. It consists of questions that allow the person to analyze the availability of resources and ways to solve the problem. Questions force the person to move from emotions to concrete steps toward the goal.

Elomia Health

Galaxy DO

Galaxy invites AI agents into collaboration on real-time whiteboards. It will provide an open marketplace to share replica of your intelligence trained on your messages and blogposts and earning when being invited to assist others on the whiteboard collaboration. The very first Galaxy AI Agent is deployed and available for communication via my personal telegram account: @galaxygur

Galaxy

Code Translation Demo

Hey everyone, this is a video of our OpenAI hackathon demo. This project consists of the whisper, gpt-3, and codex APIs. The goal of the project was to to transcribe audio using whisper, then return that text as a python script, and lastly, use codex to to translate that python script into another programming language.

The Prompt Engineers

Summy Your AI Co Worker

Whether you're a student, a programmer, or someone who simply needs to make a summary or piece of code, Summy can help you! 1. Select a mode: text or code 2. Start recording 3. Stop recording 4. You will get a response depending on the mode you selected: - text: A summarization of the recording - code: A code snippet based on the recording It can help you in: - meetings - documentation - study notes - coding tool

Neurons

WordSense

People with hearing disabilities do not have the same autonomy as others. They are not able to interact to the extent of those around them, and have limited freedom. WordSense is a hardware product that assists people with hearing disabilities in navigating daily life with tactile sensory feedback, more specifically, Haptic Touch. As a person with hearing disabilities, WordSense solves the problems of not being able to passively interpret conversations around you, having to face the person to read lip movement or sign language, not being able to multitask, and having tunnel vision due to the lack of sound as an indicator. WordSense eases the daily lives of people facing hearing disabilities, and provides them with the power of autonomy.

WordSense

Navis

What i built 1 - YouTube-Sum This tool basically give you the short summary of any YouTube video in any language so that you do not waste time to watch whole video just get the summary and get knowledge from the video in matter of minutes. summary is so awesome and easy to understand. 2- TrendSum This tool basically give you the very short summary of top trending news on any topic you searched in a search box like hacking, football match, machine learning, politics, etc summary give you the info of all news on that topic. We provide personalized content in such a way that our user read the facts, information or knowledge according to their interest and also grab that knowledge in minutes using ml models and personalized recommendation integrated in the android application.

Navis

Chase The Language

Translation is necessary for spreading new information, knowledge, and ideas across the world. It is necessary to achieve effective communication between different cultures. In the process of spreading new information, translation is something that can change history. So, we have used our expertise as computer engineers with different specialties to encourage more global communication amongst those of several cultural backgrounds using the pyttsx3, whisper, torch, os, streamlit, NumPy, Sounddevice, Scipy.io.wavfile and Wavioas libraries to build our AI model and to handle all the requirements for needed for our project. Also, we have used IoT applications like raspberry pi to act as our main handler for the project that receives the voice from the user, enters it to be processed, and then revile the translated voice through the speaker.

The Chasers

InvestogAId

Product Name InvestogAId Problem With rising cost of living and soaring inflation across the world, cash deposits are increasingly becoming worthless. Inflation is eating away at everyone's wealth and there is a need for people to invest their money in something that will grow in value. However, the stock market is a very volatile place and it is difficult for people to make informed decisions about where to invest their money. Solution Using OpenAI Whisper and GPT-3, we are creating an automated transcription tool that will watch your favourite video about a stock trading strategy and implement it for you. This will allow you to make informed decisions about where to invest your money.

Blue

ChAI Food voice assistant

ChAI is a food voice assistant. ChAI receives an audio file with a description of what someone would like to eat and then uses Whisper, GPT-3 and a food API to create recommendations. These recommendations are divided in two categories. In the first category, the user receives a list of recipes that adjust to their input. While the second category outputs a list of dishes from restaurants that fit their likings. To achieve this goal. The front end is a web application made with nodejs, css, javascript and html in which we record an audio telling what we would like to eat. We then use javascript to make a call to the whisper API to obtain the transcript. This transcript is then passed to the back end, which is a flask server with python, via an HTTP request. The request sends the transcript to the natural processing server, which parses the text with GPT-3 and asks a series of important questions according to items of interest associated with food. Finally, we use the answers provided by GPT-3 to call a food API that outputs recipes, dishes and restaurants that are related to the input queries.

Uniandes

Guardians of Discord

Our project consists of a solution for videoconferencing platforms to threats that threaten the proper development of a communicative environment by using AI Whisper as the main feature of the bot. We started with a modest Discord bot, but we consider that this idea can scale and expand to many other horizons.

Sentient cookies

Butter

Butter is an AI-based integrated chatbot that utilizes specialized speech-to-text conversion to accurately output messages from live voice recordings for individuals that stutter, and answer personal questions regarding stuttering. Butter implements the state-of-the-art Whisper API created by LabLab AI to intuitively translate speech into written form and omit any unintended interruptions in their flow of speech. Our goal is to empower and improve accessibility to communication to users with speech impediments.

Boss

SafeWord

Utilizing OpenAI's Whisper model and a CNN-Based Speech Emotion Recognition (SER) model to determine whether to call the authorities based on sentiment.

Spaghetti

Phoenix Whisper

According to research made by J. Birulés-Muntané1 and S. Soto-Faraco (10.1371/journal.pone.0158409), watching movies with subtitles can help us learn a new language more effectively. However, the traditional way of showing subtitles in YouTube or Netflix does not provide us the best way to check the meaning of new vocabulary nor understand complex slang and abbreviation. Therefore, we found out that if we display dual subtitles (the original subtitle of the video and the translated one), the learning curve immediately improves. In research conducted in Japan, the authors concluded that the participants who viewed the episode with dual subtitles did significantly better (http://callej.org/journal/22-3/Dizon-Thanyawatpokin2021.pdf). After understanding both the problem and the solution, we decided to create a platform for learning new languages with dual active transcripts. When you enter a YouTube URL or upload an MP4 file in our web application, the app will produce a web page where you can view the video and have a transcript running next to it in two different languages. We have accomplished this goal and successfully integrated OpenAI Whisper, GPT and Facebook's language model for the backend of the app. At first, we use Streamlit for the app, but it does not provide a transcript that automatically move with the audio timeline, also Streamlit does not give us the ability to design the user interface, so we create our own full stack application using Bootstrap, Flask, HTML, CSS and Javascript. Our business model is subscription-based and/or one-time purchase based on the usage. Our app isn’t just for language learners. It can also be used for writers, singers, YouTubers, or anyone who would like to make their content reach out to more people by adding different languages to their videos/audios. Due to the limitation of free hosting plan, we could not deploy the app on cloud for now but we have a simple website that you can have a quick look at what we are creating (https://phoenixwhisper.onrender.com/success/BzKtI9OfEpk/en).

Phoenix

Luminous Decibels

Luminous Decibels, give a picture to your words. An easy way to generate a video for what you want to say. A simple way that would allows someone who just knows how to fill online forms, create an interesting video.

Akatsuki

Discord Voice Chat Bot

We have created a Discord bot with Python that is able to listen to users in a voice call, and when prompted by a command, it records the user's audio, transcribes the audio using OpenAI's Whisper, generate a response using GPT-3, generate text to speech using the Uberduck API, and then finally send an the response audio back into the Discord voice call. While we think there is room to improve our implementation of the project, we think that it has quite a few uses, from voice call moderation, to accessibility and more. We plan on continuing to develop the project to a more polished state, where it can be reliably used in other discord servers.

The Picard Trio

Moriarty

According to research and statistics Hate speech has become a real issue in online communication, especially in online games and live-streaming platforms where users are shielded by their anonymity. This phenomenon discourages a lot of people from using those platforms. With this project our goal is to help already existing voice communication platforms combat hate speech, harassment and toxic behaviour. Our solution to this problem is to utilise each user's microphone in order to assess whether his speech is obscene, toxic, threatful, insulting etc. using cutting-edge Machine Learning tools like Whisper and text-classification models. Our target audience is Video-Game companies, live-streaming platforms and Social Media. We really think that our product can help them minimise hate speech in their communities and thus achieve higher Quality of service.

Biscoff

Web Bot

Our project uses Open Ai Whisper and GPT-3 services, Flask and React. Flask is used for the API part, React for the front end . The user will start recording with his microphone an question which will be transcribed and answered by the GPT3 module. We think with further developments this bot can reach a real product level with high capacity of resolving user needs.

team team

RememberThis

The RememberThis app takes in an audio recording or voice note. The voice note is transcribed into text. A keyword is extracted from the text to categorise it. The keyword and text are uploaded to a Google Sheet.

Whisper4lokal

BabelTube

YouTube has a vast and high quality educational content. But most of it is in English. This is a disadvantage to non-English speakers. BabelTube plans to democratize learning by enabling non-English speakers to generate subtitles for any video on demand and on the fly. It integrates directly with YouTube web player using Chrome Extensions, and uses the same interface used by YouTube to display its subtitles. So the user experience of this app is also on par with that of YouTube's own subtitle display.

Autobot

HearO app

HearO is an app built to help people who experience some degree of hearing loss. HearO uses audio to generate ASL (American Sign Language) through various orders. Our crucial component of the idea is Open AI Whisper API.

TATAR

Taleeq application

Our project “Taleeq” is a mobile phone application for children aged 6 to 9. This app is concerned with helping children to express their needs and feelings properly and fluently at the right time with the help of speech recognition technology, the application will convert the child’s speech to text and compare it with the words set. which makes it easy for children to deal with people in different situations. All of this is done in the form of an interesting game that has multiple levels where the child needs to collect points to open a new level.

Taleeq

voiceObot

Voice messages are becoming a more and more common way to communicate, it offers people something faster than typing and sometimes you can’t talk in real time, so a call isn’t an option. But it also has downsides, many times you are in a crowdy place and are not able to listen to voice messages, but what if you will miss something important? Don’t worry, we got your back. During this hackathon, we developed a bot for a popular messenger Telegram that uses Whisper by Open AI to transcribe voice messages. You can just forward a voice message from a sender to a bot, and you will get textual transcriptions in seconds. And it also works for as many languages as Whisper support. We hope that such a simple tool can help more people to be comfortable communicating.

UDL