Discover Whisper Apps and concepts

Browse all the AI applications built on Whisper. Explore PoC and MVP applications created by our community and discover innovative use cases for Whisper technology.

Whisper Hackathon Winners

Applications using Whisper that been in hackathon finals.

About Whisper

Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification. learn more about Whisper by browsing Whisper technology pages where you find all the Whisper resources such as tutorials, articles, videos, libraries, boilerplates, and more.

New Whisper Applications

Applications using Whisper

CryptoCrypt

Introducing our innovative Streamlit application, which harnesses the power of OpenAI GPT-3 to generate multi-layer encryption and decryption codes for secure communication. This application is designed to help users easily encrypt and decrypt their messages using state-of-the-art encryption techniques, making it nearly impossible for unauthorized parties to access their sensitive information. To use this application, users can input their speech message through OpenAI Whisper, which transcribes the message accurately. The application then uses GPT-3 to generate a multi-layer encryption code, which can be customized by the user according to their specific requirements. Once the encryption code is generated, it is applied to the speech message, making it indecipherable to anyone without the decryption code. Users can choose from a variety of encryption algorithms and key lengths, and can also input their own unique encryption key for added security. The application also allows users to save and retrieve their encryption codes for future use, making it easy to communicate securely with their contacts. In addition to its powerful encryption capabilities, the application is also highly user-friendly, with a clean and intuitive interface that allows users to easily navigate and customize their encryption settings. With its cutting-edge technology and ease of use, this Streamlit application is the perfect solution for anyone looking to communicate securely and confidently in today's digital world.

team phoeniks
application badge
OpenAI gymGPT-3Whisper

Miraa

Our app provides a fully digitalized package for our clients. We offer a range of services, including the creation of a logo, ads that can be used on social media platforms such as Facebook and Instagram, a website, and marketing videos. In order to enhance the quality of our videos, we use a technology called DeepFake. This technology generates faces which are then placed onto the video to create a more engaging advertisement. To create the ads, we use two different technologies called dalle and gpt3. Dalle is used to generate images, while gpt3 is used for text. The logo is also created using dalle for the image and gpt3 for the text under the image. For the website, we will use dalle for images and gpt3 to code the website itself. Additionally, we will be adding automation to our app to streamline the entire process. Impact:: Our app offers a comprehensive range of services that can potentially have a significant impact on the market. The fields in which our app can be used includes branding, digital marketing, web development, and video production.One potential way to use client data and requests of images for further work is to analyze the data to identify trends and patterns in the type of images that clients are requesting. This can help us to tailor our services to meet the specific needs and preferences of your clients. For example, if we notice that clients are frequently requesting certain types of images or logos, we could focus on developing more options in that style., our app has the potential to make a significant impact on the market and attract a wide range of clients.

DeepDream
medal
RedisCodexWhisperDALL-E-2ChatGPTGPT-3Stable Diffusion

Liquid LMS

The Problem: Traditional education has not changed much in the last century, and it fails to meet the diverse needs of students. One-size-fits-all teaching methods, outdated curricula, and limited access to resources often result in disengaged students who are unprepared for the workforce of tomorrow. The Solution: We propose a revolutionary approach to education that integrates AI and new technology. By leveraging the power of AI, we can create personalized learning experiences that cater to each student's unique needs, interests, and abilities. The Implementation: Our approach is built on three pillars: a. Adaptive Learning: Our AI-powered algorithms will analyze each student's performance data to create a customized learning path. This will help students learn at their own pace and achieve better learning outcomes b. Immersive Learning: We will use virtual and augmented reality to create immersive learning experiences. This will enable students to explore complex concepts in a more engaging and interactive way. c. Collaborative Learning: We will facilitate collaborative learning by leveraging AI-powered tools that enable students to work together on projects and assignments in real-time. The Benefits: Our approach to education will offer several benefits, including: a. Improved Learning Outcomes: Personalized and engaging learning experiences will help students achieve better learning outcomes and prepare them for the workforce of tomorrow. b. Cost-Effective: Our AI-powered approach to education will be cost-effective as it will reduce the need for physical classrooms and expensive resources. c. Accessible: Our approach will be accessible to all students regardless of their location, socioeconomic status, or learning abilities. Our approach to education will revolutionize the way we teach and learn. By leveraging the power of AI and new technology, we can create personalized, engaging, and cost-effective learning experiences that prepare students for tomorrow.

PlayFine
GPT-3WhisperDALL-E-2

MediFix

MediFix is an AI-powered assistant that utilizes the latest technologies such as GPT 3.5, Whisper, and gTTS to provide users with valuable healthcare information. With its advanced capabilities, MediFix is able to analyze symptoms mentioned by users and provide them with preventive measures to help them stay healthy. One of the key features of MediFix is its ability to support both voice and text input. This means that users can either speak to the assistant or type their symptoms, making it accessible to a wide range of users. When users input their symptoms, MediFix uses GPT 3.5 technology to analyze the information and provide relevant information on the causes of the symptoms and possible preventive measures. The assistant is trained on a vast amount of medical data, allowing it to provide users with accurate and reliable information. In addition, MediFix also utilizes Whisper technology to provide a personalized experience for each user. By understanding the user's context and history, MediFix is able to provide customized recommendations and preventive measures that are specific to their needs. Finally, gTTS technology is used to deliver the information to the user in a clear and easy-to-understand manner. This ensures that users are able to comprehend and follow the recommendations provided by MediFix. Overall, MediFix is a powerful healthcare assistant that leverages the latest AI technologies to provide users with accurate and personalized healthcare information. With its support for both voice and text input, MediFix is accessible to a wide range of users, making it an invaluable tool for anyone looking to take control of their health.

SeaSky
application badge
WhisperGPT-3ChatGPT

FocusX

Our application is designed to help individuals, especially those with concentration or mental health issues, to learn effectively. Leveraging advanced technologies like GTP-3, Whisper, Dall-E-2, Python, and React Native on the front end, our application is unmatched in its ability to provide personalized learning experiences tailored to specific dysfunctions. With our app, users can access a variety of learning resources such as a To-do-list, interactive exercises, and personalized quizzes. The app's intelligent algorithm also tracks the user's progress and offers personalized recommendations to help them learn more effectively. This approach ensures that users are engaged and motivated throughout their learning journey. One of the most unique aspects of our app is its ability to adapt to the specific needs of individual users. For example, if a user has a learning disability, the app will adjust the pace and difficulty level of the content to suit their needs. Similarly, for users with concentration issues, the app will provide techniques and exercises to help them stay focused. Our app is available on a freemium model for private use, while we also offer it for sale to schools, learning centers, and care facilities for people with disabilities. With these revenue streams, we aim to make our app accessible to everyone who needs it, regardless of their financial situation. In summary, our app is a game-changer for personalized learning, offering a unique and adaptive approach that is unmatched by any other application. With its ability to help those with mental health and learning difficulties, we believe our app has the potential to make a significant positive impact on the lives of millions of people. *Right now we have a to-do-plan, and help you find important information from text files, soon will be more*

Inclusive Solutions
GPT-3WhisperDALL-E-2

MindMate

During the hackathon, we fine-tuned GPT-3 and built a self-analysis tool that helps one objectively assess their problem and develop new ideas for solving it. It can be used by people who can't access mental health care because of high prices and stigma. It is based on CBT and should be highly effective in the following cases: 1. A person has a problem and doesn't know how to solve it. For example, "I can't keep up with deadlines," or "My parents are overprotective." 2. A person can't make a decision. "Should I move?", "Should I accept an offer from a new company?" etc. 3. A person can't sort out their thoughts. "I can't understand why I'm so uncomfortable being a dad," "Why have I become so irritable?" etc. 4. A person wants to improve their relationship. "I'm so jealous," "We fight all the time," "I'm not happy with my wife. I cheated, and I feel guilty". In therapy, people who are objective about their situation and able to set specific goals tend to achieve better results. This tool does exactly that. A typical session consists of three parts: 1. Analysis. This part includes questions that make the person analyze various aspects of the situation and draw an objective picture. The essence of this part is the transition from an emotional to a rational perception of reality. 2. Empathy. It consists of a comprehensive generalizing statement aimed at supporting the client emotionally. 3. Decision. It consists of questions that allow the person to analyze the availability of resources and ways to solve the problem. Questions force the person to move from emotions to concrete steps toward the goal.

Elomia Health
CodexGPT3DALL-E-2Whisper

Phoenix Whisper

According to research made by J. Birulés-Muntané1 and S. Soto-Faraco (10.1371/journal.pone.0158409), watching movies with subtitles can help us learn a new language more effectively. However, the traditional way of showing subtitles in YouTube or Netflix does not provide us the best way to check the meaning of new vocabulary nor understand complex slang and abbreviation. Therefore, we found out that if we display dual subtitles (the original subtitle of the video and the translated one), the learning curve immediately improves. In research conducted in Japan, the authors concluded that the participants who viewed the episode with dual subtitles did significantly better (http://callej.org/journal/22-3/Dizon-Thanyawatpokin2021.pdf). After understanding both the problem and the solution, we decided to create a platform for learning new languages with dual active transcripts. When you enter a YouTube URL or upload an MP4 file in our web application, the app will produce a web page where you can view the video and have a transcript running next to it in two different languages. We have accomplished this goal and successfully integrated OpenAI Whisper, GPT and Facebook's language model for the backend of the app. At first, we use Streamlit for the app, but it does not provide a transcript that automatically move with the audio timeline, also Streamlit does not give us the ability to design the user interface, so we create our own full stack application using Bootstrap, Flask, HTML, CSS and Javascript. Our business model is subscription-based and/or one-time purchase based on the usage. Our app isn’t just for language learners. It can also be used for writers, singers, YouTubers, or anyone who would like to make their content reach out to more people by adding different languages to their videos/audios. Due to the limitation of free hosting plan, we could not deploy the app on cloud for now but we have a simple website that you can have a quick look at what we are creating (https://phoenixwhisper.onrender.com/success/BzKtI9OfEpk/en).

Phoenix
application badge
GPT3CodexWhisper