OpenAI Whisper AI technology Top Builders

Explore the top contributors showcasing the highest number of OpenAI Whisper AI technology app submissions within our community.

OpenAI Whisper

The Whisper models are trained for speech recognition and translation tasks, capable of transcribing speech audio into the text in the language it is spoken (ASR) as well as translated into English (speech translation). Whisper has been trained on 680,000 hours of multilingual and multitask supervised data collected from the web. Whisper is Encoder-Decoder model. Input audio is split into 30-second chunks, converted into a log-Mel spectrogram, and then passed into an encoder. A decoder is trained to predict the corresponding text caption, intermixed with special tokens that direct the single model to perform tasks such as language identification, phrase-level timestamps, multilingual speech transcription, and to-English speech translation.

General
Relese dateSeptember, 2020
AuthorOpenAI
Repositoryhttps://github.com/openai/whisper
Typegeneral-purpose speech recognition model

Start building with Whisper

We have collected the best Whisper libraries and resources to help you get started to build with Whisper today. To see what others are building with Whisper, check out the community built Whisper Use Cases and Applications.

Tutorials

Boilerplates

Kickstart your development with a GPT-3 based boilerplate. Boilerplates is a great way to headstart when building your next project with GPT-3.


Libraries

Whisper API libraries and connectors.


OpenAI Whisper AI technology Hackathon projects

Discover innovative solutions crafted with OpenAI Whisper AI technology, developed by our community members during our engaging hackathons.

Sara Palliative Care for The New Age

Sara Palliative Care for The New Age

Sara aims to render a better rendition of what we know as a "Chat" bot or conversational agent. It provides a visual and auditory component to an LLM that allows it to be accessible to the terminally-ill, Alzheimer's, children with cancer, long term hospital residents. While AI might still not be ready to make vital life decision such as intervening with dosages, it excels at tasks where it has instructions to follow, writing, generating content, and performing trivial actions such as retrieving an image, calling someone and so on. AI chat bots also excel at being compassionate, they're trained in a way that makes them friendly to humans. They're trained to engage you in a conversation, they can remember everything said, and when aided by retrieval, can become truly unleashed. Recent articles have shown that many use ChatGPT for companion, something it was mainly aimed towards, we believe empowering it with telecommunication mechanisms and specific aims such as being compassionate and keeping a conversation going can be something powerful. We aim our solution towards the palliative care sector, quality of life for the terminally-ill. In essence, our system can be deployed on almost any device and requires a mic only at this point. Accessible technology that doesn't require any advanced knowledge of how it works for it to be used can allow us to aim it towards the elderly and children. We also believe equality in healthcare can be achieved through technologies such as AI. We deployed our system to be ready for production via stable APIs, interconnected together they allow you to hear the "AI" you're talking to and talk back, we also added a sequence of actions it can take, that we believe can be expanded on in the future. Our very minimal proof of concept can be customized and personalized, on a person basis it can help individuals out of the box, at enterprise scale, we believe we can collaborate with healthcare professionals to take our system to the next level