This is a small web service project that allows you to upload a mp3 audio or provide a youtube link - the source audio then gets transcribed and summarized by openAi models. The project was realized as part of OpenAI Whisper, GPT3, Codex & DALL-E 2 Hackathon together with colonelWalterKurtz and PioSikorski. The app was realized using python 3.10 with libraries such as Flask, openai, moviepy and pytube. The audio transcript is fed into the GPT-3 model in several pieces to ensure that it does not shorten and erase too much information. The prototype allows to convert short videos efficiently however it takes significantly more time to process longer audio files due to slow working of the requests to each model. The project provides a proof of concept that could potentailly be useful to many people who often do not wish to spend much time listening to audio files such as podcasts and if improved could allow to deliver such service online.
Category tags: