Our service/ application is an amalgamation of streamlit with DALL-E's API and gradio with Whisper. In the case of DALL-E the user needs to give the number of images he/ she wants and add the corresponding image prompt or the imager description. The API will then generate the closest possible images to the given image description when the use clicks on the generate image button. Talking about the Whisper part, we used gradio to implement it. Over here it is capable to translate the speech or the audio input by the user to the text. This text can further be used for several applications.