speech-addition

Created by team Creatting Unstability on August 24, 2023

As earlier mentioned we added the automatic-speech-recognition model by zuu. This significant addition stems from the pipelines integrated within the transformers library. The integration of this audio speech recognition model is poised to yield remarkable improvements in the accessibility and usability of the stablecode module. Furthermore, our commitment to ensuring a comprehensive accessibility experience prompted us to integrate the text-to-speech (TTS) by suno/bark framework. This TTS model introduces an auditory dimension to the module, generating natural and coherent speech outputs based on the textual information present. This feature not only enriches the overall user experience but also serves as an additional layer of accessibility.

Category tags: