VideoLangua: Bridging the Language Divide through AI-Powered Video Translation English dominates the world of online content, with more than 60% of digital content in English. This bias also reflects in large language model training, where English remains the primary language. Yet, 75% of the world's population does not speak English, creating a significant information divide. For non-English speakers, the cost of translation is prohibitive—for instance, professional video subtitling can range from $0.18 to $0.25 per word, according to Marshub. VideoLangua addresses this challenge by offering cheap and accurate AI translation services to bridge language barriers, making information more accessible to non-English speakers. Our solution leverages a suite of large language models and specialized agents for efficient, culturally sensitive, and accurate video translation: 1. Speech-to-Text Conversion: We use Whisper for accurate transcription, including timestamps. 2. Translation: For translation, we call Llama 3.1-70B-Chinese and Llama 3.1 70B to translate text. To ensure translation accuracy, Llama 3.2 3B, which supports eight languages, reviews the translation. 3. Subtitle Generation: Translated text is synced back to the video’s timeline as subtitles. 4. Optional Dubbing: For users who need voiceover in their native language, we offer an option for dubbing, ideal for video content files. With the global translation service market valued at $40.95 billion in 2023 and projected to reach $41.78 billion in 2024, VideoLangua is well-positioned to capture a share by offering a high-quality, affordable alternative to traditional video translation services. Unlike typical text-based translation tools, VideoLangua focuses on video and aduio translation, combining affordability with quality. Our solution uses Llama fine-tuned for different languages, allowing users to easily access translation and dubbing features in multiple languages.
Category tags: