Top Builders
Explore the top contributors showcasing the highest number of app submissions within our community.
Speechmatics
Founded in 2006 by Dr. Tony Robinson, a Cambridge University speech recognition pioneer, Speechmatics builds infrastructure to understand every voice globally. The company's core mission is inclusive, multilingual speech AI, covering transcription, real-time voice agents, and on-device deployment. Speechmatics serves enterprise clients across media, healthcare, financial services, and contact centers.
| General | |
|---|---|
| Company | Speechmatics |
| Founded | 2006 by Dr. Tony Robinson |
| CEO | Katy Wigdahl |
| Headquarters | Cambridge, United Kingdom |
| Website | speechmatics.com |
| Documentation | docs.speechmatics.com |
| GitHub | github.com/speechmatics |
| Type | Speech AI / B2B SaaS |
Core Products
Speechmatics API (Speech-to-Text)
The Speechmatics API provides batch and real-time transcription across 55+ languages, powered by the Ursa 2 model released in October 2024. It supports speaker diarization, custom dictionaries, automatic translation, and Voice Intelligence add-ons (summarization, sentiment analysis, entity recognition) with no retraining required.
Speechmatics Flow
Flow is a voice agent API that combines Speechmatics' speech-to-text with an LLM and text-to-speech into a single real-time pipeline. Developers connect through a single API call to build conversational AI agents with smart turn detection, interruption handling, and function calling support.
Developer Resources
Speechmatics provides SDKs for Python, JavaScript/TypeScript, and .NET, along with a developer portal and free tier to get started without a credit card.
Helpful Links
- Documentation — official API reference, quickstarts, and integration guides
- GitHub — open-source SDKs and client libraries
- Developer Portal — API key management and usage dashboard
- Pricing — free tier and pay-as-you-go rates
Key Features
Multilingual transcription across 55+ languages Speechmatics' Ursa 2 model leads accuracy benchmarks in 62% of supported languages on the FLEURS dataset, with 7.88% WER on Kincaid46 for English, surpassing human-level accuracy on that benchmark.
Flexible deployment Speechmatics runs on private SaaS cloud, on-premises, on-device, and via Docker or Kubernetes, making it suitable for data-sensitive industries like healthcare and finance.
Voice Intelligence add-ons Summarization, sentiment analysis, topic detection, chapter generation, and entity recognition layer on top of transcription without requiring additional integration work.
Use Cases
Contact center automation Real-time transcription and sentiment analysis during calls, combined with Flow for automated voice agent handling of common queries.
Clinical transcription Speechmatics' Medical Model (launched 2024) targets 93% real-time accuracy and 96% medical keyword recall for English, German, Danish, and Norwegian.
Media and broadcast Batch transcription of audio and video files for subtitling, archiving, and content search across multiple languages.
speechmatics AI Technologies Hackathon projects
Discover innovative solutions crafted with speechmatics AI Technologies, developed by our community members during our engaging hackathons.




.png&w=3840&q=75)
