Top Builders

Explore the top contributors showcasing the highest number of app submissions within our community.

Speechmatics API

The Speechmatics API is the company's core speech-to-text service, providing batch file transcription and real-time streaming transcription via WebSocket. Powered by the Ursa 2 model (released October 2024), it supports 55+ languages and dialects, speaker diarization, automatic translation into 30+ target languages, and a suite of Voice Intelligence add-ons. Transcription requires no model fine-tuning; custom dictionaries of up to 1,000 words take effect immediately.

General
Release dateGenerally available; Ursa 2 model released Oct 2024
DeveloperSpeechmatics
TypeCloud speech-to-text API (batch and real-time)
LicenseCommercial API
Documentationdocs.speechmatics.com/speech-to-text
GitHubspeechmatics/speechmatics-python-sdk

Core Features

  • 55+ languages and dialects: broad multilingual support including accent and dialect variants.
  • Two accuracy tiers: Enhanced (optimized for accuracy) and Standard (optimized for speed and cost).
  • Speaker diarization: multi-speaker detection included at no extra cost in all plans.
  • Custom dictionary: up to 1,000 domain-specific words added without retraining.
  • Automatic translation: transcripts translated into 30+ target languages via AI.
  • Voice Intelligence add-ons: summarization, sentiment analysis, topic detection, chapter generation, and entity recognition.
  • Audio events detection: identifies non-speech events in audio.
  • Smart formatting: formats numbers, dates, currencies, and capitalization automatically.
  • Sub-1-second real-time latency: streaming transcription via WebSocket.
  • Flexible deployment: cloud API, on-premises, on-device, Docker, and Kubernetes.

Accuracy Benchmarks (Ursa 2)

MetricResult
WER on Kincaid46 (English)7.88% (surpasses human-level on that test)
WER improvement vs. previous Ursa18% reduction across 50+ languages
FLEURS dataset leadershipLeads in 62% of supported languages
Head-to-head vs. other providersWins 88% of comparisons

Pricing

TierIncludedRate
Free480 minutes/monthNo credit card required
ProUp to 6,000 hours/monthFrom $0.24/hour (with discount)
EnterpriseUnlimited scale, no rate limitsCustom

Volume discounts apply automatically above 500 hours per month per transcription type.


Tools and Resources


Ecosystem and Integrations

  • Integrates with LiveKit, Pipecat, and Vapi for voice pipeline deployments.
  • Available on Microsoft Azure Marketplace.
  • Compatible with on-device and edge deployments via Docker or Kubernetes.
  • Medical Model variant targets clinical transcription in English, German, Danish, and Norwegian.

Start building with the free tier (no credit card required) and explore the full API via docs.speechmatics.com.

speechmatics Speechmatics api AI technology Hackathon projects

Discover innovative solutions crafted with speechmatics Speechmatics api AI technology, developed by our community members during our engaging hackathons.

Apohara Synthex

Apohara Synthex

AI agents now run on the live web, but prompt injection is the number-one risk on the OWASP LLM Top 10, and most teams cannot prove what their agents ingested, or that it was safe. Apohara Synthex fixes that. Synthex is the provenance and security layer for the web data an AI agent consumes. It fetches across the full Bright Data spectrum: Web Unlocker, the Web Scraper API, SERP API, Scraping Browser, and the MCP Server. We didn't just use Bright Data; we improved it, contributing PR #140 upstream. Every fetch runs a layered defense before anything reaches a model. A deterministic regex pass and Qwen3Guard on Featherless form a high-recall net; NVIDIA's NemoGuard, selected by a measured benchmark, is the low-false-positive block gate; and a reasoning model on the AI/ML API knows the difference between describing an attack and executing one. Clean content is classified across four lenses, then sealed into an enterprise Evidence Report. The seal is real and shipped: an Ed25519 signature, an RFC 3161 DigiCert timestamp, an offline-verifiable Sigstore Rekor transparency log, and C2PA Content Credentials. Anyone can verify it in seconds with openssl, the industry's own c2patool, and a public ledger. No trust required. Cognee adds memory across re-scrapes, TriggerWare turns it into an automated monitor, and Kiro runs our continuous test and QA hooks. Synthex spans all three tracks, Security & Compliance, Finance & Market Intelligence, and GTM Intelligence, built for the CISO, CFO, compliance lead, and underwriter who need evidence they can defend to a board or a regulator. The average data breach costs 4.44 million dollars; Synthex seals an evidence artifact for a fraction of a cent. Everything signed, nothing trusted, and every number ships with a command to reproduce it.

EROS - External Reality OS

EROS - External Reality OS

EROS (External Reality Operating System) is a next-generation enterprise intelligence platform designed to help organizations understand and navigate the constantly changing external world. While companies have ERP systems for internal operations, CRM systems for customer relationships, and BI platforms for internal analytics, they lack a unified system capable of continuously monitoring, interpreting, and reasoning about external reality. Critical business signals such as competitor movements, supplier risks, market shifts, regulatory changes, pricing updates, technology adoption, and emerging opportunities already exist across the web, but they remain fragmented, unstructured, and difficult to operationalize. EROS solves this challenge by leveraging Bright Data's web intelligence infrastructure to collect, structure, and analyze public information at scale. The platform creates a living External Reality Twin for every monitored entity, including customers, prospects, vendors, suppliers, competitors, technologies, industries, and markets. Using a layered intelligence architecture, EROS transforms raw web data into evidence, evidence into signals, signals into events, and events into actionable business intelligence. The platform combines knowledge graphs, organizational memory, causal reasoning, pattern detection, and future-ready multi-agent intelligence to help organizations answer critical questions: What changed? Why did it change? How confident are we? What evidence supports this conclusion? What is likely to happen next? What action should we take? By turning the internet into a continuously updated intelligence layer, EROS enables sales teams to identify buying signals earlier, procurement teams to reduce supplier risk, security teams to detect external threats faster, and executives to make strategic decisions with real-time context. EROS transforms the web from a source of information into a system of enterprise intelligence.

Pulse — Alt-Data Demand Terminal

Pulse — Alt-Data Demand Terminal

Pulse turns the open web into a real-time read on competitive demand. Sales numbers are private, but the signals that precede them aren't: marketplace prices, review velocity, product assortment, and hiring all move on public pages before revenue does. Pulse pulls that data live and unblocked via Bright Data, then turns it into one interpretable, fully-cited number. We use seven Bright Data surfaces: the Web Scraper API across three datasets (Amazon Products, Amazon Reviews, and LinkedIn Jobs), plus the datacenter proxy, Web Unlocker, and the SERP API for both shopping and news. Those streams become time-series signals — review velocity, price index, discount depth, hiring momentum, assortment growth — which roll up into a transparent, z-weighted demand-momentum nowcast. Interpretability is the hero. The gauge breaks the score into signed drivers, and every driver is click-to-expand to its exact inputs and the real source URLs behind it — no black box. When a signal inflects (say a competitor cuts a flagship price), Pulse fires an alert. A graph memory tracks what changed across runs, and a grounded "ask" panel answers "what changed and why" with clickable citations plus an unverified flag that trips if the answer ever contradicts the computed signal. It's built honest and demo-safe: every data point is labeled real or simulated, outputs are framed as competitive-intelligence signals (not investment advice), and the whole app runs end-to-end with zero API keys via graceful fallbacks — adding keys only upgrades the floor to live integrations. The default universe tracks Anker against Belkin and UGREEN in consumer electronics on Amazon, but the company set, nowcast weights, and alert thresholds are all configurable.