Spec6: The enterprise intelligence operator

Created by team Unknown wrinkle on May 31, 2026
Finance & Market IntelligenceGTM IntelligenceSecurity & Compliance

Spec6 uses Speechmatics in two distinct but connected ways: first as the live speech interface for the product, and second as a transcription engine for “spoken web” intelligence. That split is important, because this is not just “we added voice chat.” In the code, Speechmatics sits both at the front door of the system, where a user talks to Spec6 in real time, and deeper in the intelligence pipeline, where audio and video from the web can be converted into analyzable evidence. The live voice side starts in the frontend voice assistant at [/Users/nadhi/Desktop/win-win/src/frontend/react/chat/voice-assistant.tsx](/Users/nadhi/Desktop/win-win/src/frontend/react/chat/voice-assistant.tsx). The browser captures microphone input with `getUserMedia`, then converts the audio into the format Speechmatics expects: 16kHz mono PCM16. That matters because the browser does not just upload a blob after the user is done. It streams audio continuously in small chunks, which makes the experience feel like an actual assistant instead of a push-to-record form. The comment in that file describes the full loop clearly: mic input becomes PCM16, that stream goes to `/api/voice/transcribe/ws`, partial and final transcripts come back, and the final transcript is fed into the normal Spec6 agent loop. So the voice layer is not a separate AI product. It is an alternate input mode into the same reasoning engine that powers the rest of the app.

Category tags: