ChorusOps is a voice-native dealflow orchestrator built for investors and enterprise teams who run deal discussions inside Discord voice channels. Instead of taking manual notes or switching to a CRM mid-call, ChorusOps listens, understands, and acts autonomously. The system captures Discord voice audio (Opus stereo 48kHz), downmixes it to mono, and streams it in real-time to Speechmatics' WebSocket API for highly accurate transcription with multi-speaker diarization — attributing every spoken sentence to the correct speaker automatically. Transcripts are routed to Gemini 2.5 Flash, which acts as a multi-step planning orchestrator. Using function calling, Gemini maintains a persistent deal state (deal name, stage, team notes, market context, funding ask) across the entire conversation via structured tool calls. When sufficient context is gathered, Gemini autonomously dispatches a DEEP_ANALYSIS job to a Featherless serverless inference worker running an open-source LLM. This worker produces a scored investment scorecard — including investment score, recommendation, strengths, and risks — which is automatically posted back to the Discord text channel and spoken aloud via Kokoro TTS. The bot supports barge-in interruption: if a user starts speaking while the bot is talking, TTS stops instantly. Multi-guild isolation ensures the system runs across multiple Discord servers simultaneously. Slash commands (/join, /say, /status, /tts, /voice) provide a full text fallback interface. ChorusOps targets the Agentic Workflows track: the agent plans its own steps, calls external tools, manages async multi-step tasks over time, and posts results without any human intervention — from first spoken word to final scored deal. Tech stack: Discord.js, Speechmatics RT API, Gemini 2.5 Flash, Featherless LLM inference, Kokoro TTS, Express, TypeScript.
Category tags: