
Deriv Sentinel AI is a full-stack AI security platform (NestJS + Next.js + Llama 3.2 via Ollama) with three autonomous agents that form a self-improving defence loop. AI WAF — Multi-layered prompt firewall that classifies inputs as safe/suspicious/malicious, validates AI outputs for prompt leakage and PII, and adapts over time by learning from red team bypass patterns. AI Pentester — Runs 7 parallel security checks against any target: exposed .env files, outdated dependencies, Next.js CVE detection, missing security headers, CORS misconfiguration, information disclosure, and sensitive directory exposure. The LLM generates severity ratings and remediation steps for each finding. AI Red Team — Generates adversarial attacks across 7 categories (prompt injection, jailbreaks, encoding evasion, social engineering, etc.), fires them against the WAF, and records which ones bypass detection. The key innovation is the closed feedback loop: red team bypasses are automatically injected into the WAF's classification prompt, patching blind spots without human intervention. Every attack run makes the WAF stronger. All results persist to SQLite with a real-time dashboard featuring charts, activity feeds, and cross-module stats. An AI chatbot demonstrates the defence layers in a conversational context — every message passes through both input and output WAF validation.
7 Feb 2026