LeakGuard: Catch Leaks Before Google Does

Created by team Leakguard on May 29, 2026

Anthropic Claude Bright Data SERP API Bright Data Web Scraper API Bright Data Scraping Browser

Security & Compliance

Secrets leak to paste sites every minute — API keys, database URLs, customer PII — and most of them sit there until Google indexes them and the abuse begins. LeakGuard is an autonomous agent that closes that window. How it works. A LangGraph pipeline runs six nodes end-to-end: Discovery uses Bright Data's SERP API (with the brd_json=1 parsing flag) to issue Google dorks against a hot-reloading watchlist; Extraction pulls raw paste content through Bright Data's Web Unlocker, bypassing the bot walls that block naive scrapers; a local regex Triage drops the obvious noise cheaply; an Analyst (Claude Sonnet, recall-tuned) flags anything that could be a leak; a Judge (Claude Sonnet, temperature 0, three-axis rubric) only escalates with a score ≥ 8; and Alert ships a redacted Slack notification with the audit reasoning attached. Why Bright Data is load-bearing. Paste-site discovery without SERP access is guessing; Web Unlocker is what makes xtraction actually work past Cloudflare and rate limits. Both zones are real and validated end-to-end. Safety. Credentials are redacted in two layers before any alert leaves the box. LangSmith tracing is off by default — it would otherwise ship the exact secrets the agent exists to catch to a third-party log store. A pre-commit detect-secrets hook guards the repo itself. What's built. Real pipeline (not stubs), per-node tests + smoke test, a Day-3 eval set of seeded pastes for regex tuning, a Streamlit dashboard reading the JSONL audit log, ADRs for the load-bearing decisions, and a synthetic mock server so demos don't burn the $250 SERP credit cap.

Category tags:

Security, Developer Tools

Github Presentation Demo

Explore more applications

Thymus

Thymus is a lightweight hybrid token-efficient router designed to maximize accuracy while minimizing token costs in multi‑task LLM pipelines. It dynamically routes user queries across local and remote models on LLM providers.

The Disappointer

HuggingFace HubLLaMAAMD Developer Cloud

AI Classroom Edge Intelligence

A privacy-first classroom AI platform that routes sensitive work to local edge systems and eligible anonymized analysis to Fireworks AI, helping teachers make faster, safer instructional decisions even with unreliable internet.

AI Classroom Edge

AMD Developer CloudQwen3rest apiGithub CopilotCodexChatGPT

Taskly: Smart Multi-Model Task Router

Taskly classifies incoming tasks into 8 categories (QA, math, code, NLP) and routes each to the optimal Fireworks AI model with a tuned prompt — maximizing accuracy while minimizing token usage in a fully Dockerized pipeline.

RuntimeTerror

AMD Developer CloudAMD ROCm

router_007_v3

router_007_v2 is a Track 1 agent that records **zero billable tokens**: every answer is computed inside the container by a Qwen2.5-7B-Instruct model bundled in the Docker image and served in-process with llama-cpp-python

roc_auc_half

Claude CodeChatGPTAMD ROCmAMD Developer CloudCodexGemmaGPT-5NVIDIAQwen3

Lexyprep

Agent knows how to make thorough legal research based on my experience of winning legal cases. It understands your issue and advises on procedure

Lexyprep - Do you have a case

AMD Developer CloudAMD ROCmCodexGemma

Janelle Tamayo

Upcoming AI Hackathons
For Innovators & Creators

Explore more applications

Thymus

Thymus is a lightweight hybrid token-efficient router designed to maximize accuracy while minimizing token costs in multi‑task LLM pipelines. It dynamically routes user queries across local and remote models on LLM providers.

The Disappointer

HuggingFace HubLLaMAAMD Developer Cloud

AI Classroom Edge Intelligence

A privacy-first classroom AI platform that routes sensitive work to local edge systems and eligible anonymized analysis to Fireworks AI, helping teachers make faster, safer instructional decisions even with unreliable internet.

AI Classroom Edge

AMD Developer CloudQwen3rest apiGithub CopilotCodexChatGPT

Taskly: Smart Multi-Model Task Router

Taskly classifies incoming tasks into 8 categories (QA, math, code, NLP) and routes each to the optimal Fireworks AI model with a tuned prompt — maximizing accuracy while minimizing token usage in a fully Dockerized pipeline.

RuntimeTerror

AMD Developer CloudAMD ROCm

router_007_v3

router_007_v2 is a Track 1 agent that records **zero billable tokens**: every answer is computed inside the container by a Qwen2.5-7B-Instruct model bundled in the Docker image and served in-process with llama-cpp-python

roc_auc_half

Claude CodeChatGPTAMD ROCmAMD Developer CloudCodexGemmaGPT-5NVIDIAQwen3

Lexyprep

Agent knows how to make thorough legal research based on my experience of winning legal cases. It understands your issue and advises on procedure

Lexyprep - Do you have a case

AMD Developer CloudAMD ROCmCodexGemma