Footer navigation

Unlocking state-of-the-art artificial intelligence and building with the world's talent

LinkedIn
Twitter/X
Instagram
Discord
YouTube
Twitch

Other group brands:

Links

AI Tech
AI Hackathons
AI Tutorials
AI Applications
NativelyAI
AI Articles
Leaderboard
Writers

lablab

About
Brand
Hackathon Guidelines
Terms of Use
Code of Conduct
Privacy Policy

Get in touch

Discord
Sponsor
Cooperation
Contribute
community@lablab.ai

© 2026 NativelyAI Inc. All rights reserved.

3.36.4

Help CenterBrowse FAQs and ask our AI.

Discord CommunityChat with mentors and the team.

Claim FREE $100

creditsAI Hackathons AI Apps AI Tech AI Tutorials AI Articles NativelyAI Sponsor

Home
App Discovery
Agent Surface

Agent Surface

Streamlit

Created by team ValIt on May 18, 2026

AgentOps AutoGen Codex

Agent Security & AI Governance - Veea

AgentSurface is a tool for testing the security of real AI agents that are exposed through HTTP JSON APIs. Instead of relying on mock agents or generic jailbreak examples, AgentSurface connects to an actual agent endpoint, injects adversarial prompts into a configurable JSON request field, sends real HTTP requests, and records the full evidence trail: masked request, raw response, extracted answer, finding type, risk score, and recommendations. The project focuses on practical risks in AI-powered products: prompt injection, system prompt or secret disclosure, private data exposure, unsafe tool/action compliance, BOLA/IDOR-style cross-user access, and authorization gaps in support, finance, trading, CRM, and marketplace agents. AgentSurface includes a Streamlit UI with three main workspaces: Attack Sets for creating reusable adversarial prompt sets, Run for configuring the real API target and launching scans, and History for reviewing previous runs, findings, raw evidence, JSON exports, and policy drafts. It can also generate a Lobster Trap YAML policy draft, helping teams turn some detected risks into proxy-layer mitigations when applicable. The main idea behind AgentSurface is to treat an AI agent as an attack surface, not just as a chatbot. It helps teams test whether their agent follows security and business rules under adversarial input, while keeping concrete evidence that developers can use to debug and fix the issue.

Category tags:

Github Presentation Demo

Explore more applications

BudgetBrain Track 1 Champion Agent

A compact AMD Hackathon Track 1 reasoning agent that reads task prompts, routes them by category, solves many cases locally, and uses Fireworks models only when needed to maximize accuracy while minimizing tokens.

Silver linings

Zero-Token Hybrid Routing Agent

A two-stage, hybrid routing architecture that intercepts queries using an optimized regex engine and a localized quantized GGUF model, escalating only complex logic and math tasks to the Fireworks API to maximize accuracy while minimizing token costs.

CuriousVJ

AntigravityDeepSeek R1Gemma

Ai Healthcare Data Analyst assistant

Ai pulse

LogiSecure AI-Autonomous On-Prem Logistics Copilot

Autonomous AI-powered logistics platform that monitors global supply chains in real-time, detects disruptions, and executes proactive responses while keeping 100% of confidential shipment data on-premise using AMD ROCm hardware.

NextGen Minds

Nexus: The Autonomous AMD FinOps Agent

AI agent that scans your CUDA infrastructure, maps it to ROCm using a verified ruleset, and generates a cost-grounded AMD migration plan — with a confidence score on every dependency, not a guess.

AMDXX

AMD Developer CloudAMD ROCmGemma

Valentin Gorbachev

Upcoming AI Hackathons
For Innovators & Creators

Explore more applications

BudgetBrain Track 1 Champion Agent

A compact AMD Hackathon Track 1 reasoning agent that reads task prompts, routes them by category, solves many cases locally, and uses Fireworks models only when needed to maximize accuracy while minimizing tokens.

Silver linings

Zero-Token Hybrid Routing Agent

A two-stage, hybrid routing architecture that intercepts queries using an optimized regex engine and a localized quantized GGUF model, escalating only complex logic and math tasks to the Fireworks API to maximize accuracy while minimizing token costs.

CuriousVJ

AntigravityDeepSeek R1Gemma

Ai Healthcare Data Analyst assistant

Ai pulse

LogiSecure AI-Autonomous On-Prem Logistics Copilot

Autonomous AI-powered logistics platform that monitors global supply chains in real-time, detects disruptions, and executes proactive responses while keeping 100% of confidential shipment data on-premise using AMD ROCm hardware.

NextGen Minds

Nexus: The Autonomous AMD FinOps Agent

AI agent that scans your CUDA infrastructure, maps it to ROCm using a verified ruleset, and generates a cost-grounded AMD migration plan — with a confidence score on every dependency, not a guess.

AMDXX

AMD Developer CloudAMD ROCmGemma

AI app: Agent Surface for Transforming Enterprise Through AI