Footer navigation

Unlocking state-of-the-art artificial intelligence and building with the world's talent

LinkedIn
Twitter/X
Instagram
Discord
YouTube
Twitch

Other group brands:

Links

AI Tech
AI Hackathons
AI Tutorials
AI Applications
NativelyAI
AI Articles
Leaderboard
Writers

lablab

About
Brand
Hackathon Guidelines
Terms of Use
Code of Conduct
Privacy Policy

Get in touch

Discord
Sponsor
Cooperation
Contribute
[email protected]

© 2026 NativelyAI Inc. All rights reserved.

3.42.1

Help CenterBrowse FAQs and ask our AI.

Discord CommunityChat with mentors and the team.

Claim FREE $100

creditsAI Hackathons AI Apps AI Tech AI Tutorials AI Articles NativelyAI Sponsor

Home
App Discovery
EthiHack — AI Security Red-Teaming Agent

EthiHack — AI Security Red-Teaming Agent

Created by team ETHIHACK AI on May 16, 2026

LangChainrest api Anthropic Claude

Intelligent ReasoningEnterprise UtilityAgentic Workflows

EthiHack is an autonomous AI security red-teaming platform that tests any LLM or AI agent for critical vulnerabilities before deployment. The system runs 20 adaptive attack chains covering the full OWASP LLM Top 10 and MITRE ATLAS frameworks — including Direct Prompt Injection, Tool Injection, Jailbreaks, Excessive Agency, Data Exfiltration, Remote Code Execution, Memory Poisoning, Privilege Escalation, Indirect Prompt Injection, and Role Confusion. EthiHack is built as an autonomous agentic system powered by Anthropic Claude. It first fingerprints the target AI, then dynamically adapts every attack payload to the specific model and deployment context. Attacks run in real time via Server-Sent Events (SSE) streaming, with each result carrying a CVSS 3.1 score, business impact analysis, and auto-generated remediation code your engineering team can deploy immediately. In a live demo against MedBot AI — a medical chatbot with database tool access — EthiHack found 8 critical vulnerabilities in under 3 minutes: CVSS 10.0 Remote Code Execution (agent executed root shell commands), Tool Injection causing unauthorized email to 47,832 users, and full database credential exfiltration. Final security score: 0/100 CRITICAL. The platform is fully production-ready: FastAPI backend with async SSE streaming, a dark-mode dashboard UI, and a Railway-hosted live demo. It targets enterprise teams who need to validate AI safety before shipping agents into production workflows — turning what used to be a weeks-long manual audit into a 3-minute automated scan.

Category tags:

Developer Tools, Security

Github Presentation Demo

Explore more applications

AMD2_PKK

A clock-aware, zero-token-first routing agent. It classifies each task with no category hint, answers math, logic and code by generating a program and *executing* it

PKK

RiskOps

RiskOps is a event triggered supply chain risk simulator with a domain adaptive Multi-Agent AI System analyzes catastrophic events across your vendor network in parallel and generates structured mitigation plans. Built for AMD ACT II Hackathon (Track 3).

The Nacxmeers

GarudaLinux

Garuda Linux is an Arch-based Linux distribution known for its striking visual design, performance-focused tweaks (like BTRFS with automatic snapshots and Zen kernel), and a strong focus on gaming.

CoreX

AMD Developer Cloud

Simple Request Router

Uses Gemma 4 to classify complex vs. simple requests, and routes them to a local LLM / cloud provider as needed.

lone wizard

AMD Developer CloudAMD ROCmGemmaGemini AIAssistants API

ConsultIn

Quantivo AI (BOA) generates AI-powered Business Opportunity Analysis reports by combining local market data, sentiment analysis, and SWOT insights to help entrepreneurs validate and grow their business ideas.

Donat Madu

AI/ML APIAnthropic ClaudeClaude CodeCodexBright Data DatasetsBright Data Scraping BrowserBright Data MCP Server

arham nauman
ai automation

Upcoming AI Hackathons
For Innovators & Creators

Explore more applications

AMD2_PKK

A clock-aware, zero-token-first routing agent. It classifies each task with no category hint, answers math, logic and code by generating a program and *executing* it

PKK

RiskOps

RiskOps is a event triggered supply chain risk simulator with a domain adaptive Multi-Agent AI System analyzes catastrophic events across your vendor network in parallel and generates structured mitigation plans. Built for AMD ACT II Hackathon (Track 3).

The Nacxmeers

GarudaLinux

Garuda Linux is an Arch-based Linux distribution known for its striking visual design, performance-focused tweaks (like BTRFS with automatic snapshots and Zen kernel), and a strong focus on gaming.

CoreX

AMD Developer Cloud

Simple Request Router

Uses Gemma 4 to classify complex vs. simple requests, and routes them to a local LLM / cloud provider as needed.

lone wizard

AMD Developer CloudAMD ROCmGemmaGemini AIAssistants API

ConsultIn

Quantivo AI (BOA) generates AI-powered Business Opportunity Analysis reports by combining local market data, sentiment analysis, and SWOT insights to help entrepreneurs validate and grow their business ideas.

Donat Madu

AI/ML APIAnthropic ClaudeClaude CodeCodexBright Data DatasetsBright Data Scraping BrowserBright Data MCP Server

AI app: EthiHack — AI Security Red-Teaming Agent for AI Agent Olympics Hackathon