Footer navigation

Unlocking state-of-the-art artificial intelligence and building with the world's talent

LinkedIn
Twitter/X
Instagram
Discord
YouTube
Twitch

Other group brands:

Links

AI Tech
AI Hackathons
AI Tutorials
AI Applications
NativelyAI
AI Articles
Leaderboard
Writers

lablab

About
Brand
Hackathon Guidelines
Terms of Use
Code of Conduct
Privacy Policy

Get in touch

Discord
Sponsor
Cooperation
Contribute
[email protected]

© 2026 NativelyAI Inc. All rights reserved.

3.37.0

Help CenterBrowse FAQs and ask our AI.

Discord CommunityChat with mentors and the team.

Claim FREE $100

creditsAI Hackathons AI Apps AI Tech AI Tutorials AI Articles NativelyAI Sponsor

AI app: CyberSecQwen-4B: CTI Specialist Fine-tuned on AMD for AMD Developer Hackathon

Home
App Discovery
CyberSecQwen-4B: CTI Specialist Fine-tuned on AMD

CyberSecQwen-4B: CTI Specialist Fine-tuned on AMD

Created by team athena19 on May 04, 2026

AMD ROCm AMD Developer Cloud Qwen3 HuggingFace Spaces HuggingFace Hub

Fine-Tuning on AMD GPUs (Advanced / GPU-Intensive)

CyberSecQwen-4B is a 4B-parameter cybersecurity language model fine-tuned from Qwen3-4B-Instruct-2507 and trained end-to-end on a single AMD Instinct MI300X 192 GB instance. The entire pipeline, including corpus assembly, LoRA fine-tuning, adapter merging, and evaluation, was completed on a single GPU. Under the published evaluation protocol for Cisco Foundation-Sec-8B (arXiv:2504.21039), CyberSecQwen-4B scores 0.5868 on CTI-MCQ and 0.6664 on CTI-RCM based on 5-trial means at temperature 0.3. It exceeds Foundation-Sec-Instruct-8B on CTI-MCQ by +8.7 points at half the parameter count while staying within 1.9 points on CTI-RCM, with all metrics measured using our own harness under the same protocol. The AMD MI300X stack performed excellently throughout. FlashAttention-2 was enabled during training because Qwen3-4B's 128-dimensional attention heads fit within the gfx942 LDS budget, delivering a 1.6× step-time speedup over sdpa. The pipeline runs inside the official vllm/vllm-openai-rocm Docker image with AITER kernels and HipBLASLt enabled. Upload speed was also efficient, as the 8 GB merged model reaches Hugging Face in approximately 36 seconds at ~240 MB/s via the AMD Developer Cloud link. Key methodological highlights include: 1. Decontaminated training data: An earlier internal run showed 72% test-set overlap in undeduplicated CTI corpora, so the released model trains exclusively on the 2021 CVE→CWE cohort with CTI-Bench overlap removed. 2. Direct SFT: This approach outperformed knowledge distillation from a 20B teacher at the current corpus scale. 3. Multi-trial reporting: Results include standard deviations rather than single-trial numbers. Recipe portability was validated by applying the same corpus and hyperparameters to a second model family, Gemma-4-E2B-it. Both models converge within 0.9 points on CTI-RCM (0.6664 Qwen vs 0.6754 Gemma), providing strong evidence that the result is recipe-driven rather than substrate-specific.

Category tags:

Github Presentation Demo

Explore more applications

Thymus

Thymus is a lightweight hybrid token-efficient router designed to maximize accuracy while minimizing token costs in multi‑task LLM pipelines. It dynamically routes user queries across local and remote models on LLM providers.

The Disappointer

HuggingFace HubLLaMAAMD Developer Cloud

AI Classroom Edge Intelligence

A privacy-first classroom AI platform that routes sensitive work to local edge systems and eligible anonymized analysis to Fireworks AI, helping teachers make faster, safer instructional decisions even with unreliable internet.

AI Classroom Edge

AMD Developer CloudQwen3rest apiGithub CopilotCodexChatGPT

Taskly: Smart Multi-Model Task Router

Taskly classifies incoming tasks into 8 categories (QA, math, code, NLP) and routes each to the optimal Fireworks AI model with a tuned prompt — maximizing accuracy while minimizing token usage in a fully Dockerized pipeline.

RuntimeTerror

AMD Developer CloudAMD ROCm

router_007_v3

router_007_v2 is a Track 1 agent that records **zero billable tokens**: every answer is computed inside the container by a Qwen2.5-7B-Instruct model bundled in the Docker image and served in-process with llama-cpp-python

roc_auc_half

Claude CodeChatGPTAMD ROCmAMD Developer CloudCodexGemmaGPT-5NVIDIAQwen3

Lexyprep

Agent knows how to make thorough legal research based on my experience of winning legal cases. It understands your issue and advises on procedure

Lexyprep - Do you have a case

AMD Developer CloudAMD ROCmCodexGemma

Upcoming AI Hackathons
For Innovators & Creators

Explore more applications

Thymus

Thymus is a lightweight hybrid token-efficient router designed to maximize accuracy while minimizing token costs in multi‑task LLM pipelines. It dynamically routes user queries across local and remote models on LLM providers.

The Disappointer

HuggingFace HubLLaMAAMD Developer Cloud

AI Classroom Edge Intelligence

A privacy-first classroom AI platform that routes sensitive work to local edge systems and eligible anonymized analysis to Fireworks AI, helping teachers make faster, safer instructional decisions even with unreliable internet.

AI Classroom Edge

AMD Developer CloudQwen3rest apiGithub CopilotCodexChatGPT

Taskly: Smart Multi-Model Task Router

Taskly classifies incoming tasks into 8 categories (QA, math, code, NLP) and routes each to the optimal Fireworks AI model with a tuned prompt — maximizing accuracy while minimizing token usage in a fully Dockerized pipeline.

RuntimeTerror

AMD Developer CloudAMD ROCm

router_007_v3

router_007_v2 is a Track 1 agent that records **zero billable tokens**: every answer is computed inside the container by a Qwen2.5-7B-Instruct model bundled in the Docker image and served in-process with llama-cpp-python

roc_auc_half

Claude CodeChatGPTAMD ROCmAMD Developer CloudCodexGemmaGPT-5NVIDIAQwen3

Lexyprep

Agent knows how to make thorough legal research based on my experience of winning legal cases. It understands your issue and advises on procedure

Lexyprep - Do you have a case

AMD Developer CloudAMD ROCmCodexGemma