AI agent that benchmarks PyTorch/LLM workloads on AMD GPUs and recommends faster ROCm deployment settings.