AI app: Benchmarking Robustness in Agentic RAG Systems for AI Agent Olympics Hackathon