Autonomous Research Agent for Multi-Hop Reasoning
AI agent built on Gemma-4-E4B, trained via GRPO Reinforcement Learning for advanced multi-hop reasoning. It dynamically searches the web, executes code, and synthesizes complex data into comprehensive reports while preserving vision capabilities.