gowthamsource976 | Hackathon Competitor Profile

🤓 Latest Submissions

Visual Agents — Autonomous AI Computer Control

Visual Agents is an autonomous AI system that moves beyond chatbots into an agent that literally sees your screen, understands your intent, and completes complex tasks across ANY application on your OS, not just browsers. WHAT IT DOES Unlike browser-only or sandboxed tools like Comet or Claude Computer Use, Visual Agents operates across your entire operating system: Chrome, Excel, Photoshop, Slack, Terminal, SAP, desktop apps, internal enterprise tools, anything visible on screen. Give it a real-world instruction like "Pull last quarter's sales data from our ERP system, cross-reference it in Excel, build a summary chart, then email the final report via Outlook to the leadership team" and it plans every step, switches between apps, reads live UI state, handles errors mid-task, and delivers the result. No APIs. No plugins. No scripts. THE ARCHITECTURE: SEE, THINK, ACT, REMEMBER SEE: Gemini Live API streams real-time screen capture. OmniParser and SOM visual grounding identify interactive elements with pixel-level precision across any UI, any app, any OS state. THINK: A Task Planner powered by Gemini breaks goals into executable steps using state-aware planning (OSCAR-inspired), detecting failures and replanning autonomously without human input. ACT: The Action Executor performs clicks, typing, scrolling, app-switching, and keyboard shortcuts with post-action screenshot verification after every step. REMEMBER: A hierarchical memory system stores successful action trajectories. The agent gets smarter with every completed task. KEY HIGHLIGHTS Full OS control, not just browser automation V4 Mode: SOM grounding, trajectory memory, adaptive replanning, Gemini Live voice Real-Time Voice: Speak your task, no typing required Privacy-Aware: Never stores credentials or sensitive data TECH STACK Gemini Live API, Gemini 3 Pro, OmniParser, PyAutoGUI, MSS, PyAudio, Python 3.11 Open-source under MIT license. The age of manual computing is ending.

Hackathon link

19 May 2026

Code2Paper for IBM Bob + DocSync MCP

Modern AI-assisted development is rapidly shifting toward coding agents and autonomous workflows, but current AI systems still suffer from a major structural limitation: their knowledge becomes outdated faster than the ecosystem evolves. During development, I repeatedly observed coding agents generating deprecated SDK integrations, obsolete model references, and outdated API patterns even after explicit instructions were provided. For example, when instructed to use the latest Gemini SDK patterns and models such as gemini-3.1-flash-lite, many coding assistants still reverted to older implementations like gemini-1.5 or deprecated SDK syntax. The issue was not reasoning capability — it was the static nature of LLM training data versus the rapidly evolving AI ecosystem. To solve this, I built DocSync MCP, a real-time documentation intelligence system for IBM Bob. DocSync continuously scrapes official SDK documentation, indexes it into a vector database, retrieves live implementation patterns, and exposes them through MCP tools directly inside Bob’s reasoning loop. Before generating SDK-specific code, Bob can search live docs, retrieve current APIs, and query live model catalogs from providers such as Google, OpenAI, and Anthropic. This grounds code generation on real-time ecosystem intelligence instead of outdated training memory. Alongside DocSync, I also built Code2Paper, a custom orchestration mode for IBM Bob that transforms a working research repository into a publication-ready research paper. Code2Paper analyzes repositories, identifies novelty, performs federated literature search, generates architecture diagrams, plots, and comparison tables, drafts sections using venue-specific Typst templates, and compiles complete papers for conferences such as NeurIPS, CVPR, and IEEE. Together, these systems solve two connected problems: keeping AI coding agents aligned with rapidly evolving technologies, and automating scientific communication directly from codebases.

Hackathon link

17 May 2026

👉 Upcoming Hackathons

AI GENESIS

AI Genesis is a global hybrid hackathon that brings AI builders together from around the world — starting online and culminating live at /function1 AI Conference & Exhibition 2026 in Dubai. 📅Oct 26 – Nov 3, 2026 • Oct 26 – Nov 2: Online build & collaboration All projects must be submitted by end of day on November 2. • Nov 2 (Online + Dubai): An exclusive on-site build day in Dubai for selected participants. • Nov 3 (Dubai): Live on-stage pitching & winners announcement at the /function1 Conference. 🌟 Get support from expert mentors throughout. 👥 Go solo or team up. 📍 Please note: If approved for the on-site experience, you’ll also have the opportunity to showcase your project live. Travel and accommodation will not be covered. 🧑🏻‍💻 Secure your spot now, sign up before the Kick-Off Stream!

2026 Oct 26

👌 Attended Hackathons

AMD Developer Hackathon: ACT II

To participate in the hackathon, simply click Sign up with AMD. If you are not already a member of the AMD AI Developer Program (ADP), you will need to create an account. ADP members can access AI tutorials, experts, and community support. In addition, new members can claim a free month of DeepLearning.ai pro, $100 in AMD GPU cloud credits, and $50 in Fireworks AI API credits. All hackathon participants will receive on hackathon launch additional compute and API credits for their projects. Ready to build what's next on AMD? Sign up with AMD below to secure your spot.

IBM Bob Hackathon

Build solutions that improve how software is built. Work with an AI that understands your codebase, helping you reduce repetitive work and build with real context. ⏳ Build your project in 48 hours. 🚀 Get hands-on with IBM Bob and explore how AI fits into real development workflows before it becomes standard. 💡 Build tools and workflows that developers would actually use. ⭐ Learn directly from experts throughout the event. 🤝 Join a team or build solo. 💰 Prize pool: $10,000 🧑🏻‍💻 Register before the Kick-Off Stream and start building with Bob.

AI Agent Olympics Hackathon

⏱️ Build the next generation of Autonomous Agents in the heart of Europe’s AI Revolution. 📅 May 13 – 20, 2026 May 13 – 19: Online Build Phase. May 19: An exclusive on-site build day at Milan AI Week for selected participants. May 20: Awards Ceremony. 📍On-site Venue (May 19–20): Fiera Milano (Rho), Milan, Italy 🤝 Join solo or with a team. 🎟 FREE Milan AI Week conference ticket included for participants. 💰 $32,000+ prize pool 🧑‍💻 Apply now to turn your autonomous agent into a market-ready enterprise solution.

📝 Certificates

AI Agent Olympics Hackathon | Certificate

View Certificate

IBM Bob Hackathon | Certificate