Top Builders

Explore the top contributors showcasing the highest number of app submissions within our community.

Reinforcement Learning

Reinforcement learning (RL) is an area of machine learning concerned with how intelligent agents ought to take actions in an environment in order to maximize the notion of cumulative reward. Reinforcement learning is one of three basic machine learning paradigms, alongside supervised learning and unsupervised learning.

General
Relese date1960

About Reinforcement Learning


Reinforcement Learning AI technology page Hackathon projects

Discover innovative solutions crafted with Reinforcement Learning AI technology page, developed by our community members during our engaging hackathons.

Anvaya EnterpriseIQ

Anvaya EnterpriseIQ

Anvaya EnterpriseIQ: Next-Generation RAG & Data Intelligence Suite EnterpriseIQ is a state-of-the-art AI platform designed to transform fragmented enterprise data into unified, actionable intelligence. Built on a modular microservice architecture, the platform bridges the gap between unstructured document retrieval and structured data analytics. At its core, EnterpriseIQ features a high-performance RAG (Retrieval-Augmented Generation) pipeline that allows users to query thousands of documents using semantic search, powered by Vertex AI and a high-scale vector database. For structured enterprise data, the platform integrates a sophisticated Natural Language-to-SQL Analytics Agent, enabling non-technical users to perform complex mathematical queries and visualizations on massive datasets exceeding 250,000 rows. Beyond simple querying, EnterpriseIQ provides proactive intelligence through dedicated Anomaly Detection and Forecasting services, which utilize machine learning to identify system-critical patterns and predict future trends in real-time. A standout feature is the Interactive Knowledge Explorer, which automatically synthesizes relationships between disparate data sources to build a visual knowledge graph. Whether ingesting raw CSVs or complex PDF contracts, the system builds a cohesive map of entities and relationships, providing a "big picture" view of corporate knowledge. With a premium, glassmorphic UI built in Next.js and a robust GCP-integrated backend, EnterpriseIQ is engineered for the modern enterprise that demands speed, accuracy, and deep insight at scale.

ARCHON: AI for Engineering Intelligence

ARCHON: AI for Engineering Intelligence

ARCHON is an AI-powered engineering intelligence platform built with IBM Bob to help developers understand, analyze, and manage complex software systems faster and more efficiently. Unlike traditional coding assistants that mainly focus on autocomplete or code generation, ARCHON acts as an AI engineering teammate that understands the entire codebase, architecture, dependencies, and business logic of a project through repository-wide AI reasoning. Modern software systems are increasingly difficult to maintain due to large repositories, incomplete documentation, technical debt, and complex service dependencies. Developers often spend more time understanding code, debugging issues, and tracing dependencies than actually writing code. ARCHON solves this problem by transforming repositories into an intelligent and interactive engineering knowledge system powered by IBM Bob and IBM Granite. IBM Bob enables ARCHON to understand software systems beyond individual files by analyzing repository structures, dependency relationships, architecture flows, and engineering workflows across the entire project. ARCHON automatically generates architecture maps, dependency graphs, and code flow explanations while allowing developers to interact with the system using natural language queries. The platform also provides AI-powered impact analysis, root cause investigation, technical debt detection, and engineering knowledge recovery from historical commits and undocumented logic. By combining repository-wide reasoning, multi-agent AI workflows, and visual system intelligence, ARCHON helps developers debug faster, reduce onboarding time, minimize deployment risks, and significantly improve engineering productivity. ARCHON’s vision is to transform software engineering from simple code assistance into true engineering intelligence.

Trident - Redefining AI security

Trident - Redefining AI security

Trident — Brief Summary Trident is an AI response monitoring system that acts as an independent watchdog between large language models and end users. The core problem it solves is simple — when a company deploys an AI chatbot, there is no guarantee that every response the AI generates is safe, accurate, or appropriate for the user to see. Trident solves this by intercepting every single AI response before it reaches the user and running it through a 12 layer classification pipeline built using libraries like VADER, Sci-kit, and better-profanity. Each layer checks for a different type of risk — from negative sentiment and toxic language to jailbreak attempts and sensitive data leaks. If any layer flags the response as unsafe, it is immediately blocked, deleted, and the company is alerted in real time. Only responses that pass all 12 layers are delivered to the user. What makes Trident unique is its sector awareness — the same response is judged differently depending on whether it comes from a healthcare, finance, legal, or general AI system, making the monitoring contextually intelligent rather than one size fits all. It also uses a completely independent model as the auditor, meaning the AI being monitored can never influence its own judgment. The system is presented through a clean professional dashboard built in Streamlit, where judges and companies can paste any AI response, watch it pass or fail through each layer in real time, and view full analytics including risk heatmaps, block frequency charts, and scan history — all exportable as CSV. In one line — Trident is the safety layer every AI deployment needs, because every AI needs a watchdog.