Hero Banner

Top Builders

Explore the top contributors showcasing the highest number of app submissions within our community.

Cloudflare Workers AI

Run machine learning models, powered by serverless GPUs, on Cloudflare's global network. Workers AI allows you to run AI models in a serverless way, without having to worry about scaling, maintaining, or paying for unused infrastructure. You can invoke models running on GPUs on Cloudflare's network from your own code — from Workers, Pages, or anywhere via the Cloudflare API.

General
AuthorCloudflare, Inc.
WebsiteCloudflare Workers AI
Documentationhttps://developers.cloudflare.com/workers-ai/
TypeServerless AI Inference Platform
Launch Year2023
GPU Network180+ cities globally

Features

50+ Open-Source Models

  • Text generation (Llama, Mistral, and more)
  • Text embeddings and classification
  • Image generation and classification
  • Automatic speech recognition
  • Translation models
  • Object detection capabilities

Serverless Infrastructure

  • Pay-for-what-you-use pricing model
  • Automatic scaling with demand
  • No infrastructure management required
  • Fast cold start times with V8 isolates

Global Edge Network

  • AI inference close to users for low latency
  • Models available in 180+ cities worldwide
  • Reduced network bottlenecks
  • Consistent performance globally

Developer Platform Integration

  • Seamless integration with Cloudflare Workers
  • Works with Pages for full-stack AI applications
  • REST API for platform-agnostic access
  • Integration with Vectorize (vector database)
  • AI Gateway for monitoring and control

Key Capabilities

  • Edge AI Computing: Run AI models at the network edge for minimal latency
  • Serverless GPU Access: Access powerful GPU infrastructure without provisioning
  • Model Catalog: Curated selection of popular open-source AI models
  • Real-time Inference: Low-latency AI processing for interactive applications
  • Global Deployment: Deploy once, run everywhere on Cloudflare's network
  • Integrated Ecosystem: Works with R2 storage, D1 database, and other Cloudflare services

Use Cases

  • Building AI-powered chatbots and conversational interfaces
  • Real-time content moderation and classification
  • Image and video processing at scale
  • Personalization and recommendation engines
  • Automated translation and localization
  • Voice recognition and text-to-speech applications
  • RAG (Retrieval-Augmented Generation) systems
  • AI-powered API endpoints and microservices

Supported Model Categories

  • Large Language Models: For text generation and chat applications
  • Embedding Models: For semantic search and similarity matching
  • Image Models: For generation, classification, and analysis
  • Speech Models: For transcription and synthesis
  • Vision Models: For object detection and recognition
  • Translation Models: For multilingual content processing

Cloudflare Cloudflare Workers AI AI technology Hackathon projects

Discover innovative solutions crafted with Cloudflare Cloudflare Workers AI AI technology, developed by our community members during our engaging hackathons.

Lunar Graph - AI-Powered Fraud Ring Detection

Lunar Graph - AI-Powered Fraud Ring Detection

Deriv's compliance teams face 2,100+ fraud alerts weekly, 95% false positives. Each investigation takes 25+ minutes, creating weeks-long backlogs. Meanwhile, fraud rings go undetected because every individual account passes KYC, AML, and risk checks perfectly. The insight: fraud is invisible individually — it's obvious at the network level. LunarGraph maps every entity (partners, sub-affiliates, clients, trades, payments) into a knowledge graph, then deploys three AI agents in parallel: Graph Anomaly Agent — Detects structural patterns like one partner controlling 20+ accounts through layered sub-affiliates sharing IP addresses and device fingerprints. Temporal Intelligence Agent — Identifies coordinated opposite BUY/SELL trades placed within 30-second windows across linked accounts (mirror trading for commission extraction). Behavioral Trajectory Agent — Compares partner behavior against known fraud signatures to predict emerging schemes 2-3 weeks before activation. The platform connects to the real Deriv WebSocket API. Partners invite affiliates via unique referral links, each generating a tracked trading account with a TradingView-style interface. Every trade feeds into the graph engine for real-time correlation analysis. An AI Copilot synthesizes findings into case reports in 28 seconds via natural language queries. Results: 99.86% alert reduction, 28-second case generation (vs 25min manual), 14-day predictive lead time, $178K+ fraud exposure detected across 3 rings and 73 entities.