Discover AI Technologies

This index is your gateway to discovering AI technologies and related topics. On each technology page, you'll find tutorials, upcoming hackathons, frameworks, and boilerplates to help you get started.


BabyAGI is a Python project that demonstrates an AI-powered task management system that uses OpenAI and Pinecone APIs to create, prioritize and execute tasks.


OpenAI is an AI research and deployment company


Grounded-Segment-Anything is a framework that combines Grounding DINO and Segment Anything to detect and segment objects in images using text prompts. The project also incorporates other models like Stable-Diffusion, Tag2Text, and BLIP for various tasks like image generation and automatic labeling.


GPT-3.5 stands for Generative Pre-trained Transformer and is fine-tuned version of GPT-3. It is an autoregressive language model that uses deep learning to produce human-like text. There are many models in GPT-3.5 family as gpt-3.5-turbo or gpt-3.5-turbo.


Ask questions to your documents without an internet connection, using the power of LLMs. 100% private, no data leaves your execution environment at any point.

OpenAI gym

OpenAI gym is a toolkit for developing and comparing reinforcement learning algorithms. It supports teaching agents everything from walking to playing games like Pong or Pinball.


GPT-3 stands for Generative Pre-trained Transformer 3 and it is an autoregressive language model that uses deep learning to produce human-like text. It is the third-generation language prediction model in the GPT-n series created by OpenAI. GPT-3 is currently in open beta.


OpenAI Codex is an artificial intelligence platform that enables developers to translate natural language into code.


YOLOv7 is the new state-of-the-art object detector in the YOLO family. According to the paper, it is the most accurate and fastest real-time object detection to date.


BERT stands for Bidirectional Encoder Representations from Transformers. You can fine-tune it and get state-of-the-art results in a wide variety of natural language processing tasks


YOLOv5 is family of object detections models pretrained on the COCO Dataset. It has been created by Ultralytics in 2020. This architecture contains 10 different models, each one with a different size and speed. YOLOv5 is also a part of the YOLO family of object detection models.


GAN models contain two neural network models - a generator and a discriminator. They play together to be better in every epoch. One generates images, second verifies them.

Dall-e Mini

Dall-e Mini is a version of Openai's Dall-e algorithm that is specifically designed to generate small, high-quality images.


One of the best AI labs in the world. By using the Cohere platform we are able to use one of the best and most powerful language models for our own! There are many tutorials, including our own one, that show how to use Cohere.

Reinforcement Learning

Reinforcement learning is a computational approach to machine learning where agents take actions in an environment to maximize some notion of cumulative reward.


YOLOv6 is a single-stage object detection framework dedicated to industrial applications, with hardware-friendly efficient design and high performance.

AWS SageMaker

AWS SageMaker is a fully managed machine learning service that enables developers to quickly and easily build, train, and deploy machine learning models at scale. SageMaker removes the heavy lifting from each step of the machine learning process to make it easier to develop high-quality models.


EasyOCR is Python package for Optical Character Recognition. It is a general OCR that can read both natural scene text and dense text in document. It supports more than 80 languages!


CAMEL stands for Communicative Agents for "Mind" Exploration of Large Scale Language Model Society. The purpose of this framework is to enhance the collaboration among AI chat agents to achieve tasks with minimal human involvement.

Generative Agents

Generative Agents are computational software that simulate human behavior in interactive applications.

Text Generation Web UI

A Gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, and GALACTICA. It provides a user-friendly interface to interact with these models and generate text, with features such as model switching, notebook mode, chat mode, and more.


Chroma is an AI-native open-source embedding database. The fastest way to build Python or JavaScript LLM apps with memory


OpenAI Shap-E is a text to 3D generation model.


Qdrant is a search engine and database designed for vector similarity. With its user-friendly API, it offers a production-ready service for storing, managing, and searching vectors, including an additional payload. Qdrant is particularly well-suited for neural-network or semantic-based matching, faceted search, and other applications that require extended filtering support.


Redis is an open-source, in-memory data store that provides access to mutable data structures via a set of commands. It offers high performance and scalability, making it a popular choice for use in web applications and other use cases that require fast data access. Redis supports various advanced features such as persistence options, replication, clustering, and pub/sub messaging.


DALL·E 2 is a new AI system that can create realistic images and art from a description in natural language.

AI21 Labs

AI21 Labs is a platform for building AI applications. It provides a suite of tools and guides for building AI applications and training models. AI21 Labs helps developers and businesses use cutting-edge language models, with no NLP expertise required. Access powerful language models using an API that makes you smile

Cohere Embed

Turn text into numerical representations of language for deeper insights at scale. Embed makes it possible to algorithmically categorize and score text quickly to extract meaning.


Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.

Stable Diffusion

Stable Diffusion is a latent text-to-image diffusion model capable of generating stylized and photo-realistic images. It is pre-trained on a subset of of the LAION-5B dataset and the model can be run at home on a consumer grade graphics card so everyone can create stunning art within seconds.


Large language models (LLMs) have transformed the way we approach machine learning and artificial intelligence. These models are incredibly powerful, but their true potential lies in their ability to work in harmony with other sources of data and computation. This library has been developed to assist developers in building applications that combine LLMs with other technologies, allowing them to create truly intelligent and innovative products.

Cohere Generate

Generate is powered by a large language model that has read billions of words, learning the patterns and idiosyncrasies of sentences. Using this knowledge, it writes content, predicts outcomes or answers questions at your command.

Cohere Classify

Access large language models that can understand text and take appropriate action — like highlight a post that violates your community guidelines, or trigger accurate chatbot responses


GPT-4 is OpenAI's 4th generation Generative Pre-trained Transformer. It is a multimodal large language model that uses deep learning to produce human-like text, accepting image and text inputs. GPT-4 is OpenAI's most advanced system, producing safer and more useful responses


ChatGPT is a large language model trained by OpenAI to generate human-like text in a conversational style. It is a variant of the GPT-3 model, which was specifically designed to be used to generate text in response to user input. The ChatGPT model has been trained on a vast amount of text data, including conversations and other types of human-generated text, which allows it to generate text that is similar in style and content to human conversation. ChatGPT can be used to generate responses to questions, code, make suggestions, or provide information in a conversational manner, and it is able to do so in a way that is often indistinguishable from human-generated text. The initial model has been trained using Reinforcement Learning from Human Feedback (RLHF), using methods similar to InstructGPT, but with slight differences in the data collection setup. The model is trained using supervised fine-tuning, where human AI trainers provided conversations in which they played both sides—the user and an AI assistant. The trainers would have had access to model-written suggestions to help them compose their responses.


Stanford Alpaca is an open-source project that demonstrates the capabilities of an instruction-following LLaMA model. Developed by a team of researchers at Stanford University, Alpaca is designed to understand and execute tasks based on user instructions.

Cohere Neural Search

Cohere Neural Search provides powerful semantic search capabilities that find text, documents, and articles based on meaning, not just keywords.


Vercel is a platform designed for developers, providing speed, reliability, and scalability to create and deploy web applications. With built-in CI/CD, zero configuration, and deep integrations with popular Git providers such as GitHub, GitLab, and Bitbucket, Vercel streamlines the development process, making it easy for teams to collaborate and iterate on their projects.


LLaMA (Large Language Model Meta AI) is a state-of-the-art foundational large language model designed to help researchers advance their work in the subfield of AI. It is available in multiple sizes (7B, 13B, 33B, and 65B parameters) and aims to democratize access to large language models by requiring less computing power and resources for training and deployment.


Auto-GPT is an experimental open-source application that showcases the capabilities of the GPT-4 language model. This program, driven by GPT-4, chains together LLM 'thoughts' to autonomously achieve whatever goal you set.

Anthropic Claude

Claude is your friendly and versatile AI language model that can assist you as a company representative, research assistant, creative partner, or task automator.


YOLOv8 is the new state-of-the-art object detector in the YOLO family. According to the paper, it is the most accurate and fastest real-time object detection to date.


GPT4All is an open-source ecosystem of on-edge large language models that run locally on consumer-grade CPUs, providing a powerful and customizable AI assistant.


Falcon is a powerful and versatile large language model (LLM) developed by the Technology Innovation Institute (TII) in the United Arab Emirates