Event ended

Semantic Search AI Hackathon Summary

🗓️ This will be a 7-day virtual hackathon from 16-23 December 💻 Build AI application with the latest large language model-powered technology by Cohere 💡 Get the chance to work with the best AI professionals in the industry and learn from them ✔️Entry level = 0. You’ve just started with AI? Are you an experienced Data Scientist? Or maybe you are a Designer or Business Developer? Join us! We need your domain knowledge! 🐱‍💻 Register now and let's get started! It’s free!

Semantic Search AI Hackathon image

Hackathon Overview

Our AI hackathon brought together a diverse group of participants, who collaborated to develop a variety of impressive projects based on:

1819

Participants

145

Teams

34

AI Applications

Winners and Finalists

Found In Translation

Our solution is a smart Slack bot that implements multilingual semantic search and sentiment analysis to facilitate multi-language messaging platforms by improving the chat search function. The bot allows users to search using any language for messages in any language. It also contains a sentiment analysis that gives moderators a sentiment report on the messages of specific users or the chat as a whole.

RatLabs

Summit

Summit helps you search and summarise research papers to quickly get an overview of what the research says, saving you time and effort in your quest for knowledge.

Grounded AI

SemantoTube

Our application aims to make it easier for users to find and access the information they need within the specified video. By using the Cohere API, we are able to perform advanced semantic searches that go beyond simple keyword matching, allowing users to find content relevant to their query in the specified video. Our application utilizes the Cohere API to locate and highlight the specific transcript within the video that matches their search. This makes it easier for users to quickly find and access the information they need, without having to manually search through the entire video. Overall, our application is a game-changer for anyone looking to find and access specific information within YouTube videos. Whether you're a student looking for a specific lecture, a researcher trying to locate a specific quote, or just someone looking for information on a particular topic, our application makes it easier than ever to find what you need.

Giga Chads

This event has now ended, but you can still register for upcoming events on lablab.ai. We look forward to seeing you at the next one!

Checkout Upcoming Events →

Submitted Concepts, Prototypes and Pitches

Submissions from the teams participating in the Semantic Search AI Hackathon event and making it to the end 👊

Help to spread the word and share these amazing projects!

Flash Learning

Flash Learning is a flashcard application that helps the user to retain any material, like terms, definitions, laws, etc, through active recall. Unlike the classic flashcards, where the process of evaluation is ambiguous because the users say their answers verbally, our application offers the opportunity to get AI-based self-assessment scores that quantify their level of knowledge of the studied card. These scores can serve as indicators on how well the users know the material and if they are ready to learn something else or end their learning session. It can be used for both formal and informal education, where the instructor can prepare public flashcards with the curriculum needed for tests and exams. The Leitner System - a widely used method of efficiently using flashcards - is now implemented digitally in our application. It is a simple implementation of the principle of spaced repetition, where cards are reviewed at increasing intervals to enhance the process of learning and store the information in the long-term memory. The users can choose to know the card at the GOOD (>80%) or EXCELLENT (>90%) level based on their necessity. If their score is not excellent, they will even get explanations with GenerativeAI to help them improve their score.

FlashLearners

News Media Monitoring

News monitoring news allow observing and collecting key customer data directly from mentions and discussions about your company (or your client’s company). These discussions take place in social media comments, product review websites, blogs, podcasts, discussion forums, and media websites.

Embeddings

Search Light

A semantic search system that enables you to search your codebase using natural language. Project repo https://github.com/silvererudite/code_search

Amadeus

Podsearch

Podcasts are an excellent source of knowledge. But they can be too long and hard to pay attention to it the entire time. What if there is a more intuitive way to search for podcasts and also for info within podcasts? This is where our product comes into play. Key highlights 1. Searching for podcasts suited to your taste 2. Searching for answers within a podcast itself by asking it queries and without listening 3. Marking exactly where the answer is and summarising it. 4. Telling user what queries this podcast answers Major Uplifts: 1. Generating queries for dialogues in transcript using the prompt - "Generate 5 questions for the following passage {passage}" 2. Training a classifier using cohere api using the generated queries and dialogues 3. Highly scalable architecture 4. Podcast is just an example. Most documentation (python libraries, eth doc) have only keyword search. It is possible to scrape the documentation and build an index for a search engine using our architecture easily.

Info Insighters

Project Eval

Eval aims to address the problem of subjectively evaluating test answers. Traditionally, this task has been carried out manually by human graders, which can be time-consuming and prone to bias. To address this issue, the project utilizes Cohere powered APIs to automate the evaluation process. The use of Cohere APIs allows for the integration of advanced natural language processing techniques, enabling the system to accurately understand and analyze the content of test answers. The custom model built upon these APIs then scores the answers based on suitable metrics, which can be tailored to the specific requirements of the test or assessment. One potential application of this technology is in the field of education, where it could be used to grade assignments or exams in a more efficient and unbiased manner. It could also be utilized in professional settings for evaluating job applications or performance evaluations. In addition to increasing efficiency and reducing bias, the use of automated evaluation techniques has the potential to provide more consistent and reliable scoring. This can help to ensure that test-takers receive fair and accurate assessments of their knowledge and skills. The model for the same was evaluated based on 4 major metrics: - Semantic Search: this is the primary scoring strategy of Eval. It is used to semantically understand the answer given and evaluate based on content rather than simply scoring based on textual similarities. Cohere Embed was used to generate embeddings for 5 suggested answers for the question and the answer to be checked. Then we find the distance from the nearest neighbor out of the 5 suggestions and the answer. This distance is used to grade the answer. - Duplication Check: partially correct answers with duplication of text tended to get higher similarity scores compared to the ones without duplication. To stop students from using this exploit to gain extra marks, a duplication checker was implemented based on Jaccard-Similarity between sentences within the answer. - Grammar Check: this strategy aims to check the grammar of the answer and assign a score based on the number of grammatical errors. We used Cohere Generate endpoint to generate a grammatically correct version of the answer, then check for cosine similarity of the generated version with original version to check if the original version was grammatically correct. - Toxicity Check: this aims to detect for toxic content in the answer and penalize an answer if it is toxic. We trained a custom classification model on Cohere using the Social Media Toxicity Dataset by SurgeAI which gave a 98% precision on the test split. We also implemented a Custom Checks which allows users to give different weights to each of the three different metrics based on how important they are for the evaluation of the answer. This allows for a more personalized evaluation of the answer. We built our custom model into a Flask-based REST API server deployed on Replit to streamline usage and allow people to access the full-functionality of the model. We also built a highly interactive UI that allows for users to easily interact with the API and evaluate their answers as well as submit questions.

chAI

YTBrief

AI solution that will make your YouTube experience easier by getting answers from videos in seconds.

TopGeez

Genesis

It is an search app to look up meaning, hope, faith, etc in the Bible based on the user situation (happiness, sadness, depression, motivation, ...)

Genesis

Libris

Libris is a tool for making knowledge accessible to all. You just describe an important topic that you want to investigate and Libris will bring to light the relevant excerpts from the most brilliant minds of all time. You can also filter by specific authors and titles to search in a specific subset of the bibliography that is specially relevant to your research. The magical thing about Libris is that it doesn't matter the language you speak or the language the text was written on, the experience is seamless: you write the concept in your own language and get results in your own language. You can even access the original work for further reference!

Going Solo

ezInvest

ezInvest is an application, that helps user in making a decision to get a Stock based on News headlines Sentiments. Investment in Stocks is very Risky. Many Cases like Political, Inflation, economics, investors behavior Impacts Stock Prices. Identifying Stocks to Invest is Challenging. Existing Applications are very Expensive. ezInvest will address all these issues.

Team Deepfai

CoHotel

Recommend and book the best matching hotel for travelers to the US based on Semantically Searching Hotel Reviews in the city they are traveling to. Integrates with MyCroft AI to provide text and voice conversational interfaces with feedback loops that increase the accuracy of choice.

CoHotel

Citation needed

Citation Needed is a fact verification assistant that allows you to find relevant sources and citations to a given claim. Specifically, we begin with a information database, where each claim or fact is substantiated with a reference. From there, a query claim can be used with semantic search to find relevant sentences, and hence the citations of them. This application can be useful to a variety of audiences, from researchers looking to more efficiently browse through papers, teachers looking to better fact check essays, to even students looking to find more reading material. We believe that Citation Needed can be a powerful tool if scaled up fully. The current demo has a "Narrow" mode, which only contains topics related to neural networks, and a "General" mode, which scrapes for additional information on the fly, after extracting important subjects through the use of the cohere text generation API.

Citation Needed

Found In Translation

Our solution is a smart Slack bot that implements multilingual semantic search and sentiment analysis to facilitate multi-language messaging platforms by improving the chat search function. The bot allows users to search using any language for messages in any language. It also contains a sentiment analysis that gives moderators a sentiment report on the messages of specific users or the chat as a whole.

RatLabs

AlphaBot

Around 42% of start-ups fail due to a lack of market-related knowledge. Our tool helps start-ups or small businesses or even people with great ideas, to understand the market, the market need, and the financial aspect of the industry that they want to venture in.

AlphaBot

Semantic search models

Semantic search has been used in daily life applications, especially for the elderly people in speech-recognition, because they are not able to use touch device due to their bad eyesight. Speech recognition is one of the applications of semantic search, and for this there have been used NLP and Preprocessing. As a result, the information is gathered as a cluster, which then embedded into a system. Moreover, the politics has also been analyzed through preprocessing which is branch of NLP that is being used in semantic search.

AI isnt AI

SemantoTube

Our application aims to make it easier for users to find and access the information they need within the specified video. By using the Cohere API, we are able to perform advanced semantic searches that go beyond simple keyword matching, allowing users to find content relevant to their query in the specified video. Our application utilizes the Cohere API to locate and highlight the specific transcript within the video that matches their search. This makes it easier for users to quickly find and access the information they need, without having to manually search through the entire video. Overall, our application is a game-changer for anyone looking to find and access specific information within YouTube videos. Whether you're a student looking for a specific lecture, a researcher trying to locate a specific quote, or just someone looking for information on a particular topic, our application makes it easier than ever to find what you need.

Giga Chads

sundarEduAI

sundarEduAI is an advanced AI learning tool which helps you to find answers of questions of given topics/chapters within few seconds. It also increase accessibility for users who want find solutions quickly. It's quite fast, easy-to-use & accurate. It'll be next-generation of edtech.

sundarAI

Resume AI II

Generate personalized resume customized to the job description from a master resume.

Resume AI

whereDoYouMean

When students are revising their work, they face a few hurdles - (1) finding content from a big repository of school materials is time-consuming, especially when the student does not know specifically where to look for the information; (2) results searched online might not be exactly the same as what the school is teaching in terms of scope. Hence, we decided to create an application where teachers can transcribe their videos and generate the embeddings. Users can then ask a question and the top few links to the relevant videos will be given to the student (with the time-stamp attached). The context is also provided to the student so that they can make a judgement as to which video is worth exploring. After which, students can also vote on the relevance of the videos, which will be used to affect future training. We hope that this project can help students learn better with their school's resources.

whereDoYouMean

LinguaScope

LinguaScope helps refugees query official immigation documentation in their own language for urgent questions when they seek asylum. Users can query for childcare, getting heathcare etc. Often it is not possible to get translators and this app will help them navigate resources for critical information.

LinguaScope

Coherent Speech

Multilingual voice content generator, powered by co;her(💜+🤖) Generate model. Using the power of whiper, co;her(💜+🤖) and google to create speech content generator. This Application helps in generating audio content using speech as a prompt.

OraOraOra

Quick Learners

Our education platform aims to revolutionize the way students learn by providing a unique and personalized learning experience. Through the use of semantic search, we enable users to easily find specific moments in a video that discuss a particular concept, making it easier for them to focus on and understand the material. Additionally, we recommend people with similar interests to connect with each other, fostering a sense of community and collaboration among learners. Lastly, we recommend posts to users based on their interests, ensuring that they are constantly exposed to relevant and engaging content of the user with same interests.

Breaking AI

AI Generation

In my Project you can get blogpost content for posting you blogs and you can have the trending hashtags and you can also generate mails

Alpha

Cohire

Cohire is an AI powered job portal that allows top employers to find the best talents to hire. Contrary to AI taking over our jobs, this project tries to take a different narrative where AI will help us find jobs, not replace us.

Pandora

Cofinder

Built by the Community, for the Community A semantic search tool enabling Cohere users to find relevant content in one place based on their personal goal. The aim is to breakdown barriers for users such as developers, entrepreneurs and Data Scientists, making it easier to find the information they need, to remove the effort of obtaining information and focus on building amazing applications using Cohere's language models.

Cofinder

ZenWork

ZenWork is an MVP semantic search engine for finding your dream company that shares your vision.

Sepik

Summit

Summit helps you search and summarise research papers to quickly get an overview of what the research says, saving you time and effort in your quest for knowledge.

Grounded AI

Profile Picker

For startups and new businesses, it's hard to get budget, manpower and time to put on resume sorting kind of works. When large number of applicants apply for a job role then it's really difficult to choose the best ones by going through each resume. Our product "Profile Picker" provides businesses ability to sort best resumes based on the job description. Our ML model understands the meaning of the words that are in the resumes and in the job descriptions and it also gives multilingual support which means job description and resumes can be written in different languages.

VigyaAI

FilmyCode

We made an app to search movies. We have used cohere and pinecone. Pinecone is being used for vector database and cohere is for embeddings. Our app gets movies based on user inputted query. So if a user search "Alien invasion movie", the app outputs "Edge of tomorrow, etc". It is mostly google like search but for movies and also we are using NLP (cohere-large) model.

Sentient

Patent Search and Generation

This is a service assists patent attorneys/professionals/applicants/to trawl through patent databases and return the most semantically relevant patents instead of simply returning patents that correspond to lexical/keywords matching AND generate a draft patent application for the inventor. As such, we are able to reduce the time taken for the patent application process,resulting in greater revenues for patent attorneys/professionals while extending the useful patent life ofthe inventor's patent, as well as expedite his patent application process

Love AP

CoDoctor

CoDoctor is a healthcare assistance tool that helps users find answers to their questions about different health conditions. It uses a multilingual model called Co:here, which uses a semantic search algorithm to search through a dataset and provide accurate and relevant information based on users query. CoDoctor is a useful resource for anyone looking for reliable information about various health conditions and how to manage them. Whether you have a specific question or just want to learn more about different health conditions in general, CoDoctor can help you find the information you need.

OGNG

Emoji Character Voice Recog

A voice recognition way of creating emojis and special characters. 10 emojis and 13 special characters are initialized. When we speak with initialized key(there names in code), they will be displayed.

Sky High Tech

Product Description Generator

This web app let the user to fill in the name of product and its properties as keywords. Then, uses cohere-ai (via Node) to generate the description of the product related to the given keywords. Can be used for advertisement, Digital Marketing,& SEO related purposes.

Non Zero

Indic News search engine

🌍 The Solution uses a multilingual semantic model from COhere to 🚀 revolutionize the media and news industry in multilingual markets like India(We have used telugu , hindi ,bengali,English news dataset for this use case), allowing anyone to track 📰 regional news in real-time without the need for translation or understanding of other regional languages. 🙌

team phoeniks

Embetter support for Cohere

By supporting Cohere via embetter, we gain scikit-learn compatibility. That means it's easy to: 1. Build active learning pipelines via partial_fit 2. Use it for bulk labelling via bulk 3. Use it to find bad labels via doubtlab 4. Run quick retreival projects via simsity And more!

calmcode