LLM-based hierarchical data extraction

Created by team NoLimits on October 26, 2023

LLM's have reached a point now where it's possible to query their billions of parameters and uproot information in a hierarchical structure, which is the structure of information itself, human knowledge and the physical world. Basically objects are made of objects etc But the features that humans use to describe our world, whether with language or drawing, are not the features that ML & AI algorithms use, which are merely data points, pixels on an image, or a vague assembly of such pixels. It is a fight against the infinite complexity of life and Nature itself. Humans use objects, (real) features, and in a hierarchical way: a cat is a head and a body. A head is eyes, whiskers, ears, An eye is a cornea and an iris... etc. Today algorithms process trillions of data points and 'only' produce a statistical result, not real features, and will always do so no matter how much data is used (unless they incorporate our approach). Humans, on the other side, can classify with certainty billions of different (visually) cats by verifying they tick a few boxes, ie the 'real life' features we all know make a cat (pointy ears, whiskers, fur, etc). Our approach deems to create a knowledge graph of such real life features (what we usually refer to more generally as 'objects'), which in turn, can be used to improve current algorithms' performance. For instance, it is easy to imagine how it allows to verify if all the proper features of an object are present in an image because it tells us exactly what we should be looking for, what matters. Therefore, it will improve, say, object recognition, with direct applications in robotics, AV, guided systems of all sorts, medical diagnostics and even real language understanding. There are other applications but one of them is to improve LLM's themselves, by reducing the training time and their size by incorporating our concept into new architectures to avoid having to re-learn the whole human knowledge every time.

Category tags:

Healthcare, Automotive, Web Scraping & Data Extraction

Github Presentation Demo

Explore more applications

MEDVAULT

MedVault AI ensures secure, AI-powered health records with offline access, SMS support & blockchain security, bridging healthcare gaps in remote areas.

streamlit

Level-4 Autonomous Connectivity Network

A fully autonomous, AI-driven connectivity solution that leverages "Giga Nodes"—smart, low-power, solar-powered devices—to establish resilient internet networks in underserved and remote regions.

Creativity with AI

AI/ML API

Maa-connect

Our project is a simple network management chatbot. It will help teachers, health facility network managers and individuals do basic diagnostic analysis of their network.

Maa Connect

TinyLlama

Smarti

This project uses machine learning to optimize TVWS base stations, predicting interference, failures, and providing AI recommendations. It features a dashboard, a simulation map, and procurement tracking.

VIBOT

Gemini AIGenerative AI Studio

AI for Connectivity Global Green Guard

AI platform to optimize school connectivity using geospatial data and LLMS for underserved regions

Global Green Guard

GPT-4 Vision

"it is an outstanding idea. new different approach to fill a gap. good implmentation of technology. excellent work"

Walaa Nasr Elghitany

Lablab Head Judge

"This is an impressive and original use of Falcon LLMs for feature extraction. The technical execution is spot on! Your project has business potential, although a deeper dive into market analysis could enhance its value proposition. In the future consider expanding the use case scenarios and include performance benchmarks to solidify the project's market readiness and appeal to potential investors or adopters."

Donald Nwokoro

Backend Developer

"Wow! Simply astonishing. The presentation, the architecture (combined with Falcon), the demo all seems to be flawless. The hierarchical approach is a gamechanger. This idea definitely has a lot of market value and great work team. Couldn't have done any better. "

Muhammad Inaamullah

ML Engineer

Events @ lablab
For Innovators & Creators