LLM-based hierarchical data extraction

medal
Streamlit
application badge
Created by team NoLimits on October 26, 2023

LLM's have reached a point now where it's possible to query their billions of parameters and uproot information in a hierarchical structure, which is the structure of information itself, human knowledge and the physical world. Basically objects are made of objects etc But the features that humans use to describe our world, whether with language or drawing, are not the features that ML & AI algorithms use, which are merely data points, pixels on an image, or a vague assembly of such pixels. It is a fight against the infinite complexity of life and Nature itself. Humans use objects, (real) features, and in a hierarchical way: a cat is a head and a body. A head is eyes, whiskers, ears, An eye is a cornea and an iris... etc. Today algorithms process trillions of data points and 'only' produce a statistical result, not real features, and will always do so no matter how much data is used (unless they incorporate our approach). Humans, on the other side, can classify with certainty billions of different (visually) cats by verifying they tick a few boxes, ie the 'real life' features we all know make a cat (pointy ears, whiskers, fur, etc). Our approach deems to create a knowledge graph of such real life features (what we usually refer to more generally as 'objects'), which in turn, can be used to improve current algorithms' performance. For instance, it is easy to imagine how it allows to verify if all the proper features of an object are present in an image because it tells us exactly what we should be looking for, what matters. Therefore, it will improve, say, object recognition, with direct applications in robotics, AV, guided systems of all sorts, medical diagnostics and even real language understanding. There are other applications but one of them is to improve LLM's themselves, by reducing the training time and their size by incorporating our concept into new architectures to avoid having to re-learn the whole human knowledge every time.

Category tags:

"it is an outstanding idea. new different approach to fill a gap. good implmentation of technology. excellent work"

avatar

Walaa Nasr Elghitany

Data scientist and doctor

"This is an impressive and original use of Falcon LLMs for feature extraction. The technical execution is spot on! Your project has business potential, although a deeper dive into market analysis could enhance its value proposition. In the future consider expanding the use case scenarios and include performance benchmarks to solidify the project's market readiness and appeal to potential investors or adopters."

avatar

Donald Nwokoro

Backend Developer

"Wow! Simply astonishing. The presentation, the architecture (combined with Falcon), the demo all seems to be flawless. The hierarchical approach is a gamechanger. This idea definitely has a lot of market value and great work team. Couldn't have done any better. "

avatar

Muhammad Inaamullah

Machine Learning Engineer