Falcon RVD

Streamlit
application badge
Created by team Falcon RVD on August 04, 2024

Recursive Vector Disambiguation (RVD) is a technique to improve semantic vector search process by using LLMs like Falcon-180B to prioritize terms in the query and then matching for similar vectors in the order of prioritized terms. The terms may be from the query or generated by the LLM (Falcon-180B) to better encapsulate the intent of the query. The embedding model used is Falcon2. These terms will be weighed and pairwise similarity will be considered to obtain the final scores and ranking. This ensures high quality results/top-matches. For Ex: When not using RVD, the query 'go break a leg' might match sentences related to literal leg injuries instead of the correct interpretation of wishing good luck. Similarly, the query Italian food serving restaurant having outdoor seating might wrongly match sentences that contain entities related to Italy or restaurant while the user has specifically constrained the search to the mentioned terms. A lot depends on what the training objective of the embedding model was. RVD tries to improve this search experience. You can try the demo below.

Category tags: