6 projects
lateness
Modern ColBERT for Late Interaction with native multi-vector support
Route0x
Low latency, High Accuracy, Custom Query routers.
FlashRank
Ultra lite & Super fast SoTA cross-encoder based re-ranking for your search & retrieval pipelines.
route-360
Low latency, High Accuracy, Custom Query routers for Humans and Agents
SPLADERunner
Ultralight and Fast wrapper for the independent implementation of SPLADE++ models for your search & retrieval pipelines. Models and Library created by Prithivi Da, For PRs and Collaboration to checkout the readme.
flashembed
Lightweight & Fast Python library to add low-footprint (all-MiniLM-* equivalent) multilingual retrievers to your RAG and Search & Retrieval pipelines.