6 projects
yasbd-lib
A high-accuracy, rule-based Sentence Boundary Detector (SBD) with a drop-in adapter for pysbd, delivering faster and more accurate segmentation.
yasbd-auxlang
Constructed language support for yasbd-lib — Esperanto, Interlingua, and more.
yasbd-xx
Experimental multilingual aggregate for yasbd-lib — best-effort sentence splitting over all installed language profiles.
chunklet-py
High-fidelity context-aware chunking and interactive visualization for RAG. Advanced segmentation for code and documents, because your LLM is only as smart as the fragments you feed it.
vinkra
A lightweight vector database with incremental inserts, automatic exact-to-ANN switching, and explicit storage management. Add vectors anytime without rebuilding the index
chunklet
A smart multilingual text chunker for LLMs, RAG, and beyond.