Observability and diagnostic framework for embedding spaces and vector stores
Project description
VecHealth
Observability and diagnostic framework for embedding spaces and vector stores.
Like Prometheus monitors infrastructure and MLflow monitors models, VecHealth monitors the health of your embedding space.
The Problem
When RAG retrieval quality drops, teams typically try to fix it by:
- Changing the embedding model
- Rebuilding the vector store
- Modifying chunking strategy
This is expensive and time-consuming — and often doesn't answer: Where is the actual root cause?
What VecHealth Does
VecHealth answers why retrieval quality degrades by analyzing the geometry and topology of your embedding space directly.
import vechealth as vh
report = vh.analyze("path/to/your/vectorstore")
print(report.health_score) # 0.73
print(report.pathologies) # ["hubness", "void_regions"]
print(report.recommendations) # ["Consider re-indexing with higher M parameter"]
Detected Pathologies
| Pathology | What it means | Impact on retrieval |
|---|---|---|
| Hubness | Few vectors dominate all k-NN results | Low diversity |
| Void regions | Dead semantic zones in embedding space | Coverage failures |
| Anisotropy | Vectors clustered in narrow cone | Cosine similarity breaks down |
| Embedding collapse | Low intrinsic dimensionality | ANN recall degrades |
| Near-duplicate flood | Redundant vectors dominate retrieval | Low diversity |
Status
🔬 Active research project — Paper #1 in preparation. Star the repo to follow progress.
Research
This project is developed as part of research into geometric and topological analysis of embedding spaces. Papers coming soon.
License
Apache 2.0
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file vechealth-0.1.0.tar.gz.
File metadata
- Download URL: vechealth-0.1.0.tar.gz
- Upload date:
- Size: 7.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
91d84b3455b5972ccb897f9c2af1000f3683a933880c4b78493628ca04221c07
|
|
| MD5 |
54bbec14510733159f215c5294d571e9
|
|
| BLAKE2b-256 |
f33d8985084b8cfb99d8175508d1257de51c085883d73de923327bd381f6efd6
|
File details
Details for the file vechealth-0.1.0-py3-none-any.whl.
File metadata
- Download URL: vechealth-0.1.0-py3-none-any.whl
- Upload date:
- Size: 6.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5c8792717d1bd71a2e2c37ef8ffaddf1251a6e71ec125cd71e74339df010f230
|
|
| MD5 |
fb5ffc1c83c28bb7048208c76a844c0c
|
|
| BLAKE2b-256 |
d28067ae70d67f2626c0fd7b9c9b78a45f7176d7773cadece1d7103289dae516
|