Skip to main content

Observability and diagnostic framework for embedding spaces and vector stores

Project description

VecHealth

Observability and diagnostic framework for embedding spaces and vector stores.

Like Prometheus monitors infrastructure and MLflow monitors models, VecHealth monitors the health of your embedding space.

The Problem

When RAG retrieval quality drops, teams typically try to fix it by:

  • Changing the embedding model
  • Rebuilding the vector store
  • Modifying chunking strategy

This is expensive and time-consuming — and often doesn't answer: Where is the actual root cause?

What VecHealth Does

VecHealth answers why retrieval quality degrades by analyzing the geometry and topology of your embedding space directly.

import vechealth as vh

report = vh.analyze("path/to/your/vectorstore")
print(report.health_score)      # 0.73
print(report.pathologies)       # ["hubness", "void_regions"]
print(report.recommendations)   # ["Consider re-indexing with higher M parameter"]

Detected Pathologies

Pathology What it means Impact on retrieval
Hubness Few vectors dominate all k-NN results Low diversity
Void regions Dead semantic zones in embedding space Coverage failures
Anisotropy Vectors clustered in narrow cone Cosine similarity breaks down
Embedding collapse Low intrinsic dimensionality ANN recall degrades
Near-duplicate flood Redundant vectors dominate retrieval Low diversity

Status

🔬 Active research project — Paper #1 in preparation. Star the repo to follow progress.

Research

This project is developed as part of research into geometric and topological analysis of embedding spaces. Papers coming soon.

License

Apache 2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vechealth-0.1.0.tar.gz (7.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vechealth-0.1.0-py3-none-any.whl (6.7 kB view details)

Uploaded Python 3

File details

Details for the file vechealth-0.1.0.tar.gz.

File metadata

  • Download URL: vechealth-0.1.0.tar.gz
  • Upload date:
  • Size: 7.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.12

File hashes

Hashes for vechealth-0.1.0.tar.gz
Algorithm Hash digest
SHA256 91d84b3455b5972ccb897f9c2af1000f3683a933880c4b78493628ca04221c07
MD5 54bbec14510733159f215c5294d571e9
BLAKE2b-256 f33d8985084b8cfb99d8175508d1257de51c085883d73de923327bd381f6efd6

See more details on using hashes here.

File details

Details for the file vechealth-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: vechealth-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 6.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.12

File hashes

Hashes for vechealth-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 5c8792717d1bd71a2e2c37ef8ffaddf1251a6e71ec125cd71e74339df010f230
MD5 fb5ffc1c83c28bb7048208c76a844c0c
BLAKE2b-256 d28067ae70d67f2626c0fd7b9c9b78a45f7176d7773cadece1d7103289dae516

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page