Skip to main content

A diagnostic toolkit for evaluating and selecting language-model embedding spaces.

Project description

embedprobe

A diagnostic toolkit for evaluating and selecting language-model embedding spaces.

Status: early development. This is a placeholder release to reserve the name. The first working version is coming soon.

What it will do

embedprobe helps you choose the right embedding model for your downstream task by diagnosing why models succeed or fail on your data and language pairs, going beyond aggregate leaderboard scores. It probes an embedding space across four levels:

  • Signal-to-noise separability — how cleanly true pairs separate from noise.
  • Retrieval performance — Recall@K, MRR, and cumulative-match analysis.
  • Topic-level structure — UMAP projections and topic-cohesion heatmaps.
  • Error categorisation — a taxonomy of retrieval misses (lexical / semantic / topic-boundary).

Install

pip install embedprobe

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

embedprobe-0.0.0.tar.gz (1.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

embedprobe-0.0.0-py3-none-any.whl (1.8 kB view details)

Uploaded Python 3

File details

Details for the file embedprobe-0.0.0.tar.gz.

File metadata

  • Download URL: embedprobe-0.0.0.tar.gz
  • Upload date:
  • Size: 1.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.9

File hashes

Hashes for embedprobe-0.0.0.tar.gz
Algorithm Hash digest
SHA256 bb5dde42c22dbc250f24687fe16ff58930b0eb8a2eb377d619eed4b5867f0484
MD5 a7a2efbf6f4fe01df4ffaf3829b01e0b
BLAKE2b-256 ce3953efac69ab5e959f1fc59c8f386aca032061a0eda2bbd78896e422eeb788

See more details on using hashes here.

File details

Details for the file embedprobe-0.0.0-py3-none-any.whl.

File metadata

  • Download URL: embedprobe-0.0.0-py3-none-any.whl
  • Upload date:
  • Size: 1.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.9

File hashes

Hashes for embedprobe-0.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 d2b9969186bccd405d03c984574df1d10c95da94fc4379f11092355cfcae30c1
MD5 bf7e7b167a7204aa9cb73907a1c21c6f
BLAKE2b-256 bca8ec285ef981c00f82bdcd2c51a2881b3d1e520575f8c0664dfb6f3421ebcd

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page