Embedding of whole slide images with Foundation Models

These details have not been verified by PyPI

Project links

Project description

slide2vec

slide2vec is a Python package for efficient encoding of whole-slide images using publicly available foundation models. It builds on hs2p for fast preprocessing and exposes a focused surface around Model, Pipeline, and ExecutionOptions.

Documentation site: https://clemsgrs.github.io/slide2vec/

Installation

pip install slide2vec
pip install "slide2vec[fm]"

slide2vec keeps the base install focused on the core package surface. Use slide2vec[fm] when you want the PyPI-hosted FM dependencies.

Some model backends still rely on upstream Git repositories that PyPI will not accept as package metadata. Install those separately when needed:

pip install git+https://github.com/lilab-stanford/MUSK.git
pip install git+https://github.com/Mahmoodlab/CONCH.git
pip install git+https://github.com/prov-gigapath/prov-gigapath.git

AtlasPatch-backed tissue segmentation is available through hs2p's sam2 path in the bundled install.

Python API

from slide2vec import Model
from slide2vec.utils.config import hf_login

hf_login()

model = Model.from_preset("virchow2")
embedded = model.embed_slide("/path/to/slide.svs")

tile_embeddings = embedded.tile_embeddings
x = embedded.x
y = embedded.y

Use list_models() when you want to inspect the shipped presets programmatically:

from slide2vec import list_models

all_models = list_models()
tile_models = list_models("tile")
slide_models = list_models("slide")
patient_models = list_models("patient")

Use Pipeline(...) for manifest-driven batch processing when you want artifacts written to disk instead of only in-memory outputs:

from slide2vec import ExecutionOptions, Pipeline, PreprocessingConfig

pipeline = Pipeline(
    model=model,
    preprocessing=PreprocessingConfig(
        requested_spacing_um=0.5,
        requested_tile_size_px=224,
        tissue_threshold=0.1,
    ),
    execution=ExecutionOptions(output_dir="outputs/demo"),
)
result = pipeline.run(manifest_path="/path/to/slides.csv")

By default, ExecutionOptions() uses all available GPUs. Set ExecutionOptions(num_gpus=4) when you want to cap the sharding explicitly.

Hierarchical Feature Extraction

Tile embeddings can be spatially grouped into regions for downstream models that consume region-level structure. Enable it by setting region_tile_multiple on PreprocessingConfig:

preprocessing = PreprocessingConfig(
    requested_spacing_um=0.5,
    requested_tile_size_px=224,
    region_tile_multiple=6,  # 6x6 tiles per region
)
embedded = model.embed_slide("/path/to/slide.svs", preprocessing=preprocessing)

Hierarchical outputs have shape (num_regions, tiles_per_region, feature_dim) and are written to hierarchical_embeddings/ when persisted.

See docs/python-api.md for details.

Input Manifest

Manifest-driven runs use the schema below. mask_path and spacing_at_level_0 are optional.

sample_id,image_path,mask_path,spacing_at_level_0
slide-1,/path/to/slide-1.svs,/path/to/mask-1.png,0.25
slide-2,/path/to/slide-2.svs,,
...

Use spacing_at_level_0 when the slide file reports a missing or incorrect level-0 spacing and you want to override it.

Outputs

The package writes explicit artifact directories:

tile_embeddings/<sample_id>.pt or .npz
tile_embeddings/<sample_id>.meta.json
hierarchical_embeddings/<sample_id>.pt or .npz (when region_tile_multiple is set)
hierarchical_embeddings/<sample_id>.meta.json
slide_embeddings/<sample_id>.pt or .npz
slide_embeddings/<sample_id>.meta.json
optional slide_latents/<sample_id>.pt or .npz

.pt remains the default format. .npz is available through ExecutionOptions(output_format="npz").

Supported Models

slide2vec currently ships preset configs for 17 tile-level models and 3 slide-level models.
For the full catalog and preset names, see docs/models.md.

CLI

The CLI is a thin wrapper over the package API.
Bundled configs live under slide2vec/configs/preprocessing/ and slide2vec/configs/models/.

slide2vec /path/to/config.yaml

By default, manifest-driven CLI runs use all available GPUs. Set speed.num_gpus=4 when you want to cap the sharding explicitly.

New to the CLI or doing batch runs to disk? Start with docs/cli.md for the config-driven workflow, overrides, and common run patterns.

Docker

Docker remains available when you prefer a containerized runtime:

docker pull waticlems/slide2vec:latest
docker run --rm -it \
    -v /path/to/your/data:/data \
    -e HF_TOKEN=<your-huggingface-api-token> \
    waticlems/slide2vec:latest

Documentation

Documentation website for the polished docs site
docs/python-api.md for the detailed API reference
docs/cli.md for the config-driven CLI guide
docs/models.md for the full supported-model catalog
tutorials/api_walkthrough.ipynb for a notebook walkthrough of the API

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

4.4.0

Apr 24, 2026

4.3.0

Apr 20, 2026

4.2.0

Apr 12, 2026

4.1.1

Apr 11, 2026

4.1.0

Apr 10, 2026

4.0.4

Apr 9, 2026

4.0.3

Apr 9, 2026

4.0.2

Apr 8, 2026

4.0.1

Apr 8, 2026

3.2.1

Mar 25, 2026

3.2.0

Mar 23, 2026

3.1.0

Mar 23, 2026

3.0.1

Mar 18, 2026

3.0.0

Mar 18, 2026

2.0.2

Mar 2, 2026

2.0.1

Jan 8, 2026

2.0.0

Dec 16, 2025

1.4.0

Dec 4, 2025

1.3.0

Oct 21, 2025

1.2.3

Sep 5, 2025

1.2.2

Aug 7, 2025

1.2.1

Jul 22, 2025

1.2.0

May 30, 2025

1.1.2

May 22, 2025

1.1.1

May 15, 2025

1.1.0

Apr 28, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

slide2vec-4.4.0.tar.gz (121.8 kB view details)

Uploaded Apr 24, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

slide2vec-4.4.0-py3-none-any.whl (105.8 kB view details)

Uploaded Apr 24, 2026 Python 3

File details

Details for the file slide2vec-4.4.0.tar.gz.

File metadata

Download URL: slide2vec-4.4.0.tar.gz
Upload date: Apr 24, 2026
Size: 121.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for slide2vec-4.4.0.tar.gz
Algorithm	Hash digest
SHA256	`d356e69272ca8e13cd10a91f5de8adaad05b0d297999c7d79ae08f67ed4f63cc`
MD5	`99ebf2d6447ca43851dbdb35de9992c8`
BLAKE2b-256	`3fcdf109a41c7b4aa71242e6b20a29d687c7548559cdc14b3db0761bc5eb9248`

See more details on using hashes here.

File details

Details for the file slide2vec-4.4.0-py3-none-any.whl.

File metadata

Download URL: slide2vec-4.4.0-py3-none-any.whl
Upload date: Apr 24, 2026
Size: 105.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for slide2vec-4.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4c4e0f7af4c88faaf6af13eba5e1249243186fa7312ca13a0c85477979921173`
MD5	`1dc0b4483828de97c3506688a4cac994`
BLAKE2b-256	`8788eb027b0661bd6113648a807e5f9c742d5d22701c308c4c24858cdd6d47f3`

See more details on using hashes here.

slide2vec 4.4.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

slide2vec

Installation

Python API

Hierarchical Feature Extraction

Input Manifest

Outputs

Supported Models

CLI

Docker

Documentation

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes