Evaluation metrics for single-cell perturbation predictions

Project description

cell-eval

Description

This package provides a comprehensive suite of metrics for evaluating the performance of models that predict cellular responses to perturbations at the single-cell level. It can be used either as a command-line tool or as a Python module.

Installation

Distribution with uv

# install from pypi
uv pip install -U cell-eval

# install from github directly
uv pip install -U git+https://github.com/arcinstitute/cell-eval

# install cli with uv tool
uv tool install -U git+https://github.com/arcinstitute/cell-eval

# Check installation
cell-eval --help

Usage

To get started you'll need to have two anndata files.

a predicted anndata (adata_pred).
a real anndata to compare against (adata_real).

Prep (VCC)

To prepare an anndata for VCC evaluation you can use the cell-eval prep command. This will strip the anndata to bare essentials, compress it, adjust naming conventions, and ensure compatibility with the evaluation framework.

This step is optional for downstream usage, but recommended for optimal performance and compatibility.

Run this on your predicted anndata:

cell-eval prep \
    -i <your/path/to>.h5ad \
    -g <expected_genelist>

Run

To run an evaluation between two anndatas you can use the cell-eval run command.

This will run differential expression for each anndata and then run a suite of evaluation metrics to compare the two (select your suite of metrics with the --profile flag).

To save time you can submit precomputed differential expression results, see the cell-eval run --help menu for more information.

cell-eval run \
    -ap <your/path/to/pred>.h5ad \
    -ar <your/path/to/real>.h5ad \
    --num-threads 64 \
    --profile full

To run this as a python module you will need to use the MetricsEvaluator class.

from cell_eval import MetricsEvaluator
from cell_eval.data import build_random_anndata, downsample_cells

adata_real = build_random_anndata()
adata_pred = downsample_cells(adata_real, fraction=0.5)
evaluator = MetricsEvaluator(
    adata_pred=adata_pred,
    adata_real=adata_real,
    control_pert="control",
    pert_col="perturbation",
    num_threads=64,
)
(results, agg_results) = evaluator.compute()

This will give you metric evaluations for each perturbation individually (results) and aggregated results over all perturbations (agg_results).

Data ceiling

To estimate the maximum achievable score on each metric given the noise inherent in the real data, pass --ceiling. This is computed from the real data only: per perturbation (and the control), its cells are bootstrapped to twice their count and split into two equal halves; one half plays "real" and the other "prediction", and the full metric suite is run on that self-split. The result is, per metric, an upper bound on how well any model could score on this dataset.

cell-eval run \
    -ap <your/path/to/pred>.h5ad \
    -ar <your/path/to/real>.h5ad \
    --num-threads 64 \
    --profile full \
    --ceiling

This is additive: it writes the normal results.csv / agg_results.csv and ceiling_results.csv / agg_ceiling_results.csv. The bootstrap is reproducible via --ceiling-seed (default 0). From python, call compute_ceiling on the evaluator:

ceiling, ceiling_agg = evaluator.compute_ceiling(seed=0)

Score

To normalize your scores against a baseline you can run the cell-eval score command.

This accepts two agg_results.csv (or agg_results objects in python) as input.

cell-eval score \
    --user-input <your/path/to/user>/agg_results.csv \
    --base-input <your/path/to/base>/agg_results.csv

Or from python:

from cell_eval import score_agg_metrics

user_input = "./cell-eval-user/agg_results.csv"
base_input = "./cell-eval-base/agg_results.csv"
output_path = "./score.csv"

score_agg_metrics(
    results_user=user_input,
    results_base=base_input,
    output=output_path,
)

Library Design

The metrics are built using the python registry pattern. This allows for easy extension for new metrics with a well-typed interface.

Take a look at existing metrics in cell_eval.metrics to get started.

Development

This work is open-source and welcomes contributions. Feel free to submit a pull request or open an issue.

Citation

Any publication that uses this source code should cite the State paper.

Project details

Release history Release notifications | RSS feed

0.8.1

Jun 29, 2026

0.8.0

Jun 29, 2026

This version

0.7.4

Jun 29, 2026

0.7.2

May 15, 2026

0.7.1

May 4, 2026

0.7.0

Mar 24, 2026

0.6.8

Jan 16, 2026

0.6.7

Jan 15, 2026

0.6.6

Nov 11, 2025

0.6.5

Nov 8, 2025

0.6.4

Nov 3, 2025

0.6.3

Oct 27, 2025

0.6.2

Oct 16, 2025

0.6.1

Oct 8, 2025

0.6.0

Oct 1, 2025

0.5.45

Sep 24, 2025

0.5.43

Aug 6, 2025

0.5.42

Jul 28, 2025

0.5.41

Jul 8, 2025

0.5.40

Jun 30, 2025

0.5.39

Jun 27, 2025

0.5.38

Jun 25, 2025

0.5.36

Jun 23, 2025

0.5.34

Jun 23, 2025

0.5.33

Jun 18, 2025

0.1.0

May 22, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cell_eval-0.7.4.tar.gz (38.9 kB view details)

Uploaded Jun 29, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

cell_eval-0.7.4-py3-none-any.whl (41.1 kB view details)

Uploaded Jun 29, 2026 Python 3

File details

Details for the file cell_eval-0.7.4.tar.gz.

File metadata

Download URL: cell_eval-0.7.4.tar.gz
Upload date: Jun 29, 2026
Size: 38.9 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.11.25 {"installer":{"name":"uv","version":"0.11.25","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for cell_eval-0.7.4.tar.gz
Algorithm	Hash digest
SHA256	`e45178395101d23c99c8ea7e56a32cae85b1ab52af7c4404393aa30cce2e61f6`
MD5	`221e9e645f1547a95214f03103addaf4`
BLAKE2b-256	`4208444b7a0d7f8d2092844ab7b5c533605917a8bd9004d97a87b1fe218eac4c`

See more details on using hashes here.

File details

Details for the file cell_eval-0.7.4-py3-none-any.whl.

File metadata

Download URL: cell_eval-0.7.4-py3-none-any.whl
Upload date: Jun 29, 2026
Size: 41.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.11.25 {"installer":{"name":"uv","version":"0.11.25","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for cell_eval-0.7.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2ef10f6fb20875a3af8627dae9a13abe49d9328593a667cb260fa23578df50b0`
MD5	`2f1dbc9c7d11cfaac5afe6c7f0bd7e96`
BLAKE2b-256	`b18e6b48dc92db64dd9f56cbaa3779d322b908335611b5f6e5a9b2e80b6166c1`

See more details on using hashes here.

cell-eval 0.7.4

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

cell-eval

Description

Installation

Usage

Prep (VCC)

Run

Data ceiling

Score

Library Design

Development

Citation

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes