Opinionated, report-first CLI for single-cell and multi-omics analysis. Sane defaults baked in, defensible deliverables out.

These details have not been verified by PyPI

Project links

Project description

scellrun

Your LLM agent uses this. scellrun is a CLI an agent (Claude Code, Hermes, Codex, …) drives end-to-end on a researcher's behalf — convert cellranger output, QC, integrate, cluster, annotate, render an HTML report — and quotes the decision log back when the researcher asks "why mt% 20?" or "why res 0.5?". The deliverable is a defensible analysis the user can read in five minutes, not a pipeline they have to learn.

The agent-driven story is the canonical one. See docs/agent-demo.md for a verbatim transcript on real OA cartilage scRNA data. The agent's operational guide is skills/scellrun/SKILL.md.

You can also drive scellrun yourself if you prefer:

# 1. one-time: a clean Python environment so scellrun's deps don't collide with anything else on your machine
conda create -n scellrun python=3.11 -y
conda activate scellrun

# 2. install
pip install scellrun

# 3. run the full pipeline in one shot — qc → integrate → markers → annotate → report
scellrun analyze data.h5ad --tissue "OA cartilage"
# → opens with a single index.html link you can drop into a browser

Don't have a .h5ad? Cellranger output works directly:

scellrun scrna convert path/to/cellranger_outs -o data.h5ad
scellrun analyze data.h5ad --tissue "OA cartilage"

Want a Chinese report? Add --lang zh. Want a clinician walkthrough? See docs/quickstart.md. Want to add a profile or a stage? See docs/contributing.md.

Status: v1.0.0. The CLI surface is frozen for the v1.x series; new stages and profiles land additively.

Who this is for

A clinician or rotating student looking at scRNA-seq data for the first time
A postdoc on a deadline who doesn't want to write QC boilerplate
A bioinformatics core that needs every project to look the same in a report
An LLM coding agent that should reach for a real tool instead of re-deriving thresholds

If you already have your own pipeline, you don't need scellrun. If you don't, or you're tired of re-litigating the same decisions every project, this is for you.

Why this exists

Tools like scanpy, pyDESeq2, decoupler, SCENIC and nf-core are excellent and intentionally unopinionated. Every new project re-litigates the same decisions:

Which mt% threshold? Doublet filter before or after batch correction?
The composite score correlates with disease — Spearman ρ on each of 18 clinical features, or one well-chosen dichotomy?
What does the QC report actually need to look like for a reviewer to stop asking?

scellrun answers these once, in code, by encoding the working practice of a clinician + bioinformatics team. Defaults aren't "neutral" — they're a real position, made for real reasons, with the rationale rendered into every report.

You can override anything. But if you don't override, you get a defensible analysis on day one.

Where this fits

Tool	What it is
Anthropic scientific skills	Prompt scaffolds that teach an LLM how to call `scanpy` etc.
Bioconda recipes	Packaging
`nf-core` pipelines	General-purpose, infrastructure-heavy, vendor-neutral workflows
`scellrun`	An opinionated, report-first CLI optimized for low barrier to entry

What v1.0 ships

scellrun analyze <h5ad> — one-shot pipeline (qc → integrate → markers → annotate → report). Writes a deterministic decision log (00_decisions.jsonl) and a top-level 05_report/index.html with an At-a-glance block + per-stage decision tables.
scellrun scrna {qc,integrate,markers,annotate} — per-stage commands for deep customization.
scellrun scrna convert <cellranger_dir> -o data.h5ad — convert 10x cellranger / Seurat-mtx output.
Self-check + --auto-fix — each stage flags actionable issues (low QC pass-rate, panel-tissue mismatch, fragmented clustering) and the orchestrator can apply the cheapest fix once.
Profiles: default and joint-disease (Fan 2024 chondrocyte 11-subtype + 15-group celltype_broad panel).
Optional --ai: Anthropic-API LLM second opinion on annotation + resolution recommendation.
Distribution: PyPI wheel, Dockerfile, skills/scellrun/SKILL.md for agent harnesses.

See ROADMAP.md for the post-v1.0 plan (conda-forge feedstock, registry-pushed Docker image, bulk RNA-seq, metabolomics composite scoring, proteomics integration).

Install

For users — clean conda env (recommended)

conda create -n scellrun python=3.11 -y
conda activate scellrun
pip install scellrun

If you don't have conda, miniconda takes about 2 minutes to install. Or use uv for a faster venv:

uv venv .scellrun-env && source .scellrun-env/bin/activate
uv pip install scellrun

Either way, scellrun ends up in its own environment — your other Python projects (an old scanpy, Seurat-via-rpy2, etc.) won't be touched.

For contributors — editable from source

git clone https://github.com/drstrangerujn/scellrun.git
cd scellrun
conda create -n scellrun-dev python=3.11 -y && conda activate scellrun-dev
pip install -e ".[dev]"
pytest -q          # full suite should be green

Profiles

Different tissue domains have different defaults. v1.0 ships two profiles:

default — fresh-tissue 10x v3 chemistry, joint-tissue-aware mt% ceiling
joint-disease — tighter hb% for avascular cartilage; ships the Fan 2024 chondrocyte 11-subtype panel + 15-group broad celltype panel used by scrna annotate

scellrun profiles list
scellrun profiles show joint-disease   # prints thresholds + panels

Adding a profile = one Python file under src/scellrun/profiles/. If your community has working practice, contribute a profile.

For LLM agents

skills/scellrun/SKILL.md is a portable instruction document that teaches Claude Code, Hermes, Codex, or any markdown-aware agent harness when and how to invoke scellrun. Install it by symlinking into your agent's skills directory — see skills/README.md for one-liner install commands per agent.

License

MIT — see LICENSE.

Acknowledgements

Defaults reflect the working practice of a clinician + bioinformatics team that has shipped these analyses for the OARSI / MSK community. The R "AIO" pipeline that prefigured scellrun's design is documented in ROADMAP.md.

Built with assistance from Claude (Anthropic).

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.3.0

May 2, 2026

1.2.0

May 1, 2026

1.1.2

May 1, 2026

1.1.1

May 1, 2026

1.1.0

Apr 30, 2026

1.0.2

Apr 30, 2026

1.0.1

Apr 30, 2026

This version

1.0.0

Apr 30, 2026

0.9.1

Apr 30, 2026

0.9.0

Apr 30, 2026

0.8.0

Apr 30, 2026

0.7.0

Apr 30, 2026

0.6.0

Apr 30, 2026

0.5.0

Apr 30, 2026

0.4.0

Apr 30, 2026

0.3.0

Apr 30, 2026

0.2.0

Apr 30, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scellrun-1.0.0.tar.gz (133.9 kB view details)

Uploaded Apr 30, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

scellrun-1.0.0-py3-none-any.whl (96.2 kB view details)

Uploaded Apr 30, 2026 Python 3

File details

Details for the file scellrun-1.0.0.tar.gz.

File metadata

Download URL: scellrun-1.0.0.tar.gz
Upload date: Apr 30, 2026
Size: 133.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for scellrun-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`d32eb14b8f8b160fe5e11832ae1dc09ed8f69870603cdedc260a8efff4b14a93`
MD5	`1d183d1754409d145bfc558cb64df1f5`
BLAKE2b-256	`52d3d4c7ca1cfb58679da6b530fade08507267710416132b7f11188f909ea19d`

See more details on using hashes here.

File details

Details for the file scellrun-1.0.0-py3-none-any.whl.

File metadata

Download URL: scellrun-1.0.0-py3-none-any.whl
Upload date: Apr 30, 2026
Size: 96.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for scellrun-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`442d877b593414269b613e98731a6412151f03a54b8b8564260fb48cd168f574`
MD5	`a4fbdb8892381b4151f2ff3bfa317a3d`
BLAKE2b-256	`e20841a93f3eda731dcb59ac97c4c16b416ea529a23b31657e126608a71c08bc`

See more details on using hashes here.

scellrun 1.0.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

scellrun

Who this is for

Why this exists

Where this fits

What v1.0 ships

Install

For users — clean conda env (recommended)

For contributors — editable from source

Profiles

For LLM agents

License

Acknowledgements

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes