One Codex API client and Python library

These details have not been verified by PyPI

Project links

Project description

One Codex API - Python Client Library and CLI

Command line interface (CLI) and Python client library for interacting with the One Codex v1 API.

Documentation: API | Python

MAINTAINERS: @clausmith, @boydgreenfield

Installation

This package provides 3 major pieces of functionality: (1) a core Python client library; (2) a simple CLI for interacting with the One Codex platform that uses that core library; and (3) optional extensions to the client library, which offers many features aimed at advanced users and provides functionality for use in interactive notebook environments (e.g., IPython notebooks).

Python 3.10 or later is required. Python 2 is no longer supported.

Basic installation

The CLI (and core Python library) may be simply installed using pip. To download a minimal installation (#1 and #2), simply run:

pip install onecodex

Installation with optional extensions

To also download the optional extensions to the client library, and all of their dependencies, run:

pip install 'onecodex[all]'

Using the CLI

Logging in

The CLI supports authentication using either your One Codex API key or your One Codex username and password. To log in using your username and password:

onecodex login

This command will save a credentials file at ~/.onecodex, which will then automatically be used for authentication the next time the CLI or Python client library are used (OS X/Linux only). You can clear this file and remove your API key from your machine with onecodex logout.

In a shared environment, we recommend directly using your One Codex API key, rather than logging in and storing it in a credentials file. To use API key authentication, simply pass your key as an argument to the onecodex command:

onecodex --api-key=YOUR_API_KEY samples

Your API key can be found on the One Codex settings page and should be 32 character string. You may also generate a new API key on the settings page in the web application. Note: Because your API key provides access to all of the samples and metadata in your account, you should immediately reset your key on the website if it is ever accidentally revealed or saved (e.g., checked into a GitHub repository).

Uploading files

The CLI supports uploading FASTA or FASTQ files (optionally gzip compressed) via the upload command.

onecodex upload bacterial_reads_file.fq.gz

Multiple files can be uploaded in a single command as well:

onecodex upload file1.fq.gz file2.fq.gz ...

You can also upload files using the Python client library:

uploaded_sample1 = ocx.Samples.upload("/path/to/file.fastq")

# Or upload a tuple of paired end files
uploaded_sample2 = ocx.Samples.upload(("/path/to/R1.fastq", "/path/to/R2.fastq"))

Which returns a Samples resource (as of 0.5.0). Samples can be associated with tags, metadata, and projects at upload timing using those respective keyword arguments:

# Note format must match the schema defined for our API, with arbitrary
# metadata allowed as a single-level dictionary in the `custom` field.
# See https://developer.onecodex.com/api-reference/metadata-resource for details.
metadata = {
    "platform": "Illumina NovaSeq 6000",
    "date_collected": "2019-04-14T00:51:54.832048+00:00",
    "external_sample_id": "my-lims-ID-or-similar",
    "custom": {
        "my-string-field": "A most interesting sample...",
        "my-boolean-field": True,
        "my-number-field-1": 1,
        "my-number-field-2": 2.0,
    }
}

Uploads can be made in parallel using Python threads (or multiple processes), e.g.:

import concurrent.futures
uploaded_samples = []

with concurrent.futures.ThreadPoolExecutor(max_workers=4) as executor:
    futures = {executor.submit(ocx.Samples.upload, file) for file in LIST_OF_FILES}
    for future in concurrent.futures.as_completed(futures):
        try:
            uploaded_samples.append(future.result())
        except Exception as e:
            print("An execption occurred during your upload: {}".format(e))

Resources (CLI)

The CLI supports retrieving One Codex samples, analyses and other resources. For a complete list, see the documentation.

Your samples (Samples)
Sample metadata (Metadata)
Analyses, which include several subtypes with additional functionality and fields:
- Classifications, which are basic metagenomic classification results for your samples
- Panels, which are in silico panels for particular genes or other functional markers (example on One Codex)
Jobs, which provide information on the name, version, and type of analysis which was performed for a given Analyses

Simply invoke the onecodex command, using one of the above resource names as a subcommand (all lowercase). For example:

# fetch all your samples
onecodex samples

# fetch a list of panels based on their ids
onecodex panels 0123456789abcdef 0987654321fdecba

Using the Python client library

Initialization

To load the API, use the following import:

from onecodex.api import Api

Instantiate an API client either by passing your API key or automatically fetching your credentials from ~/.onecodex if you've previously called onecodex login.

from onecodex.api import Api

# Instantiate a One Codex API object, will attempt to get credentials from ~/.onecodex
ocx = Api()

# Instantiate an API object, manually specifying an API key
ocx = Api(api_key="YOUR_API_KEY_HERE")

Resources

Resources are exposed as attributes on the API object. You can fetch a resource directly by its ID or you can fetch it using the query interface. Currently you can access resources using either get() or where(). If you need help finding the ID for a sample, its identifier is part of its url on our webpage: e.g. for an analysis at https://app.onecodex.com/analysis/public/1d9491c5c31345b6, the ID is 1d9491c5c31345b6. IDs are all short unique identifiers, consisting of 16 hexadecimal characters (0-9a-f). For a complete list of resource models, see the documentation.

sample_analysis = ocx.Classifications.get("1d9491c5c31345b6")   # Fetch an individual classification
sample_analysis.results()  # Returns classification results as JSON object
sample_analysis.table()    # Returns a pandas dataframe

Custom Workflows

One Codex supports creating, running, and retrieving results from custom bioinformatics workflows, including Nextflow pipelines, from the web app as well as programmatically through the API and client library. See "Your Authored Workflows on One Codex" for more information.

The relevant models are:

ocx.Jobs — a runnable workflow definition (script, image, resource requirements, dependencies).
ocx.Assets — files (e.g. references, databases) that can be attached to a job and made available at run time.
ocx.Analyses — the result of running a job against a sample. Subtypes (Classifications, FunctionalProfiles, Panels, Mlsts, Alignments, Workflows) expose result-specific accessors.

Running a workflow

Workflows are defined by the ocx.Jobs model, and can be run on a sample to generate a new analysis (ocx.Analyses). Jobs can also take arguments, passed in via the job_args keyword argument.

You can launch a workflow against an uploaded sample directly from the Python client. Jobs.run() returns the freshly-created Analyses instance, which you can then poll for completion:

job = ocx.Jobs.get("0123456789abcdef")
# or look up by name:
job = ocx.Jobs.where(name="my-custom-job")[0]
sample = ocx.Samples.get("fedcba9876543210")

analysis = job.run(sample, job_args={"min_quality": 30})
analysis.await_completion()                # block until the job finishes

if analysis.success:
    results = analysis.results()
else:
    print(f"Job failed: {analysis.error_msg}")

await_completion() returns the (refreshed) analysis once it reaches a terminal state — including failure. It does not raise on a failed job; check analysis.success and analysis.error_msg to distinguish success from failure. The only exception it raises is TimeoutError, when a timeout= is set and exceeded.

The CLI exposes the same functionality:

onecodex jobs run <job_id> <sample_id> -a min_quality=30
onecodex analyses await <analysis_id>

# Or block in a single step:
onecodex jobs run <job_id> <sample_id> --arg min_quality=30 --await

# Reuse a prior analysis as a dependency, optionally under a relative path:
onecodex jobs run <job_id> <sample_id> -d <analysis_id>
onecodex jobs run <job_id> <sample_id> -d <analysis_id>=parent_out

Passing arguments

onecodex jobs run <job_id> <sample_id> --args-json '{"min_quality": 30, "trim": true}'

# To read arguments from a file, use shell substitution:
onecodex jobs run <job_id> <sample_id> --args-json "$(cat args.json)"

-a/--arg only supports string values — every key=value is sent to the server as a string. If a job argument expects another type (integer, float, boolean, array, object), use --args-json to pass the full argument set as a JSON object, which preserves types:

--args-json is mutually exclusive with -a/--arg.

Creating and updating jobs

You can create and update custom jobs from the client.

asset = ocx.Assets.upload("reference.fa.gz")
parent = ocx.Jobs.get("0123456789abcdef")

job = ocx.Jobs.create(
    name="my-custom-job",
    script=open("run.sh").read(),
    image_uri="docker.io/library/python:3.12",
    job_type="shell_script",  # or "nextflow"
    cpu=1, ram_gb=1, storage_gb=1,
    assets=[asset],
    dependencies=[{"job": parent, "output_dir": "parent_out"}],
)

job.update(name="renamed", description="now with a description")

The CLI mirrors this:

onecodex jobs create \
    --name my-custom-job \
    --script ./run.sh \
    --image-uri docker.io/library/python:3.12 \
    --cpu 1 --ram-gb 1 --storage-gb 1 \
    --asset-id <asset_id> \
    -d <parent_job_id>=parent_out

onecodex jobs update <job_id> --name renamed

For long-running analyses, await_completion() polls until the analysis reaches a terminal state (complete=True). The cadence backs off over time, so failures surface in seconds while longer jobs poll on the order of minutes:

analysis = ocx.Analyses.get("0123456789abcdef")
analysis.await_completion()                # block indefinitely
analysis.await_completion(timeout=600)     # raise TimeoutError after 10 minutes

For custom workflow runs, .logs() returns the job run logs as a string:

analysis = ocx.Analyses.get("0123456789abcdef")
print(analysis.logs())                     # full log
print(analysis.logs(tail=200))             # last 200 lines

The method refreshes analysis in place and returns it; check analysis.success to see whether it finished cleanly. analysis.refresh() is also available if you just need to re-fetch the current state without blocking.

In addition to methods on individual instances of a given resource (e.g., a Sample or an Analysis), the library also provides methods for aggregating sets of samples or analyses:

all_completed_analyses = ocx.Classifications.where(complete=True)
all_completed_analyses.to_otu()   # Returns a BIOM v1 OTU table as an OrderedDict (JSON-serializable)
all_completed_analyses.to_df()    # Returns a pandas dataframe

Awaiting an analysis

To block until an analysis reaches a terminal state, use the analyses await subcommand. Polling starts at a few seconds and backs off, so failures surface quickly while long-running jobs don't get hammered:

onecodex analyses await 0123456789abcdef
onecodex analyses await 0123456789abcdef --timeout 600

The command exits non-zero if the analysis finishes unsuccessfully or times out.

Dependencies

To re-use the output of a previous run as an input to a new one, pass dependency_overrides:

from onecodex.models.misc import DependencyOverride

prior = ocx.Analyses.get("abcdef0123456789")
analysis = job.run(
    sample,
    job_args={"k": 31},
    dependency_overrides=[DependencyOverride(analysis=prior)],
)

Fetching analysis logs

To view the job run logs for a custom workflow analysis, use analyses logs:

onecodex analyses logs 0123456789abcdef
onecodex analyses logs 0123456789abcdef --tail 200

--tail defaults to the last 1000 lines. Logs are only available for custom workflow runs.

Fetching results (files) from an analysis

analysis = ocx.Analyses.get("0123456789abcdef")

output_files = analysis.get_files()

for file in output_files:
    analysis.download_file(file, progressbar=True)

Upgrading from 0.19.x to 1.0

In 1.0, SampleCollection no longer takes metric, normalize, or rank at construction time. These are now passed directly to .to_df() and the .plot_*() functions instead, so you can switch metrics or ranks without rebuilding the collection. normalize=True has been removed; use the explicit normalized_* metric value instead (e.g. normalized_readcount_w_children). In alpha- and beta-diversity functions, the old metric argument was also renamed to diversity_metric / distance_metric, with metric now referring to the underlying abundance metric.

Major changes

SampleCollection(...) no longer accepts metric, normalize, or rank. Pass these to .to_df() and .plot_*() instead.
normalize=True is gone. Use the matching normalized_* metric (e.g. normalized_readcount_w_children).
In alpha-/beta-diversity functions, metric= was renamed to diversity_metric= / distance_metric=; metric= now refers to the abundance metric.
ClassificationDataframe.ocx has been removed.

Examples

# Before:

phylum = SampleCollection(samples, metric='readcount_w_children', normalize=True, rank='phylum')
phylum.plot_bargraph()
phylum_df = phylum.to_df()

species = SampleCollection(samples, metric='readcount_w_children', normalize=True, rank='species')
species.plot_heatmap()

abundance = SampleCollection(samples, metric='abundance_w_children', rank='genus')
abundance.plot_pca()

In 1.0.x, build the collection once and pick the metric and rank per call. normalize=True becomes the corresponding normalized_* metric, and the diversity-vs-abundance ambiguity is gone. Diversity functions now take diversity_metric or distance_metric alongside an abundance metric:

# After:

samples = SampleCollection(samples)

samples.plot_bargraph(metric='normalized_readcount_w_children', rank='phylum')
samples.plot_bargraph(metric='normalized_readcount_w_children', rank='species')
phylum_df = samples.to_df(metric='normalized_readcount_w_children', rank='phylum')

samples.plot_heatmap(metric='normalized_readcount_w_children', rank='species')
samples.plot_pca(metric='abundance_w_children', rank='genus')

Mapping `normalize` to a metric

Prior to 1.0, metric and normalize were separate arguments. Now, they've been merged into a single metric argument with values corresponding to normalized and un-normalized variants:

0.19.x	1.0.x
`metric='readcount', normalize=True`	`metric='normalized_readcount'`
`metric='readcount_w_children', normalize=True`	`metric='normalized_readcount_w_children'`
`metric='readcount_w_children', normalize=False`	`metric='readcount_w_children'`
`metric='abundance_w_children'`	`metric='abundance_w_children'`

For a full list of supported metrics and their definitions, see the documentation.

Development

Environment Setup

Before developing, git and python version >=3.10 are needed. We recommend using uv for Python version management and dependency installation.

To download the client library from GitHub:

git clone https://github.com/onecodex/onecodex.git
cd onecodex/

To set up the project, install dependencies using uv:

# If you are on a M1 Macbook, run the line below, adjusting the version as needed
export HDF5_DIR=/opt/homebrew/Cellar/hdf5/1.12.1_1/

uv sync --all-extras --dev --locked

To activate the virtual environment:

source .venv/bin/activate

Tests are run via pytest while code formatting and linting is done using ruff:

make lint
make test

We use pre-commit for automated linting using ruff and various whitespace and newline formatters during development.

Writing Unit Tests

We use pytest as our unit testing framework. Tests should be able to run without an internet connection, and One Codex API calls must be mocked. We use responses to mock API responses.

Tip: Any API calls that do not have a matching mock will raise an error. You can figure out which API calls need to be mocked by writing a test, running it, and inspecting the error message to see which route(s) are missing.

Warning: Mocked URLs without a query string will ignore query strings in any matching requests. If the mocked URL includes a query string, it will be used when matching requests.

Fixtures

These pytest fixtures may be helpful when writing unit tests:

ocx: this is a mocked Api object that uses the One Codex v1 API schema.
api_data: this mocks some v1 API data.

Mocking API Data

API data are stored in tests/data/api/:

tests/data/api
└── v1  # the API version
    ├── ...
    ├── analyses
    │   └── index.json  # payload for accessing GET::api/v1/analyses. Will also be used to mock each resource instance, e.g. GET::api/v1/analyses/<uuid>
    ├── classifications
    │   ├── 0f4ee4ecb3a3412f
    │   │   └── results
    │   │       └── index.json  # payload for accessing GET::api/v1/classifications/0f4ee4ecb3a3412f/results
    │   └── index.json  # payload for accessing GET::api/v1/classifications. Instance routes are also auto-mocked
    └── ...

The directory structure mirrors the One Codex API. For example:

The payload for API route api/v1/classifications is stored at tests/data/api/v1/classifications/index.json.
API route api/v1/classifications/0f4ee4ecb3a3412f/results has its payload stored at tests/data/api/v1/classifications/0f4ee4ecb3a3412f/results/index.json.

This idea can be extended to arbitrary nesting/depths within the API.

Note: If the payload is large, you can gzip it and name it index.json.gz.

A resource's instance list payload (e.g. api/v1/analyses gives you a list of analyses) is used to auto-mock each resource instance (e.g. api/v1/analyses/<uuid>). You don't need to create an index.json for each instance.

conftest.py

API data is loaded in tests/conftest.py. If you need to mock API calls in a way that's not supported by this framework, you can add custom mocked calls in conftest.py.

Things that are not supported by mocking in tests/data/api/:

Non-GET requests (e.g. DELETE)
Query parameters

Jupyter Notebook Custom Exporters

We also package custom Jupyter notebook nbconvert exporters. These can be tested with the following snippets and the provided example.ipynb report.

Our OneCodexHTMLExporter:

ONE_CODEX_REPORT_FILENAME=example.html jupyter nbconvert --execute --to onecodex_html --ExecutePreprocessor.timeout=-1 --output="$ONE_CODEX_REPORT_FILENAME" --output-dir="." notebook_examples/example.ipynb && open example.html

And using the OneCodexPDFExporter:

ONE_CODEX_REPORT_FILENAME=example.pdf jupyter nbconvert --execute --to onecodex_pdf --ExecutePreprocessor.timeout=-1 --output="$ONE_CODEX_REPORT_FILENAME" --output-dir="." notebook_examples/example.ipynb && open example.pdf

Note that OneCodexPDFExporter requires the vl-convert-python package to be installed.

Docker

Docker images are built against the master branch automatically, and available at:

ghcr.io/onecodex/onecodex:latest

For a complete list of version-tagged images, see Packages

The image contains only the base + all dependencies (equivalent of pip install onecodex[all])

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

1.1.0

Jun 18, 2026

1.0.2

May 12, 2026

1.0.1

Mar 5, 2026

1.0.0

Feb 9, 2026

0.19.5

Feb 6, 2026

0.19.4

Jan 21, 2026

0.19.3

Jan 9, 2026

0.19.2

Jan 8, 2026

0.19.1

Oct 14, 2025

0.19.0

Oct 6, 2025

0.18.0

Apr 18, 2025

0.17.0

Dec 3, 2024

0.16.0

Aug 19, 2024

0.15.1

Jan 12, 2024

0.15.0

Jan 11, 2024

0.14.0

Jan 11, 2024

0.13.0

Sep 25, 2023

0.12.0

Sep 5, 2023

0.11.0

Nov 29, 2022

0.10.0

Apr 5, 2022

0.9.6

Nov 15, 2021

0.9.5

Mar 16, 2021

0.9.4

Oct 27, 2020

0.9.3

Aug 7, 2020

0.9.2

Jul 23, 2020

0.9.1

Jul 10, 2020

0.9.0

Jul 10, 2020

0.8.2

Jun 11, 2020

0.8.1

Jun 8, 2020

0.8.0

Jun 3, 2020

0.7.2

Oct 15, 2019

0.7.1

Oct 7, 2019

0.7.0

Oct 4, 2019

0.6.5

Aug 29, 2019

0.6.4

Aug 29, 2019

0.6.3

Aug 12, 2019

0.6.2

Jul 26, 2019

0.6.1

Jul 26, 2019

0.6.0

Jul 19, 2019

0.5.6

Jul 2, 2019

0.5.5

Jul 1, 2019

0.5.4

Jun 6, 2019

0.5.3

May 1, 2019

0.5.2

Apr 30, 2019

0.5.1

Apr 24, 2019

0.5.0

Apr 19, 2019

0.4.5

Apr 2, 2019

0.4.4

Mar 25, 2019

0.4.3

Mar 25, 2019

0.4.2

Mar 18, 2019

0.4.1

Mar 8, 2019

0.4.0

Feb 21, 2019

0.3.1

Jan 9, 2019

0.3.0

Jan 2, 2019

0.2.14

Nov 17, 2018

0.2.13

Sep 24, 2018

0.2.12

May 16, 2018

0.2.11

Mar 2, 2018

0.2.10

Jan 9, 2018

0.2.9

Oct 27, 2017

0.2.8

Sep 14, 2017

0.2.7

Sep 5, 2017

0.2.6

Aug 25, 2017

0.2.5

Aug 21, 2017

0.2.4

Aug 21, 2017

0.2.3

Jan 21, 2017

0.2.2

Jan 21, 2017

0.2.1

Jan 17, 2017

0.2.0

Jan 6, 2017

0.2.0a1 pre-release

Nov 7, 2016

0.2.0a0 pre-release

Nov 4, 2016

0.1.4

Sep 15, 2016

0.1.3

Aug 23, 2016

0.1.2

Oct 21, 2015

0.1.1

Sep 9, 2015

0.1.0

Aug 6, 2015

0.0.10

Jul 22, 2015

0.0.9

Jul 15, 2015

0.0.8

May 19, 2015

0.0.7

Feb 13, 2015

0.0.6

Oct 30, 2014

0.0.5

Oct 29, 2014

0.0.4

Oct 29, 2014

0.0.3

Oct 28, 2014

0.0.2

Oct 21, 2014

0.0.1

Oct 20, 2014

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

onecodex-1.1.0.tar.gz (4.0 MB view details)

Uploaded Jun 18, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

onecodex-1.1.0-py3-none-any.whl (3.0 MB view details)

Uploaded Jun 18, 2026 Python 3

File details

Details for the file onecodex-1.1.0.tar.gz.

File metadata

Download URL: onecodex-1.1.0.tar.gz
Upload date: Jun 18, 2026
Size: 4.0 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for onecodex-1.1.0.tar.gz
Algorithm	Hash digest
SHA256	`5fd3dbe34fa59318db3828196b667b2e14ff7de4bd658b44a162d8b9c35ab7ac`
MD5	`917e21636450fba6a7c7fc8eb66e9ee3`
BLAKE2b-256	`c99931de5c66654487eb58faed15a137e4d4a90ed2d73d89dea0697b22a2d07a`

See more details on using hashes here.

File details

Details for the file onecodex-1.1.0-py3-none-any.whl.

File metadata

Download URL: onecodex-1.1.0-py3-none-any.whl
Upload date: Jun 18, 2026
Size: 3.0 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for onecodex-1.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b98824493812990923759aea41c2cb7b4443f11a47b9144ef32adc1b65e8cab9`
MD5	`9be0e7135cfbd26b2b9e0cd9ca568f55`
BLAKE2b-256	`1f4478dad8dc1dd99d224c30e7b5cad719661c8570598703985a0aa3a20a8eb0`

See more details on using hashes here.

onecodex 1.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

One Codex API - Python Client Library and CLI

Installation

Basic installation

Installation with optional extensions

Using the CLI

Logging in

Uploading files

Resources (CLI)

Using the Python client library

Initialization

Resources

Custom Workflows

Running a workflow

Passing arguments

Creating and updating jobs

Awaiting an analysis

Dependencies

Fetching analysis logs

Fetching results (files) from an analysis

Upgrading from 0.19.x to 1.0

Major changes

Examples

Mapping normalize to a metric

Development

Environment Setup

Writing Unit Tests

Fixtures

Mocking API Data

conftest.py

Jupyter Notebook Custom Exporters

Docker

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Mapping `normalize` to a metric