MGni.py: A Python Wrapper for the MGnify API

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

anglup

These details have not been verified by PyPI

Project links

Project description

MGni.py

MGni.py ('mæɡ-ni-paɪ') is a lightweight python client and toolkit for the MGnify API.

cicd.yml Python 3.11 to 3.13
GitHub issues GitHub license GitHub last commit GitHub stars

mgnipy schematic

Features
Available API Endpoints
Installation
Quick Start
Additional Documentation
Development
License
Citation

Features

FAIR: More findable MGnify analyses and metadata, returned in familiar metagenomics data formats (e.g., GFF, Darwin Core, Dataframes[pandas, polars, anndata])
Simplifies API interactions: Let MGni.Py handle the complexity of building, executing, and parsing API calls so you can focus on the data!
Fast: MGni.Py uses caching to speed up API expolation, as well as supports both sync and async API calls

Available API Endpoints

Studies: MGnify studies are based on ENA studies/projects, and are collections of samples, runs, assemblies, and analyses associated with a certain set of experiments.
Samples: MGnify samples are based on ENA/BioSamples samples, and represent individual biological samples.
Runs: Sequencing runs (ENA run accessions; individual sequencing runs of a sample).
Assemblies: Metagenome assemblies (equivalent to ENA assemblies for one or more runs).
Analyses: MGnify analyses are runs of a standard pipeline on an individual sequencing run or assembly. They can include collections of taxonomic and functional annotations.
Publications: Publications (e.g. journal articles) may describe or analyse the content of MGnify Studies or their corresponding datasets in ENA.
Genomes: MGnify Genomes are annotated draft genomes based on either isolates, or metagenome-assembled genomes (MAGs). They are arranged in biome-specific catalogues.
Biomes: The hierarchical GOLD ecosystem classifications biomes represented in MGnify.

Note: Private Data

To access your private data in any of these API endpoints you just need your MGnify user and password to obtain a valid sliding auth token via the MGnify Authentication endpoints.
for example you can put your login credentials in a .env file in your working directory (see .env.example) and
mgnipy.MGnipyConfig takes care of getting and caching the auth token so that you can easily access your private data using MGni.py 🎉
for example you can put your login credentials in a .env file in your working directory (see .env.example) and
mgnipy.MGnipyConfig takes care of getting and caching the auth token so that you can easily access your private data using MGni.py 🎉

Installation

From PyPI

pip install mgnipy

Development installation

git clone https://github.com/EBI-Metagenomics/mgnipy.git
cd mgnipy
uv sync --all-groups  # or: pip install -e ".[dev,docs]"

Quick Start

🚀 1. Initialize `mgnipy.MGnipy`

from mgnipy import MGnipy

# Create the main client, with default configuration
mg = MGnipy()

# See available endpoints
mg.list_resources()

🔎 2. Search resources with a `mgnipy.MGnifier`

Building the query set

# Search for studies keyword
studies = mg.studies(
    search="disease"
)

# Can preview requests before fetching
studies.explain()

Executing the queries

# get page by page via .get(), getting 3 pages
for _ in range(3)
    studies.get()

# or via .page(), getting another 3 pages
for i in range(4,7):
    studies.page(i)

# OR potentially all at once in large batches (also async option .abulk_fetch())
studies.bulk_fetch()

# then can enrich with detailed metadata
studies.enrich_details()

Viewing the metadata

# as pandas
pd_metadata = studies.to_df()

# As polars DataFrame
pl_metadata = studies.to_polars()
pl_metadata = studies.to_polars()

# as json
json_metadata = studies.to_json()

# with all details
detailed_metadata = studies.details_df()

🗃️ 3. Explore a `mgnipy.MGzine` of datasets

# accessing the mgazine of datasets
mgazine = studies.datasets

# preview
print(mgazine)

Downloading the data

# download file by file 
mgazine.download(to_dir="downloads_folder", alias="mgnify_file_alias.fasta.gz")

# or download all 
mgazine.download_all(to_dir="downloads_folder")

Reading in the data

# support for tsv, csv, txt, jsonl
taxa_table = mgazine.stream(alias="mgnify_file_alias.tsv", df_engine="polars")

# support for fasta, gff, biom via skbio
skbio_fasta = mgazine.stream(alias="mgnify_file_alias.fasta.gz")

Additional Documentation

Development

see Contributing.md

License

TODO

Citation

TODO

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

anglup

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.3

Jun 4, 2026

0.1.2

May 31, 2026

0.1.1

May 19, 2026

0.1.0

May 18, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mgnipy-0.1.3.tar.gz (38.7 MB view details)

Uploaded Jun 4, 2026 Source

File details

Details for the file mgnipy-0.1.3.tar.gz.

File metadata

Download URL: mgnipy-0.1.3.tar.gz
Upload date: Jun 4, 2026
Size: 38.7 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for mgnipy-0.1.3.tar.gz
Algorithm	Hash digest
SHA256	`3ae381af7b4b340d6839f548710e3f5d01c49ac35e614e99e19abd7038889202`
MD5	`aaac17bdde816acbff54fdbd2c8f6d55`
BLAKE2b-256	`a0bdf11c2b3b62d6e11cf05bb80c22c348672ee63bae2457554ae613223691e5`

See more details on using hashes here.

Provenance

The following attestation bundles were made for mgnipy-0.1.3.tar.gz:

Publisher: cicd.yml on EBI-Metagenomics/mgnipy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: mgnipy-0.1.3.tar.gz
- Subject digest: 3ae381af7b4b340d6839f548710e3f5d01c49ac35e614e99e19abd7038889202
- Sigstore transparency entry: 1713930137
- Sigstore integration time: Jun 4, 2026
Source repository:
- Permalink: EBI-Metagenomics/mgnipy@bd23e10602848d9422c372288aeffb1347aa1bf6
- Branch / Tag: refs/tags/v0.1.3
- Owner: https://github.com/EBI-Metagenomics
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: cicd.yml@bd23e10602848d9422c372288aeffb1347aa1bf6
- Trigger Event: push

mgnipy 0.1.3

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

MGni.py

Contents

Features

Available API Endpoints

Note: Private Data

Installation

From PyPI

From PyPI

Development installation

Quick Start

🚀 1. Initialize mgnipy.MGnipy

🔎 2. Search resources with a mgnipy.MGnifier

Building the query set

Executing the queries

Viewing the metadata

🗃️ 3. Explore a mgnipy.MGzine of datasets

Downloading the data

Reading in the data

Additional Documentation

Development

License

Citation

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes

Provenance

🚀 1. Initialize `mgnipy.MGnipy`

🔎 2. Search resources with a `mgnipy.MGnifier`

🗃️ 3. Explore a `mgnipy.MGzine` of datasets