This library extracts Darwin Core datasets from MGnify
Project description
mgnify-extract
This library extracts Darwin Core datasets from MGnify. API documentation at https://iobis.github.io/mgnify-extract.
Usage
Find studies:
from mgnifyextract.studies import find_studies
filters = {
"lineage": "root:Environmental:Aquatic:Marine",
"search": "Tara"
}
studies = find_studies(filters, max_results=1)
Fetching samples, runs, analyses, and downloads:
from mgnifyextract.studies import get_study
study = get_study("MGYS00000462")
# <Study https://www.ebi.ac.uk/metagenomics/studies/MGYS00000462>
samples = study.get_samples()
# [<Sample https://www.ebi.ac.uk/metagenomics/samples/ERS667567>, <Sample https://www.ebi.ac.uk/metagenomics/samples/ERS667569>, <Sample https://www.ebi.ac.uk/metagenomics/samples/ERS667570>, ...]
runs = samples[0].get_runs()
# [<Run https://www.ebi.ac.uk/metagenomics/runs/ERR867642>, <Run https://www.ebi.ac.uk/metagenomics/runs/ERR867641>, <Run https://www.ebi.ac.uk/metagenomics/runs/ERR770958>]
analyses = runs[0].get_analyses()
# [<Analysis https://www.ebi.ac.uk/metagenomics/analyses/MGYA00593805>, <Analysis https://www.ebi.ac.uk/metagenomics/analyses/MGYA00135741>]
downloads = analyses[0].get_downloads()
# [<FastaDownload Sequence data https://www.ebi.ac.uk/metagenomics/api/v1/analyses/MGYA00593805/file/ERR867642_MERGED_FASTQ.fasta.gz?format=json>, <FastaDownload Taxonomic analysis SSU rRNA https://www.ebi.ac.uk/metagenomics/api/v1/analyses/MGYA00593805/file/ERR867642_MERGED_FASTQ_SSU.fasta.gz?format=json>, <MseqDownload Taxonomic analysis SSU rRNA https://www.ebi.ac.uk/metagenomics/api/v1/analyses/MGYA00593805/file/ERR867642_MERGED_FASTQ_SSU_MAPSeq.mseq.gz?format=json>, <TsvDownload Taxonomic analysis SSU rRNA https://www.ebi.ac.uk/metagenomics/api/v1/analyses/MGYA00593805/file/ERR867642_MERGED_FASTQ_SSU_OTU.tsv?format=json>, <Hdf5BiomDownload Taxonomic analysis SSU rRNA https://www.ebi.ac.uk/metagenomics/api/v1/analyses/MGYA00593805/file/ERR867642_MERGED_FASTQ_SSU_OTU_TABLE_HDF5.biom?format=json>, <JsonBiomDownload Taxonomic analysis SSU rRNA https://www.ebi.ac.uk/metagenomics/api/v1/analyses/MGYA00593805/file/ERR867642_MERGED_FASTQ_SSU_OTU_TABLE_JSON.biom?format=json>]
Generating Darwin Core tables:
from mgnifyextract.studies import get_superstudy_studies
from mgnifyextract.dwc import study_to_dwc
study = get_superstudy_studies("atlanteco")[0]
# <Study https://www.ebi.ac.uk/metagenomics/studies/MGYS00005780>
occ, dna = study_to_dwc(study)
Run scripts
PYTHONPATH=. python scripts/develop.py
Run module
python -m mgnifyextract
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
mgnifyextract-0.0.2.tar.gz
(7.4 kB
view details)
Built Distribution
File details
Details for the file mgnifyextract-0.0.2.tar.gz
.
File metadata
- Download URL: mgnifyextract-0.0.2.tar.gz
- Upload date:
- Size: 7.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d35d5011651078f34c6e2c2ccaeca77b7d4f93439685bc83aa9a9a7fa80d862b |
|
MD5 | f92cbc87ed224b51c23131b16fe01e9f |
|
BLAKE2b-256 | cecbe16a7350f67811dfa3d6b053b8569aba58089b9be36c4b7b01316d6ed575 |
File details
Details for the file mgnifyextract-0.0.2-py3-none-any.whl
.
File metadata
- Download URL: mgnifyextract-0.0.2-py3-none-any.whl
- Upload date:
- Size: 8.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4c3b10b0f3abc0df006d574b2ea8cf602f3ed4704829458bbb1f9e8a8964dcde |
|
MD5 | c2b04566040febab2b43ebff88233dc8 |
|
BLAKE2b-256 | 37bea4d6e088ec261c42ac7490a0d1b60f29dfe8dc057fc7099657f54d10b581 |