Skip to main content

Build mitogenomes from WGS metagenomes

Project description

License: MIT

GitHub last commit (branch)

MitoBee logo

MitoBee

Snakemake workflow to get mitogenomes from metagenomic data

Still under development! Stable release out as a version, but only if there is a closely related mitogenome available.

Documentation: Wiki

Install

Source install Run the below commands:

git clone https://github.com/npbhavya/MitoBee.git
cd MitBee
mamba create -y -n mitobee python=3.13
conda activate mitobee
pip install -e . 

Once I have a stable version release, I will upload them to conda and pip as well

Running the code

Note: This code works only on paired end metagenomes for now.

This workflow is made modular;

  1. mitobee run If there is a representative closely related genome mitogenome, provide that as the host seq and get started
    #Running mitobee with test files available in the repo
    mitobee run --input test-files/metagenomes --extn fastq.gz \
         --pattern_r1 _R1 --pattern_r2 _R2 \
         --host_seq test-files/am-dh4.fasta \
         --output output
  1. mitobee tree Once the mitochondrial genomes are built from each metagenome sample, run this module to build a tree with these mitogenomes and other references
    #After the mitogenomes are made from the mitobee run results. Add other references to build a tree
    #Once again example with test files
    mitobee tree --input test-files/mitogenomes --extn fasta --output output -k all

  1. mitobee search If there is a no closely related mitogenome available, then this step can be run first to search against a set of mitogenomes or mito genes This module will provide an overview of which reference to use
    #If a closely related mitochondrial genome is not available, but a gene is, like cox  or rRNA genes
    #Download the reference genes you would like to use of the closely related genomes

    #to search against mitogenomes refernece set
    mitobee search --input test-files/mitogenomes --extn fastq.gz \
        --pattern_r1 _R1 --pattern_r2 _R2 \
        --ref_seq  test-files/ref-set-genome --output output \
        -k all --mode mitogenome

    #to search against mitogenomes refernece gene set 
    mitobee search --input test-files/mitogenomes --extn fastq.gz \
        --pattern_r1 _R1 --pattern_r2 _R2 \
        --ref_seq  test-files/ref-set-genes --output output \
        -k all --mode genes 

Input files

Input files:

  • Input directory with metagenomes
  • Reference directory
    • If running run or tree module, provide a (one) reference genome.
    • If running gene module, provide a reference gene set

Output files

Output files: Provide the output folder, contains subdirectories

  • PROCESSING: Folder containing intermediate files
  • REPORTS: Final results including the mitogenome fasta files from (hopefully) each metagenome sample
    Also inlcudes the QC reports, to include stats on how many reads were processed, and not

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mitobee-1.0.post0.tar.gz (6.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mitobee-1.0.post0-py3-none-any.whl (7.2 kB view details)

Uploaded Python 3

File details

Details for the file mitobee-1.0.post0.tar.gz.

File metadata

  • Download URL: mitobee-1.0.post0.tar.gz
  • Upload date:
  • Size: 6.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for mitobee-1.0.post0.tar.gz
Algorithm Hash digest
SHA256 4406c2df0ca6a32fcc96ba5b5531b1e513899e4ff2916700d928ecdfbc019e9c
MD5 b718eaac38aaef1d486772c20e726831
BLAKE2b-256 f4c23c91100dc3a90a676d7a063e8d240f174eaf60e9e76061c129dee2eee281

See more details on using hashes here.

File details

Details for the file mitobee-1.0.post0-py3-none-any.whl.

File metadata

  • Download URL: mitobee-1.0.post0-py3-none-any.whl
  • Upload date:
  • Size: 7.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for mitobee-1.0.post0-py3-none-any.whl
Algorithm Hash digest
SHA256 3ca66da416fa290f70cded959e6a4a64a6259bcaa2e31aafc78da132dfe47e5d
MD5 074fe13bf93c1e7faef24a1a7d466ff9
BLAKE2b-256 1c64566c1ddfc3121b675496ef9050fecf18ff2d9ce78616861c7fc5c2d66e3e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page