Skip to main content

MirMachine

Project description

MirMachine

Logo

Build Status Documentation Status

Project Status: Active – The project has reached a stable, usable state and is being actively developed. PyPI version Anaconda-Server Badge Anaconda-Server Badge
Anaconda-Server Badge Docker Pulls
A command line tool to detect miRNA homologs in genome sequences.

Installation

To install this package with conda run:

conda install mirmachine -c bioconda -c conda-forge

Please add conda-forge as a channel. Installing via mamba is also strongly recommended for a faster installation. You can install mamba and later MirMachine like this:

conda install mamba -c conda-forge
mamba install mirmachine -c bioconda -c conda-forge

Check if the installation works by calling the main script.

MirMachine.py --help

Note: You have to install dependencies if you prefer GitHub or PyPi installation.

Quick start example

Create a new directory and run MirMachine there after the installation. MirMachine will create the required directories while running.

MirMachine.py -n Caenorhabditis -s Caenorhabditis_elegans --genome sample/genomes/ce11.fa --cpu 20 --model proto

See our documentation for detailed explanations: https://mirmachine.readthedocs.io/

Options and Arguments

Usage:
    MirMachine.py --node <text> --species <text> --genome <text> [--model <text>] [--evalue <float>] [--cpu <integer>] [--add-all-nodes|--single-node-only] [--unlock|--remove] [--touch] [--dry] [--long]
    MirMachine.py --species <text> --genome <text> --family <text> [--model <text>] [--evalue <float>] [--cpu <integer>] [--unlock|--remove] [--touch] [--dry] [--long]
    MirMachine.py --node <text> [--add-all-nodes]
    MirMachine.py --print-all-nodes
    MirMachine.py --print-all-families
    MirMachine.py --print-ascii-tree
    MirMachine.py (-h | --help)
    MirMachine.py --version

Arguments:
    -n <text>, --node <text>              Node name. (e.g. Caenorhabditis)
    -s <text>, --species <text>           Species name. (e.g. Caenorhabditis_elegans)
    -g <text>, --genome <text>            Genome fasta file location (e.g. data/genome/example.fasta)
    -m <text>, --model <text>             Model type: deutero, proto, combined [default: combined]
    -f <text>, --family <text>            Run only a single microRNA family (e.g. Let-7).
    -e <text>, --evalue <float>           Inclusion E-value. May inflate low quality hits. [default: 0.2] Default 5 if --long is used.
    -c <integer>, --cpu <integer>         CPUs. [default: 2]

Options:
    -a, --add-all-nodes                 Move on the tree both ways. NOT required most of the time.
    -o, --single-node-only              Run only on the given node for microRNA families.
    --long                              Use long microRNA covariance models rather than standard models (Experimental).
    -p, --print-all-nodes               Print all available node options and exit.
    -l, --print-all-families            Print all available families in this version and exit.
    -t, --print-ascii-tree              Print ascii tree of the tree file.
    -u, --unlock                        Rescue stalled jobs (Try this if the previous job ended prematurely).
    -r, --remove                        Clear all output files (this won't remove input files).
    -d, --dry                           Dry run.
    -h, --help                          Show this screen.
    --touch                             Touch output files (mark them up to date without really changing them).
    --version                           Show version.

Output

The MirMachine main executable will generate GFF annotations (filtered and unfiltered) and some other files. You will see results/predictions/ directory which contains:

gff/ All predicted microRNA families.
filtered_gff/ High confidence microRNA family predictions after bitscore filtering. (This file is what you need in most cases)
fasta/ Both high and low confidence predictions in FASTA format.

MirMachine's other repos

Supplementary files repo: https://github.com/sinanugur/MirMachine-supplementary

Citation

Our Cell Genomics paper is here: https://doi.org/10.1016/j.xgen.2023.100348 Please cite if you find our tool useful.

MirMachine website: https://mirmachine.org

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mirmachine-0.3.0.4.tar.gz (251.2 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mirmachine-0.3.0.4-py3-none-any.whl (263.2 MB view details)

Uploaded Python 3

File details

Details for the file mirmachine-0.3.0.4.tar.gz.

File metadata

  • Download URL: mirmachine-0.3.0.4.tar.gz
  • Upload date:
  • Size: 251.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.7

File hashes

Hashes for mirmachine-0.3.0.4.tar.gz
Algorithm Hash digest
SHA256 7cb6d7da7c31a06e7720b73d9b9bf97e8edf2e4c6675efb4edc1ecedfe87dbe4
MD5 d0e3f4cb9e6c910d04c5af3fee4acdf3
BLAKE2b-256 860af3a494ee12559ed62f33e5d0b591bf6d2a97e0736cb8f1add05d4f8f4f1f

See more details on using hashes here.

File details

Details for the file mirmachine-0.3.0.4-py3-none-any.whl.

File metadata

  • Download URL: mirmachine-0.3.0.4-py3-none-any.whl
  • Upload date:
  • Size: 263.2 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.7

File hashes

Hashes for mirmachine-0.3.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 543a682198f5ecefc9c8f330fc3c8fe490bf41212b36f64ad77634f6ae7db232
MD5 72793d6a063ef405f1921b72a938c2fb
BLAKE2b-256 1d9fedcf96e6c20d2aac4ad6110fffb1ebe4fc78971b71a9315214ddaf46cd05

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page