Skip to main content

AMRfior: A toolkit that uses BLAST, BWA, Bowtie2, DIAMOND, and Minimap2 to search DNA and protein sequences against AMR databases (DNA and AA) such as CARD/RGI and ResFinder.

Project description

AMRfíor (pronounced AMR "feer", sounds like beer)

This toolkit utilises a combined approach that uses BLAST, BWA, Bowtie2, DIAMOND, and Minimap2 to search DNA and protein sequences against AMR databases (DNA and AA) such as CARD/RGI and ResFinder.

Requirements:

- python >=3.10
- samtools >=1.19.2
- blast >=2.17.0
- diamond >=2.1.13
- bowtie2 >=2.5.4
- bwa >=0.7.19
- minimap2 >=2.30
- seqtk >=1.4

Menu:

AMRfíor v0.3.0 - The Multi-Tool AMR Gene Detection Toolkit.

options:
  -h, --help            show this help message and exit

Required selection:
  -i INPUT, --input INPUT
                        Input FASTA/FASTAQ file(s) with sequences to analyse -
                        Separate FASTQ R1 and R2 with a comma for Paired-FASTQ
                        or single file path for Single-FASTA - .gz files
                        accepted
  -st {Single-FASTA,Paired-FASTQ}, --sequence-type {Single-FASTA,Paired-FASTQ}
                        Specify the input Sequence Type: Single-FASTA or
                        Paired-FASTQ (R1+R2) - Will convert Paired-FASTQ to
                        single combined FASTA for BLAST and DIAMOND analyses
                        (SLOW)
  -o OUTPUT, --output OUTPUT
                        Output directory for results

Output selection:
  --report_fasta {None,all,detected,detected-all}
                        Specify whether to output sequences that "mapped" to
                        genes."all" should only be used for deep
                        investigation/debugging."detected" will report the
                        reads that passed detection thresholds for each
                        detected gene."detected-all" will report all reads for
                        each detected gene. (default: None)

Tool selection:
  --tools {blastn,blastx,diamond,bowtie2,bwa,minimap2,all} [{blastn,blastx,diamond,bowtie2,bwa,minimap2,all} ...]
                        Specify which tools to run - "all" will run all tools
                        (default: all except blastx/n as it is very slow!!)

Database selection:
  --databases {resfinder,card,ncbi,user-provided} [{resfinder,card,ncbi,user-provided} ...]
                        Specify which AMR gene databases to use (default:
                        resfinder and card) -If "user-provided" is selected,
                        please ensure the path contains the appropriate
                        databases set up as per the documentation and specify
                        the path with --user-db-path.
  --user-db-path USER_DB_PATH
                        Path to the directory containing user-provided
                        databases (required if --databases includes "user-
                        provided")

Query threshold Parameters:
  --q-min-cov QUERY_MIN_COVERAGE, --query-min-coverage QUERY_MIN_COVERAGE
                        Minimum coverage threshold in percent (default: 40.0)

Gene Detection Parameters:
  --d-min-cov DETECTION_MIN_COVERAGE, --detection-min-coverage DETECTION_MIN_COVERAGE
                        Minimum coverage threshold in percent (default: 80.0)
  --d-min-id DETECTION_MIN_IDENTITY, --detection-min-identity DETECTION_MIN_IDENTITY
                        Minimum identity threshold in percent (default: 80.0)
  --d-min-base-cov DETECTION_MIN_BASE_COVERAGE, --detection-min-base-coverage DETECTION_MIN_BASE_COVERAGE
                        Minimum average base coverage (depth) for detection -
                        calculated against regions of the detected gene with
                        at least one read hit (default: 1.0)
  --d-min-reads DETECTION_MIN_NUM_READS, --detection-min-num-reads DETECTION_MIN_NUM_READS
                        Minimum number of reads required for detection
                        (default: 1)

Mode Selection:
  --dna-only            Run only DNA-based tools
  --protein-only        Run only protein-based tools
  --sensitivity {default,conservative,sensitive,very-sensitive}
                        Preset sensitivity levels - default means each tool
                        uses its own default settings and very-sensitive
                        applies DIAMONDs --ultra-sensitive and Bowtie2s
                        --very-sensitive-local presets

Tool-Specific Parameters:
  --minimap2-preset {sr,map-ont,map-pb,map-hifi}
                        Minimap2 preset: sr=short reads, map-ont=Oxford
                        Nanopore, map-pb=PacBio, map-hifi=PacBio HiFi
                        (default: sr)

Runtime Parameters:
  -t THREADS, --threads THREADS
                        Number of threads to use (default: 4)
  --no_cleanup
  -v, --verbose

Examples:
  # Basic usage with default tools (runs DNA & protein tools)
  AMRfior -i reads.fasta -st Single-FASTA -o results/

  # Select specific tools and output detected FASTA sequences
  AMRfior -i reads.fasta -st Single-FASTA -o results/     --tools diamond bowtie2     --report_fasta detected

  # Custom thresholds, paired-fastq input, threads and dna-only mode
  AMRfior -i reads_R1.fastq,reads_R2.fastq -st Paired-FASTQ -o results/     -t 16 --d-min-cov 90 --d-min-id 85     --dna-only

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

amrfior-0.3.0.tar.gz (54.1 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

amrfior-0.3.0-py3-none-any.whl (54.6 MB view details)

Uploaded Python 3

File details

Details for the file amrfior-0.3.0.tar.gz.

File metadata

  • Download URL: amrfior-0.3.0.tar.gz
  • Upload date:
  • Size: 54.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.2

File hashes

Hashes for amrfior-0.3.0.tar.gz
Algorithm Hash digest
SHA256 b15c67741e12a7c208fe34ffcc2ea44a47bffebdd7843c7647de7c1b2e597c76
MD5 37383b44d9e46de77e363fc98c4ebe1a
BLAKE2b-256 cbad437fabfff16ede80e540f06261c2f33b2832f041ba459c2f6ab42c4972c8

See more details on using hashes here.

File details

Details for the file amrfior-0.3.0-py3-none-any.whl.

File metadata

  • Download URL: amrfior-0.3.0-py3-none-any.whl
  • Upload date:
  • Size: 54.6 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.2

File hashes

Hashes for amrfior-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 08c6284793df3eef2420e3fec50f719bc8ccd05b32434d41f04b04cdf385219a
MD5 08d73a761aacb821eb95af514836f261
BLAKE2b-256 0bbeb94f6b8c8b6e8262bff23663f673c36165e42364f681af9a51fced03e479

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page