Skip to main content

IRIS: Detection and Validation Of Chimeric Reads

Project description

IRIS: Detection and Validation Of Chimeric Reads.

PyPI version GitHub Downloads License

Introduction

IRIS is a method designed to detect and validate chimeric junction from multi-genome alignments. The method constructs a DP alignment matrix from two separate alignments to infer precise breakpoint. The two-pass algorithm is implemented to refine consistency of breakpoint inference. The method is designed to take advantage of anntoations of either or, ideally, both genomes involved in the chimeric event by penalizing and prioritizing events at known junctions.

Publications

Coming soon...

Documentation

Installation

Via PyPI

The easiest way to install IRIS is through PyPI:

$ pip install iris-av
$ iris --help

To uninstall SNAPPER:

$ pip uninstall iris-av

Building from source

To build from source, clone the git repository:

$ git clone https://github.com/alevar/iris.git --recursive
$ cd iris
$ pip install -r requirements.txt
$ pip install .

Requirements

Requirement Details
Language support Python ≥ 3.6
Dependencies -

Getting started

IRIS expects BLAST alignments to be provided in the following format:

blastn \
  -db blast_database \
  -query query_fasta \
  -out output.blastn6 \
  -outfmt "6 qseqid qlen sseqid pident length mismatch gapopen qstart qend sstart send evalue bitscore"

Usage

iris [-h] -i1 INPUT1 -i2 INPUT2 -a1 ANNOTATION1 -a2 ANNOTATION2 -o OUTPUT [--two_pass] [-g] [--chim-genome]
                   [-g1 GENOME1] [-g2 GENOME2] [-max_dist MAX_DIST] [-max_weight MAX_WEIGHT]
                   [-full_weight FULL_WEIGHT] [-half_weight HALF_WEIGHT] [--overhang OVERHANG]

Options

Option Description
-i1, --input1 Path to the file containing BLAST mapping of reads to genome 1. Alignment is expected to have the following format: -outfmt "6 qseqid qlen sseqid pident length mismatch gapopen qstart qend sstart send evalue bitscore"
-i2, --input2 Path to the file containing BLAST mapping of reads to genome 2. Alignment is expected to have the following format: -outfmt "6 qseqid qlen sseqid pident length mismatch gapopen qstart qend sstart send evalue bitscore"
-a1, --annotation1 Path to the file containing GTF/GFF annotation for genome 1.
-a2, --annotation2 Path to the file containing GTF/GFF annotation for genome 2.
-g1, --genome1 Path to the file containing genome 1 FASTA sequence.
-g2, --genome2 Path to the file containing genome 2 FASTA sequence.
--two-pass Flag enables the 2-pass mode. Breakpoints from the first pass will be used to bias DP trace towards consensus sites.
--group If enabled, will output a file with breakpoints groupped by position.
-o, --output Path to the output file.
--chim-genome (Requires -group). If enabled, will generate a fasta file with chimeric genome sequences, stitching together the two genomes at the breakpoints.
--max-dist Maximum distance between breakpoints of the two segments. Default: 5.
---max-weight Maximum weight of a breakpoint when biasing the 2nd pass. Default: 5.
--full-weight Weight of a breakpoint that matches donor and acceptor. Default: 5.
--half-weight Weight of a breakpoint that matches either donor or acceptor. Default: 3.
--overhang Number of bases to include in the chimeric genome overhang. Default: 1000.

Help Options

Option Description
-h, --help Prints help message.

Example Data

Sample datasets are provided in the "example" directory to test and get familiar with SNAPPER.

The included example can be run with the following command from the root directory of the repository:

iris --input1 ./examples/AY69_E4p5_LTA/host.blastn.6 --input2 ./examples/AY69_E4p5_LTA/path.blastn.6 --annotation1 ./examples/AY69_E4p5_LTA/host.gtf --annotation2 ./examples/csess.1.0.0.known.gtf --output ./examples/AY69_E4p5_LTA/ris --genome1 ./examples/AY69_E4p5_LTA/host.fa --genome2 ./examples/SIV239.fa --chim-genome --two-pass --group

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

iris_av-1.0.0.tar.gz (44.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

iris_av-1.0.0-py3-none-any.whl (45.9 kB view details)

Uploaded Python 3

File details

Details for the file iris_av-1.0.0.tar.gz.

File metadata

  • Download URL: iris_av-1.0.0.tar.gz
  • Upload date:
  • Size: 44.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for iris_av-1.0.0.tar.gz
Algorithm Hash digest
SHA256 71765f0c73a913b9ba08710e859ec0c87c5b351f9c7c48a2a3bec265cea3fbec
MD5 62a8d563089c71e640d06df9ed70d011
BLAKE2b-256 452e03b91518138df4704d0294d6e57eeed0fc4ad81af71502ee23396de57ac6

See more details on using hashes here.

File details

Details for the file iris_av-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: iris_av-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 45.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for iris_av-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 e969a6547478f5551a5f233402bda38db1d81cd65932ebf805d89e111cb9edf7
MD5 7ae5bded990b94711a462bfe65088bcd
BLAKE2b-256 a10a2737aa6c7bdca1cbaeb4b9776caf8e3daaf3351383677c000cd2cc7949ea

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page