Skip to main content

A structural variant caller for long reads.

Project description

## SVIM - Structural variant identification using long reads

SVIM (pronounced SWIM) is a structural variant caller for long reads. It is able to detect and classify six different classes of structural variants. Unlike existing methods, SVIM integrates information from across the genome to precisely distinguish similar events, such as duplications and cut&paste insertions. In our experiments on simulated and real PacBio data, SVIM reached consistently better results than competing methods, particularly on low-coverage datasets. Furthermore, it is unique in its capability of extracting both the genomic origin and destination of insertions and duplications.

### Background on Structural Variants and Long Reads

<img src=”https://raw.githubusercontent.com/eldariont/svim/master/docs/SVclasses.png” align=”right” width=”400px”> Structural variants (SVs) are typically defined as genomic variants larger than 50bps (e.g. deletions, duplications, inversions). Studies have shown that they affect more bases in any given genome than SNPs and small Indels taken together. Consequently, they have a large impact on genes and regulatory regions. This is reflected in the large number of genetic diseases that are caused by SVs.

Common sequencing technologies by providers such as Illumina generate short reads with high accuracy. However, they exhibit weaknesses in repeat and low-complexity regions. This negatively affects SV detection because SVs are associated to such regions. Single molecule long-read sequencing technologies from Pacific Biotechnologies and Oxford Nanopore produce reads with error rates of up to 15% but with lengths of several kb. The high read lengths enable them to cover entire repeats and SVs which facilitates SV detection.

### Input

SVIM analyzes long reads contained in a FASTA file. It was tested on PacBio data only but might work with Nanopore reads as well. Alternatively, SVIM can analyze an alignment file in BAM format. It works best for alignment files produces by [NGM-LR](https://github.com/philres/ngmlr “NGM-LR repository”).

### Output

SVIM distinguishes six different SV classes (see above schema): deletions, inversions, cut&paste insertions, novel insertions, interspersed and tandem duplications. For each of these SV classes, it produces a BED file with the SV coordinates. Additonally, a VCF file is produced containing all found SVs.

### Installation and Usage

Please see our wiki (https://github.com/eldariont/svim/wiki).

### Contact

If you experience problems or have suggestions please create an issue or a pull request or contact heller_d@molgen.mpg.de.

### License

The project is licensed under the GNU General Public License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

svim-0.3.1.tar.gz (22.5 kB view details)

Uploaded Source

Built Distribution

svim-0.3.1-py3-none-any.whl (45.3 kB view details)

Uploaded Python 3

File details

Details for the file svim-0.3.1.tar.gz.

File metadata

  • Download URL: svim-0.3.1.tar.gz
  • Upload date:
  • Size: 22.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.13.0 setuptools/40.6.2 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6

File hashes

Hashes for svim-0.3.1.tar.gz
Algorithm Hash digest
SHA256 bf4aae8e774bb2621d8ad649148152a73ad07952a04b672aaf6201546b900d76
MD5 a80b13bfb5ca82a63710481afefa982c
BLAKE2b-256 ba1b80a7542b27637ee85ff9239b10ac14283be3c2fcf52a9659d3cc82b34929

See more details on using hashes here.

File details

Details for the file svim-0.3.1-py3-none-any.whl.

File metadata

  • Download URL: svim-0.3.1-py3-none-any.whl
  • Upload date:
  • Size: 45.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.13.0 setuptools/40.6.2 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6

File hashes

Hashes for svim-0.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 e8048f2b03b7cdae5ae59e34d4ed1be36266ede33bfed11a8bc4678b4b48cdcb
MD5 cc95acc091a407017c6aebb2ccc41371
BLAKE2b-256 f3e10259aeb2e22f4d6311d87ed7b3eda4d789044be7904c021eb70f410bd012

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page