Skip to main content

Tasmanian tool to analyze mismatches at read and position in high throughput sequencing data

Project description

Bioconda License: AGPL v3 Language

Image of Tasmanian Devil

Tasmanian

A tool for the analysis of reference mismatches in high throughput sequencing data from DNA samples. Unlike other tools, it is able to evalutate the portions of reads that overlap with specified regions (e.g. Repeats)

Install

conda: conda install -c bioconda tasmanian-mismatch
pip: pip install tasmanian-mismatch

Goals

The main goal is to identify systematic missmatches that might confound SNPs or other variations that should or should not be associated to biological outcomes. Since we noticed a set of regions, which might not necessarily be missplaced in the reference genome, have dramatic effects in this analysis, we provide a way of spliting these reads and incorporate the information in different tables, so that intersecting/non intersecting reads are not filtered out. Also, the researcher has a more accurate picture of the influence of these regions in the observed artifacts.

Overview of Tasmanian use:

samtools view bam | run_intersections [OPTIONS] | run_tasmanian [OPTIONS]
  1. Classification of each base of the read into overlapping (in which case could be contained or boundary - see figure below) or Non-overlapping with regions of interest included in a bed/bedgraph file.
  2. Positional analysis of artifacts splitted by read 1 and read 2.
---

The output includes tables to manupulate and plot the data and a built in report for fast access the data (see figure below).

  • Easy to use command-line and nextflow implementation.
  • Includes a Galaxy wrapper

Contributing

Contributions are welcome and encouraged.

Running Tests

To run the test suite for Tasmanian, follow these steps:

  1. Create a virtual environment:
python3.10 -m venv tasmanian_venv
source tasmanian_venv/bin/activate
  1. Install dependencies:
pip install -r requirements.txt
  1. Run the tests:
bash tasmanian/tests/basic_tests.sh

Authors

License

tasmanian artifact metrics tool is open source software released under the GNU License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tasmanian_mismatch-1.1.1.tar.gz (110.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tasmanian_mismatch-1.1.1-py3-none-any.whl (124.7 kB view details)

Uploaded Python 3

File details

Details for the file tasmanian_mismatch-1.1.1.tar.gz.

File metadata

  • Download URL: tasmanian_mismatch-1.1.1.tar.gz
  • Upload date:
  • Size: 110.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for tasmanian_mismatch-1.1.1.tar.gz
Algorithm Hash digest
SHA256 8ab95e0fb0d893cd453afdd6b7ff8c596c1e48f4e5aec60c1f62cee7a4d22eec
MD5 224ab29b019c83bbb0bb11025cc30aea
BLAKE2b-256 1b9708f38ee2092488032170c18fbc6e073d2c371cd0a89258b92e8eba7f9920

See more details on using hashes here.

File details

Details for the file tasmanian_mismatch-1.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for tasmanian_mismatch-1.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 05b236628c82e0bb1af6499f8fc0d7c12875a9dc03698c5e88e58b406b9a11ad
MD5 6e9a69a65e38171f350b36da765b88b8
BLAKE2b-256 e900a8d901fb0ff5a2991c0670ff6b427afd7d15a83d0bc36741c752c9436ace

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page