Skip to main content

Extract and analyze satellite DNA from raw sequences.

Project description

extracTR

Introduction

extracTR is a tool for extracting and analyzing tandem repeats from raw DNA sequences. It utilizes advanced algorithms to identify repetitive patterns in genomic data, providing valuable insights for researchers in genomics and bioinformatics.

Features

  • Efficient tandem repeat detection from raw sequencing data
  • Support for paired-end FASTQ files
  • Customizable parameters for fine-tuning repeat detection
  • Output in easy-to-analyze CSV format
  • Multi-threaded processing for improved performance

Requirements

  • Python 3.7 or later
  • Jellyfish 2.3.0 or later

To install Jellyfish:

conda install -c bioconda jellyfish

Installation

Install extracTR using pip:

pip install extracTR

Usage

Before running extracTR, ensure that you have removed adapters from your sequencing reads. This is crucial to prevent contamination of predicted repeats with adapter sequences.

Basic usage:

extracTR -1 reads_1.fastq -2 reads_2.fastq -o output_prefix

Advanced usage with custom parameters:

extracTR -1 reads_1.fastq -2 reads_2.fastq -o output_prefix -t 64 -c 30 -k 25

Options:

  • -1, --fastq1: Input file with forward DNA sequences in FASTQ format
  • -2, --fastq2: Input file with reverse DNA sequences in FASTQ format
  • -o, --output: Prefix for output files
  • -t, --threads: Number of threads to use (default: 32)
  • -c, --coverage: Coverage to use for indexing (default: 1)
  • -k, --k: K-mer size to use for indexing (default: 23)

Output

extracTR generates the following output files:

  • {output_prefix}.csv: Main output file containing detected tandem repeats
  • {output_prefix}.sdat: Intermediate file with k-mer frequency data
  • Additional files for detailed analysis and debugging

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

extracTR-0.1.0.tar.gz (11.7 kB view details)

Uploaded Source

File details

Details for the file extracTR-0.1.0.tar.gz.

File metadata

  • Download URL: extracTR-0.1.0.tar.gz
  • Upload date:
  • Size: 11.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.19

File hashes

Hashes for extracTR-0.1.0.tar.gz
Algorithm Hash digest
SHA256 f15664c936ccc0d8d9c3130147e8cdac7c30a945757b95b2910f27fecee239a6
MD5 c1c09623e44fadd3b955430929aeef67
BLAKE2b-256 5a179b5d0de724ee28b611c5af48557a87cf5396bb732b482407653ce38194ef

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page