Skip to main content

delfies is a tool for the detection of DNA Elimination breakpoints

Project description

PyPI codecov License: MIT JOSS paper status

delfies is a tool for the detection of DNA breakpoints with de-novo telomere addition.

It identifies genomic locations where double-strand breaks have occurred followed by telomere addition. It was initially designed and validated for studying the process of Programmed DNA Elimination in nematodes, but should work for other clades and applications too.

Getting started

delfies takes as input a genome fasta (gzipped supported) and an indexed SAM/BAM of sequencing reads aligned to the genome.

delfies --help
samtools index <aligned_reads>.bam
delfies <genome>.fa.gz <aligned_reads>.bam <output_dir>
cat <output_dir>/breakpoint_locations.bed

Table of Contents

Installation

Using pip (or equivalent - poetry, etc.):

# Install latest release from PyPI
pip install delfies

# Or install a specific release from PyPI:
pip install delfies==0.7.0

# Or clone and install tip of main
git clone https://github.com/bricoletc/delfies/
pip install ./delfies

User Manual

CLI options

delfies --help
  • Do use the --threads option if you have multiple cores/CPUs available.
  • [Breakpoints]
    • There are two types of breakpoints: see detailed docs.
    • Nearby breakpoints can be clustered together to account for variability in breakpoint location (--clustering_threshold).
  • [Region selection]: You can select a specific region to focus on, specified as a string or as a BED file.
  • [Telomeres]
    • Specify the telomere sequence for your organism using --telo_forward_seq. If you're unsure, I recommend the tool telomeric-identifier for finding out.
  • [Aligned reads]
    • To analyse confidently-aligned reads only, you can filter reads by MAPQ (--min_mapq) and by bitwise flag (--read_filter_flag).
    • You can tolerate more or less mutations in the assembly telomeres (and in the sequencing reads) using --telo_max_edit_distance and --telo_array_size.

Outputs

The two main outputs of delfies are:

  • breakpoint_locations.bed: a BED-formatted file containing the location of identified elimination breakpoints.
  • breakpoint_sequences.fasta: a FASTA-formatted file containing the sequences of identified elimination breakpoints

I highly recommend visualising your results! E.g., by loading your input fasta and BAM and output delfies' output breakpoint_locations.bed in IGV.

Applications

  • The fasta output enables looking for sequence motifs that occur at breakpoints, e.g. using MEME.
  • The BED output enables classifying a genome into retained and eliminated regions. The 'strand' of breakpoints is especially useful for this: see detailed docs.
  • The BED output also enables assembling past somatic telomeres: for how to do this, see detailed docs.

Detailed documentation

For more details on delfies, including outputs and applications, see detailed_docs.

Contributing

Contributions always welcome!

Please see CONTRIBUTING.md for how (reporting issues, requesting features, contributing code).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

delfies-0.7.0.tar.gz (13.6 kB view details)

Uploaded Source

Built Distribution

delfies-0.7.0-py3-none-any.whl (14.9 kB view details)

Uploaded Python 3

File details

Details for the file delfies-0.7.0.tar.gz.

File metadata

  • Download URL: delfies-0.7.0.tar.gz
  • Upload date:
  • Size: 13.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for delfies-0.7.0.tar.gz
Algorithm Hash digest
SHA256 bcd27c457e7ce2c34b441829f0243c5f5023ebb28918d77e01a1d63c293c636e
MD5 7521076ec1f193008a6531870410aa95
BLAKE2b-256 d6733752fe118ef452890a57e19dd4fecfada354aecda661cdde9673e3da07f6

See more details on using hashes here.

Provenance

The following attestation bundles were made for delfies-0.7.0.tar.gz:

Publisher: release_pypi.yml on bricoletc/delfies

Attestations:

File details

Details for the file delfies-0.7.0-py3-none-any.whl.

File metadata

  • Download URL: delfies-0.7.0-py3-none-any.whl
  • Upload date:
  • Size: 14.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for delfies-0.7.0-py3-none-any.whl
Algorithm Hash digest
SHA256 85ee0a4c83b736604f65d98e8fde9c0b60decdc6afac72fbfffb7ab9774d29a2
MD5 777d78d69aacd361b6a1cfb06e65bbbd
BLAKE2b-256 1badd0346292f2ed2a8e2ccbca8d39f707591418cd78a4dd373dfd7fe85c68f0

See more details on using hashes here.

Provenance

The following attestation bundles were made for delfies-0.7.0-py3-none-any.whl:

Publisher: release_pypi.yml on bricoletc/delfies

Attestations:

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page