delfies is a tool for the detection of DNA Elimination breakpoints
Project description
delfies
is a tool for the detection of DNA breakpoints with de-novo telomere addition.
It identifies genomic locations where double-strand breaks have occurred followed by telomere addition. It was initially designed and validated for studying the process of Programmed DNA Elimination in nematodes, but should work for other clades and applications too.
Getting started
delfies
takes as input a genome fasta (gzipped supported) and an indexed SAM/BAM of
sequencing reads aligned to the genome.
delfies --help
samtools index <aligned_reads>.bam
delfies <genome>.fa.gz <aligned_reads>.bam <output_dir>
cat <output_dir>/breakpoint_locations.bed
Table of Contents
Installation
Using pip
(or equivalent - poetry, etc.):
# Install latest release from PyPI
pip install delfies
# Or install a specific release from PyPI:
pip install delfies==0.7.0
# Or clone and install tip of main
git clone https://github.com/bricoletc/delfies/
pip install ./delfies
User Manual
CLI options
delfies --help
- Do use the
--threads
option if you have multiple cores/CPUs available. - [Breakpoints]
- There are two types of breakpoints: see detailed docs.
- Nearby breakpoints can be clustered together to account for variability in breakpoint location (
--clustering_threshold
).
- [Region selection]: You can select a specific region to focus on, specified as a string or as a BED file.
- [Telomeres]
- Specify the telomere sequence for your organism using
--telo_forward_seq
. If you're unsure, I recommend the tool telomeric-identifier for finding out.
- Specify the telomere sequence for your organism using
- [Aligned reads]
- To analyse confidently-aligned reads only, you can filter reads by MAPQ (
--min_mapq
) and by bitwise flag (--read_filter_flag
). - You can tolerate more or less mutations in the assembly telomeres (and in the sequencing reads) using
--telo_max_edit_distance
and--telo_array_size
.
- To analyse confidently-aligned reads only, you can filter reads by MAPQ (
Outputs
The two main outputs of delfies
are:
breakpoint_locations.bed
: a BED-formatted file containing the location of identified elimination breakpoints.breakpoint_sequences.fasta
: a FASTA-formatted file containing the sequences of identified elimination breakpoints
I highly recommend visualising your results! E.g., by loading your input
fasta and BAM and output delfies
' output breakpoint_locations.bed
in
IGV.
Applications
- The fasta output enables looking for sequence motifs that occur at breakpoints, e.g. using MEME.
- The BED output enables classifying a genome into retained and eliminated regions. The 'strand' of breakpoints is especially useful for this: see detailed docs.
- The BED output also enables assembling past somatic telomeres: for how to do this, see detailed docs.
Detailed documentation
For more details on delfies
, including outputs and applications, see detailed_docs.
Contributing
Contributions always welcome!
Please see CONTRIBUTING.md for how (reporting issues, requesting features, contributing code).
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file delfies-0.7.0.tar.gz
.
File metadata
- Download URL: delfies-0.7.0.tar.gz
- Upload date:
- Size: 13.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | bcd27c457e7ce2c34b441829f0243c5f5023ebb28918d77e01a1d63c293c636e |
|
MD5 | 7521076ec1f193008a6531870410aa95 |
|
BLAKE2b-256 | d6733752fe118ef452890a57e19dd4fecfada354aecda661cdde9673e3da07f6 |
Provenance
The following attestation bundles were made for delfies-0.7.0.tar.gz
:
Publisher:
release_pypi.yml
on bricoletc/delfies
-
Statement type:
https://in-toto.io/Statement/v1
- Predicate type:
https://docs.pypi.org/attestations/publish/v1
- Subject name:
delfies-0.7.0.tar.gz
- Subject digest:
bcd27c457e7ce2c34b441829f0243c5f5023ebb28918d77e01a1d63c293c636e
- Sigstore transparency entry: 148615475
- Sigstore integration time:
- Predicate type:
File details
Details for the file delfies-0.7.0-py3-none-any.whl
.
File metadata
- Download URL: delfies-0.7.0-py3-none-any.whl
- Upload date:
- Size: 14.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 85ee0a4c83b736604f65d98e8fde9c0b60decdc6afac72fbfffb7ab9774d29a2 |
|
MD5 | 777d78d69aacd361b6a1cfb06e65bbbd |
|
BLAKE2b-256 | 1badd0346292f2ed2a8e2ccbca8d39f707591418cd78a4dd373dfd7fe85c68f0 |
Provenance
The following attestation bundles were made for delfies-0.7.0-py3-none-any.whl
:
Publisher:
release_pypi.yml
on bricoletc/delfies
-
Statement type:
https://in-toto.io/Statement/v1
- Predicate type:
https://docs.pypi.org/attestations/publish/v1
- Subject name:
delfies-0.7.0-py3-none-any.whl
- Subject digest:
85ee0a4c83b736604f65d98e8fde9c0b60decdc6afac72fbfffb7ab9774d29a2
- Sigstore transparency entry: 148615480
- Sigstore integration time:
- Predicate type: