Skip to main content

No project description provided

Project description

Travis-CI Build Status Commits since latest release freedom

abeona v0.45.0

A simple transcriptome assembler based on kallisto and Cortex graphs.

Abeona consists of the following stages:

  1. Assembly of reads into a De Bruijn graph

  2. Pruning of tips and low-coverage unitigs

  3. Partitioning of the De Bruijn graph into subgraphs

  4. Generation of candidate transcripts by simple path traversal

  5. Filtering of candidate transcripts by kallisto

Installation

The easiest way to install abeona is into a conda environment.

After activating the conda environment, run:

conda install abeona -c conda-forge -c bioconda

Usage

The principal command is abeona assemble. This command assembles transcripts from cleaned short-read RNA-seq reads in FASTA or FASTQ format. A description of command arguments is available with the command:

abeona assemble --help

Specifying input read data

Abeona is designed to be run on reads from one biological sample at a time. Abeona uses sequencing reads in two stages: for De Bruijn-graph construction, and for candidate transcript filtering with kallisto. The first stage accepts paired-end, single-end, or both types of reads through the --fastx-* arguments. The reads for the second stage are specified with the --kallisto-fastx-* arguments. Kallisto only accepts single-end or paired-end reads, so input to this stage is also restricted in that manner.

Toy Example

# Let's create a FASTA consisting of sub-reads from two transcripts: AAAAACCC and AAAAAGGG
$ for s in AAAAACC AAAAAGG AAAACCC AAAAGGG; do for i in $(seq 1 3); do echo -e ">_\n$s" >> input.fa; done; done

# Now feed the fasta to the graph assembly step with --fastx-single and to the kallisto filtering
# step with --kallisto-fastx-single.
$ abeona assemble -k 5 -m 4 --fastx-single input.fa --kallisto-fastx-single \
    input.fa --kallisto-fragment-length 7 --kallisto-sd 1 -o test --no-links
N E X T F L O W  ~  version 0.31.1
Launching `assemble.nf` [determined_allen] - revision: 11c20ed355
[bootstrap_samples:100, fastx_forward:null, fastx_reverse:null, fastx_single:/Users/winni/tmp/input.fa, initial_contigs:null, jobs:2, kallisto_fastx_forward:null, kallisto_fastx_reverse:null, kallisto_fastx_single:/Users/winni/tmp/input.fa, kallisto_fragment_length:7.0, kallisto_sd:1.0, kmer_size:5, max_paths_per_subgraph:0, memory:4, merge_candidates_before_kallisto:false, min_tip_length:0, min_unitig_coverage:4, out_dir:test, quiet:false, resume:false, mccortex:mccortex 5, mccortex_args:--sort --force -m 4G]
[warm up] executor > local
[26/119d41] Submitted process > fullCortexGraph
[fc/585605] Submitted process > cleanCortexGraph
[dd/40b5fc] Submitted process > pruneCortexGraphOfTips
[36/f63343] Submitted process > traverseCortexSubgraphs
[23/6d9033] Submitted process > candidateTranscripts (1)
[d5/05d417] Submitted process > buildKallistoIndices (1)
[ac/e36d53] Submitted process > kallistoQuant (1)
[ec/2b258d] Submitted process > filter_transcripts (1)
[49/d4c7e3] Submitted process > concatTranscripts

# View the resulting assembled transcripts
$ zcat test/all_transcripts/transcripts.fa.gz
>g0_p0 prop_bs_est_counts_ge_1=0.98
AAAAAGGG
>g0_p1 prop_bs_est_counts_ge_1=1.0
AAAAACCC

Development

conda env create -f environment.yml my-dev-env
conda activate my-dev-env
make test

License

Abeona is distributed under the terms of the Apache License, Version 2.0.

Citing

If you use abeona in your research, please cite:

Akhter S, Kretzschmar WW, Nordal V, Delhomme N, Street NR, Nilsson O, Emanuelsson O, Sundström JF. Integrative Analysis of Three RNA Sequencing Methods Identifies Mutually Exclusive Exons of MADS-Box Isoforms During Early Bud Development in Picea abies. Front. Plant Sci. 9, 1–18 (2018).

Changelog

Version 0.45.0

Date:

XXX

New features

abeona assemble

  • Mccortex is now used for pruning by default

  • The command line argument --prune-tips-with-mccortex is now deprecated. Instead use --no-prune-tips-with-mccortex.

  • New iterative pruning strategy --prune-tips-iteratively.

Version 0.44.0

Date:

2019-03-26

This version skips commits made for the 0.43.0 tag.

New features

  • Reads that share kmers with subgraphs that are skipped are now reported in the unassembled_reads directory.

Version 0.42.0

Date:

2018-12-17

Interface Changes

  • Cleanup now deletes all directories in output dir except for all_transcripts/transcripts.fa.gz

  • Cleanup is now on by default

  • Cleanup can be turned off with --no-cleanup flag

  • all_transcripts/transcripts.fa.gz is unzipped and stored as transcripts.fa to conform to the convention set by Trinity and Oases for output file names

Version 0.41.0

Date:

2018-12-13

Interface changes

  • Remove --kallisto-fastx-* arguments. Being able to separately specify reads to graph building and kallisto has not been all that useful, and it increases the complexity of the code.

  • Add default value of --kmer-size for --min-tip-length.

Fixes

  • There are several ways in which kallisto can fail due to no reads pseudoaligning to a subgraph’s candidate transcripts. When this happens, abeona now catches the error and silently ignores the subgraph.

Version 0.40.0

Date:

2018-11-17

New features

  • Add --no-links argument to turn off link use in candidate transcript creation

  • Add --max-junctions argument to allow fast skipping of subgraphs with too many junctions

Fixes

  • Properly assign reads to all subgraphs to which they are assignable

  • Solve high-mem use problem by creating links only on assigned reads

Version 0.36.0

Date:

2018-10-25

New features

  • Graph traversal now uses links

Fixes

  • Lots of improvements to abeona reads to improve memory and filehandle use

Version 0.33.0

Date:

2018-10-17

New features

  • Use kmer mapping (abeona reads) to assign reads to subgraphs before quantification of candidate transcripts with kallisto

Fixes

  • Add missing conda dependency seqtk to environment.yml for travis CI

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

abeona-0.45.0.tar.gz (23.2 kB view details)

Uploaded Source

Built Distribution

abeona-0.45.0-py3-none-any.whl (35.7 kB view details)

Uploaded Python 3

File details

Details for the file abeona-0.45.0.tar.gz.

File metadata

  • Download URL: abeona-0.45.0.tar.gz
  • Upload date:
  • Size: 23.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/41.0.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.7

File hashes

Hashes for abeona-0.45.0.tar.gz
Algorithm Hash digest
SHA256 bc7512f2eef785b037d836f4cc6faded457ac277f75c6e34eccd12da7c85258f
MD5 be891d0d847cc4f93bc1c7e1581b8ad8
BLAKE2b-256 99988e2d68c9e8560ea22b3791fd150efb9b5bdf139195144d75309c9c8de7e9

See more details on using hashes here.

File details

Details for the file abeona-0.45.0-py3-none-any.whl.

File metadata

  • Download URL: abeona-0.45.0-py3-none-any.whl
  • Upload date:
  • Size: 35.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/41.0.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.7

File hashes

Hashes for abeona-0.45.0-py3-none-any.whl
Algorithm Hash digest
SHA256 9c367c6c067172de3298caa93494dd11b212729130f85c0672b714ebcc0bb4e7
MD5 c278551b0eccc418fba75e3c1720c20b
BLAKE2b-256 efbd435d933436353503f9699788ff702da758988bb1f57a6e5c52465c6ea657

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page