Skip to main content

Count read alignments on transposable elements subfamilies, families and classes.

Project description

GitHub Workflow Status PyPI install with bioconda

TEcount

A package to count reads mapping on transposable elements (TEs) subfamilies, families and classes in bulk RNA-seq.

Features

TEcount counts high-throughput sequencing reads aligned on TEs on a reference genome. It reports TE counts by subfamily, family and class on separate outputs. In case of reads aligning on multiple TE loci, it counts only one alignment occurrence for each feature (i.e. subfamily, family or class). It can use single or paired end BAM files, count in strand-specific manner and filter by a minimum read-TE overlap, among other features that can be discovered by running TEcount --help.

Install

Using conda (recommended)

We recommend using conda, as it will install all the required packages along TEcount.

conda create -n tecount -c conda-forge -c bioconda tecount

Using pip

If for any reason it's not possible or desiderable to use conda, it can be installed with pip and the following requirements must be installed manually: python>=3.7, samtools>=1.14 and bedtools>=2.30.0.

pip install tecount

Usage

Requires a BAM file sorted by coordinates and a bed6+3 file with subfamily, family and class in columns 7, 8 and 9.

TEcount -b <sorted_by_coord.bam> -r <rmsk.bed>

Example using test dataset:

TEcount -b ./test/Aligned.sortedByCoord.out.bam  -r ./test/rmsk.GRCm38.chr19.bed.gz

See all available parameters with:

TEcount --help

Acknowledgements

Based on the transposable elements RNA-seq quantification strategy implemented in 1.

References

1. Marasca, F., Sinha, S., Vadalà, R. et al. LINE1 are spliced in non-canonical transcript variants to regulate T cell quiescence and exhaustion. Nat Genet 54, 180–193 (2022). https://doi.org/10.1038/s41588-021-00989-7

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tecount-1.0.1.tar.gz (6.4 kB view details)

Uploaded Source

Built Distribution

tecount-1.0.1-py3-none-any.whl (7.0 kB view details)

Uploaded Python 3

File details

Details for the file tecount-1.0.1.tar.gz.

File metadata

  • Download URL: tecount-1.0.1.tar.gz
  • Upload date:
  • Size: 6.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.14

File hashes

Hashes for tecount-1.0.1.tar.gz
Algorithm Hash digest
SHA256 828fe6d5bc0e85bc12890d2447bfa70c6c25f91cd06bbeb7aa44809ad62e1ea5
MD5 aa795c9dc21afa1285bbe0cb1fc2647e
BLAKE2b-256 674f4129d53b4218c3180964314b04678229634974d070185af747940a0c4927

See more details on using hashes here.

File details

Details for the file tecount-1.0.1-py3-none-any.whl.

File metadata

  • Download URL: tecount-1.0.1-py3-none-any.whl
  • Upload date:
  • Size: 7.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.14

File hashes

Hashes for tecount-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 c0ed843344b58e7f7cbae3352317f84be7f743d0a8ee4e399757cd241fafbca7
MD5 f37957bb31cc7d1b7eb02e25b6ae748d
BLAKE2b-256 7a51a054efc139b66fa87aef22b49567b73866d79648356907f243358b2c1aa5

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page