Perform k-mer abundance analysis in DNA sequences
Project description
eme_selex
eme_selex (Every Motif Ever for SELEX Analysis) is a Python package to perform k-mer abundance analysis in DNA sequences. eme_selex is developed to perform fast and efficient analysis of short k-mers (tested with k-mers up to length 10).
While eme_selex can be used for general purpose k-mer analysis, motivation to develop eme_selex is to perform Systemic Evolution of Ligands by EXponential enrichment coupled with High Throughput sequencing (HT-SELEX) analysis in a Pythonic way. By default, for every k-mer, eme_selex quantifies the fraction of reads containing that k-mer in a non-redundant manner. After the quantification, a basic position frequency matrix (PFM) for the top 50 k-mers is generated. If the user wants to generate more PFMs, they can change the top keyword argument to a desired number.
Installation
pip install eme_selex
Tutorial for HT-SELEX analysis
Jupyter notebooks detailing the usage of eme_selex and extensive analysis for HT-SELEX are hosted here https://eme_selex.readthedocs.io
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for eme_selex-0.3-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 004be513f4385271b1db057c8f6e315321a9544f4bd8e6c7f1ae6dc612d3e2fe |
|
MD5 | 9302e02baff71f6bd64fb5ff69ebc231 |
|
BLAKE2b-256 | 15680f4ccc4c085a2c525ec5f5fb0062b4c8a6d00e46d1cad35c4a13777fc826 |