A package dedicated to diPWM motif search into a text
Project description
Package diPWM_search
Description
Dedicated package to diPWM_search through a text. Provides different approaches to seek for motifs (diPWM) through a text (for example a sequence) :
- sliding window (
search_semi_naive
) - enumeration of valids words and AhoCorasick search of that set of words through the text (
search_aho
) - super alphabet search (
search_super
)
Install
Local installation using git
- clone the git repository
git clone git@gite.lirmm.fr:rivals/dipwmsearch.git
- go to the root of the folder
cd dipwm_search
- use the
Makefile
to install
make install
Installation using pip
....
Getting started
- To import the package
import dipwm_search as ds
- To parse a diPWM file and create an object diPWM
diP = ds.create_diPwm(diPwm_path_file)
- To use the enumeration and Aho-Corasick search
for start_position, word, score in ds.search_aho_ratio(diP, text, ratio):
print(f'{start_position}\t{word}\t{score}')
Documentation
More info in the documentation
License
License type: CeCILL-B More info
Authors
- Marie Mille (main contributor)
- Bastien Cazaux
- Julie Ripoll
- Eric Rivals
Dependencies
Basics to install
pyahocorasick
For tests
pytest
pandas
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
dipwmsearch-0.0.2.tar.gz
(29.8 kB
view hashes)
Built Distribution
Close
Hashes for dipwmsearch-0.0.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | e8b7afacdd058a95cd5fc66113b4d559245547dae35c73a1257f0994e5a7ee99 |
|
MD5 | cf580270f8a8e155395bbc53b0ae14c8 |
|
BLAKE2b-256 | 861286d92f9378ac419d485b238819bac4987ae67209c5f2af62ca8e7c11d081 |