A package dedicated to diPWM motif search into a text
Project description
Package dipwmsearch
Description
Dedicated package to dipwmsearch
through a text. Provides different approaches to seek for motifs (diPWM) through a text (for example a sequence) :
- sliding window (
search_semi_naive
) - enumeration of valids words and AhoCorasick search of that set of words through the text (
search_aho
) - super alphabet search (
search_super
)
Install
Local installation using git
- clone the git repository
git clone git@gite.lirmm.fr:rivals/dipwmsearch.git
- go to the root of the folder
cd dipwmsearch
- use the
Makefile
to install
make install
Installation using pip
pip install dipwmsearch
Getting started
- To import the package
import dipwmsearch as ds
- To parse a diPWM file and create an object diPWM
diP = ds.create_diPwm(diPwm_path_file)
- To use the enumeration and Aho-Corasick search
for start_position, word, score in ds.search_aho_ratio(diP, text, ratio):
print(f'{start_position}\t{word}\t{score}')
Documentation
More info in the documentation
License
License type: CeCILL-B More info
Authors
- Marie Mille (main contributor)
- Bastien Cazaux
- Julie Ripoll
- Eric Rivals
Dependencies
Basics to install
pyahocorasick
For tests
pytest
pandas
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
dipwmsearch-0.0.4.tar.gz
(29.9 kB
view hashes)
Built Distribution
Close
Hashes for dipwmsearch-0.0.4-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | af88c7d0ccb62d79a0958b3c8efb67e1e256a9e4bd878c7d105f5d182d6724b0 |
|
MD5 | 7e66b6f475ea4501dc6e360d1c847827 |
|
BLAKE2b-256 | 4e48d62556b573b81020d026f8753de0fd1feb3cabb9f6faa76623c54a0e4e16 |