Leverages the Burrows-Wheeler compression of BZip to estimate the complexity/information density of a sequence
Project description
bzComplexityAnalyzer
Getting Started
import bzComplexityAnalyzer
Preparing an analyzer object
analyzer = bzComplexityAnalyzer.Analyzer(alphabet="dna", ignoreCase=True, iterations=1000)
- alphabet: string or list of characters Specifies the alphabet to be compared against. Default: DNA
- ignoreCase: boolean If true, casing of letters will be homogenized before analysis to neutralize their effects on complexity
- iterations int Number of random sequences to use for finding the distribution of compressed product lengths
Estimating the complexity of a sequence
analyzer.getCompressionZScore("GAATTCGAATTCGAATTC") # Returns a Z-score
analyzer.getCompressionPercentile("GAATTCGAATTCGAATTC") # Returns a percentile
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
bzcomplexityanalyzer-0.0.1.tar.gz
(30.8 kB
view hashes)
Built Distribution
Close
Hashes for bzcomplexityanalyzer-0.0.1.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | c4909bcbe820b9845e4354f5ba9a770f1dec8ce7d244da0cb5cc856442837807 |
|
MD5 | 620571038bbce6fb74e43ae3e2df48a9 |
|
BLAKE2b-256 | 4511419eabec29a47cc436ce2d128332a23e54d8cd54dc181cf80fb04ffc15f0 |
Close
Hashes for bzcomplexityanalyzer-0.0.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 35d03d607aeea6418823d9e9b5e8852cf6e6407febb3a09430af765360c199e8 |
|
MD5 | 56d936a316a61a53d866747164e68350 |
|
BLAKE2b-256 | e1cfca75eb6de28fef578461666e1844ac2904119bbccdb48f9071980c465180 |