Skip to main content

Leverages the Burrows-Wheeler compression of BZip to estimate the complexity/information density of a sequence

Project description

bzComplexityAnalyzer

Getting Started

import bzComplexityAnalyzer

Preparing an analyzer object

analyzer = bzComplexityAnalyzer.Analyzer(alphabet="dna", ignoreCase=True, iterations=1000)
  • alphabet: string or list of characters Specifies the alphabet to be compared against. Default: DNA
  • ignoreCase: boolean If true, casing of letters will be homogenized before analysis to neutralize their effects on complexity
  • iterations int Number of random sequences to use for finding the distribution of compressed product lengths

Estimating the complexity of a sequence

analyzer.getCompressionZScore("GAATTCGAATTCGAATTC") # Returns a Z-score
analyzer.getCompressionPercentile("GAATTCGAATTCGAATTC") # Returns a percentile

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bzcomplexityanalyzer-0.0.1.tar.gz (30.8 kB view hashes)

Uploaded Source

Built Distribution

bzcomplexityanalyzer-0.0.1-py3-none-any.whl (29.3 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page