cobilib

Optimizing Codon Usage with a Quasispecies Model

These details have not been verified by PyPI

Development Status
- 3 - Alpha
Environment
- Console
Intended Audience
- Science/Research
License
- OSI Approved :: GNU General Public License v3 or later (GPLv3+)
Programming Language
- Python
Topic
- Scientific/Engineering :: Bio-Informatics

Project description

Summary

We provide a library that enables us to select a number of reference genes to which codon usage should be optimized. Furthermore, we allow for input of a variable amount of fitness factors: translation speed of codons, tRNA abundance, etc. Given these contributing fitness factors the result is displayed as the strength of the respective fitness factors that lead to the best resemblance between simulated and reference codon usage. In a next step, the strengths can be tuned and a codon usage can be generated that can afterwards be used to adapt a gene sequence with the help of classic codon optimization tools as OPTIMIZER.

Example

In an example workflow you might want to select a fasta file that contains the genes you want use. You can either select them from a file or a url. In both cases a histogram of codon usage and amino acid usage is generated.

You can then (optionally) load a list of highly expressed genes, we support the format from the HEG database. Visualizing the codon usage bias for e.g. checking if the CUB as you expect can be done by plotting various methods of dimensionality reduction.

If you do not want to use all the genes you can enter a number n. The first n genes will only be analysed.

You now have to select a fitness matrix which gives the probability of one amino acid to be represented by another one.

Additionally, you can select a number of fitnessfunctions that assign to each codon a fitness. These functions will be normalized! If you want to perform a test run you have to enter the parameters: alpha,beta,selection,t_i for every testfunction. alpha and beta are parameters for the <todo> model of codon substitution and are related to transition/transversion bias. Input is either comma or whitespace/tab separated (or a combination of those).

You can compare the absolute codon usage and relative (normalized for each amino acid) codon usage by plot comparison. For optimizing the distance you can try optimizing the first gene and again regard the comparison to see if the algorithm works at all.

In a last step you can optimize all genes you have read in. Returned are the optimal parameters, a goodness of fit and the RSCU that you can use for optimizing with the help of, e.g., OPTIMIZER.

Authors and License

GPLv3 Jan-Hendrik Trösemeier, Susanne Lipp, Christel Kamp

Contact: name.lastname at pei.de

Project details

These details have not been verified by PyPI

Development Status
- 3 - Alpha
Environment
- Console
Intended Audience
- Science/Research
License
- OSI Approved :: GNU General Public License v3 or later (GPLv3+)
Programming Language
- Python
Topic
- Scientific/Engineering :: Bio-Informatics

Release history Release notifications | RSS feed

This version

1.0.0

Feb 25, 2013

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cobilib-1.0.0.tar.gz (74.1 kB view details)

Uploaded Feb 25, 2013 Source

File details

Details for the file cobilib-1.0.0.tar.gz.

File metadata

Download URL: cobilib-1.0.0.tar.gz
Upload date: Feb 25, 2013
Size: 74.1 kB
Tags: Source
Uploaded using Trusted Publishing? No

File hashes

Hashes for cobilib-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`fd75868faa384b69296f96702c62379a1604a035f85344da282a8f2afb5b18c5`
MD5	`cc20c55eca61c2e887684750b233eccd`
BLAKE2b-256	`8a87d6b61da0fd1e871b50245da42e97f0c39c53eaec97cc4de70cf5a98ad896`

See more details on using hashes here.

cobilib 1.0.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers