Optimize DNA sequences under constraints.
Project description
DNA Chisel is a Python library to optimize the nucleotides of DNA sequences with respect to a set of constraints and optimization objectives. It can be used for codon-optimizing the genes of a sequence for a particular organism, modifying a sequence to meet the constraints of a DNA provider while preserving genes, and much more.
DNA Chisel comes with more than 15 types of optimizations and constraints and allows users to define new specifications in Python, making the library suitable for a large range of automated sequence design applications, or complex custom design projects.
Example of use
from dnachisel import *
# DEFINE THE OPTIMIZATION PROBLEM
random_sequence = random_dna_sequence(10000)
problem = DnaOptimizationProblem(
sequence=random_sequence,
constraints=[AvoidPattern("BsaI_site"),
EnforceGCContent(mini=0.3, maxi=0.7, window=50)],
objectives=[CodonOptimize(species='e_coli', location=(500, 1300))]
)
# SOLVE THE CONSTRAINTS, OPTIMIZE WITH RESPECT TO THE OBJECTIVE
problem.resolve_constraints()
problem.optimize()
# PRINT SUMMARIES TO CHECK THAT CONSTRAINTS PASS
print(problem.constraints_text_summary())
print(problem.objectives_text_summary())
See this page for an overview of available specifications.
Infos
PIP installation:
pip install dnachisel[reports]
(you can omit the [reports] suffix if you intend to use dnachisel only for sequence optimization, without generating figures or PDF reports)
Web documentation:
https://edinburgh-genome-foundry.github.io/DnaChisel/
Github Page
https://github.com/Edinburgh-Genome-Foundry/DnaChisel
Live demo
http://cuba.genomefoundry.org/sculpt_a_sequence
License: MIT, Copyright Edinburgh Genome Foundry
More biology software
DNA Chisel is part of the EGF Codons synthetic biology software suite for DNA design, manufacturing and validation.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.