Skip to main content

Contains classes and code for representing double stranded DNA and functions for simulating homologous recombination between DNA molecules.

Project description

Documentation Status GitHub starsGitHub issuesResearch software impactCode ClimateIssue CountcodecovCircleCIicon1icon2PyPI versionAnaconda-Server Badge0Anaconda-Server Badge1Anaconda-Server Badge2Anaconda-Server Badge3

Planning genetic constructs with many parts and assembly steps, such as recombinant metabolic pathways are often difficult to properly document.

The Pydna python package provide a human-readable formal description of cloning and assembly strategies in Python which also allows for simulation and verification of cloning strategies. Pydna can be though of as executable documentation for molecular biology.

Pydna provides simulation of:

  • Restriction digestion

  • Ligation

  • PCR

  • Primer design

  • Gibson assembly

  • Golden gate assembly

  • Homologous recombination

  • Gel electrophoresis of DNA with generation of gel images

Virtually any sub-cloning experiment can be described in pydna, and its execution yield the sequence of the of intermediate and final resulting DNA molecule(s). A cloning strategy expressed in pydna is complete, unambiguous and can be made stable. Pydna has been designed to be understandable for biologists with only some basic understanding of Python.

Pydna can formalize planning and sharing of cloning strategies and is especially useful for complex or combinatorial DNA molecule constructions.

Look at some assembly strategies made in the Jupyter notebook format here.

There is an open access paper in BMC Bioinformatics describing pydna:

abstr

Please reference the above paper when using pydna.

Usage

Most pydna functionality is implemented as methods for the double stranded DNA sequence record classes Dseq and Dseqrecord, which are subclasses of the Biopython Seq and SeqRecord classes.

These classes make cut and paste cloning and PCR very simple:

>>> from pydna.dseq import Dseq
>>> seq = Dseq("GGATCCAAA","TTTGGATCC",ovhg=0)
>>> seq
Dseq(-9)
GGATCCAAA
CCTAGGTTT
>>> from Bio.Restriction import BamHI
>>> a,b = seq.cut(BamHI)
>>> a
Dseq(-5)
G
CCTAG
>>> b
Dseq(-8)
GATCCAAA
    GTTT
>>> a+b
Dseq(-9)
GGATCCAAA
CCTAGGTTT
>>> b+a
Dseq(-13)
GATCCAAAG
    GTTTCCTAG
>>> b+a+b
Dseq(-17)
GATCCAAAGGATCCAAA
    GTTTCCTAGGTTT
>>> b+a+a
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/usr/local/lib/python2.7/dist-packages/pydna/dsdna.py", line 217, in __add__
    raise TypeError("sticky ends not compatible!")
TypeError: sticky ends not compatible!
>>>

As the example above shows, pydna keeps track of sticky ends.

Notably, homologous recombination and Gibson assembly between linear DNA fragments can be easily simulated without any additional information besides the primary sequence of the fragments.

Gel electrophoresis of DNA fragments can be simulated using the included gel module by Bruno Silva:

simulated agarose gel

alt text

Look at an example notebook with a gel simulation here.

Pydna can be very compact. The nine lines of Python below, simulates the construction of a recombinant plasmid. DNA sequences are downloaded from Genbank by accession numbers that are guaranteed to be stable over time.

from pydna.genbank import Genbank
gb = Genbank("myself@email.com") # Tell Genbank who you are!
gene = gb.nucleotide("X06997") # Kluyveromyces lactis LAC12 gene for lactose permease.
from pydna.parsers import parse_primers
primer_f,primer_r = parse_primers(''' >760_KlLAC12_rv (20-mer)
                                      ttaaacagattctgcctctg

                                      >759_KlLAC12_fw (19-mer)
                                      aaatggcagatcattcgag ''')
from pydna.amplify import pcr
pcr_prod = pcr(primer_f,primer_r, gene)
vector = gb.nucleotide("AJ001614") # pCAPs cloning vector
from Bio.Restriction import EcoRV
lin_vector = vector.linearize(EcoRV)
rec_vec =  ( lin_vector + pcr_prod ).looped()

Pydna can automate the simulation of sub cloning experiments using python. This is helpful to generate examples for teaching purposes.

Read the documentation or the cookbook with example files for further information.

Please post a message in the google group for pydna if you have problems, questions or comments.

Feedback is very welcome! ## Who is using pydna?

An Automated Protein Synthesis Pipeline with Transcriptic and Snakemake

Pyviko: an automated Python tool to design gene knockouts in complex viruses with overlapping genes

Documentation

Documentation is built using Sphinx from docstrings in the code and displayed at readthedocs Documentation Status

The numpy docstring format is used.

Installation using conda on Anaconda

The absolutely best way of installing and using pydna is to use the free Anaconda or Miniconda python distributions.

Anaconda is a large download (about 400 Mb) while Miniconda is about 40-50 Mb.

Once Anaconda (or Miniconda) is installed, the conda package manager can be used to install pydna. Pydna and its dependencies are available from the BjornFJohansson package channel ast Anaconda.org.

The first step is to add the channel by typing the command below followed by return:

conda config --append channels BjornFJohansson

Then pydna can be installed by typing the command below followed by return:

conda install pydna

This works on Windows, MacOSX and Linux, and installs all necessary and optional dependencies automatically (see below).

Installation using pip

The second best way of installing pydna is with pip, the officially recommended tool.

Pip is included in recent Python versions.

Pip installs the minimal installation requirements automatically, but not the optional requirements (see below). This will probably not work directly on windows, as biopython is not directly installable.

Linux:

bjorn@bjorn-UL30A:~/pydna$ sudo pip install pydna

Windows:

Installing biopython on Windows can be tricky. The biopython site has executable installers. Read here on how to install biopython requirements such as Numpy. Christoph Gohlke at University of California, Irvine has compiled many binary installers for Windows wich include most requirements.

When the requrements are installed you can pip install pydna from the Windows terminal:

C:\> pip install pydna

Installation from Source

If you install from source, you need to install all dependencies separately (listed above). Download one of the source installers from the pypi site or from Github and extract the file. Open the pydna source code directory (containing the setup.py file) in terminal and type:

python setup.py install

Source Code

Pydna is developed on Github.

Minimal installation requirements

Pydna is currently developed on and for Python 3.5 or 3.6. Pydna versions before 1.0.0 were compatible with python 2.7 only. The list below is the minimal requirements for installing pydna. Biopython has c-extensions, but the other modules are pure python.

Optional Requirements

Pydna has been designed to be used from the Jupyter notebook. If IPython and Jupyter are installed, importing ipython notebooks as modules among are supported among other things.

If the modules listed below are installed, gel simulation functionality is available.

The pydna conda package installs the optional requirements listed above as well as:

Requirements for running tests

Requirements for analyzing code coverage

Automatic testing

The test suit is run automatically after each commit on:

  • Ubuntu 14.04 using CircleCI

  • OSX-64 using TravisCI

  • Windows using AppveyorCI

See the badges at the top of this page.

Automatic builds

Conda packages are built on CircleCI(Linux), TravisCI(MacOS) and AppveyorCI(Windows). Source setuptools packages and wheels are built on Linux for all systems. Binary setuptools packages are built for Windows and MacOSX.

  • Conda packages Anaconda-Server Badge0

  • Setuptools packages

Builds are controlled by Git tags. Tags like 1.0.2a4 are considered test builds and are uploaded to testpypi and to Anaconda.org with a “test” label. These are only meant to test the finished packages and are not meant to be used.

Tags like 1.0.3 are considered final builds and are built and uploaded to Anaconda.org under the “main” label and to the regular pypi server.

Changelog

See the change log for recent changes.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

pydna-2.0.1-py3.6.egg (126.8 kB view details)

Uploaded Source

pydna-2.0.1-py3.5.egg (126.8 kB view details)

Uploaded Source

pydna-2.0.1-py3-none-any.whl (132.7 kB view details)

Uploaded Python 3

File details

Details for the file pydna-2.0.1-py3.6.egg.

File metadata

  • Download URL: pydna-2.0.1-py3.6.egg
  • Upload date:
  • Size: 126.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for pydna-2.0.1-py3.6.egg
Algorithm Hash digest
SHA256 ce5f6a8f54d3cc5978ddbe9d38bc1fe0d6555c7651401bcf46f6acdbfdd83f28
MD5 469a6e364f9bd68428b178a7a9adf05e
BLAKE2b-256 2ad110a84f905e1a5f0746d65dfa882f4d5e8589c2f55305db025232453f9511

See more details on using hashes here.

Provenance

File details

Details for the file pydna-2.0.1-py3.5.egg.

File metadata

  • Download URL: pydna-2.0.1-py3.5.egg
  • Upload date:
  • Size: 126.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for pydna-2.0.1-py3.5.egg
Algorithm Hash digest
SHA256 3f2a5c99131cd27e5a9740fe158a370d111df465dc858424f06065af46d141b8
MD5 ff641960b01e6837f6efaceb42ddecc0
BLAKE2b-256 f41f271e860a7a77e1ae90363dcc618f1f29b89b6a9949e285471b4de1151c98

See more details on using hashes here.

Provenance

File details

Details for the file pydna-2.0.1-py3-none-any.whl.

File metadata

File hashes

Hashes for pydna-2.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 24ede67cdd3a96ea3e424bd13bceeafe58f2b6b0efdea813dc561bdf2ff1fb4a
MD5 4b5b5e8a6f6961edc3e3825bd4da6a2a
BLAKE2b-256 8c6efd93a93312b657c1827fd32cf938235e726b0b2654386f1a8356ec29b97b

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page