Skip to main content

A Python package for constructing microbial strains

Project description

teemi logo

teemi: a python package designed to make HT strain construction reproducible and FAIR

https://badge.fury.io/py/teemi.svg https://github.com/hiyama341/teemi/actions/workflows/main.yml/badge.svg Documentation Status https://img.shields.io/github/license/hiyama341/teemi Supported Python Versions https://codecov.io/gh/hiyama341/teemi/branch/main/graph/badge.svg?token=P4457QACUY https://img.shields.io/badge/code%20style-black-black https://img.shields.io/github/last-commit/hiyama341/teemi

What is teemi?

teemi, named after the Greek goddess of fairness, is a python package designed to make microbial strain construction reproducible and FAIR (Findable, Accessible, Interoperable, and Reusable). With teemi, you can simulate all steps of a strain construction cycle, from generating genetic parts to designing a combinatorial library and keeping track of samples through a commercial Benchling API and a low-level CSV file database. This tool can be used in literate programming to increase efficiency and speed in metabolic engineering tasks. To try teemi, visit our Google Colab notebooks.

teemi not only simplifies the strain construction process but also offers the flexibility to adapt to different experimental workflows through its open-source Python platform. This allows for efficient automation of repetitive tasks and a faster pace in metabolic engineering.

Our demonstration of teemi in a complex machine learning-guided metabolic engineering task showcases its efficiency and speed by debottlenecking a crucial step in the strictosidine pathway. This highlights the versatility and usefulness of this tool in various biological applications.

Curious about how you can build strains easier and faster with teemi? Head over to our Google Colab notebooks and give it a try.

Please cite our paper (in preparation - link tba) if you’ve used teemi in a scientific publication.

Features

  • Combinatorial library generation

  • HT cloning and transformation workflows

  • Flowbot One instructions

  • CSV-based LIMS system as well as integration to Benchling

  • Genotyping of microbial strains

  • Advanced Machine Learning of biological datasets with the AutoML H2O

  • Workflows for selecting enzyme homologs

  • Promoter selection workflows from RNA-seq datasets

  • Data analysis of large LC-MS datasets along with workflows for analysis

Getting started

To get started with making microbial strains in an HT manner please follow the steps below:

  1. Install teemi. You will find the necessary information below for installation.

  2. Check out our notebooks for inspiration to make HT strain construction with teemi.

  3. You can start making your own workflows by importing teemi into either Google colab or Jupyter lab/notebooks.

Colab notebooks

As a proof of concept we show how teemi and literate programming can be used to streamline bioengineering workflows. These workflows should serve as a guide or a help to build your own workflows and thereby harnessing the power of literate programming with teemi.

Specifically, in this study we present how teemi and literate programming to build simulation-guided, iterative, and evolution-guided laboratory workflows for optimizing strictosidine production in yeast.

Below you can find all the notebooks developed in this work. Just click the Google colab badge to start the workflows.

Strictosidine case : First DBTL cycle

DESIGN:

  1. Automatically fetch homologs from NCBI from a query in a standardizable and repeatable way

Notebook 00

  1. Promoters can be selected from RNAseq data and fetched from online database with various quality measurements implemented

Notebook 01

  1. Combinatorial libraries can be generated with the DesignAssembly class along with robot executable intructions

Notebook 02

BUILD:

  1. Assembly of a CRISPR plasmid with USER cloning

Notebook 03

  1. Construction of the background strain by K/O of G8H and CPR

Notebook 04

  1. First combinatorial library was generated for 1280 possible combinations

Notebook 05

TEST:

  1. Data processing of LC-MS data and genotyping of the generated strains

Notebook 06

LEARN:

  1. Use AutoML to predict the best combinations for a targeted second round of library construction

Notebook 07

Strictosidine case : Second DBTL cycle

DESIGN:

  1. Results from the ML can be translated into making a targeted library of strains

Notebook 08

BUILD:

  1. Shows the construction of a targeted library of strains

Notebook 09

TEST:

  1. Data processing of LC-MS data like in notebook 6

Notebook 10

LEARN:

  1. Second ML cycle of ML showing how the model increased performance and saturation of best performing strains

Notebook 11

Installation

Use pip to install teemi from PyPI.

$ pip install teemi

If you want to develop or if you cloned the repository from our GitHub you can install teemi in the following way.

$ pip install -e <path-to-teemi-repo>

You might need to run these commands with administrative privileges if you’re not using a virtual environment (using sudo for example). Please check the documentation for further details.

Documentation and Examples

Documentation is available on through numerous Google Colab notebooks with examples on how to use teemi and how we use these notebooks for strain construnction. The Colab notebooks can be found here teemi.notebooks.

Contributions

Contributions are very welcome! Check our guidelines for instructions how to contribute.

License

  • Free software: MIT license

Credits

  • teemis logo was made by Jonas Krogh Fischer. Check out his website.

History

0.1.0 (2023-01-02)

  • First release on PyPI.

0.2.0 (2023-31-05)

  • New submodules: CRISPRsequencecutter, sequence_finder.

CRISPRSequenceCutter is a dataclass that is used to cut DNA through CRISPR-cas9 double-stranded break. SequenceFinder is a dataclass that finds upstream and downstream sequences from a sequence input, annotates them and saves them.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

teemi-0.2.0.tar.gz (79.7 kB view details)

Uploaded Source

Built Distribution

teemi-0.2.0-py2.py3-none-any.whl (70.2 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file teemi-0.2.0.tar.gz.

File metadata

  • Download URL: teemi-0.2.0.tar.gz
  • Upload date:
  • Size: 79.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.16

File hashes

Hashes for teemi-0.2.0.tar.gz
Algorithm Hash digest
SHA256 cea33a4bb5464e4407d79b4f3f58182b0edb3e4dee31164c2c3047cc13e8e8c9
MD5 256966af312126fcc1cc74ffde10b12b
BLAKE2b-256 7a9b20392f8d68a377ccb41f6a3b218d594db05b29eb2a852115715fbbf8cdd8

See more details on using hashes here.

File details

Details for the file teemi-0.2.0-py2.py3-none-any.whl.

File metadata

  • Download URL: teemi-0.2.0-py2.py3-none-any.whl
  • Upload date:
  • Size: 70.2 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.16

File hashes

Hashes for teemi-0.2.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 7013b8830317f1793cd787068e1719c0952b0c7c30be548c9f449933304475aa
MD5 c0be52d8b01946df9435762b1f3dca23
BLAKE2b-256 d8015970972d745f8e03023e02602d07bed419e18fcc08ebc32586435f3ea475

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page