A framework of tools to structure, configure and drive deep learning projects
Project description
serotiny
While going about the work of building deep learning projects, several simultaneous problems seemed to emerge:
- How do we reuse as much work from previous projects as possible, and focus on building the part of the project that makes it distinct?
- How can we automate the generation of new models that are based on existing models, but vary in a crucial yet non-trivial way?
- When generating a multiplicity of related models, how can we keep all of the results, predictions, and analyses straight?
- How can the results from any number of trainings and predictions be compared and integrated in an insightful yet generally applicable way?
Serotiny arose from the need to address these issues and convert the complexity of deep learning projects into something simple, reproducible, configurable, and automatable at scale.
Serotiny is still a work-in-progress, but as we go along the solutions to these problems become more clear. Maybe you've run into similar situations? We'd love to hear from you.
Overview
serotiny
is a framework and set of tools to structure, configure and drive deep
learning projects, developed with the intention of streamlining the lifecycle of
deep learning projects at Allen Institute for Cell Science.
It achieves this goal by:
- Standardizing the structure of DL projects
- Relying on the modularity afforded by this standard structure to make DL projects highly configurable, using hydra as the framework for configuration
- Making it easy to adopt best-practices and latest-developments in DL infrastructure
by tightly integrating with
- Pytorch Lightning for neural net training/testing/prediction
- MLFlow for experiment tracking and artifact management
In doing so, DL projects become reproducible, easy to collaborate on and can benefit from general and powerful tooling.
Getting started
For more information, check our documentation, or jump straight into our getting started page, and learn how training a DL model can be as simple as:
$ serotiny train data=my_dataset model=my_model
Authors
- Guilherme Pires @colobas
- Ryan Spangler @prismofeverything
- Ritvik Vasan @ritvikvasan
- Caleb Chan @calebium
- Theo Knijnenburg @tknijnen
- Nick Gomez @gomeznick86
Citing
If you find serotiny useful, please cite this repository as:
Serotiny Authors (2022). Serotiny: a framework of tools to structure, configure and drive deep learning projects [Computer software]. GitHub. https://github.com/AllenCellModeling/serotiny
Free software: BSD-3-Clause
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file serotiny-0.0.4.dev202210122257.tar.gz
.
File metadata
- Download URL: serotiny-0.0.4.dev202210122257.tar.gz
- Upload date:
- Size: 48.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.14
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | bce594b69793b123027720e07ca9c52af0b229e857e708ab67b79bd6fa57b103 |
|
MD5 | ddc7f3a7ad355c6e3749f4d5c9970d52 |
|
BLAKE2b-256 | f075e0ac727aad2b740bcbabe330675218a0c32e941d9d902d1b59d7cd24a9b4 |
File details
Details for the file serotiny-0.0.4.dev202210122257-py3-none-any.whl
.
File metadata
- Download URL: serotiny-0.0.4.dev202210122257-py3-none-any.whl
- Upload date:
- Size: 73.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.14
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1d4830580dfb9d967e7f91beec29ba1f6290ec623e109e47f6d45edfe3ff27fd |
|
MD5 | 5b701b08f93fab4f97c506b269698bc1 |
|
BLAKE2b-256 | 182c90e713e68f6a570da28305b090d0e9acad8640bd5cdaab1a4a543defe297 |