Temporal action detection for biology
Project description
DeepEthogram
- Written by Jim Bohnslav, except where as noted
- JBohnslav@gmail.com
DeepEthogram is an open-source package for automatically classifying each frame of a video into a set of pre-defined behaviors. Designed for neuroscience research, it could be used in any scenario where you need to detect actions from each frame of a video.
Example use cases:
- Measuring itching or scratching behaviors to assess the differences between wild-type and mutant animals
- Measuring the amount of time animals spend courting, and comparing between experimental conditions
- Counting licks from video for appetite measurement
- Measuring reach onset times for alignment with neural activity
DeepEthogram uses state-of-the-art algorithms for temporal action detection. We build on the following previous machine learning research into action detection:
- Hidden Two-Stream Convolutional Networks for Action Recognition
- Temporal Gaussian Mixture Layer for Videos
Installation
DeepEthogram recently migrated from Miniconda/pip to uv. Current releases are uv-first,
recommend Python 3.11, support Python >=3.9,<3.12, and use PySide6 for the GUI.
Install uv first:
curl -LsSf https://astral.sh/uv/install.sh | sh
# or
brew install uv
For development from this repository:
git clone https://github.com/jbohnslav/deepethogram.git
cd deepethogram
uv sync
uv run deepethogram
For a user install:
uv pip install deepethogram
Detailed setup, troubleshooting, and legacy conda/pip instructions for older releases are in docs/installation.md.
Data
NEW! All datasets collected and annotated by the DeepEthogram authors are now available from this DropBox link: https://www.dropbox.com/sh/3lilfob0sz21och/AABv8o8KhhRQhYCMNu0ilR8wa?dl=0
If you have issues downloading the data, please raise an issue on Github.
COLAB
I've written a Colab notebook that shows how to upload your data and train models. You can also use this if you don't have access to a decent GPU.
To use it, please click this link to the Colab notebook.
Then, click copy to Drive at the top. You won't be able to save your changes to the notebook as-is.
News
We now support docker! Docker is a way to run deepethogram in completely reproducible environments, without interacting
with other system dependencies. See docs/Docker for more information
Pretrained models
Rather than start from scratch, we will start with model weights pretrained on the Kinetics700 dataset. Go to
To download the pretrained weights, please use this Google Drive link.
Unzip the files in your project/models directory. Make sure that you don't add an extra directory when unzipping! The path should be
your_project/models/pretrained_models/{models 1:6}, not your_project/models/pretrained_models/pretrained_models/{models1:6}.
Licensing
Copyright (c) 2020 - President and Fellows of Harvard College. All rights reserved.
This software is free for academic use. For commercial use, please contact the Harvard Office of Technology Development (hms_otd@harvard.edu) with cc to Dr. Chris Harvey. For details, see license.txt.
Usage
To use the GUI, click
To use the command line interface, click
Dependencies
The major dependencies for DeepEthogram are as follows:
- pytorch, torchvision: all the neural networks, training, and inference pipelines were written in PyTorch
- pytorch-lightning: for nice model training base classes
- kornia: for GPU-based image augmentations
- PySide6: for the GUI
- opencv: for video and image reading and writing
- opencv_transforms: for fast image augmentation
- scikit-learn, scipy: for binary classification metrics
- matplotlib: plotting metrics and neural network outputs
- pandas: reading and writing CSVs
- h5py: saving inference outputs as HDF5 files
- omegaconf: for smoothly integrating configuration files and command line inputs
- tqdm: for nice progress bars
Hardware requirements
For GUI usage, we expect that the users will be working on a local workstation with a good NVIDIA graphics card. For training via a cluster, you can use the command line interface.
- CPU: 4 cores or more for parallel data loading
- Hard Drive: SSD at minimum, NVMe drive is better.
- GPU: DeepEthogram speed is directly related to GPU performance. An NVIDIA GPU is absolutely required, as PyTorch uses
CUDA, while AMD does not.
The more VRAM you have, the more data you can fit in one batch, which generally increases performance. a
I'd recommend 6GB VRAM at absolute minimum. 8GB is better, with 10+ GB preferred.
Recommended GPUs:
RTX 3090,RTX 3080,Titan RTX,2080 Ti,2080 super,2080,1080 Ti,2070 super,2070Some older ones might also be fine, like a1080or even1070 Ti/1070.
testing
Test coverage is still low, but in the future we will be expanding our unit tests.
First, download a copy of testing_deepethogram_archive.zip
Make a directory in tests called DATA. Unzip this and move it to the deepethogram/tests/DATA
directory, so that the path is deepethogram/tests/DATA/testing_deepethogram_archive/{DATA,models,project_config.yaml}.
To run tests:
# Run all tests except GPU tests (default)
uv run pytest tests/
# Run only GPU tests (requires NVIDIA GPU)
uv run pytest -m gpu
# Run all tests including GPU tests
uv run pytest -m ""
GPU tests are skipped by default as they require significant computational resources and time to complete. These tests perform end-to-end model training and inference.
Developer Guide
Code Style and Pre-commit Hooks
We use pre-commit hooks to maintain code quality and consistency. The hooks include:
- Ruff for Python linting and formatting
- Various file checks (trailing whitespace, YAML validation, etc.)
To set up the development environment:
- Install the development dependencies:
uv sync --dev
- Install pre-commit hooks:
uvx pre-commit install
The hooks will run automatically on every commit. You can also run them manually on all files:
uvx pre-commit run --all-files
Changelog
- 0.1.4: bugfixes for dependencies; added docker
- 0.1.2/3: fixes for multiclass (not multilabel) training
- 0.1.1.post1/2: batch prediction
- 0.1.1.post0: flow generator metric bug fix
- 0.1.1: bug fixes
- 0.1: deepethogram beta! See above for details.
- 0.0.1.post1: bug fixes and video conversion scripts added
- 0.0.1: initial version
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file deepethogram-0.3.0.tar.gz.
File metadata
- Download URL: deepethogram-0.3.0.tar.gz
- Upload date:
- Size: 271.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fae9daeafb7929cad60bcf84f96660811c5995c2e2a3dfc27f90e57443679827
|
|
| MD5 |
072ad12a0e02ce0be2e965a0e1eadc4c
|
|
| BLAKE2b-256 |
abdd514ce79014273e94d538f2da2b24458e3371fe516810c3ab49fcfaa98412
|
Provenance
The following attestation bundles were made for deepethogram-0.3.0.tar.gz:
Publisher:
release.yml on jbohnslav/deepethogram
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
deepethogram-0.3.0.tar.gz -
Subject digest:
fae9daeafb7929cad60bcf84f96660811c5995c2e2a3dfc27f90e57443679827 - Sigstore transparency entry: 1199722200
- Sigstore integration time:
-
Permalink:
jbohnslav/deepethogram@81d5415bd19c39882176dc62f53a55db6212d84b -
Branch / Tag:
refs/heads/master - Owner: https://github.com/jbohnslav
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@81d5415bd19c39882176dc62f53a55db6212d84b -
Trigger Event:
push
-
Statement type:
File details
Details for the file deepethogram-0.3.0-py3-none-any.whl.
File metadata
- Download URL: deepethogram-0.3.0-py3-none-any.whl
- Upload date:
- Size: 308.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
00d710ed2292d1c7579a2173c5893a653220095b9c3d7dedf113f27be2004eb5
|
|
| MD5 |
7d3c9621f47fa28d047a197a657a5cb8
|
|
| BLAKE2b-256 |
1d01a49f98a2f5578d8d06504a277fc6062dc35c31f78be7c822a25b29f07581
|
Provenance
The following attestation bundles were made for deepethogram-0.3.0-py3-none-any.whl:
Publisher:
release.yml on jbohnslav/deepethogram
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
deepethogram-0.3.0-py3-none-any.whl -
Subject digest:
00d710ed2292d1c7579a2173c5893a653220095b9c3d7dedf113f27be2004eb5 - Sigstore transparency entry: 1199722224
- Sigstore integration time:
-
Permalink:
jbohnslav/deepethogram@81d5415bd19c39882176dc62f53a55db6212d84b -
Branch / Tag:
refs/heads/master - Owner: https://github.com/jbohnslav
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@81d5415bd19c39882176dc62f53a55db6212d84b -
Trigger Event:
push
-
Statement type: