SEAM: Meta-explanations for interpreting sequence-based deep learning models
Project description
SEAM: systematic explanation of attribution-based mechanisms for regulatory genomics
SEAM (Systematic Explanation of Attribution-based for Mechanisms) is a Python suite to use meta-explanations to interpret sequence-based deep learning models for regulatory genomics data. For installation instructions, tutorials, and documentation, please refer to the SEAM website, https://seam-nn.readthedocs.io/. For an extended discussion of this approach and its applications, please refer to our paper:
- Seitz, E.E., McCandlish, D.M., Kinney, J.B., and Koo P.K. Deciphering the determinants of mechanistic variation in regulatory sequences. bioRxiv (2025). (unpublished)
Installation:
With Anaconda sourced, create a new environment via the command line:
conda create --name seam
Next, activate this environment via conda activate seam
, and install the following packages:
pip install seam-nn
Finally, when you are done using the environment, always exit via conda deactivate
.
Notes
SEAM has been tested on Mac and Linux operating systems. Typical installation time on a normal computer is less than 1 minute.
If you have any issues installing SEAM, please see:
- https://seam-nn.readthedocs.io/en/latest/installation.html
- https://github.com/evanseitz/seam-nn/issues
For issues installing SQUID, the package used for sequence generation and inference, please see:
- https://squid-nn.readthedocs.io/en/latest/installation.html
- https://github.com/evanseitz/squid-nn/issues
Older DNNs that require inference via Tensorflow 1.x or related packages may be in conflict with SEAM defaults. Users will need to run SEAM piecewise within separate environments:
- Tensorflow 1.x environment for generating in silico sequence-function-mechanism dataset
- Tensorflow 2.x environment for applying SEAM to explain in silico sequence-function-mechanism dataset
Usage:
SEAM provides a simple interface that takes as input a sequence-based oracle (e.g., a genomic DNN), which is used to generate an in silico sequence-function-mechanism dataset representing a localized region of sequence space. SEAM uses a meta-explanation framework to interpret the in silico sequence-function-mechanism dataset, deciphering the determinants of mechanistic variation in regulatory sequences.
Examples
Google Colab examples for applying SEAM on previously-published deep learning models are available at the following links:
- Local library to annotate all TFBSs and biophysical states
- DeepSTARR: Enhancer 20647 (Fig.2a)
- Local library with 30k sequences and 10% mutation rate | Integrated gradients; hierarchical clustering
- Expected run time: ~3.2 minutes on Colab A100 GPU
- Local library augmented with SNV and DNV libraries to map evolvability of mechanisms
- DeepSTARR: Enhancer 22612 (Fig.2b)
- Local library with 30,000 sequences and 10% mutation rate
- SNV and DNV library with 77,029 sequences
- Integrated gradients; PCA with K-Means clustering
- Expected run time: ~TODO minutes
- Local library to reveal low-affinity motifs using background separation
- DeepSTARR: Enhancer 4071 (Fig.TBD)
- Local library with 60k sequences and 10% mutation rate | Integrated gradients; hierarchical clustering
- Expected run time: ~9.3 minutes on Colab A100 GPU
- Combinatorial-complete library with empirical mutagenesis maps
- PBM: Zfp187 (Fig.TBD)
- Combinatorial-complete library with 65,536 sequences | ISM; Hierarchical clustering
- Expected run time: ~12 minutes on Colab A100 GPU
- Combinatorial-complete library with interactive mechanism space viewer
- PBM: Hnf4a (Fig.TBD)
- Combinatorial-complete library with 65,536 sequences | ISM; UMAP with K-Means clustering
- Expected run time: ~4.9 minutes on Colab A100 GPU
- Global library to compare mechanistic heterogeneity of an enhancer TFBS
- DeepSTARR: CREB/ATF (Fig.TBD)
- Global library with 100k sequences | Saliency maps: UMAP with K-Means clustering
- Expected run time: ~3.2 minutes on Colab A100 GPU
- Global library to compare mechanisms across different developmental programs
- DeepSTARR: DRE (Fig.TBD)
- Global library with 100k sequences | Saliency maps; UMAP with K-Means clustering
- Expected run time: ~2.7 minutes on Colab A100 GPU
- Global library to compare mechanisms associated with genomic and synthetic TFBSs
- DeepSTARR: AP-1 (Fig.TBD)
- Global library with 100k sequences | Integrated gradients; UMAP with K-Means clustering
- Expected run time: ~3.9 minutes on Colab A100 GPU
Python script examples are provided in the examples/
folder for locally running SEAM and exporting outputs to file. Additional dependencies for these examples may be required and outlined at the top of each script.
Citation:
If this code is useful in your work, please cite our paper.
bibtex TODO
License:
Copyright (C) 2023–2025 Evan Seitz, David McCandlish, Justin Kinney, Peter Koo
The software, code sample and their documentation made available on this website could include technical or other mistakes, inaccuracies or typographical errors. We may make changes to the software or documentation made available on its web site at any time without prior notice. We assume no responsibility for errors or omissions in the software or documentation available from its web site. For further details, please see the LICENSE file.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file seam_nn-0.5.2.tar.gz
.
File metadata
- Download URL: seam_nn-0.5.2.tar.gz
- Upload date:
- Size: 129.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.11.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ff7373f56b18b449f07ae2749b5387c9bb3ee3e7024ea7dfd49599cb6a65942d |
|
MD5 | 70549412393f8b53c754fe9703d3d211 |
|
BLAKE2b-256 | 812b147b5342c6ebcf5f414950cdbd16627efa03b2a4f9dc44014047578af461 |
File details
Details for the file seam_nn-0.5.2-py3-none-any.whl
.
File metadata
- Download URL: seam_nn-0.5.2-py3-none-any.whl
- Upload date:
- Size: 151.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.11.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 01ae78e429ce07c7e81da5a81800711749d58803ec451a2c7960000e71ae6e95 |
|
MD5 | ee7757e8d83e47e27f7c7e84122335b7 |
|
BLAKE2b-256 | bea09c80a11abff6004584eb2ad6676ef85bfdfd8f629f34b53d585305d2f352 |