Skip to main content

nterpreting sequence-to-function machine learning models

Project description

PyPI version PyPI - Downloads

SeqExplainer Logo

SeqExplainer -- Interpreting sequence-to-function machine learning models

A huge goal of applying machine learniing to genomics data is to obtain novel geneetic insights [1]. This can be challenging when models are complex (such as with neural networks). There are many interpretability methods specifically designed for such complex sequence-to-function preditctors, but they can be difficult to use and often are not interoperable.

The goal of SeqExplainer is to bring all these methods under one roof. We have designed a workflow that can take in any PyTorch model trained to predict labels from DNA sequence and expose it to many of the most popular explainability methods available in the field. We also offer some wrappers for explaining "shallow" sklearn models.

What is the scope of SeqExplainer?

Most of the core functionality is for post-hoc analysis of a trained model.

Common workflows

Feature attribution analysis (coming soon)

Identifying motifs in attributions (coming soon)

Testing feature dependencies (coming soon)

Tutorials

Extracting motif syntax rules from a CNN (coming soon)

Explaining predictions for shallow models on synthetic MPRAs using SHAP (coming soon)

Requirements

The main dependencies of SeqExplainer are:

python
torch
captum
numpy
matplotlib
logomaker
sklearn
shap

References

  1. Novakovsky, G., Dexter, N., Libbrecht, M. W., Wasserman, W. W. & Mostafavi, S. Obtaining genetics insights from deep learning via explainable artificial intelligence. Nat. Rev. Genet. 1–13 (2022) doi:10.1038/s41576-022-00532-2

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

seqexplainer-0.1.0.tar.gz (31.8 kB view details)

Uploaded Source

Built Distribution

seqexplainer-0.1.0-py3-none-any.whl (38.9 kB view details)

Uploaded Python 3

File details

Details for the file seqexplainer-0.1.0.tar.gz.

File metadata

  • Download URL: seqexplainer-0.1.0.tar.gz
  • Upload date:
  • Size: 31.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.5.1 CPython/3.9.16 Linux/4.18.0-425.3.1.el8.x86_64

File hashes

Hashes for seqexplainer-0.1.0.tar.gz
Algorithm Hash digest
SHA256 8a97888076dc3c20427b5749bb4a86f870bc0870b88174715f7ecf2af6701e1d
MD5 5f2b5f09f9a6423548dfe230ef62dd67
BLAKE2b-256 225d1d4d38a7893be7aefeb91d3c15fbdde97ff0515b64b1529084862f9e57fd

See more details on using hashes here.

Provenance

File details

Details for the file seqexplainer-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: seqexplainer-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 38.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.5.1 CPython/3.9.16 Linux/4.18.0-425.3.1.el8.x86_64

File hashes

Hashes for seqexplainer-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c04cb609b06a84ee578c7dde33c29faf28e577a69a0e13499e5002cea9ab199b
MD5 eb377faeb99bf7c9de3203e0ce6ff9a1
BLAKE2b-256 13e0ecc6182559ee3534dd939a9a0633c2ea89d8ebc434cff2e503df2f70428b

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page