Skip to main content

No project description provided

Project description

SHEPHARD

Sequence-based Hierarchical and Extendable Platform for High-throughput Analysis of Region of Disorder

Current major version: 0.1.16 (September 2022)

Travis Build Status codecov

About

SHEPHARD is a Python toolkit for integrative proteome-wide analysis. It was written by Garrett Ginell and Alex Holehouse.

SHEPHARD enables you to read in protein sequence data and annotate it with different types of sequence annotations (Sites, Domains, and Tracks). As an example

Installation

Copy and paste into your terminal:

pip install shephard

This installs the current stable release candidate from PyPi.

Installation from GitHub

Copy and paste into your terminal:

pip install shephard@git+git://github.com/holehouse-lab/shephard.git

This installs the current bleeding-edge version directly from GitHub.

Documentation

Online documentation for SHEPHARD can be found here:

https://shephard.readthedocs.io/en/latest/

Tutorial Examples

Examples and Google Colab tutorials can be found here:

https://github.com/holehouse-lab/shephard-colab

Status

SHEPHARD is fully released, and the SHEPHARD preprint is forthcoming.

Roadmap

SHEPHARD is the base code for a large body of sequence-based bioinformatic tools developed by the Holehouse lab. These include:

  • metapredict - high-performance disorder predictor
  • parrot - a general tool for deep learning of sequence features
  • pipit - A simple tool for sequential sequence shuffling, as implemented in Langstein et al. preprint here.
  • sparrow - a high-throughput tool for sequence analysis (in development)
  • goose - a general purpose tool for the rational design of disordered protein sequences (in development)

These tools together form the backbone of our informatics infrastructure, and SHEPHARD will contain direct or indirect API access to each of them (and various other tools).

Change log

As we approach final release and versions of SHEPHARD are available for distribution, a change log is updated and changes that break backwards compatibility or introduce new features are tagged as minor/major increments. Bug fixes/docs/tests are simply tagged by their git hash.

Version 0.1.16 (September 2022)

  • Update for PyPI update
  • Improved documentation ahead of final release (including tools docs).
  • Added ability to return sites as lists for all site acquisition functions in proteins and domains.
  • Added much more detailed tests for site acquisition functions

Version 0.1.15 (September 2022)

Version 0.1.10 (September 2022)

  • Major update
  • Lots of new tests
  • Enable sites to read/write if values = None without throwing an exception
  • Fixed bug in writing sites from list
  • BREAKING CHANGE: Changed shephard.protein.get_residue() to shephard.protein.residue(), inkeeping with style for other getter functions

Version 0.1.9 (September 2022)

  • Major update
  • Lots of new tests
  • Added ability to write lists of sites and tracks (as we can with domains)
  • Refactoring of interface writing code
  • Added explicitly checks for domain, site, and track types when writing from lists of these objects
  • Added Track.symbol() and Track.value() functions to extract a single symbol or value at a specific position.
  • Updated documentation to include these new functions
  • Updated tests to encompass new features
  • Fixed bugs in exception handling
  • BREAKING CHANGE: Changed shephard.interfaces.si_tracks.write_track() to shephard.interfaces.si_tracks.write_tracks() (i.e. plural) to match names from other functions

Version 0.1.8 (August 2022)

  • Bug fix in domain_tools.py for identifying overlap between two domains
  • Fixed inconsistencies in writing domains that led to trailing whitespace
  • Fixed bugs in exception throwing code
  • More tests

Version 0.1.7 (April 2022)

  • Improved documentation
  • Added domain_to_track() function in tools.track_tools

Version 0.1.5 (April 2022)

  • First version released to PyPI

Version 0.1.4 (Feb 2022)

  • Added ability to remove Tracks, Sites and Domains from a Protein objects
  • Track number of unique domains, sites, and tracks rather than just their presence/absence
  • Updated Track writing
  • Added Tracks MUST be either symbolic or values-based but cannot be both

Version 0.1.3.1 (May 2021)

  • Various bug fixes
  • Improved performance
  • Updated interfaces for reading/writing different types of files
  • Major updates to internal docs
  • This release should be considered largely stable, although docs are lacking
  • Expanded the test suite

Version 0.1.2.1 (August 2020)

WARNING: This version breaks backwards compatibility with prior versions!

  • protein.get_domains_by_type() now returns a list of domains instead of a dictionary. This helps bring consistency to how domains are retrieved and moves us away from dictionary returning.
  • Various internal updates

Copyright

Copyright (c) 2019-2022, Garrett M. Ginell and Alex S. Holehouse - Holehouse lab

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

shephard-0.1.16.tar.gz (185.8 kB view details)

Uploaded Source

File details

Details for the file shephard-0.1.16.tar.gz.

File metadata

  • Download URL: shephard-0.1.16.tar.gz
  • Upload date:
  • Size: 185.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.8.12

File hashes

Hashes for shephard-0.1.16.tar.gz
Algorithm Hash digest
SHA256 5de16f3b45c62dcd9997b0b31fe1551a442a4ea0b3c0efb04c615b6e987f179e
MD5 7000b6ebfb96ff31f0105ea4b58dd8e6
BLAKE2b-256 c69c5920ae7fe8ea8af0aa79d7d2554018f6408462d68ac30b501933c9870bc9

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page