Skip to main content

PROtein Feature Aggregation and Variants.

Project description

Protein Feature Aggregation and Variants

Pypi Build Status Documentation Status Python: versions License

ProteoFAV is a Python module that address the challenge of cross-mapping protein structures and protein sequences, allowing for protein structures to be annotated with sequence features. It implements methods for working with protein structures (via mmCIF, PDB, PDB Validation, DSSP and SIFTS files), sequence Features (via UniProt GFF annotations) and genetic variants (via UniProt/EBI Proteins API and Ensembl REST API). Cross-mapping of structure and sequence is performed with the aid of SIFTS.

ProteFAV relies heavily in the Pandas library to quickly load data into DataFrames for fast data exploration and analysis. Structure and sequence data are parsed/fetched onto Pandas DataFrames that are then merged-together (collapsed) onto a single DataFrame.

Getting Started

Dependencies

ProteoFAV was developed to support Python 3.5+ and Pandas 0.20+.

Check requirements for specific requirements.

Installation

To install the stable release, run this command in your terminal:

$ pip install proteofav

If you don’t have pip installed, this Python installation guide can guide you through the process.

Installing from source in a virtual environment

Getting ProteoFAV:

$ wget https://github.com/bartongroup/ProteoFAV/archive/master.zip -O ProteoFAV.zip
$ unzip ProteoFAV.zip

# alternatively, cloning the git repository
$ git clone https://github.com/bartongroup/ProteoFAV.git

Installing With Conda:

$ conda-env create -n proteofav -f path/to/ProteoFAV/requirements.txt
$ source activate proteofav
$ cd path/to/ProteoFAV
$ pip install .

Installing with Virtualenv:

$ virtualenv --python `which python` env
$ source env/bin/activate
$ pip install -r requirements.txt
$ python path/to/ProteoFAV/setup.py install

Testing the installation

Test dependencies should be resolved with:

$ python path/to/ProteoFAV/setup.py develop --user

Run the Tests with:

$ python path/to/ProteoFAV/setup.py test
# or
$ cd path/to/ProteoFAV/tests
$ python -m unittest discover

ProteoFAV Configuration

ProteoFAV uses a configuration file config.ini where the user can specify the directory paths, as well as urls for commonly used data sources.

After installing run:

$ proteofav-setup

Example Usage

Example usage is currently provided as a Jupyter Notebook, which can be viewed with the GitHub’s file viewer or with the Jupyter nbviewer.

You can download the Jupyter notebook from GitHub and test it with your ProteoFAV’s installation.

Contributing and Bug tracking

Feel free to fork, clone, share and distribute. If you find any bugs or issues please log them in the issue tracker.

Before you submit your Pull-requests read the Contributing Guide.

Credits

See the Credits

Changelog

See the Changelog

Licensing

The MIT License (MIT). See license for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

proteofav-0.2.2.tar.gz (88.8 kB view details)

Uploaded Source

File details

Details for the file proteofav-0.2.2.tar.gz.

File metadata

  • Download URL: proteofav-0.2.2.tar.gz
  • Upload date:
  • Size: 88.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for proteofav-0.2.2.tar.gz
Algorithm Hash digest
SHA256 c2b2f64d9732f6de7b3511c55d3d8fa96e7539d1aa92ebd348d44b48ef30ac50
MD5 5070c06aa8cf007eb2b6a4e871ac013e
BLAKE2b-256 a36567da2b4e8e9edce508394bd3d31789d809ea657a0abbfc1674612355bfba

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page