Skip to main content

Python packaging for CPTAC data

Project description

cptac

This project is intended to facilitate accessing and interacting with cancer data from the National Cancer Institute CPTAC consortium, which characterizes and studies the proteogenomic landscape of tumors. Currently, the datasets available are: endometrial cancer, ovarian cancer and colon cancer. These cancer studies are downloadable via our Python package as native dataframe objects and can therefore be integrated very quickly and easily with other Python-based data analysis tools. Follow our walkthrough tutorials for a basic cookbook of ways to use our system.

Setup instructions can be found in doc/setup.md

Tutorials

Tutorials for this package describe how to use the package functions for research with the provided data. All the tutorials are written in Python using the interactive Jupyter notebooks. If you are unfamiliar with Jupyter, follow the instructions given at jupyter.org/install. You will then be able to run our tutorials as interactive, exploratory data analyses.

  • Use Case 0: Exploring the data
  • Use Case 1: Comparing transcriptomics and proteomics for a single gene
  • Use Case 2: Looking for correlation between clinical factors
  • Use Case 3: Find genes significantly correlated with a clinical attribute
  • Use Case 4: Investigating how genetic mutation affects protein abundance
  • Use Case 5: Running gene set enrichment analysis
  • Use Case 6: Comparing derived molecular features with protein abundance

Requirements

This package is intended to run on Python 3.6 with pandas 0.23.4. In the tutorials, we use seaborn 0.9.0 for data visualization.

License

This package contains LICENSE.md document which describes the license for use. Please note the difference between the license as it applies to code versus data.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

cptac-0.4.3-py3-none-any.whl (36.9 MB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page