Skip to main content

This is a python client for the cBioPortal API

Project description

cbio_py - a Python wrapper for the cbioportal Cancer API

This is a python package for accessing the Cancer Genomics Portal API. The purpose of this package is to provide a simple wrapper for the API and return data in a format that is easy to work with.

Quickstart

The module is available as a pip package and can be installed with: pip install cbio_py.

Then run the following command to get the list of available studies:

from cbio_py import cbio_mod as cb
cb.get_studies()

Available Functions

The data retrieval functions are broken up over several key data types. The following sections provide a list of the available functions for each data type.

A quick note about the return data types - this API, by default, return the data in a list of dictionaries.This is because the data that is natively return by cbio is in a list of custom classes such as Study, Case, Sample, etc. This is a bit cumbersome to work with and is not the most efficient way to work with the data if you're using it for data analysis.

If you would like for the data to be returned in the native format - all of the functions should support a return_type parameter that can be set to native to return the data in the native format.

Studies

getAllStudies(return_type = 'dict') - this returns a list of all available studies. getSpecificStudy(studyId, return_type = 'dict') - this returns a list of studies with the given study id. getStudyTags(studyId,return_type = 'dict') - this returns a list of study tags given a study id.

Cancers

getAllCancerTypes(return_type = 'dict') - this returns a list of all available cancer types. getCancerByID(cancerTypeId, return_type = 'dict') - this returns a list of cancer types with the given cancertypeid.

Clinical Attributes

getAllClinicalAttributes(return_type = 'dict') - this returns a list of all available clinical attributes. getClinicalAttributesByStudyId(studyId, return_type = 'dict') - this returns a list of clinical attributes for the given study id. getClinicalAttributeInStudy(studyId, clinicalAttributeId, return_type = 'dict') - this returns a list of clinical attributes for the given study id and clinical attribute id.

Clinical Data

getAllClinicalDataInStudy(studyId, return_type = 'dict'): - this returns a list of clinical data for the given study id. getAllClinicalDataOfPatientInStudy(studyId, patientId, return_type = 'dict') - this returns a list of clinical data for the given study id and patient id. getAllClinicalDataOfSampleInStudy(studyId, sampleId, return_type = 'dict') - this returns a list of clinical data for the given study id and sample id.

Copy Number Segments

getCopyNumberSegmentsInSampleInStudy(studyId, sampleId, return_type = 'dict') - this returns a list of copy number segments for the given study id and sample id.

Gene Panels

getAllGenePanels(return_type = 'dict') - this returns a list of all available gene panels. getSpecificGenePanel(genePanelId, return_type = 'dict') - this returns a list of gene panels with the given gene panel id.

Genes

getAllGenes(return_type = 'dict') - this returns a list of all available genes. getAliasForGene(geneId, return_type = 'dict') - this returns a list of gene aliases for the given gene id. getGene(geneId, return_type = 'dict') - this returns a list of genes with the given gene id.

Molecular Profiles

getAllMolecularProfiles(return_type = 'dict') - this returns a list of all available molecular profiles. getMolecularProfile(molecularProfileId, return_type = 'dict') - this returns a list of molecular profiles with the given molecular profile id. getMolecularProfileInStudy(studyId, molecularProfileId, return_type = 'dict') - this returns a list of molecular profiles for the given study id and molecular profile id.

getMutationsInMolecularProfile(molecularProfileId, sampleListId, projection='DETAILED', return_type = 'dict', append='yes') - this returns a list of mutations for the given molecular profile id and sample list id. The projection parameter can be used to return more detailed information about the mutation. The append parameter implies that you would like to get ALL of the mutations for the given molecular profile id and sample list id - if you wish to get a customer molecular profile you will need to set the append parameter to 'no' and then set the molecular profile id and sample list id with the appropriate values in the fuction call.

Patients

getAllPatients(return_type = 'dict') - this returns a list of all available patients. getAllPatientsInStudy(studyId, return_type = 'dict') - this returns a list of patients for the given study id. getPatientInStudy(studyId, patientId, return_type = 'dict') - this returns a list of patients for the given study id and patient id.

Samples

getAllSamplesList(return_type = 'dict') - this returns a list of all available samples. getSpecificSampleList(sampleListId, return_type = 'dict') - this returns a list of samples with the given sample list id. getSampleListInStudy(studyId, sampleListId, return_type = 'dict') - this returns a list of samples for the given study id and sample list id. getAllSamplesInStudy(studyId, return_type = 'dict') - this returns a list of samples for the given study id. getAllSamplesOfPatientInStudy(studyId, patientId, return_type = 'dict') - this returns a list of samples for the given study id and patient id. getSampleInStudy(studyId, sampleId, return_type = 'dict') - this returns a list of samples for the given study id and sample id. getSamplesByKeyword(keyword, return_type = 'dict') - this returns a list of samples for the given keyword.

Potential Issues / Future Development

The Treatement api endpoint is not configured. I will hopefully be able to update the api to include this functionality in the future.

POST endpoints are not included in the api. All of the requests are GET requests.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cbio_py-0.1.0.tar.gz (7.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cbio_py-0.1.0-py3-none-any.whl (5.8 kB view details)

Uploaded Python 3

File details

Details for the file cbio_py-0.1.0.tar.gz.

File metadata

  • Download URL: cbio_py-0.1.0.tar.gz
  • Upload date:
  • Size: 7.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.10.0 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.7

File hashes

Hashes for cbio_py-0.1.0.tar.gz
Algorithm Hash digest
SHA256 b10cfd60184bd64a7e45620d3cdc924fefc0163340fbea7f1304c05c1d7b2f71
MD5 8f0789e24141ddfbfff02f129c009572
BLAKE2b-256 4e288f820eb8c2e9c74e21aa904cbc13e430f38b5015a295453465a803378d6f

See more details on using hashes here.

File details

Details for the file cbio_py-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: cbio_py-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 5.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.10.0 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.7

File hashes

Hashes for cbio_py-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 7dfe02ae06d6ec580fbb2ebce0fc722f365e9c2be89140f79738f277cc894fb0
MD5 0069f77b5e9c577d359aa955c7e82881
BLAKE2b-256 137e9c7233c505da6955bec9f31137098764afc1d152f0ed40c7ee1f4556999b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page