Skip to main content

Package for creating database from cBioPortal and annotating VCF with information from this database

Project description

ISEQcBioPortal

Scripts that allow you to:

  1. Create/Update database from cBioPortal
  2. Annotate VCF with information such as:
  • number of samples with one or more mutations
  • percentage of samples with one or more mutations

Install iseqcbioportal library

Optional steps (create virtual environment):

python3 -m venv venv
source venv/bin/activate

Obligatory steps:

python3 -m pip install --upgrade pip
pip3 install iseqcbioportal

Requirements

  • python >=3.6
  • pandas >= 1.4.2
  • requests >= 2.28.1
  • SQLAlchemy >= 1.4.0
  • loguru >= 0.6.0
  • pysam >= 0.21.0

Create databases

create_cbioportal_database

Annotate VCF

vcf_annotate_cbioportal --input-vcf "/path/to/input.vcf.gz" \
                        --studies "studyId" \
                        --output-vcf "/path/to/output.vcf.gz"

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

iseqcbioportal-0.0.6.tar.gz (6.7 kB view details)

Uploaded Source

File details

Details for the file iseqcbioportal-0.0.6.tar.gz.

File metadata

  • Download URL: iseqcbioportal-0.0.6.tar.gz
  • Upload date:
  • Size: 6.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.8.10

File hashes

Hashes for iseqcbioportal-0.0.6.tar.gz
Algorithm Hash digest
SHA256 317e783bd604c5f3f594a1adee19d81fabf99b7858902303e9cad75094417b2b
MD5 9a8d11f6a5e88f78de4a4fc133bd788a
BLAKE2b-256 defb8b1e966ec6da7e953c79c7b9ef8635d9bc0724ab13df7739205e0935772a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page