Skip to main content

Encrypted filtering of germline calls from a somatic vcf.

Project description

Germline Filter is a Python program used in the ICGC-TCGA DREAM Mutation Calling Challenge meant to contribute to the real data security measures. It takes as input a preprocessed and encrypted germline calls file, as generated by GATK, and a somatic SNV vcf file, and it returns the number of germline calls found in the somatic vcf.

Features:

  • The most important feature of the GermlineFilter is that the program runs in an encrypted fashion, making it safe for running on any server. All the filtering steps described in the flowchart are done at runtime, and at no point data is written on the disk. It has three options:
    • encrypt_germline_vcf - Encrypt a truth germline vcf (preprocessing steps in the workflow above)

    • filter - Filter a somatic vcf against an encrypted truth germline vcf. This step is done in an encrypted fashion.

    • get_germline_positions - Get the actual germline positions called in a somatic vcf. This step is done in an unencrypted fashion, against the original truth germline vcf. It should only be run locally or on an encrypted server. The output is written to a tab delimited file.

  • Multiple germline vcf’s can be preprocessed at the same time, with a common salt file and key file.

  • Multiple somatic vcf’s corresponding to the same encrypted truth germline file can be filtered simultaneously. This considerably increases the speed versus individual runs.

  • The user can choose the encryption protocol (AES, Blowfish); default AES

  • The user can choose the hashing protocol (md5 or sha512); default sha512

  • Get the actual germline position in a vcf, for plotting and further analysis.

Usage:

After installation, to find out how to use the Germline filter, run:

germline_filter --help

For more examples, please take a look at the user manual, located in <path-to-dir>/GermlineFilter-1.2/doc

Project details


Release history Release notifications | RSS feed

This version

1.2

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

GermlineFilter-1.2.tar.gz (102.3 kB view details)

Uploaded Source

File details

Details for the file GermlineFilter-1.2.tar.gz.

File metadata

  • Download URL: GermlineFilter-1.2.tar.gz
  • Upload date:
  • Size: 102.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for GermlineFilter-1.2.tar.gz
Algorithm Hash digest
SHA256 e4982353c544d94b34f02c5690ab3d3ebc93480d5b62fe6f3317f23c515acc05
MD5 34e9e56c901c87fe584c8da5fbe4803d
BLAKE2b-256 3e240eef52136367143111f2cdcc3f93a80fc19c19820c0e361cc3061bff3aaf

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page