No project description provided
Project description
Exomiser Runner for PhEval
This is the Exomiser plugin for PhEval. With this plugin, you can leverage the variant prioritisation tool, Exomiser, to run the PhEval pipeline seamlessly. Instructions for setting up the appropriate directory layout, including the input directory and test data directory for a single PhEval run, can be found here
Contents
- Quick start
- Installation
- Running with
pheval run - Input directory structure
- Test data directory structure
- Post-processing pre-generated Exomiser results
- Generating Exomiser batch files
- Outputs
Quick start: running Exomiser with PhEval
pheval run \
--input-dir /path/to/input_dir \
--testdata-dir /path/to/testdata_dir \
--runner exomiserphevalrunner \
--output-dir /path/to/output_dir \
--version 15.0.0
This command will:
- Prepare Exomiser inputs
- Execute Exomiser
- Post-process raw Exomiser outputs into PhEval-standardised Parquet results ready for benchmarking
Installation
Install from PyPI (recommended):
pip install pheval.exomiser
Or from source:
git clone https://github.com/monarch-initiative/pheval.exomiser.git
cd pheval.exomiser
poetry install
poetry shell
Running with pheval run
The pheval run command manages the full lifecycle:
prepare → run → post-process.
pheval run \
--input-dir /path/to/input_dir \
--testdata-dir /path/to/testdata_dir \
--runner exomiserphevalrunner \
--output-dir /path/to/output_dir \
--version 15.0.0
Input directory structure
The input directory contains all Exomiser configuration, databases, and software.
Required files
input_dir/
├── config.yaml
├── exomiser-cli-15.0.0/
│ └── exomiser-cli-15.0.0.jar
├── preset-exome-analysis.yml
├── 2512_phenotype/
├── 2512_hg19/
└── 2512_hg38/
config.yaml
tool: exomiser
tool_version: 15.0.0
# NOTE gene-only preset analysis should only be run with Exomiser versions >= 13.2.0
variant_analysis: true
gene_analysis: true
disease_analysis: false
tool_specific_configuration_options:
environment: local
exomiser_software_directory: exomiser-cli-15.0.0
analysis_configuration_file: preset-exome-analysis.yml # can be blank if running without VCF, alternatively specify your own analysis configuration file for phenotype only
max_jobs: 0
application_properties:
remm_version:
cadd_version:
hg19_data_version: "2512"
hg19_local_frequency_path:
hg19_whitelist_path:
hg38_data_version: "2512"
hg38_local_frequency_path:
hg38_whitelist_path:
phenotype_data_version: "2512"
# either none, simple, or caffeine
cache_type: none
cache_caffeine_spec:
output_formats: [PARQUET] # options include HTML, JSON, PARQUET (v15.0.0 onwards), TSV_VARIANT, TSV_GENE, VCF
post_process:
# For Exomiser < 15.0.0, valid ranking methods include combinedScore, priorityScore, variantScore or pValue
# For Exomiser >= 15.0.0, valid ranking methods include geneCombinedScore, geneVariantScore or pValue
score_name: geneCombinedScore
# sort order should be specified to either ASCENDING or DESCENDING
# ASCENDING orders results with the lowest values ranked first
# DESCENDING orders results with the highest values ranked first
# NOTE when changing the score_name ensure the sort_order is also correct
sort_order: DESCENDING
Optional databases
input_dir/
├── cadd/
│ └── {{CADD-VERSION}}/
│ ├── hg19/
│ └── hg38/
├── local/
│ ├── local_frequency_test_hg19.tsv.gz
│ └── local_frequency_test_hg38.tsv.gz
└── remm/
├── ReMM.v{{REMM-VERSION}}.hg19.tsv.gz
└── ReMM.v{{REMM-VERSION}}.hg38.tsv.gz
Testdata directory structure
testdata_dir/
├── phenopackets/
└── vcf/ # optional
Phenotype-only mode
Set:
variant_analysis: false
Only phenopackets/ is required.
Important
Ifvariant_analysis: trueand novcf/directory exists, the VCF path will be taken from the Phenopacket.
Post-processing pre-generated Exomiser results
If Exomiser was run outside PhEval:
pheval-exomiser post-process-exomiser-results \
--results-dir /path/to/exomiser_results \
--phenopacket-dir /path/to/phenopackets \
--output-dir /path/to/write/output \
--score-name geneCombinedScore \
--gene-analysis \
--variant-analysis \
--version 15.0.0
Use pheval-exomiser post-process-exomiser-results --help for more options.
⚠️ Critical file naming rule (stem matching)
PhEval matches results to cases using file stem equality.
Phenopacket: patient_001.json
Result file: patient_001.json ✅
Result file: patient_001-exomiser.json ✅ (auto-stripped)
Result file: patient_001_run1.json ❌
Rule: the result filename stem must exactly match the phenopacket stem.
Generating Exomiser batch files
Generate batch scripts without running Exomiser:
pheval-exomiser prepare-exomiser-batch \
--phenopacket-dir /phenopackets \
--vcf-dir /vcf \
--variant-analysis \
--analysis-yaml preset-exome-analysis.yml \
--output-dir /batch_files \
--results-dir /exomiser_results
--exomiser-version 15.0.0
--output-formats PARQUET
--output-formats HTML
Use pheval-exomiser prepare-exomiser-batch --help for more options.
This writes batch files under tool_input_commands/.
Outputs
output_dir/
├── pheval_gene_results/
├── pheval_variant_results/
├── pheval_disease_results/
├── raw_results/
└── results.yml
These outputs are directly consumable by PhEval benchmarking utilities.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pheval_exomiser-0.4.12.tar.gz.
File metadata
- Download URL: pheval_exomiser-0.4.12.tar.gz
- Upload date:
- Size: 15.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.9.25
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
90eadcdd1f649ccd48b46e2a4130fe1fd58b77bdd2605e58c883eda2251d9106
|
|
| MD5 |
88985e01ae26a1d9733a1a7a069c24c3
|
|
| BLAKE2b-256 |
7d4b60b1e8e0bce105b7fa5d805e9dd1b9a79725203c6179d22a748d5d03c6ea
|
File details
Details for the file pheval_exomiser-0.4.12-py3-none-any.whl.
File metadata
- Download URL: pheval_exomiser-0.4.12-py3-none-any.whl
- Upload date:
- Size: 17.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.9.25
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
cab1215c3865f11c028e26e3b4d85b2d3e70a01c24574aaa874ff56ed0433dfe
|
|
| MD5 |
ca8e76decb0aa7cb091977eac32c3b5b
|
|
| BLAKE2b-256 |
276002fc36ce8f1b00cac093aa7976a7053eb313ea8c4f18ee7e6390fb07b1cf
|