A Python toolkit for auditable harmonization, liftover, intersection, and projection of genetic variant tables and payloads.
Project description
Variant Table Toolkit
This toolkit harmonizes genetic variant data across common research formats and reference assemblies. It supports GRCh37, GRCh38, and T2T-CHM13v2.0, chromosomes 1-22, X, Y, and MT, common contig naming modes (ncbi, ucsc, plink), and biallelic variants, including SNPs and supported indels.
The common workflow rewrites chr / bp / a1 / a2 / snp fields in BFILE, PFILE, VCF, or summary-statistics inputs into a standardized variant key while adjusting the attached data, such as genotypes or summary-statistic columns, accordingly. Users request the target build, contig naming, and optional filtering or normalization flags; the pipeline handles build guessing in the source data, liftover between builds (if needed), allele swaps, reference-anchored allele ordering, sorting, and duplicate removal. The workflow is split into preparation and projection phases, so users can save and reuse a prepared variant set, or project only a user-defined subset of source variants.
Start here
- Install the runtime using one of the supported paths in docs/install.md.
- Download the reference FASTA/chain assets and configure
config.yamlas described in docs/downloads.md. - Run through the worked example in docs/tutorial-1.md.
Documentation
- Workflow: the common prepare, combine, restrict, and project workflow.
- Summary statistics: metadata, SNP-only imports with
--id-lookup, projection, and clean projection. - Primitive tools and object model reference: lower-level tools plus
.vmap,.vtable, payloads, source-row mapping, object metadata, and allele ordering.
Specifications
For exact schema and edge-case rules, see SPEC.md and the detailed specs in spec/. Wrapper behavior for prepare_variants.py, prepare_variants_sharded.py, and project_payload.py is defined in spec/workflow.md. Payload-application semantics are defined in spec/payload-application.md.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file genomatch-0.4.3.tar.gz.
File metadata
- Download URL: genomatch-0.4.3.tar.gz
- Upload date:
- Size: 153.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e0aff6e26f2e228be7e3eea3b07155384e2f71a20f58424ef6c0362fa9aa4bcb
|
|
| MD5 |
2796c75d008eedce031b1787cfdd1fcc
|
|
| BLAKE2b-256 |
fa1ea2e03f3ef00c6ab20c6837416273402a982685011805bea56ef6f672b73e
|
Provenance
The following attestation bundles were made for genomatch-0.4.3.tar.gz:
Publisher:
release.yaml on precimed/genomatch
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
genomatch-0.4.3.tar.gz -
Subject digest:
e0aff6e26f2e228be7e3eea3b07155384e2f71a20f58424ef6c0362fa9aa4bcb - Sigstore transparency entry: 1575193350
- Sigstore integration time:
-
Permalink:
precimed/genomatch@e4978ba7663df874d52579f75ffc6e25c1922a26 -
Branch / Tag:
refs/tags/v0.4.3 - Owner: https://github.com/precimed
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yaml@e4978ba7663df874d52579f75ffc6e25c1922a26 -
Trigger Event:
push
-
Statement type:
File details
Details for the file genomatch-0.4.3-py3-none-any.whl.
File metadata
- Download URL: genomatch-0.4.3-py3-none-any.whl
- Upload date:
- Size: 134.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
347afac3a405a7a8bff09b7f2efb6450aeca2a5c1d639444c5c242a9a67573af
|
|
| MD5 |
76f80dd856d5e3e810007b8d197cc363
|
|
| BLAKE2b-256 |
e29b41263a8c70b33a23d8e267fde35ef36a6c16ea943f551bb15cfac65095b9
|
Provenance
The following attestation bundles were made for genomatch-0.4.3-py3-none-any.whl:
Publisher:
release.yaml on precimed/genomatch
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
genomatch-0.4.3-py3-none-any.whl -
Subject digest:
347afac3a405a7a8bff09b7f2efb6450aeca2a5c1d639444c5c242a9a67573af - Sigstore transparency entry: 1575193366
- Sigstore integration time:
-
Permalink:
precimed/genomatch@e4978ba7663df874d52579f75ffc6e25c1922a26 -
Branch / Tag:
refs/tags/v0.4.3 - Owner: https://github.com/precimed
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yaml@e4978ba7663df874d52579f75ffc6e25c1922a26 -
Trigger Event:
push
-
Statement type: