Skip to main content

Tools for indexing GREGoR VCFs

Project description

vrsix: Indexing VRS-Annotated VCFs

Overview

vrsix provides a file-based indexing strategy to support fast lookup of AnVIL-hosted VCFs using IDs and annotations drawn from the GA4GH Variation Representation Specification.

See the vrsix Terra workflow for a readymade Terra implementation.

Usage

To get started, you will need a fully VRS-annotated VCF. Confirm that your VCF contains the following info fields:

  • VRS_Allele_IDs
  • VRS_Starts
  • VRS_Stops
  • VRS_States

For example, confirm that chr1.vcf has the required fields.

bcftools view -h chr1.vcf | grep '^##INFO='

From a VRSified VCF, ingest a VRS ID and the corresponding VCF-called location (i.e. sufficient inputs for a tabix lookup), and store them in a sqlite database.

vrsix load chr1.vcf

Each variation is stored with an associated file URI to support later retrieval. By default, this URI is simply the input VCF's location in the file system, but you may declare a custom URI instead as an optional argument:

vrsix load chr1.vcf gs://my_stuff/chr1.vcf

By default, all records are ingested into a sqlite file located at ~/.local/share/vrsix.db. This can be overridden with either the environment variable VRS_VCF_INDEX, or with an optional flag to the CLI:

vrsix load --db-location=./vrsix.db input.vcf

Development

Ensure that a recent version of the Rust toolchain is available.

Create a virtual environment and install developer dependencies:

python3 -m venv venv
source venv/bin/activate
python3 -m pip install -e '.[dev,tests]'

This installs Python code as editable, but after any changes to Rust code, run maturin develop to rebuild the Rust binary:

maturin develop

Be sure to install pre-commit hooks:

pre-commit install

Check Python style with ruff:

python3 -m ruff format . && python3 -m ruff check --fix .

Use cargo fmt to check Rust style (must be run from within the rust/ subdirectory):

cd rust/
cargo fmt

Run tests from the project root with pytest:

pytest

Some granular tests are written directly into the Rust backend as well:

cd rust/
cargo test

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vrsix-0.2.0.tar.gz (26.0 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

vrsix-0.2.0-cp313-cp313t-musllinux_1_2_x86_64.whl (3.0 MB view details)

Uploaded CPython 3.13tmusllinux: musl 1.2+ x86-64

vrsix-0.2.0-cp313-cp313t-musllinux_1_2_i686.whl (3.1 MB view details)

Uploaded CPython 3.13tmusllinux: musl 1.2+ i686

vrsix-0.2.0-cp313-cp313t-musllinux_1_2_armv7l.whl (2.9 MB view details)

Uploaded CPython 3.13tmusllinux: musl 1.2+ ARMv7l

vrsix-0.2.0-cp313-cp313t-musllinux_1_2_aarch64.whl (2.9 MB view details)

Uploaded CPython 3.13tmusllinux: musl 1.2+ ARM64

vrsix-0.2.0-cp313-cp313t-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (2.9 MB view details)

Uploaded CPython 3.13tmanylinux: glibc 2.17+ x86-64

vrsix-0.2.0-cp313-cp313t-manylinux_2_17_s390x.manylinux2014_s390x.whl (3.0 MB view details)

Uploaded CPython 3.13tmanylinux: glibc 2.17+ s390x

vrsix-0.2.0-cp313-cp313t-manylinux_2_17_ppc64le.manylinux2014_ppc64le.whl (2.9 MB view details)

Uploaded CPython 3.13tmanylinux: glibc 2.17+ ppc64le

vrsix-0.2.0-cp313-cp313t-manylinux_2_17_i686.manylinux2014_i686.whl (2.9 MB view details)

Uploaded CPython 3.13tmanylinux: glibc 2.17+ i686

vrsix-0.2.0-cp313-cp313t-manylinux_2_17_armv7l.manylinux2014_armv7l.whl (2.6 MB view details)

Uploaded CPython 3.13tmanylinux: glibc 2.17+ ARMv7l

vrsix-0.2.0-cp313-cp313t-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (2.7 MB view details)

Uploaded CPython 3.13tmanylinux: glibc 2.17+ ARM64

vrsix-0.2.0-cp313-cp313t-macosx_11_0_arm64.whl (2.5 MB view details)

Uploaded CPython 3.13tmacOS 11.0+ ARM64

vrsix-0.2.0-cp313-cp313t-macosx_10_12_x86_64.whl (2.6 MB view details)

Uploaded CPython 3.13tmacOS 10.12+ x86-64

vrsix-0.2.0-cp310-abi3-musllinux_1_2_x86_64.whl (3.0 MB view details)

Uploaded CPython 3.10+musllinux: musl 1.2+ x86-64

vrsix-0.2.0-cp310-abi3-musllinux_1_2_i686.whl (3.1 MB view details)

Uploaded CPython 3.10+musllinux: musl 1.2+ i686

vrsix-0.2.0-cp310-abi3-musllinux_1_2_armv7l.whl (2.9 MB view details)

Uploaded CPython 3.10+musllinux: musl 1.2+ ARMv7l

vrsix-0.2.0-cp310-abi3-musllinux_1_2_aarch64.whl (2.9 MB view details)

Uploaded CPython 3.10+musllinux: musl 1.2+ ARM64

vrsix-0.2.0-cp310-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (2.9 MB view details)

Uploaded CPython 3.10+manylinux: glibc 2.17+ x86-64

vrsix-0.2.0-cp310-abi3-manylinux_2_17_s390x.manylinux2014_s390x.whl (3.0 MB view details)

Uploaded CPython 3.10+manylinux: glibc 2.17+ s390x

vrsix-0.2.0-cp310-abi3-manylinux_2_17_ppc64le.manylinux2014_ppc64le.whl (2.9 MB view details)

Uploaded CPython 3.10+manylinux: glibc 2.17+ ppc64le

vrsix-0.2.0-cp310-abi3-manylinux_2_17_i686.manylinux2014_i686.whl (2.9 MB view details)

Uploaded CPython 3.10+manylinux: glibc 2.17+ i686

vrsix-0.2.0-cp310-abi3-manylinux_2_17_armv7l.manylinux2014_armv7l.whl (2.6 MB view details)

Uploaded CPython 3.10+manylinux: glibc 2.17+ ARMv7l

vrsix-0.2.0-cp310-abi3-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (2.7 MB view details)

Uploaded CPython 3.10+manylinux: glibc 2.17+ ARM64

vrsix-0.2.0-cp310-abi3-macosx_11_0_arm64.whl (2.5 MB view details)

Uploaded CPython 3.10+macOS 11.0+ ARM64

vrsix-0.2.0-cp310-abi3-macosx_10_12_x86_64.whl (2.6 MB view details)

Uploaded CPython 3.10+macOS 10.12+ x86-64

File details

Details for the file vrsix-0.2.0.tar.gz.

File metadata

  • Download URL: vrsix-0.2.0.tar.gz
  • Upload date:
  • Size: 26.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: maturin/1.8.3

File hashes

Hashes for vrsix-0.2.0.tar.gz
Algorithm Hash digest
SHA256 3a11e970f4110dc5e1f37e0a56448bd6f8370c9f1522ea3902dcc62143c637d5
MD5 a9211a6debcaa94a2a38795be50d719f
BLAKE2b-256 8c1e2a4064235112902d30de5f76ae47719a72f9d447e8a71af81eab06fd599f

See more details on using hashes here.

File details

Details for the file vrsix-0.2.0-cp313-cp313t-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for vrsix-0.2.0-cp313-cp313t-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 5e74c0f9cf31f343e715225d7a7107628b56ae9c2b1305acecc5dd02bd2e4ae3
MD5 4b61a8825e7a391b90a39c2503a833e5
BLAKE2b-256 35a3316f1661406ada41c182a3c714f357c7b6c8cdd2de065168922167c255ef

See more details on using hashes here.

File details

Details for the file vrsix-0.2.0-cp313-cp313t-musllinux_1_2_i686.whl.

File metadata

File hashes

Hashes for vrsix-0.2.0-cp313-cp313t-musllinux_1_2_i686.whl
Algorithm Hash digest
SHA256 61e25e0232ccd7dd3141cc665264c691309b45d45fdc9c911f6970d18bfc1c25
MD5 36cf0be3fb1a4055446fe48ce00e9f1a
BLAKE2b-256 02ee5e195b782f42763bf2bb8d452a0278fd98b4571556f2db163c8c3567c965

See more details on using hashes here.

File details

Details for the file vrsix-0.2.0-cp313-cp313t-musllinux_1_2_armv7l.whl.

File metadata

File hashes

Hashes for vrsix-0.2.0-cp313-cp313t-musllinux_1_2_armv7l.whl
Algorithm Hash digest
SHA256 567ff3bd3bcefab43928cba0d0667e78fa3a4f80ff51d9256a949a4a9c81afb4
MD5 f434d1228c1dfd5f5487d1c91cd6b4b7
BLAKE2b-256 4f980906a7d1a00b41f167e642398237c57749aaf2e9bf52528e511d9d0f1c09

See more details on using hashes here.

File details

Details for the file vrsix-0.2.0-cp313-cp313t-musllinux_1_2_aarch64.whl.

File metadata

File hashes

Hashes for vrsix-0.2.0-cp313-cp313t-musllinux_1_2_aarch64.whl
Algorithm Hash digest
SHA256 144a6e08e507c0ae810c0c09a8b4fe8807bec64028c36ec0009f49b70cea9a27
MD5 ae8fdc7d09ef5f471a18da9c7b311a87
BLAKE2b-256 d3d28348d97e03e2b4f279ea86c0e27973f909d92b67277cbef95814fa2f6635

See more details on using hashes here.

File details

Details for the file vrsix-0.2.0-cp313-cp313t-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for vrsix-0.2.0-cp313-cp313t-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 aad4311dcf8f00928006565f7ea9f36f06d50748649f30158407de0f2dc9ffc5
MD5 958aef04dc05d60ab5fc903e52cca458
BLAKE2b-256 7f18a42804dd3128066581466a6cc4f62047bcd652ef33d62cf2d4f286198d62

See more details on using hashes here.

File details

Details for the file vrsix-0.2.0-cp313-cp313t-manylinux_2_17_s390x.manylinux2014_s390x.whl.

File metadata

File hashes

Hashes for vrsix-0.2.0-cp313-cp313t-manylinux_2_17_s390x.manylinux2014_s390x.whl
Algorithm Hash digest
SHA256 1dcdb8ca0c9af34f06f6a57f000c663be556956f5c5f2a4cb344939d06088f53
MD5 87802da3d51171492e0ead68e165becd
BLAKE2b-256 d767b37df6a82201a48169e18ec197d064903b28630e696243784651589ee86f

See more details on using hashes here.

File details

Details for the file vrsix-0.2.0-cp313-cp313t-manylinux_2_17_ppc64le.manylinux2014_ppc64le.whl.

File metadata

File hashes

Hashes for vrsix-0.2.0-cp313-cp313t-manylinux_2_17_ppc64le.manylinux2014_ppc64le.whl
Algorithm Hash digest
SHA256 72d74b5895d091060328e7b04e3a4a2392def86b86ea97cd0bf053ea4b8c1385
MD5 3f193c5395a11432c7b41cef18144c4e
BLAKE2b-256 03593217fb99e107d3fe5188156a0641a0c49ea32c5c054275c93bab9930d9a3

See more details on using hashes here.

File details

Details for the file vrsix-0.2.0-cp313-cp313t-manylinux_2_17_i686.manylinux2014_i686.whl.

File metadata

File hashes

Hashes for vrsix-0.2.0-cp313-cp313t-manylinux_2_17_i686.manylinux2014_i686.whl
Algorithm Hash digest
SHA256 50680c8aabaaa53af74344c2fc03fdf53a0c76456da2cf9ce0c92c23c46a0657
MD5 888b16c0c073489afe0441cbd9c1363b
BLAKE2b-256 07b0890658b97c7ff2d9f20c4cab51240ea240121e9648320f2b9a1c6c359d2f

See more details on using hashes here.

File details

Details for the file vrsix-0.2.0-cp313-cp313t-manylinux_2_17_armv7l.manylinux2014_armv7l.whl.

File metadata

File hashes

Hashes for vrsix-0.2.0-cp313-cp313t-manylinux_2_17_armv7l.manylinux2014_armv7l.whl
Algorithm Hash digest
SHA256 529a836f788a126a3fcf0f3bcf84254e6db3b9242de8473377a5972e0ac0b284
MD5 c60a67e05b8b247edaf77ef21f70042f
BLAKE2b-256 b5d91907d3c07ed00cb4aa997fcca92a959407fb14cd9973ae5f1b29db03341e

See more details on using hashes here.

File details

Details for the file vrsix-0.2.0-cp313-cp313t-manylinux_2_17_aarch64.manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for vrsix-0.2.0-cp313-cp313t-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 cafe0c423d36461d314edf4cb6e3e94387165a0a5b1ed638ffc61d8beb0eda3c
MD5 9390bf49cfb91405f334a1b4aae260a0
BLAKE2b-256 c4d9472a8b2a5fc8f71cf74da5e6cb716343c1eba669a1ee1118815142618c06

See more details on using hashes here.

File details

Details for the file vrsix-0.2.0-cp313-cp313t-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for vrsix-0.2.0-cp313-cp313t-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 2ef6089e1973db9fd9cf05ac59f798c03a5050326f15e3d6175c87cb079a95b0
MD5 e108035bfda6832c8f93280b7907595e
BLAKE2b-256 b5cf89b48415a5ae1c248ebcdec702727fe818f2f246862ab79cac67dcbb0c52

See more details on using hashes here.

File details

Details for the file vrsix-0.2.0-cp313-cp313t-macosx_10_12_x86_64.whl.

File metadata

File hashes

Hashes for vrsix-0.2.0-cp313-cp313t-macosx_10_12_x86_64.whl
Algorithm Hash digest
SHA256 5a7211169c2de247edf5d78670d62787608631ba9fa7d0039791a524446dea49
MD5 6fa790216682d14ba2c7e405a19c02d4
BLAKE2b-256 7f5c15cc4188cda70f5ff02b9eaac2d5cd6654909c832fdcb6debc94aedbebb5

See more details on using hashes here.

File details

Details for the file vrsix-0.2.0-cp310-abi3-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for vrsix-0.2.0-cp310-abi3-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 20689d3b7b12d3230d3dbba3253797084f7381cb9b54448cb29f11c6fbb1cd78
MD5 8ece4cf6abaf96a0dac97d355faf715f
BLAKE2b-256 423ecdfb44efb2824ffa450b5dc54f8e30393fc8a21d8f853c747914faba2f31

See more details on using hashes here.

File details

Details for the file vrsix-0.2.0-cp310-abi3-musllinux_1_2_i686.whl.

File metadata

File hashes

Hashes for vrsix-0.2.0-cp310-abi3-musllinux_1_2_i686.whl
Algorithm Hash digest
SHA256 9b7fb0c51e0f471e1d4854942b4ef1b5b5d130bebbf21b32ed6d66a2a7d39d42
MD5 a656f837cf3a75ed679a7e084c387568
BLAKE2b-256 40e2240bf5e85292bbf28432cda9b4b44bc6435376969444baa31f9540e27a10

See more details on using hashes here.

File details

Details for the file vrsix-0.2.0-cp310-abi3-musllinux_1_2_armv7l.whl.

File metadata

File hashes

Hashes for vrsix-0.2.0-cp310-abi3-musllinux_1_2_armv7l.whl
Algorithm Hash digest
SHA256 aed3e5ae14b5fcdb9479784675fbcdcecd60b2c11e241ad83193cda2ae6ccc4d
MD5 d8754bfff12cfe75701c80b56ab96be9
BLAKE2b-256 e22d0c236a6611df4122cfbb481301dc58e3b8909a13207a102f7897ac039d02

See more details on using hashes here.

File details

Details for the file vrsix-0.2.0-cp310-abi3-musllinux_1_2_aarch64.whl.

File metadata

File hashes

Hashes for vrsix-0.2.0-cp310-abi3-musllinux_1_2_aarch64.whl
Algorithm Hash digest
SHA256 6f6a4df363bd3faa82dec16d1f9523875d3bf0e34a9eca7aaac9bf1e2d5877c0
MD5 eb0ed0507595e3211396a64f6e8315e6
BLAKE2b-256 ec612657293644e09b4eef5f0082e238213a65a7fb636534eb2d5fb9ba72742b

See more details on using hashes here.

File details

Details for the file vrsix-0.2.0-cp310-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for vrsix-0.2.0-cp310-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 248cdb9125f220e0ea16c8c908b5583e910c7867dde4da25fdc84d5c621d2b4c
MD5 eb8c494671171b14e83fe9faf46543e5
BLAKE2b-256 b8f2db01d58795776be86bfa4c98efbd6f5a654918d673d7fece49acbc4de0a4

See more details on using hashes here.

File details

Details for the file vrsix-0.2.0-cp310-abi3-manylinux_2_17_s390x.manylinux2014_s390x.whl.

File metadata

File hashes

Hashes for vrsix-0.2.0-cp310-abi3-manylinux_2_17_s390x.manylinux2014_s390x.whl
Algorithm Hash digest
SHA256 72d127d0df17319388ea03518c42da70a052d06a0e5cce702cca790910b7e028
MD5 b1ea60995552d5f93089bf9ccbaad1c8
BLAKE2b-256 60d1ee7715cad5679d209a6fed54020553c5f0b5abf1d15527db8d433ca74866

See more details on using hashes here.

File details

Details for the file vrsix-0.2.0-cp310-abi3-manylinux_2_17_ppc64le.manylinux2014_ppc64le.whl.

File metadata

File hashes

Hashes for vrsix-0.2.0-cp310-abi3-manylinux_2_17_ppc64le.manylinux2014_ppc64le.whl
Algorithm Hash digest
SHA256 4a3b5b3486b4c427b5ca23ec684853786f9be9bf5b0c9ea606b6e87efa83937c
MD5 0417856fec83dad8b84b22df9c5ec82a
BLAKE2b-256 705eb37b7edc816fc12476681611dc4b5e9d569e0d97435e9af74b7e3cac295e

See more details on using hashes here.

File details

Details for the file vrsix-0.2.0-cp310-abi3-manylinux_2_17_i686.manylinux2014_i686.whl.

File metadata

File hashes

Hashes for vrsix-0.2.0-cp310-abi3-manylinux_2_17_i686.manylinux2014_i686.whl
Algorithm Hash digest
SHA256 867e2060d5a18b883b61bd22042ec66024c6a91f8bec568e1ac02e137be04623
MD5 3caacbc2345d6d9d9ee32502aaae172f
BLAKE2b-256 0de4f3c02a872084bb275da6c3a34e888aaee0b2a8b19620ab7caccf2d8f7795

See more details on using hashes here.

File details

Details for the file vrsix-0.2.0-cp310-abi3-manylinux_2_17_armv7l.manylinux2014_armv7l.whl.

File metadata

File hashes

Hashes for vrsix-0.2.0-cp310-abi3-manylinux_2_17_armv7l.manylinux2014_armv7l.whl
Algorithm Hash digest
SHA256 dcfc316e105bd2249759012cad58658c042a025ae6f3102969f3df0504acfb5d
MD5 d226cea7f569f6c718e401d59a11c23f
BLAKE2b-256 bacb0472bca0cf4cd991865bcb306d6129dddcc142069d9d0ede1ea7ca80d92d

See more details on using hashes here.

File details

Details for the file vrsix-0.2.0-cp310-abi3-manylinux_2_17_aarch64.manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for vrsix-0.2.0-cp310-abi3-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 3728c579c88e1ded85e91a1a2b15113e5236ee5d6fe10d89d5a5f2f715edb0b8
MD5 bf22d4bf0466e2eacfd7e8aa3e8e0637
BLAKE2b-256 892756c4c4d0e1cb12917fb5287210b43a344bf14902b6483474e2d8816e7415

See more details on using hashes here.

File details

Details for the file vrsix-0.2.0-cp310-abi3-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for vrsix-0.2.0-cp310-abi3-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 93b6881c1dd254bcc9d81c84068310f7cc21afb426f1e6b37e5b4a6f33676ea2
MD5 fb9cdfb3012ef0d1126d07a15b71bd30
BLAKE2b-256 6c96e3d6f6df94777f90093940ab9fa713e3253fe9393327987d6901b058ad5d

See more details on using hashes here.

File details

Details for the file vrsix-0.2.0-cp310-abi3-macosx_10_12_x86_64.whl.

File metadata

File hashes

Hashes for vrsix-0.2.0-cp310-abi3-macosx_10_12_x86_64.whl
Algorithm Hash digest
SHA256 fd6cae6d68c9a8841543706d5f1cda601d090036c897d6ec875d6e6d90d1965f
MD5 cc5f990a2d94a26e5fff45919f746366
BLAKE2b-256 a3c6c7adc9c33dace9f05300d893b723f73ed024e6f760cd6a425ec59606377a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page