Skip to main content

Command-line tool to cut VCF (variant call files) into smaller batches, intended to be used for multiprocessing or distributed computing.

Project description

📑️ VCF Batcher

Rust

This is a Rust crate to cut VCF (variant call files) into smaller batches, intended to be used for multiprocessing or distributed computing.

🧰️ Installation

Depending on what your goals are, you can use this tool as a CLI or as a library in 🦀️ Rust or 🐍️ Python.

Installing the CLI

In order to install the program as a CLI, you will need to have cargo installed. Instructions to install cargo

Once you have it, you can run the following command in your terminal to install the VCF batcher.

cargo install vcf_batcher

Installing Rust Crate

In order to install the tool as a rust crate, you can add it to your Cargo.toml dependencies or run:

cargo add vcf_batcher

You can find the crate documentation on docs.rs.

Installing python bindings

We provide python bindings for the VCF batcher which can be installed via pip.

pip install vcf-batcher

🪄️ Usage

CLI

Using the CLI after installing can be done through the vcf_batcher_cli command.

vcf_batcher_cli path/to/your_file.vcf path/to/ouput/directory

By default, this will create batches with 25'000 samples each. If you'd like to override this default, you can do so by providing a custom --batch-size or -b argument:

vcf_batcher_cli -b 1000 path/to/your_file.vcf path/to/ouput/directory

Library

After installing either the rust crate or python module, you can use the provided function.

🦀️ Rust

pub fn extract_variants_to_batches(
    file_path: &str,
    batch_size: usize,
    output_path: &Path,
    compression_level: Option<Compression>
)

🐍️ Python

vcf_batcher.py_extract_variants_to_batches(
        input_file,
        batches_folder,
        batch_size,
)

License

The software is licensed under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

vcf_batcher-0.2.1-cp311-none-win_amd64.whl (252.6 kB view details)

Uploaded CPython 3.11Windows x86-64

vcf_batcher-0.2.1-cp311-none-win32.whl (247.1 kB view details)

Uploaded CPython 3.11Windows x86

vcf_batcher-0.2.1-cp311-cp311-macosx_11_0_arm64.whl (383.5 kB view details)

Uploaded CPython 3.11macOS 11.0+ ARM64

vcf_batcher-0.2.1-cp311-cp311-macosx_10_7_x86_64.whl (396.8 kB view details)

Uploaded CPython 3.11macOS 10.7+ x86-64

vcf_batcher-0.2.1-cp310-none-win_amd64.whl (252.6 kB view details)

Uploaded CPython 3.10Windows x86-64

vcf_batcher-0.2.1-cp310-none-win32.whl (247.0 kB view details)

Uploaded CPython 3.10Windows x86

vcf_batcher-0.2.1-cp310-cp310-manylinux_2_34_x86_64.whl (333.5 kB view details)

Uploaded CPython 3.10manylinux: glibc 2.34+ x86-64

vcf_batcher-0.2.1-cp310-cp310-macosx_11_0_arm64.whl (383.5 kB view details)

Uploaded CPython 3.10macOS 11.0+ ARM64

vcf_batcher-0.2.1-cp310-cp310-macosx_10_7_x86_64.whl (396.8 kB view details)

Uploaded CPython 3.10macOS 10.7+ x86-64

vcf_batcher-0.2.1-cp39-none-win_amd64.whl (252.6 kB view details)

Uploaded CPython 3.9Windows x86-64

vcf_batcher-0.2.1-cp39-none-win32.whl (247.1 kB view details)

Uploaded CPython 3.9Windows x86

vcf_batcher-0.2.1-cp38-none-win_amd64.whl (252.6 kB view details)

Uploaded CPython 3.8Windows x86-64

vcf_batcher-0.2.1-cp38-none-win32.whl (247.0 kB view details)

Uploaded CPython 3.8Windows x86

vcf_batcher-0.2.1-cp37-none-win_amd64.whl (252.6 kB view details)

Uploaded CPython 3.7Windows x86-64

vcf_batcher-0.2.1-cp37-none-win32.whl (247.0 kB view details)

Uploaded CPython 3.7Windows x86

File details

Details for the file vcf_batcher-0.2.1-cp311-none-win_amd64.whl.

File metadata

File hashes

Hashes for vcf_batcher-0.2.1-cp311-none-win_amd64.whl
Algorithm Hash digest
SHA256 e3cf0564d3fee01b8b83aea2bacb2e0da8a7e3e0d67cf9f7709476f3b707f504
MD5 deb345fc99df63f38d63d5f7c8eb568b
BLAKE2b-256 ea25612c1f57852792614e37310e71bc062cd70a4430317c5983ac0122e54c61

See more details on using hashes here.

File details

Details for the file vcf_batcher-0.2.1-cp311-none-win32.whl.

File metadata

  • Download URL: vcf_batcher-0.2.1-cp311-none-win32.whl
  • Upload date:
  • Size: 247.1 kB
  • Tags: CPython 3.11, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: maturin/0.15.2

File hashes

Hashes for vcf_batcher-0.2.1-cp311-none-win32.whl
Algorithm Hash digest
SHA256 ee13e96d401271d6c72617729d4ed43df5cc088050c633eaeb750378f9160d57
MD5 710f0cbf8f27ec6764a43226eb4ecf03
BLAKE2b-256 ebe9784da21e4b9868dbf28212a58df675cd83f73820d53ef5dd641eb6035a6d

See more details on using hashes here.

File details

Details for the file vcf_batcher-0.2.1-cp311-cp311-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for vcf_batcher-0.2.1-cp311-cp311-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 cd59330cabc033322705bb970c711520f5a4c7a09ced4a77dcb21629b6968041
MD5 1c1325a54394145fb91649acb5f49ac4
BLAKE2b-256 21f77ec6160a4331500b5379b6ee628119c695c9c781202cf2aa82f81d4598cf

See more details on using hashes here.

File details

Details for the file vcf_batcher-0.2.1-cp311-cp311-macosx_10_7_x86_64.whl.

File metadata

File hashes

Hashes for vcf_batcher-0.2.1-cp311-cp311-macosx_10_7_x86_64.whl
Algorithm Hash digest
SHA256 ef5de7e9ea6bea9377ca275707000b7950768682589188ddad4fd5be8f34a9c0
MD5 e4b99f513976293a5409c65bdca80e32
BLAKE2b-256 4487960b06839a6ceaf56411dcd49fddc22734523fbe64da8070119f3f8f9e28

See more details on using hashes here.

File details

Details for the file vcf_batcher-0.2.1-cp310-none-win_amd64.whl.

File metadata

File hashes

Hashes for vcf_batcher-0.2.1-cp310-none-win_amd64.whl
Algorithm Hash digest
SHA256 08d0a631ac350e242409b810eb90c3bf9e90b83d1ae84aac57cd8103d5687cec
MD5 6350c6cb24327258f6a473514132274f
BLAKE2b-256 9207cf9358ccb3b8b5089ea07bfa50f0883bd30bce5d9b227b0ef160b190c953

See more details on using hashes here.

File details

Details for the file vcf_batcher-0.2.1-cp310-none-win32.whl.

File metadata

  • Download URL: vcf_batcher-0.2.1-cp310-none-win32.whl
  • Upload date:
  • Size: 247.0 kB
  • Tags: CPython 3.10, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: maturin/0.15.2

File hashes

Hashes for vcf_batcher-0.2.1-cp310-none-win32.whl
Algorithm Hash digest
SHA256 cc0aa2b1ebf7a0e993c2375e6adf68d05c3db1a57803e9d0efa5a11dce7c90d1
MD5 31685dc0c855067ae0838e276d13d214
BLAKE2b-256 f6ff14c504c2b4404dc38ff55f7bd57f2be8c378a8ed9aa5a7bc6468b0a99429

See more details on using hashes here.

File details

Details for the file vcf_batcher-0.2.1-cp310-cp310-manylinux_2_34_x86_64.whl.

File metadata

File hashes

Hashes for vcf_batcher-0.2.1-cp310-cp310-manylinux_2_34_x86_64.whl
Algorithm Hash digest
SHA256 47aa82c00bf29ee72610b9bd264031d2ad7733b31b1f234fbd2673be1f73b821
MD5 986df20421dd29c3d26a4cedc64db538
BLAKE2b-256 1e6d9dc3e459e26ccac73d73240f7f95797314ce6355e96d6d61e7d1c9a32149

See more details on using hashes here.

File details

Details for the file vcf_batcher-0.2.1-cp310-cp310-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for vcf_batcher-0.2.1-cp310-cp310-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 c7efd62d6b553bd9ef7e57583aa85e7da5faafe3a6951fea5718297568dc06c7
MD5 525923acc964751bf05d6b76d28f244f
BLAKE2b-256 4973d66f0f758f5331ee56285ba0e6545b6835600daa4e97015264611e34500b

See more details on using hashes here.

File details

Details for the file vcf_batcher-0.2.1-cp310-cp310-macosx_10_7_x86_64.whl.

File metadata

File hashes

Hashes for vcf_batcher-0.2.1-cp310-cp310-macosx_10_7_x86_64.whl
Algorithm Hash digest
SHA256 7d88190c31c776e38c9eb2094715fd352fe03489903950215340532dda8051a9
MD5 85bff5fc46b8d1563b7c07d5e643a877
BLAKE2b-256 fd864abcb6eb95c99b86ea65d64605c3530e23c8e16794a2eafc06a8ea868c2f

See more details on using hashes here.

File details

Details for the file vcf_batcher-0.2.1-cp39-none-win_amd64.whl.

File metadata

File hashes

Hashes for vcf_batcher-0.2.1-cp39-none-win_amd64.whl
Algorithm Hash digest
SHA256 b48225f1657c6b6d6804de40d7789491770fcc8fbedb003913e2ba0d25278394
MD5 f6846f1d55c3c6983026f0cda4611232
BLAKE2b-256 98595b49445cb96675508294992f3f786cf20d5e65360a92bc087957ea7c03df

See more details on using hashes here.

File details

Details for the file vcf_batcher-0.2.1-cp39-none-win32.whl.

File metadata

  • Download URL: vcf_batcher-0.2.1-cp39-none-win32.whl
  • Upload date:
  • Size: 247.1 kB
  • Tags: CPython 3.9, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: maturin/0.15.2

File hashes

Hashes for vcf_batcher-0.2.1-cp39-none-win32.whl
Algorithm Hash digest
SHA256 a4f7d3d64274ac32ef5aafbe05be911174298fc20178e182fd0c60f18ec89e6d
MD5 de5dfa627489ab00a021441951dd486f
BLAKE2b-256 5640640f1797e5c0d2d634a2c27a76d53c9004a253a534f755acc928c7e6513c

See more details on using hashes here.

File details

Details for the file vcf_batcher-0.2.1-cp38-none-win_amd64.whl.

File metadata

File hashes

Hashes for vcf_batcher-0.2.1-cp38-none-win_amd64.whl
Algorithm Hash digest
SHA256 5f80351d0593b714b91c220f71822e5448b374637918dcdcf28ddf717d943df1
MD5 cf486e02993993024bd849e0f1271b8a
BLAKE2b-256 f1cc22761d4256119b4758bcce9adbf91b577ae9ae65b6ea784c848944dbc0c9

See more details on using hashes here.

File details

Details for the file vcf_batcher-0.2.1-cp38-none-win32.whl.

File metadata

  • Download URL: vcf_batcher-0.2.1-cp38-none-win32.whl
  • Upload date:
  • Size: 247.0 kB
  • Tags: CPython 3.8, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: maturin/0.15.2

File hashes

Hashes for vcf_batcher-0.2.1-cp38-none-win32.whl
Algorithm Hash digest
SHA256 c642136cbab006e726d4795cfc314738106320e690f746c5f38562d05e31a557
MD5 05aca07deb1ad3d985c5d449918d4264
BLAKE2b-256 1d711de6ddf5507dc2935a1ed3b32ab3a6b2eab5b72dd90e0f191dc0494b18fb

See more details on using hashes here.

File details

Details for the file vcf_batcher-0.2.1-cp37-none-win_amd64.whl.

File metadata

File hashes

Hashes for vcf_batcher-0.2.1-cp37-none-win_amd64.whl
Algorithm Hash digest
SHA256 0e6ca1de4a438530685fa999685be8e7754bfdf68c118c8610d93f160f0ad57b
MD5 ea9eb7765eb27f696a04ad13805907d8
BLAKE2b-256 7ede60e153b9e559e9fdbb7fdc2c132627e03064a9c3c9a9287118e2b0a7fe82

See more details on using hashes here.

File details

Details for the file vcf_batcher-0.2.1-cp37-none-win32.whl.

File metadata

  • Download URL: vcf_batcher-0.2.1-cp37-none-win32.whl
  • Upload date:
  • Size: 247.0 kB
  • Tags: CPython 3.7, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: maturin/0.15.2

File hashes

Hashes for vcf_batcher-0.2.1-cp37-none-win32.whl
Algorithm Hash digest
SHA256 49b1e65a8c3509f8dce4ab98e46b9139c287da95744cec2056acca07f2c4149b
MD5 c8c21ea998e76b0090439d3a38811c0c
BLAKE2b-256 a43cac91c44e1cf519b04215af31ef2860241433b013f834441d050fe37c1333

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page