Command-line tool to cut VCF (variant call files) into smaller batches, intended to be used for multiprocessing or distributed computing.
Project description
📑️ VCF Batcher
This is a Rust crate to cut VCF (variant call files) into smaller batches, intended to be used for multiprocessing or distributed computing.
🧰️ Installation
Depending on what your goals are, you can use this tool as a CLI or as a library in 🦀️ Rust or 🐍️ Python.
Installing the CLI
In order to install the program as a CLI, you will need to have cargo installed.
Instructions to install cargo
Once you have it, you can run the following command in your terminal to install the VCF batcher.
cargo install vcf_batcher
Installing Rust Crate
In order to install the tool as a rust crate, you can add it to your Cargo.toml dependencies or
run:
cargo add vcf_batcher
You can find the crate documentation on docs.rs.
Installing python bindings
We provide python bindings for the VCF batcher which can be installed via pip.
pip install vcf-batcher
🪄️ Usage
CLI
Using the CLI after installing can be done through the vcf_batcher_cli command.
vcf_batcher_cli path/to/your_file.vcf path/to/ouput/directory
By default, this will create batches with 25'000 samples each. If you'd like to override this
default, you can do so by providing a custom --batch-size or -b argument:
vcf_batcher_cli -b 1000 path/to/your_file.vcf path/to/ouput/directory
Library
After installing either the rust crate or python module, you can use the provided function.
🦀️ Rust
pub fn extract_variants_to_batches(
file_path: &str,
batch_size: usize,
output_path: &Path,
compression_level: Option<Compression>
)
🐍️ Python
vcf_batcher.py_extract_variants_to_batches(
input_file,
batches_folder,
batch_size,
)
License
The software is licensed under the MIT License.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file vcf_batcher-0.2.1-cp311-none-win_amd64.whl.
File metadata
- Download URL: vcf_batcher-0.2.1-cp311-none-win_amd64.whl
- Upload date:
- Size: 252.6 kB
- Tags: CPython 3.11, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/0.15.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e3cf0564d3fee01b8b83aea2bacb2e0da8a7e3e0d67cf9f7709476f3b707f504
|
|
| MD5 |
deb345fc99df63f38d63d5f7c8eb568b
|
|
| BLAKE2b-256 |
ea25612c1f57852792614e37310e71bc062cd70a4430317c5983ac0122e54c61
|
File details
Details for the file vcf_batcher-0.2.1-cp311-none-win32.whl.
File metadata
- Download URL: vcf_batcher-0.2.1-cp311-none-win32.whl
- Upload date:
- Size: 247.1 kB
- Tags: CPython 3.11, Windows x86
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/0.15.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ee13e96d401271d6c72617729d4ed43df5cc088050c633eaeb750378f9160d57
|
|
| MD5 |
710f0cbf8f27ec6764a43226eb4ecf03
|
|
| BLAKE2b-256 |
ebe9784da21e4b9868dbf28212a58df675cd83f73820d53ef5dd641eb6035a6d
|
File details
Details for the file vcf_batcher-0.2.1-cp311-cp311-macosx_11_0_arm64.whl.
File metadata
- Download URL: vcf_batcher-0.2.1-cp311-cp311-macosx_11_0_arm64.whl
- Upload date:
- Size: 383.5 kB
- Tags: CPython 3.11, macOS 11.0+ ARM64
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/0.15.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
cd59330cabc033322705bb970c711520f5a4c7a09ced4a77dcb21629b6968041
|
|
| MD5 |
1c1325a54394145fb91649acb5f49ac4
|
|
| BLAKE2b-256 |
21f77ec6160a4331500b5379b6ee628119c695c9c781202cf2aa82f81d4598cf
|
File details
Details for the file vcf_batcher-0.2.1-cp311-cp311-macosx_10_7_x86_64.whl.
File metadata
- Download URL: vcf_batcher-0.2.1-cp311-cp311-macosx_10_7_x86_64.whl
- Upload date:
- Size: 396.8 kB
- Tags: CPython 3.11, macOS 10.7+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/0.15.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ef5de7e9ea6bea9377ca275707000b7950768682589188ddad4fd5be8f34a9c0
|
|
| MD5 |
e4b99f513976293a5409c65bdca80e32
|
|
| BLAKE2b-256 |
4487960b06839a6ceaf56411dcd49fddc22734523fbe64da8070119f3f8f9e28
|
File details
Details for the file vcf_batcher-0.2.1-cp310-none-win_amd64.whl.
File metadata
- Download URL: vcf_batcher-0.2.1-cp310-none-win_amd64.whl
- Upload date:
- Size: 252.6 kB
- Tags: CPython 3.10, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/0.15.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
08d0a631ac350e242409b810eb90c3bf9e90b83d1ae84aac57cd8103d5687cec
|
|
| MD5 |
6350c6cb24327258f6a473514132274f
|
|
| BLAKE2b-256 |
9207cf9358ccb3b8b5089ea07bfa50f0883bd30bce5d9b227b0ef160b190c953
|
File details
Details for the file vcf_batcher-0.2.1-cp310-none-win32.whl.
File metadata
- Download URL: vcf_batcher-0.2.1-cp310-none-win32.whl
- Upload date:
- Size: 247.0 kB
- Tags: CPython 3.10, Windows x86
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/0.15.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
cc0aa2b1ebf7a0e993c2375e6adf68d05c3db1a57803e9d0efa5a11dce7c90d1
|
|
| MD5 |
31685dc0c855067ae0838e276d13d214
|
|
| BLAKE2b-256 |
f6ff14c504c2b4404dc38ff55f7bd57f2be8c378a8ed9aa5a7bc6468b0a99429
|
File details
Details for the file vcf_batcher-0.2.1-cp310-cp310-manylinux_2_34_x86_64.whl.
File metadata
- Download URL: vcf_batcher-0.2.1-cp310-cp310-manylinux_2_34_x86_64.whl
- Upload date:
- Size: 333.5 kB
- Tags: CPython 3.10, manylinux: glibc 2.34+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/0.15.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
47aa82c00bf29ee72610b9bd264031d2ad7733b31b1f234fbd2673be1f73b821
|
|
| MD5 |
986df20421dd29c3d26a4cedc64db538
|
|
| BLAKE2b-256 |
1e6d9dc3e459e26ccac73d73240f7f95797314ce6355e96d6d61e7d1c9a32149
|
File details
Details for the file vcf_batcher-0.2.1-cp310-cp310-macosx_11_0_arm64.whl.
File metadata
- Download URL: vcf_batcher-0.2.1-cp310-cp310-macosx_11_0_arm64.whl
- Upload date:
- Size: 383.5 kB
- Tags: CPython 3.10, macOS 11.0+ ARM64
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/0.15.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c7efd62d6b553bd9ef7e57583aa85e7da5faafe3a6951fea5718297568dc06c7
|
|
| MD5 |
525923acc964751bf05d6b76d28f244f
|
|
| BLAKE2b-256 |
4973d66f0f758f5331ee56285ba0e6545b6835600daa4e97015264611e34500b
|
File details
Details for the file vcf_batcher-0.2.1-cp310-cp310-macosx_10_7_x86_64.whl.
File metadata
- Download URL: vcf_batcher-0.2.1-cp310-cp310-macosx_10_7_x86_64.whl
- Upload date:
- Size: 396.8 kB
- Tags: CPython 3.10, macOS 10.7+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/0.15.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7d88190c31c776e38c9eb2094715fd352fe03489903950215340532dda8051a9
|
|
| MD5 |
85bff5fc46b8d1563b7c07d5e643a877
|
|
| BLAKE2b-256 |
fd864abcb6eb95c99b86ea65d64605c3530e23c8e16794a2eafc06a8ea868c2f
|
File details
Details for the file vcf_batcher-0.2.1-cp39-none-win_amd64.whl.
File metadata
- Download URL: vcf_batcher-0.2.1-cp39-none-win_amd64.whl
- Upload date:
- Size: 252.6 kB
- Tags: CPython 3.9, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/0.15.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b48225f1657c6b6d6804de40d7789491770fcc8fbedb003913e2ba0d25278394
|
|
| MD5 |
f6846f1d55c3c6983026f0cda4611232
|
|
| BLAKE2b-256 |
98595b49445cb96675508294992f3f786cf20d5e65360a92bc087957ea7c03df
|
File details
Details for the file vcf_batcher-0.2.1-cp39-none-win32.whl.
File metadata
- Download URL: vcf_batcher-0.2.1-cp39-none-win32.whl
- Upload date:
- Size: 247.1 kB
- Tags: CPython 3.9, Windows x86
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/0.15.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a4f7d3d64274ac32ef5aafbe05be911174298fc20178e182fd0c60f18ec89e6d
|
|
| MD5 |
de5dfa627489ab00a021441951dd486f
|
|
| BLAKE2b-256 |
5640640f1797e5c0d2d634a2c27a76d53c9004a253a534f755acc928c7e6513c
|
File details
Details for the file vcf_batcher-0.2.1-cp38-none-win_amd64.whl.
File metadata
- Download URL: vcf_batcher-0.2.1-cp38-none-win_amd64.whl
- Upload date:
- Size: 252.6 kB
- Tags: CPython 3.8, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/0.15.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5f80351d0593b714b91c220f71822e5448b374637918dcdcf28ddf717d943df1
|
|
| MD5 |
cf486e02993993024bd849e0f1271b8a
|
|
| BLAKE2b-256 |
f1cc22761d4256119b4758bcce9adbf91b577ae9ae65b6ea784c848944dbc0c9
|
File details
Details for the file vcf_batcher-0.2.1-cp38-none-win32.whl.
File metadata
- Download URL: vcf_batcher-0.2.1-cp38-none-win32.whl
- Upload date:
- Size: 247.0 kB
- Tags: CPython 3.8, Windows x86
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/0.15.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c642136cbab006e726d4795cfc314738106320e690f746c5f38562d05e31a557
|
|
| MD5 |
05aca07deb1ad3d985c5d449918d4264
|
|
| BLAKE2b-256 |
1d711de6ddf5507dc2935a1ed3b32ab3a6b2eab5b72dd90e0f191dc0494b18fb
|
File details
Details for the file vcf_batcher-0.2.1-cp37-none-win_amd64.whl.
File metadata
- Download URL: vcf_batcher-0.2.1-cp37-none-win_amd64.whl
- Upload date:
- Size: 252.6 kB
- Tags: CPython 3.7, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/0.15.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0e6ca1de4a438530685fa999685be8e7754bfdf68c118c8610d93f160f0ad57b
|
|
| MD5 |
ea9eb7765eb27f696a04ad13805907d8
|
|
| BLAKE2b-256 |
7ede60e153b9e559e9fdbb7fdc2c132627e03064a9c3c9a9287118e2b0a7fe82
|
File details
Details for the file vcf_batcher-0.2.1-cp37-none-win32.whl.
File metadata
- Download URL: vcf_batcher-0.2.1-cp37-none-win32.whl
- Upload date:
- Size: 247.0 kB
- Tags: CPython 3.7, Windows x86
- Uploaded using Trusted Publishing? No
- Uploaded via: maturin/0.15.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
49b1e65a8c3509f8dce4ab98e46b9139c287da95744cec2056acca07f2c4149b
|
|
| MD5 |
c8c21ea998e76b0090439d3a38811c0c
|
|
| BLAKE2b-256 |
a43cac91c44e1cf519b04215af31ef2860241433b013f834441d050fe37c1333
|