Split VCF file into intelligible tab-delimited files
Project description
Split VCF file into intelligible tab-delimited files.
VCF stands for variants call format and is widelly used in bioinformatics for saving XX information for being flexible enough to store a wide range of data. Although it has been inspired by CSV files to (I suppose) make VCF easy to read by humans and easy to parse by machines, it is nowadays hardly doing well in both instances. It is a file format whose full specification cannot be given before-hand as each VCF files is free to specify the format of its own fields. As a result, the first step to process data contained in a VCF file is often extracting a subset of it and converting the extract data into a more amenable file format.
This command-line Python package aims to tackle the above problems. It will split a VCF format into standard CSV files, each of which containing a different field of the original VCF file. Therefore facilitating making it both easier to read by humans and machines again.
Install
Assuming you have a up-to-date Python installation, you will almost certainly have all the necessary requirements to install unvcf. From terminal, enter:
pip install unvcf
Usage
After the installation, you will have acccess to unvcf from terminal. In which case you can start using it as follows:
unvcf path_to_file.vcf destination_folder/
This will produce CSV files that represent the original fields of path_to_file.vcf. For more information, you can enter
unvcf --help
If by any change you face a problem or have a question, please, create a new issue and we will try to sort it out as soon as possible.
License
This project is licensed under the MIT License.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file unvcf-0.1.2.tar.gz
.
File metadata
- Download URL: unvcf-0.1.2.tar.gz
- Upload date:
- Size: 1.1 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | efcea32ea52e79ddd459e801b4abcc28817eb4a05afd187380d9007bf86a4072 |
|
MD5 | 70306e9e11043521a047ef3b00b273e9 |
|
BLAKE2b-256 | 02e34a8b1b09fc0da10a402f9c828d8720f360fd65ec9f3d66eabd8b2b8cf6c9 |