Compression of binary table
Project description
CompressBinaryTable
Basic Usage:
cbt -i input_file -o output_file -c
Options:
-i = input, required
-o = output, required
-c = compression
-d = decompression
-t = output type, either csv or tsv
--override = to override output file if it exist
Available Python functions:
# Returns numpy array of file as uncompressed, useful for ML (same as load as pandas df, and to_numpy())
cbt_to_array(compressed_file)
# Returns numpy array of name of the mutations as uncompressed, useful for ML, (same as load pandas df and use df.colums)
cbt_columns(compressed_file)
Required file example:
CSV:
strain_name,mut1,mut2,mut3,mut4,outcome strain1,0,1,1,1,1,1 strain2,0,0,1,1,1,0 strain3,0,1,0,1,1,0 strain4,1,1,1,1,1,1
TSV:
strain_name mut1 mut2 mut3 mut4 outcome strain1 0 1 1 1 1 1 strain2 0 0 1 1 1 0 strain3 0 1 0 1 1 0 strain4 1 1 1 1 1 1
CBT:
1;mut1;mut2;mut3;mut4;outcome strain1;6;43;87;102 strain2;16;43;87;102 strain3;6;53;78;112 strain4;61;413;824;942
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for compressbinarytable-0.1.7.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | c8008df2e5fce8a1db79ea7fecf6ec00963066f8f18e0b78b9ecb57de6e7ed59 |
|
MD5 | 24ec3d6f565b7b64a0f86aa8dae172da |
|
BLAKE2b-256 | 0ca226a3d60c3f26600db8c9ba180e31b882e4975ab5ba08ada9b9a7f701e235 |
Close
Hashes for compressbinarytable-0.1.7-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 931122895cf5e2bae3e1071b75f65da53ba3d888e5bfc7632711fb607e0ce89b |
|
MD5 | e3b14feb677670ffe242db749e1bb765 |
|
BLAKE2b-256 | 0a4f082dd2a589cdf820a8dfc28c2ea27c1c1f5c9f1d26982ba663d6d2735788 |