Compression of binary table
Project description
CompressBinaryTable
Basic Usage:
cbt -i input_file -o output_file -c
Options:
-i = input, required
-o = output, required
-c = compression
-d = decompression
-t = output type, either csv or tsv
--override = to override output file if it exist
Available Python functions:
# Returns numpy array of file as uncompressed, useful for ML (same as load as pandas df, and to_numpy())
cbt_to_array(compressed_file)
# Returns numpy array of name of the mutations as uncompressed, useful for ML, (same as load pandas df and use df.colums)
cbt_columns(compressed_file)
Required file example:
CSV:
strain_name,mut1,mut2,mut3,mut4,outcome
strain1,0,1,1,1,1,1
strain2,0,0,1,1,1,0
strain3,0,1,0,1,1,0
strain4,1,1,1,1,1,1
TSV:
strain_name mut1 mut2 mut3 mut4 outcome
strain1 0 1 1 1 1 1
strain2 0 0 1 1 1 0
strain3 0 1 0 1 1 0
strain4 1 1 1 1 1 1
CBT:
1;mut1;mut2;mut3;mut4;outcome
strain1;6;43;87;102
strain2;16;43;87;102
strain3;6;53;78;112
strain4;61;413;824;942
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for compressbinarytable-0.2.0.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7fceb1bdf86ac5f2cf13906a90194ce0401489195c258bf95b3ad0cf92a9cf52 |
|
MD5 | 7873d354375be5e073bfcea193a795bb |
|
BLAKE2b-256 | de2d833741a05dd0dffb637c8475ac0e7e958a0911d60b4a680485be752d867b |
Close
Hashes for compressbinarytable-0.2.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2f6a345d82e69ff398dd85dadb0cb08d3139804a263469c7b6e4cd56f40ba967 |
|
MD5 | 591a15adb4f4cfccdd6fb3bd00e12768 |
|
BLAKE2b-256 | b6fc43cba1ab984a4d2a72ad6d5124895ebf2c0b6598097335a6e3153aa97dba |