Skip to main content

Compression of binary table

Project description

CompressBinaryTable

Basic Usage:

cbt -i input_file -o output_file -c 

Options:

-i = input, required 
-o = output, required 
-c = compression 
-d = decompression 
-t = output type, either csv or tsv 
--override = to override output file if it exist 

Available Python functions:

# Returns numpy array of file as uncompressed, useful for ML (same as load as pandas df, and to_numpy())
cbt_to_array(compressed_file)

# Returns numpy array of name of the mutations as uncompressed, useful for ML, (same as load pandas df and use df.colums)
cbt_columns(compressed_file)

Required file example:

CSV:

strain_name,mut1,mut2,mut3,mut4,outcome
strain1,0,1,1,1,1,1
strain2,0,0,1,1,1,0
strain3,0,1,0,1,1,0
strain4,1,1,1,1,1,1

TSV:

strain_name mut1    mut2    mut3    mut4    outcome
strain1 0   1   1   1   1   1
strain2 0   0   1   1   1   0
strain3 0   1   0   1   1   0
strain4 1   1   1   1   1   1

CBT:

1;mut1;mut2;mut3;mut4;outcome
strain1;6;43;87;102
strain2;16;43;87;102
strain3;6;53;78;112
strain4;61;413;824;942

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

compressbinarytable-0.2.0.tar.gz (3.6 kB view hashes)

Uploaded Source

Built Distribution

compressbinarytable-0.2.0-py3-none-any.whl (4.4 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page