Skip to main content

Compression of binary table

Project description

CompressBinaryTable

Basic Usage:

cbt -i input_file -o output_file -c 

Options:

-i = input, required 
-o = output, required 
-c = compression 
-d = decompression 
-t = output type, either csv or tsv 
--override = to override output file if it exist 

Available Python functions:


# Returns numpy array of file as uncompressed, useful for ML (same as load as pandas df, and to_numpy())
cbt_to_array(compressed_file)

# Returns numpy array of name of the mutations as uncompressed, useful for ML, (same as load pandas df and use df.colums)
cbt_columns(compressed_file)

Required file example:

CSV:

strain_name,mut1,mut2,mut3,mut4,outcome strain1,0,1,1,1,1,1 strain2,0,0,1,1,1,0 strain3,0,1,0,1,1,0 strain4,1,1,1,1,1,1

TSV:

strain_name mut1 mut2 mut3 mut4 outcome strain1 0 1 1 1 1 1 strain2 0 0 1 1 1 0 strain3 0 1 0 1 1 0 strain4 1 1 1 1 1 1

CBT:

1;mut1;mut2;mut3;mut4;outcome strain1;6;43;87;102 strain2;16;43;87;102 strain3;6;53;78;112 strain4;61;413;824;942

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

compressbinarytable-0.1.7.tar.gz (3.2 kB view hashes)

Uploaded Source

Built Distribution

compressbinarytable-0.1.7-py3-none-any.whl (4.1 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page