Tool for quick random and systematic changes (jumbling) to .csv files.
Project description
usage: main.py [-h] [-p JUMBLING_PERCENT] [-c CATEGORICAL_SWITCH_PROBABILITY]
[-d [DROP_COLUMNS ...]] [-l [CATEGORICAL_COLUMNS ...]]
[-o OUTPUT_FILE] [--not-all-categorical-parameters-present]
[--block-negative] [-n NUMBER_OF_HEADER_ROWS]
[--significant-figures SIGNIFICANT_FIGURES]
[--float-formatting FLOAT_FORMATTING]
[--int-formatting INT_FORMATTING]
csv_path
Tool for jumbling data, removing data and reformatting data in CSV format.
positional arguments:
csv_path Path to the .csv file. Note the file has to be comma
seperated with full-stop decimals , not semicolon with
comma decimal.
options:
-h, --help show this help message and exit
-p JUMBLING_PERCENT, --jumbling-percent JUMBLING_PERCENT
Percentage to jumble non-categorical values in %
-c CATEGORICAL_SWITCH_PROBABILITY, --categorical-switch-probability CATEGORICAL_SWITCH_PROBABILITY
Probability to change a categorical parameter in %
-d [DROP_COLUMNS ...], --drop-columns [DROP_COLUMNS ...]
List of column indices to drop in the new table
-l [CATEGORICAL_COLUMNS ...], --categorical-columns [CATEGORICAL_COLUMNS ...]
List of column indices containing a categorical
parameter
-o OUTPUT_FILE, --output-file OUTPUT_FILE
Write the resulting CSV file to this path
--not-all-categorical-parameters-present
Set this if all possible values of categorical
parameters are not present in the data. If set data
will be interpolated, so we assume the outer values
are represented.
--block-negative Caps values at a minimum of 0
-n NUMBER_OF_HEADER_ROWS, --number-of-header-rows NUMBER_OF_HEADER_ROWS
Number of header rows in the input CSV file
--significant-figures SIGNIFICANT_FIGURES
Number of significant figures to use when printing
floats. If neither '--significant-figures', '--float-
formatting' is set, the values will be represented
centrally, as wide as the header of the column
--float-formatting FLOAT_FORMATTING
Float formatting to use when printing. E.g. '4.2f', '
^8.2f'. Whatever is accepted by your python
interpreter 'print' function should work.
--int-formatting INT_FORMATTING
Integer formatting to use when printing. E.g. '4d', '
^8d'. Whatever is accepted by your python interpreter
'print' function should work. If not set the values
will be represented centrally, as wide as the header
of the column
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
jumblecsv-0.1.3.tar.gz
(4.8 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file jumblecsv-0.1.3.tar.gz.
File metadata
- Download URL: jumblecsv-0.1.3.tar.gz
- Upload date:
- Size: 4.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
602caab8605c367b3481b06b90f9f53afdfa450cf6e49d5e8450e0e8773983a5
|
|
| MD5 |
f89a4eb072c68c468869de5247427b3f
|
|
| BLAKE2b-256 |
516a552a0eb879ce04bdd0e996a9b51646cbbf9f94098d7e6461728c9c299130
|
File details
Details for the file jumblecsv-0.1.3-py3-none-any.whl.
File metadata
- Download URL: jumblecsv-0.1.3-py3-none-any.whl
- Upload date:
- Size: 5.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ab4f7c46b99180c04446334a2f7e37c2f888447cf03913549ab3817a2e418ced
|
|
| MD5 |
5d801892dabcb883df68d172ef44a482
|
|
| BLAKE2b-256 |
8b5503c0e8bf236fee5199cb526140a597d840d46b284709330d441b3b397bf6
|