clean CSVs
Project description
clean your CSVs!
This command line tool cleans CSV files by:
- detecting the encoding and converting it to utf-8
- detecting the delimiter and safely converting it to a comma
- casting all variables to json form, i.e. integers, floats, booleans, string or null.
A pypi build is not available yet so:
- checkout the code
- build it
poetry build
- and run like
poetry run bleach my-data.csv
The only option is the output file name, by default it will be your original file name with .scsv
extension.
You will now be able to parse your CSV safely with a simple script like:
import json
def parse_row(text: str) -> list:
return json.loads(f"[{text}]")
with open("my-data.scsv") as f:
header, *rows = map(parse_row, f)
for row in rows:
print(dict(zip(header, row)))
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
csv_bleach-0.1.1.tar.gz
(5.2 kB
view hashes)
Built Distribution
Close
Hashes for csv_bleach-0.1.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7d91f7a193abefe2d12234651712e1ed5f005786ccde204a93d650470513b9b2 |
|
MD5 | 9014dcd1c0dc53125532f998c0a2db58 |
|
BLAKE2b-256 | 00863cd9c957fd7dbf6cd061e992a8877ad99f898c10447b064c7a5ffc96e429 |