Skip to main content

clean CSVs

Project description

clean your CSVs!

This command line tool cleans CSV files by:

  1. detecting the encoding and converting it to utf-8
  2. detecting the delimiter and safely converting it to a comma
  3. casting all variables to json form, i.e. integers, floats, booleans, string or null.

A pypi build is not available yet so:

  • checkout the code
  • build it poetry build
  • and run like poetry run bleach my-data.csv

The only option is the output file name, by default it will be your original file name with .scsv extension.

You will now be able to parse your CSV safely with a simple script like:

import json


def parse_row(text: str) -> list:
    return json.loads(f"[{text}]")


with open("my-data.scsv") as f:
    header, *rows = map(parse_row, f)
    for row in rows:
        print(dict(zip(header, row)))

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

csv_bleach-0.1.1.tar.gz (5.2 kB view hashes)

Uploaded Source

Built Distribution

csv_bleach-0.1.1-py3-none-any.whl (6.6 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page