Skip to main content

Tools for working with delimited files.

Project description


Tools for working with delimited files.

1. DialectDetector
When passed delimited file path will sniff file for dialect and header information using specified detection settings for DialectDetector instance.
1. sniff_for_dialect:
Sniff delimited file and return csv.Dialect.
2. sniff_for_headers:
Sniff delimited file and return first row if the file has headers else list of numbered columns as headers in the form of 'Column' + column #.
3. sniff_for_encoding:
Sniff file to try and determine and return encoding else returns latin-1.

2. SchemaEngine
Engine for data-typing delimited file into field_names, their data type, and max length (if applicable).
1. build_schema_string:
Process delimited file and return schema string for file passed.

3. integrity_check
Take filepath and report on the number of columns detected per line, extra quotechars, etc. Helps to detect problems in source files which may cause issues when creating schema files and indicator for whether file will need preprocessor.

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for csv_tools, version 0.4
Filename, size File type Python version Upload date Hashes
Filename, size csv_tools-0.4.tar.gz (8.5 kB) File type Source Python version None Upload date Hashes View

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring DigiCert DigiCert EV certificate Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page