Detects the delimiter used in CSV, TSV and other ad hoc file formats.
Project description
# detect_delimiter
## About
Detects the delimiter used in CSV, TSV and other ad hoc file formats.
## Installation
Use pip install detect_delimiter
## Usage
detect_delimiter exposes the detect() functinon, which takes a str as input and returns a delimiter.
>>> from detect_delimiter import detect >>> detect("looks|like|the vertical bar\n is|the|delimiter\n") '|'
When detect() doesn’t know, it returns None:
>>> text = "not really any delimiters in here.\nthis is just text.\n" >>> detect()
It’s possible to provide a default, which will be used in that case:
>>> detect(text, default=',') ','
By default, detect() will prevent avoid checking alpha-numeric characters and the period/full stop character (“.”). This can be adjusted via the blacklist parameter.
If you believe that you know the delimiter, it’s possible to provide a list of possible delimiters to check for via the whitelist parameter. If you don’t provide a value, [‘,’, ‘;’, ‘:’, ‘|’, ‘t’] will be checked.
## Testing
You can either use pytest or tox directly from the project’s root directory.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for detect_delimiter-0.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4924e19013510c03c2f28be9a963244d1002b3376ddb98d0b44dee91fe2d4c64 |
|
MD5 | 0fbf7d2c3699eff2de916607eed6a0e6 |
|
BLAKE2b-256 | ad7610dcb7a65aebb8ec4bca827e9151bcb50cd79899f3764e6cff05309033ae |