Detects the delimiter used in CSV, TSV and other ad hoc file formats.
Project description
detect_delimiter
About
Detects the delimiter used in CSV, TSV and other ad hoc file formats.
Installation
Use pip install detect_delimiter
Usage
detect_delimiter
exposes the detect()
functinon, which takes a str
as input and returns a delimiter.
>>> from detect_delimiter import detect
>>> detect("looks|like|the vertical bar\n is|the|delimiter\n")
'|'
When detect()
doesn't know, it returns None
:
>>> text = "not really any delimiters in here.\nthis is just text.\n"
>>> detect()
It's possible to provide a default, which will be used in that case:
>>> detect(text, default=',')
','
By default, detect()
will prevent avoid checking alpha-numeric characters
and the period/full stop character ("."). This can be adjusted via
the blacklist
parameter.
If you believe that you know the delimiter, it's possible to provide
a list of possible delimiters to check for via the whitelist
parameter.
If you don't provide a value, [',', ';', ':', '|', '\t']
will be checked.
Testing
You can either use pytest
or tox
directly from the project's root directory.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for detect_delimiter-0.1.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f5284821c65eb09acd3530ef4352535e37b2929522a5e11a72d571d7aba62a5d |
|
MD5 | c703d48cd518a79ff6ed29c770ffb255 |
|
BLAKE2b-256 | bc6fa2f175429daf03493cbb76e63fbb5681202a259ffe47c015457dcb18cf37 |