Detects the delimiter used in CSV, TSV and other ad hoc file formats.
Project description
detect_delimiter
About
Detects the delimiter used in CSV, TSV and other ad hoc file formats.
Installation
Use pip install detect_delimiter
Usage
detect_delimiter
exposes the detect()
functinon, which takes a str
as input and returns a delimiter.
>>> from detect_delimiter import detect
>>> detect("looks|like|the vertical bar\n is|the|delimiter\n")
'|'
When detect()
doesn't know, it returns None
:
>>> text = "not really any delimiters in here.\nthis is just text.\n"
>>> detect()
It's possible to provide a default, which will be used in that case:
>>> detect(text, default=',')
','
By default, detect()
will prevent avoid checking alpha-numeric characters
and the period/full stop character ("."). This can be adjusted via
the blacklist
parameter.
If you believe that you know the delimiter, it's possible to provide
a list of possible delimiters to check for via the whitelist
parameter.
If you don't provide a value, [',', ';', ':', '|', '\t']
will be checked.
Testing
You can either use pytest
or tox
directly from the project's root directory.
Project details
Release history Release notifications
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Filename, size | File type | Python version | Upload date | Hashes |
---|---|---|---|---|
Filename, size detect_delimiter-0.1.1-py3-none-any.whl (3.6 kB) | File type Wheel | Python version py3 | Upload date | Hashes View hashes |
Filename, size detect_delimiter-0.1.1.tar.gz (3.1 kB) | File type Source | Python version None | Upload date | Hashes View hashes |
Hashes for detect_delimiter-0.1.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f5284821c65eb09acd3530ef4352535e37b2929522a5e11a72d571d7aba62a5d |
|
MD5 | c703d48cd518a79ff6ed29c770ffb255 |
|
BLAKE2-256 | bc6fa2f175429daf03493cbb76e63fbb5681202a259ffe47c015457dcb18cf37 |