DatCh checks data for inconsistancies.
Project description
DatCh
The DatCh data checker checks data files for any inconsistancies.
DatCh currently checks for:
- nan values
- white space in strings
- CAPS errors
- data types
Usage:
# check a pandas DataFrame
import pandas as pd
import datch
df = pd.DataFrame({
'id': [0,1,'x',3,4,5,'y',7,None,9],
'name': ['Rose', 'Lily', 'Tulip', 'Orchid', 'Carnation', 'Freesia', 3.14159265, 'Gladiolus', 'Anemone', 'Daffodil'],
})
datch.errors.nan(df['id'])
>> DataError(error_type='nan error', errors=1, values=10, error_ratio=0.10)
datch.errors.dtype(df['id'])
>> DataError(error_type='data type error', errors=3, values=10, error_ratio=0.30)
datch.errors.dtype(df['name'])
>> DataError(error_type='data type error', errors=1, values=10, error_ratio=0.10)
datch.check('data/data.xlsx')
>> DatCh is done! The output file is saved as /data/data_datched.xlsx
Wish list as optional checks:
- zero values (or maybe not)
- outliers (np.abs(float value) > mean + 3 * np.std)
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
datch-0.1.4.tar.gz
(3.9 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
datch-0.1.4-py3-none-any.whl
(16.9 kB
view details)
File details
Details for the file datch-0.1.4.tar.gz.
File metadata
- Download URL: datch-0.1.4.tar.gz
- Upload date:
- Size: 3.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/51.0.0 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.7.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d21e8407e0e8ec93f28989ef051c92bcf3410fdddba3c371a567f894d960e640
|
|
| MD5 |
5bc5883766338ba927650dc4bb3306df
|
|
| BLAKE2b-256 |
eed9472e51da4e03e39828e139077919c08b596448bf561aeed909aaa1a4c377
|
File details
Details for the file datch-0.1.4-py3-none-any.whl.
File metadata
- Download URL: datch-0.1.4-py3-none-any.whl
- Upload date:
- Size: 16.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/51.0.0 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.7.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e036055c449c6a5f75a09cc9f61b6fe52580d80cd2544a6e6a612ee56037cc04
|
|
| MD5 |
db052174b9b93097f1446a48aac3c942
|
|
| BLAKE2b-256 |
2e83e3142051226f1fc3519d693c67684e448fc6f168ad5a34e4ba73d6fb8bb2
|