Skip to main content

DatCh checks data for inconsistancies.

Project description

DatCh

The DatCh data checker checks data files for any inconsistancies.

DatCh currently checks for:

  • nan values
  • white space in strings
  • CAPS errors
  • data types

Usage:

# check a pandas DataFrame
import pandas as pd
import datch

df = pd.DataFrame({
    'id': [0,1,'x',3,4,5,'y',7,None,9],
    'name': ['Rose', 'Lily', 'Tulip', 'Orchid', 'Carnation', 'Freesia', 3.14159265, 'Gladiolus', 'Anemone', 'Daffodil'],
})

datch.errors.nan(df['id'])
>> DataError(error_type='nan error', errors=1, values=10, error_ratio=0.10)

datch.errors.dtype(df['id'])
>> DataError(error_type='data type error', errors=3, values=10, error_ratio=0.30)

datch.errors.dtype(df['name'])
>> DataError(error_type='data type error', errors=1, values=10, error_ratio=0.10)

datch.check('data/data.xlsx')
>> DatCh is done! The output file is saved as /data/data_datched.xlsx

Wish list as optional checks:

  • zero values (or maybe not)
  • outliers (np.abs(float value) > mean + 3 * np.std)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datch-0.1.4.tar.gz (3.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

datch-0.1.4-py3-none-any.whl (16.9 kB view details)

Uploaded Python 3

File details

Details for the file datch-0.1.4.tar.gz.

File metadata

  • Download URL: datch-0.1.4.tar.gz
  • Upload date:
  • Size: 3.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/51.0.0 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.7.6

File hashes

Hashes for datch-0.1.4.tar.gz
Algorithm Hash digest
SHA256 d21e8407e0e8ec93f28989ef051c92bcf3410fdddba3c371a567f894d960e640
MD5 5bc5883766338ba927650dc4bb3306df
BLAKE2b-256 eed9472e51da4e03e39828e139077919c08b596448bf561aeed909aaa1a4c377

See more details on using hashes here.

File details

Details for the file datch-0.1.4-py3-none-any.whl.

File metadata

  • Download URL: datch-0.1.4-py3-none-any.whl
  • Upload date:
  • Size: 16.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/51.0.0 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.7.6

File hashes

Hashes for datch-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 e036055c449c6a5f75a09cc9f61b6fe52580d80cd2544a6e6a612ee56037cc04
MD5 db052174b9b93097f1446a48aac3c942
BLAKE2b-256 2e83e3142051226f1fc3519d693c67684e448fc6f168ad5a34e4ba73d6fb8bb2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page