Skip to main content

jiboia is a python package for automatically normalizing DataFrames and optimizing types efficiently.

Project description

Jiboia

Jiboia is a Python package for automatically normalizing and optimizing DataFrames efficiently.

Key features:

  • String normalization:
    • Removes extra spaces.
    • Strips leading and trailing spaces.
    • Detects data pollution (e.g., columns that should be numeric but contain strings).
  • Type conversion:
    • Numeric strings and floats ending in .0 → integers (int8, int16, int32, …).
    • Converts floats and integers to the most memory-efficient type.
    • Converts strings in various date formats to datetime (yyyy?mm?dd, dd?mm?yyyy, yyyymmd, dd?mm?yy).
    • Converts time strings (hhmm UTC, hh:mm:ss, hh:mm:ss.s) to timedelta.
  • Null standardization → converts different null representations to pd.NA.
  • Automatic CSV detection:
    • Detects delimiter.
    • Detects encoding.
  • Memory optimization:
    • Provides memory usage information for DataFrames.
    • Converts columns to the most compact types possible.

Example Usage

import jiboia as jb

df = pd.read_csv("data.csv")
df = jb.normalize_df(df)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

jiboia-0.1.0.tar.gz (13.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

jiboia-0.1.0-py3-none-any.whl (16.0 kB view details)

Uploaded Python 3

File details

Details for the file jiboia-0.1.0.tar.gz.

File metadata

  • Download URL: jiboia-0.1.0.tar.gz
  • Upload date:
  • Size: 13.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.11

File hashes

Hashes for jiboia-0.1.0.tar.gz
Algorithm Hash digest
SHA256 74c76a00762c794e489ca995873727be3c5960b1a1312495c8c429dd0e78e0dc
MD5 b6d7b6b91cd80587d753faeb4f5d4e09
BLAKE2b-256 22fc4d1cad45f1f47195841af4337c29aeb149878e28d1accb3738cc7ad3c9d9

See more details on using hashes here.

File details

Details for the file jiboia-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: jiboia-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 16.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.11

File hashes

Hashes for jiboia-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 64ea07ba7588b52080235dcea6e10b821b7ac21f28bc770031b4ecc5c259c215
MD5 d97bdd4b88a309e10b8522cd5b18d0b2
BLAKE2b-256 cb32c58715d03a21ab6022a35b67ca01291923657253f58e8514ddeacb5aab62

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page