Skip to main content

jiboia_gpu is a python package for automatically normalizing DataFrames and optimizing types efficiently.

Project description

Jiboia

jiboia-gpu is a Python package to normalize and optimize DataFrames automatically efficiently using the Nvidia GPU in the RAPIDS ecosystem.

Key features:

  • String normalization:
    • Removes extra spaces.
    • Strips leading and trailing spaces.
    • Detects data pollution (e.g., columns that should be numeric but contain strings).
  • Type conversion:
    • Numeric strings and floats ending in .0 → integers (int8, int16, int32, …).
    • Converts floats and integers to the most memory-efficient type.
    • Converts strings in various date formats to datetime (yyyy?mm?dd, dd?mm?yyyy, yyyymmd, dd?mm?yy).
    • Converts time strings (hhmm UTC, hh:mm:ss, hh:mm:ss.s) to timedelta.
  • Null standardization → converts different null representations to pd.NA.
  • Automatic CSV detection:
    • Detects delimiter.
    • Detects encoding.
  • Memory optimization:
    • Provides memory usage information for DataFrames.
    • Converts columns to the most compact types possible.

Example Usage

from jiboia-gpu import jiboia_gpu as jb

pd.normalize_category("data_frame_cudf")

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

jiboia_gpu-0.1.0.tar.gz (24.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

jiboia_gpu-0.1.0-py3-none-any.whl (33.5 kB view details)

Uploaded Python 3

File details

Details for the file jiboia_gpu-0.1.0.tar.gz.

File metadata

  • Download URL: jiboia_gpu-0.1.0.tar.gz
  • Upload date:
  • Size: 24.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.11

File hashes

Hashes for jiboia_gpu-0.1.0.tar.gz
Algorithm Hash digest
SHA256 64492a4fe26ccc161294b25b66bbba89c138d481ff65679d46e7c94984108afe
MD5 c06917ac4bf89db534f5fa3ce681f28e
BLAKE2b-256 2baaeab9d8b381ead76698027f6f2a2ccfa21a6f9db2b7c25163b32e860de7a9

See more details on using hashes here.

File details

Details for the file jiboia_gpu-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: jiboia_gpu-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 33.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.11

File hashes

Hashes for jiboia_gpu-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 48ae333ba6b0abded7c867adc81e4fb4f769515bf8e25b2374fc88e6ccf69c44
MD5 f55c711a68b603c60e425b9a70ab96e3
BLAKE2b-256 920d0b86a31921237cc848bf310f662fc851874db5a3e03a19be4287ba2d9a1a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page