Skip to main content

A Python toolkit for simplifying dataset preparation and preprocessing

Project description

datasetops

A Python toolkit for simplifying dataset preparation and preprocessing.

Installation

pip install datasetops

Usage

from datasetops import DatasetOps

ds = DatasetOps("data.csv")

ds.summary()
ds.missing_report()
ds.remove_duplicates()
ds.normalize()
ds.encode_categorical()
ds.save_csv("clean_data.csv")

Features

  • Load CSV, Excel, JSON datasets
  • Dataset summary and inspection
  • Missing value detection and filling
  • Duplicate detection and removal
  • Normalization and standardization
  • Categorical encoding
  • Correlation analysis
  • Outlier detection
  • Visualization (histogram, boxplot, scatter, heatmap)
  • Export to CSV and Excel

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datasetops_toolkit-0.1.0.tar.gz (4.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

datasetops_toolkit-0.1.0-py3-none-any.whl (5.8 kB view details)

Uploaded Python 3

File details

Details for the file datasetops_toolkit-0.1.0.tar.gz.

File metadata

  • Download URL: datasetops_toolkit-0.1.0.tar.gz
  • Upload date:
  • Size: 4.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for datasetops_toolkit-0.1.0.tar.gz
Algorithm Hash digest
SHA256 3ef9ddc8f1f4671ab43a93ef316990613014e6a1b7c966d0151e8690f52067ea
MD5 b8c97d51041c59caeb4d30cfcd10c22c
BLAKE2b-256 e06475aae3674a6386c0376200db387dcc9961cfe5da73cd59c45d87a1665887

See more details on using hashes here.

File details

Details for the file datasetops_toolkit-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for datasetops_toolkit-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 0d970195f98de8e6909aec61e6d559299fbe23a1813331daf7c114e835398e44
MD5 b733d19fe5829cc31e1084971244ca88
BLAKE2b-256 3626f940ef76051976856b5eef088106a4994eb6fe8ccf26ef16fa4038e50e70

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page