Skip to main content

A Python toolkit for simplifying dataset preparation and preprocessing

Project description

datasetops

A Python toolkit for simplifying dataset preparation and preprocessing.

Installation

pip install datasetops

Usage

from datasetops import DatasetOps

ds = DatasetOps("data.csv")

ds.summary()
ds.missing_report()
ds.remove_duplicates()
ds.normalize()
ds.encode_categorical()
ds.save_csv("clean_data.csv")

Features

  • Loading: load_csv, load_excel, load_json, reload_dataset
  • Inspection: summary, head, tail, shape, column_names, column_types, describe_stats
  • Missing Values: missing_report, missing_percentage, fill_missing_mean, fill_missing_median, fill_missing_mode, fill_missing_value, drop_missing_rows, drop_missing_columns
  • Duplicates: duplicate_report, count_duplicates, remove_duplicates
  • Column Operations: rename_column, drop_column, add_column, select_columns
  • Row Operations: filter_rows, sort_rows, sample_rows
  • Transformation: normalize, standardize, log_transform, encode_categorical, one_hot_encode
  • Analysis: data_quality_report, correlation_matrix, detect_outliers, column_unique_values, value_counts, group_by, aggregate_stats, top_values
  • Visualization: plot_histogram, plot_boxplot, plot_scatter, plot_correlation_heatmap
  • Export: save_csv, save_excel, export_report

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datasetops_toolkit-0.1.1.tar.gz (4.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

datasetops_toolkit-0.1.1-py3-none-any.whl (6.1 kB view details)

Uploaded Python 3

File details

Details for the file datasetops_toolkit-0.1.1.tar.gz.

File metadata

  • Download URL: datasetops_toolkit-0.1.1.tar.gz
  • Upload date:
  • Size: 4.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for datasetops_toolkit-0.1.1.tar.gz
Algorithm Hash digest
SHA256 5b791e40c370c5131fd060b7e461f2d927f10d3385e57b0e1ac8ebd25f0dd45f
MD5 6455d5ada573a201f627d17208c485c6
BLAKE2b-256 218eb27ef46dad445289c9d722d686bae08bf730bc8b5908689d18e2a814cd4d

See more details on using hashes here.

File details

Details for the file datasetops_toolkit-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for datasetops_toolkit-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 e5406c17903458f7c93af944b901203d96308d2aa3eebe74e0025b8ecf8b966d
MD5 04bdd874e360fac217b78865b9450c6f
BLAKE2b-256 820a21d75298fcabdd42c60f0a276e37afaf6a57e28fe26dfa3583bc519943e5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page