Skip to main content

KeelDS is a package to load some datasets from KEEL repository with some normalizations and with split and discretization options.

Project description

KeelDS

KeelDS: A Python package for loading datasets from KEEL repository

KeelDS is a Python package that provides easy access to datasets from the KEEL repository, a popular source for machine learning datasets. This package simplifies the process of loading KEEL datasets, offering options for cross-validation and discretization.

Features

  • Load KEEL datasets with a single line of code
  • Access datasets pre-split into train and test sets
  • Discretization option using the Fayyad algorithm (MDLP)
  • Support for both balanced and imbalanced datasets
  • Easy integration with machine learning workflows

Installation

Dependencies

  • Python (>= 3.12)
  • pandas (>= 2.2.2)

You can install KeelDS using pip:

pip install keel-ds

Usage

Here's a simple example of how to use KeelDS with a machine learning model:

from keel_ds import load_data
import numpy as np
from catboost import CatBoostClassifier

file_name = 'iris'
folds = load_data(file_name)

evaluations = []
for x_train, y_train, x_test, y_test in folds:
    model = CatBoostClassifier(verbose=False)
    model.fit(x_train, y_train)
    evaluation = model.score(x_test, y_test)
    evaluations.append(evaluation)

print(np.mean(evaluations))  # Output: 0.933333333333

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

keel_ds-0.2.4.tar.gz (14.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

keel_ds-0.2.4-py3-none-any.whl (14.3 MB view details)

Uploaded Python 3

File details

Details for the file keel_ds-0.2.4.tar.gz.

File metadata

  • Download URL: keel_ds-0.2.4.tar.gz
  • Upload date:
  • Size: 14.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.12.3 Linux/6.8.0-41-generic

File hashes

Hashes for keel_ds-0.2.4.tar.gz
Algorithm Hash digest
SHA256 d19a57877cb43812928245bfd3c725c5acda29c80853c464b0535cd62236242d
MD5 452e13e3d0f01f94669b5b7a2da65763
BLAKE2b-256 df6686404c0bedee700612cf4d7a7c7fc02183181d46000c0eefbecfc47a77dc

See more details on using hashes here.

File details

Details for the file keel_ds-0.2.4-py3-none-any.whl.

File metadata

  • Download URL: keel_ds-0.2.4-py3-none-any.whl
  • Upload date:
  • Size: 14.3 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.12.3 Linux/6.8.0-41-generic

File hashes

Hashes for keel_ds-0.2.4-py3-none-any.whl
Algorithm Hash digest
SHA256 ac809e3c946569402f3cee02a7367390e010576fd8ea73d51edd04e0829e086e
MD5 967208b027640a1d7fee0bf5f510c243
BLAKE2b-256 29b5f8960116f55c2157e64267fb45cf383475295a94ba2dbe7889d548119012

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page