Skip to main content

KeelDS is a package to load some datasets from KEEL repository with some normalizations and with split and discretization options.

Project description

KeelDS

KeelDS: A Python package for loading datasets from KEEL repository

KeelDS is a Python package that provides easy access to datasets from the KEEL repository, a popular source for machine learning datasets. This package simplifies the process of loading KEEL datasets, offering options for cross-validation and discretization.

Features

  • Load KEEL datasets with a single line of code
  • Access datasets pre-split into train and test sets
  • Discretization option using the Fayyad algorithm (MDLP)
  • Support for both balanced and imbalanced datasets
  • Easy integration with machine learning workflows

Installation

Dependencies

  • Python (>= 3.12)
  • pandas (>= 2.2.2)

You can install KeelDS using pip:

pip install keel-ds

Usage

Here's a simple example of how to use KeelDS with a machine learning model:

from keel_ds import load_data
import numpy as np
from catboost import CatBoostClassifier

file_name = 'iris'
folds = load_data(file_name)

evaluations = []
for x_train, y_train, x_test, y_test in folds:
    model = CatBoostClassifier(verbose=False)
    model.fit(x_train, y_train)
    evaluation = model.score(x_test, y_test)
    evaluations.append(evaluation)

print(np.mean(evaluations))  # Output: 0.933333333333

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

keel_ds-0.2.3.tar.gz (14.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

keel_ds-0.2.3-py3-none-any.whl (14.3 MB view details)

Uploaded Python 3

File details

Details for the file keel_ds-0.2.3.tar.gz.

File metadata

  • Download URL: keel_ds-0.2.3.tar.gz
  • Upload date:
  • Size: 14.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.12.3 Linux/6.8.0-41-generic

File hashes

Hashes for keel_ds-0.2.3.tar.gz
Algorithm Hash digest
SHA256 4b28aab9f74c07d557d0323be7caff52220405d82537b5dd6ffccac86e41a19e
MD5 84001f768d2e6ff8c497faec145e7595
BLAKE2b-256 9be16423857fa22b7bfea6c0420171bef7b94223ac204c4445233e09cee79569

See more details on using hashes here.

File details

Details for the file keel_ds-0.2.3-py3-none-any.whl.

File metadata

  • Download URL: keel_ds-0.2.3-py3-none-any.whl
  • Upload date:
  • Size: 14.3 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.12.3 Linux/6.8.0-41-generic

File hashes

Hashes for keel_ds-0.2.3-py3-none-any.whl
Algorithm Hash digest
SHA256 54a251684e4ce270d9bb65c0b8f0c4a0b6fd8b4c266b3348aa57089f7d8c2bfb
MD5 385f23d9f7d828b2937422431be8a8db
BLAKE2b-256 a600a152db5b335d55d4650e3263ae1a290f6140678d34ec3b99423eddc513cf

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page