Skip to main content

KeelDS is a package to load some datasets from KEEL repository with some normalizations and with split and discretization options.

Project description

KeelDS

KeelDS: A Python package for loading datasets from KEEL repository

KeelDS is a Python package that provides easy access to datasets from the KEEL repository, a popular source for machine learning datasets. This package simplifies the process of loading KEEL datasets, offering options for cross-validation and discretization.

Features

  • Load KEEL datasets with a single line of code
  • Access datasets pre-split into train and test sets
  • Discretization option using the Fayyad algorithm (MDLP)
  • Support for both balanced and imbalanced datasets
  • Easy integration with machine learning workflows

Installation

Dependencies

  • Python (>= 3.12)
  • pandas (>= 2.2.2)

You can install KeelDS using pip:

pip install keel-ds

Usage

Here's a simple example of how to use KeelDS with a machine learning model:

from keel_ds import load_data
import numpy as np
from catboost import CatBoostClassifier

file_name = 'iris'
folds = load_data(file_name)

evaluations = []
for x_train, y_train, x_test, y_test in folds:
    model = CatBoostClassifier(verbose=False)
    model.fit(x_train, y_train)
    evaluation = model.score(x_test, y_test)
    evaluations.append(evaluation)

print(np.mean(evaluations))  # Output: 0.933333333333

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

keel_ds-0.2.5.tar.gz (14.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

keel_ds-0.2.5-py3-none-any.whl (14.3 MB view details)

Uploaded Python 3

File details

Details for the file keel_ds-0.2.5.tar.gz.

File metadata

  • Download URL: keel_ds-0.2.5.tar.gz
  • Upload date:
  • Size: 14.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.12.3 Linux/6.8.0-47-generic

File hashes

Hashes for keel_ds-0.2.5.tar.gz
Algorithm Hash digest
SHA256 89e09293ccaeb72b49f4a7d602f6addc760e71064be9ff1410d069b1274776a2
MD5 a630d47f3e1752ce1be86b23fcabae50
BLAKE2b-256 8e080c8d3a7bcc7d114086f833a7b27773080b763e34e19da47a8150e48171f5

See more details on using hashes here.

File details

Details for the file keel_ds-0.2.5-py3-none-any.whl.

File metadata

  • Download URL: keel_ds-0.2.5-py3-none-any.whl
  • Upload date:
  • Size: 14.3 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.12.3 Linux/6.8.0-47-generic

File hashes

Hashes for keel_ds-0.2.5-py3-none-any.whl
Algorithm Hash digest
SHA256 79faf1bd2f3ac2082d16eb9c8c49b2b1a60a5182e94464c5d32c7c642ea9650e
MD5 c2d9c3298147e85c60af5fddc4a24c7c
BLAKE2b-256 7788c99136c61bb85663bd8cfb328fada55846eb10bd7271058160526e9674bf

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page