Skip to main content

KeelDS is a package to load some datasets from KEEL repository with some normalizations and with split and discretization options.

Project description

KeelDS

KeelDS: A Python package for loading datasets from KEEL repository

KeelDS is a Python package that provides easy access to datasets from the KEEL repository, a popular source for machine learning datasets. This package simplifies the process of loading KEEL datasets, offering options for cross-validation and discretization.

Features

  • Load KEEL datasets with a single line of code
  • Access datasets pre-split into train and test sets
  • Discretization option using the Fayyad algorithm (MDLP)
  • Support for both balanced and imbalanced datasets
  • Easy integration with machine learning workflows

Installation

Dependencies

  • Python (>= 3.12)
  • pandas (>= 2.2.2)

You can install KeelDS using pip:

pip install keel-ds

Usage

Here's a simple example of how to use KeelDS with a machine learning model:

from keel_ds import load_data
import numpy as np
from catboost import CatBoostClassifier

file_name = 'iris'
folds = load_data(file_name)

evaluations = []
for x_train, y_train, x_test, y_test in folds:
    model = CatBoostClassifier(verbose=False)
    model.fit(x_train, y_train)
    evaluation = model.score(x_test, y_test)
    evaluations.append(evaluation)

print(np.mean(evaluations))  # Output: 0.933333333333

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

keel_ds-0.2.0.tar.gz (14.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

keel_ds-0.2.0-py3-none-any.whl (14.3 MB view details)

Uploaded Python 3

File details

Details for the file keel_ds-0.2.0.tar.gz.

File metadata

  • Download URL: keel_ds-0.2.0.tar.gz
  • Upload date:
  • Size: 14.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.12.3 Linux/6.8.0-41-generic

File hashes

Hashes for keel_ds-0.2.0.tar.gz
Algorithm Hash digest
SHA256 79d9ffe7ee18caf76d6a0a0663cfe249f824509b55123683e0aa4061122f2430
MD5 6f3b6bc1b3f4431d7d73812761d8a699
BLAKE2b-256 5f7b1a9aa68572e9922c63f771bc26265670e6eb1844dd8b1a88488fc7577fdf

See more details on using hashes here.

File details

Details for the file keel_ds-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: keel_ds-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 14.3 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.2 CPython/3.12.3 Linux/6.8.0-41-generic

File hashes

Hashes for keel_ds-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 40b932a4727a0a434e6ae674557f538901dbb5c8c69df92345bdfa7676667041
MD5 98b4822cd12028a713cefd96a556b249
BLAKE2b-256 1384f2d60aa697603ff8359fdf87c0c7a4da9eee6162c518470e0b3ac9d261bb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page