Skip to main content

Library to create simulation to find out what train test ratio is ideal

Project description

English | Español | Français | Deutsch | 中文 | Türkçe | 日本語 | 한국어

train_test_sim

A library to create quick simulation of optimal train-test size you can keep

Developed by Marcel Tino (c) 2024

Examples of How To Use the library

You can use this to alter according to your requirements

##syntax
model=RandomForestClassifier()
get_simulation(X,Y,model)

you can use any model on sklearn or xgboost. All you need to do is specify correct model name
from sklearn.datasets import load_diabetes
import numpy as np
from sklearn.ensemble import RandomForestClassifier
diabetes = load_diabetes()
X, y = diabetes.data, diabetes.target

# Convert the target variable to binary (1 for diabetes, 0 for no diabetes)
Y = (y > np.median(y)).astype(int)
model = RandomForestClassifier()

get_simulation(X, Y, model)

Note: We can create this for any model

  • Share retail_dictionary on these social media platforms if you like it! Reddit HackerNews Twitter Facebook LinkedIn

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

train_test_sim-0.1.0.tar.gz (3.1 kB view hashes)

Uploaded Source

Built Distribution

train_test_sim-0.1.0-py3-none-any.whl (3.2 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page