RapidoML is a simple Automated Machine Learning (AutoML) library
Project description
RapidoML: Automated Machine Learning Made Simple
RapidoML is an easy-to-use Python library that automates the process of building and optimizing machine learning models. It simplifies feature selection, hyperparameter tuning, and model selection, making it perfect for quickly prototyping and testing models. RapidoML relies on popular dependencies like scikit-learn, NumPy, TensorFlow, Keras, XGBoost, LightGBM, and CatBoost.
Installation
You can install RapidoML using pip
:
pip install RapidoML
This will install the required dependencies along with the RapidoML library.
Quickstart
First, import the required functions from the library:
import rapidomL
from rapidoml.datasets import sample_datasets
Load a sample dataset, such as Iris, Titanic, or others, and split it into training and testing sets with a 70/30 ratio. The function sample_datasets
also performs preprocessing, cleaning, and normalization.
df_train, df_test = sample_datasets('iris')
Create an instance of the AutoML
class and train the model using the training dataset. If you want to specify the model to use, you can pass it as an argument, for example: model='XGBClassifier'
. Otherwise, RapidoML will automatically pick the best model for the given dataset.
automl = RapidoML.AutoML()
automl.train(df_train)
Evaluate the model using the testing dataset. This will return a dictionary containing evaluation metrics for the trained model, such as accuracy, precision, recall, and F1 score for classification tasks, or R2, MSE, and MAE for regression tasks.
metrics = RapidoML.evaluate(automl, df_test, model_type='classifier')
print(metrics)
Finally, save the trained model to a pickle file for future use.
automl.save()
Custom Datasets
If you want to use your own dataset, follow these steps:
- Load your dataset using your preferred method (e.g.,
pandas.read_csv
). - Preprocess your dataset (cleaning, normalization, etc.).
- Split the dataset into training and testing sets (e.g., using
train_test_split
fromsklearn.model_selection
). - Pass the training dataset to the
train
function of theAutoML
instance. - Evaluate and save the model as described in the Quickstart example above.
Supported Models
RapidoML supports various models, including:
- DeepLearningClassifier and DeepLearningRegressor
- XGBClassifier and XGBRegressor
- LGBMClassifier and LGBMRegressor
- CatBoostClassifier and CatBoostRegressor
You can easily extend the library by adding more models to the models.py
file.
Contributing
If you'd like to contribute to RapidoML, feel free to submit pull requests, report issues, or suggest new features. Your feedback is highly appreciated!
Support
If you have any questions or need help using RapidoML, please post a question or open an issue on GitHub.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file rapidoml-0.1.3.tar.gz
.
File metadata
- Download URL: rapidoml-0.1.3.tar.gz
- Upload date:
- Size: 9.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.16
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 35501ebc5173f5383a657d713856c588b63f25aa631b0beba4a63f9c9a02b5e8 |
|
MD5 | d76b3a26851e0f0e94f38f71f8ce335f |
|
BLAKE2b-256 | ccca942884c99ae8125929a5e338f56c1de31c5711e11737b9500decf83a61e9 |
File details
Details for the file rapidoml-0.1.3-py3-none-any.whl
.
File metadata
- Download URL: rapidoml-0.1.3-py3-none-any.whl
- Upload date:
- Size: 12.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.16
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a687d79c93718efc0767c424685647113b9f94c46238232693db7aa5dbda25ff |
|
MD5 | 472997ca52a7a06099949e7c05266dd7 |
|
BLAKE2b-256 | ea816073ec8a3290330445bc9ebd32551973e2931bb177ef5795777bfd623a2d |