Templates to work with models for classification, regression and clustering with xgboost and sklearn.
Project description
Sinapsis Data Analysis
Module for machine learning model training, analysis, and inference, using the Scikit-learn and XGBoost libraries.
🐍 Installation • 🚀 Features • 📚 Usage Example • 📙 Documentation • 🔍 License
Sinapsis Data Analysis provides a comprehensive set of tools for machine learning model training, evaluation, and inference using industry-standard libraries like scikit-learn and XGBoost.
🐍 Installation
Install using your package manager of choice. We encourage the use of uv
Example with uv:
uv pip install sinapsis-data-analysis --extra-index-url https://pypi.sinapsis.tech
or with raw pip:
pip install sinapsis-data-analysis --extra-index-url https://pypi.sinapsis.tech
🚀 Features
Templates Supported
Sinapsis Data Analysis provides a variety of templates for machine learning workflows:
Scikit-Learn Models
The following model types are supported:
- Linear Models: LinearRegression, Ridge, Lasso, ElasticNet, LogisticRegression, etc.
- Neighbors Models: KNeighborsClassifier, KNeighborsRegressor, RadiusNeighborsClassifier, etc.
- Neural Network Models: MLPClassifier, MLPRegressor, BernoulliRBM
- Tree Models: DecisionTreeClassifier, DecisionTreeRegressor, ExtraTreeClassifier, etc.
Each template uses the same base attributes:
generic_field_key(str, required): Key of the generic field where datasets are storedmodel_save_path(str, required): Path where the trained model will be saved
XGBoost Models
XGBoost model templates include:
- XGBClassifier
- XGBRegressor
- XGBRanker
- XGBRFClassifier
- XGBRFRegressor
- Booster
Attributes are the same as those for Scikit-learn templates.
Manifold Learning
Templates for dimensionality reduction using scikit-learn's manifold learning techniques:
- SKLearnManifold: Base class for all manifold learning algorithms
generic_field_key(str, required): Key of the generic field where the input data is stored
Specific algorithms include t-SNE, MDS, Isomap, LocallyLinearEmbedding, and more.
Inference Templates
Templates for using trained models to make predictions on new data:
- SKLearnInference: For inference with scikit-learn models
- XGBoostInference: For inference with XGBoost models
To use these templates, you should replace the model_path to point to the path of the trained model.
[!TIP] Use CLI command
sinapsis info --all-template-namesto show a list with all the available Template names installed with Sinapsis Data Analysis.
[!TIP] Use CLI command
sinapsis info --example-template-config TEMPLATE_NAMEto produce an example Agent config for the Template specified in TEMPLATE_NAME.
For example, for LinearRegression use sinapsis info --example-template-config LinearRegression to produce an example config.
📚 Usage Example
Below is an example configuration for **Sinapsis Data Analysis** using LinearRegressionWrapper for regression.Example config
agent:
name: sklearn_linear_models_agent
description: agent to train a LinearRegression model from scikit-learn using the load_diabetes dataset
templates:
- template_name: InputTemplate
class_name: InputTemplate
attributes: {}
- template_name: load_diabetesWrapper
class_name: load_diabetesWrapper
template_input: InputTemplate
attributes:
split_dataset: true
train_size: 0.8
load_diabetes:
return_X_y: false
as_frame: true
- template_name: LinearRegressionWrapper
class_name: LinearRegressionWrapper
template_input: load_diabetesWrapper
attributes:
generic_field_for_data: load_diabetesWrapper
model_save_path: "artifacts/linear_regression.joblib"
linearregression_init:
fit_intercept: true
copy_X: true
n_jobs: null
positive: false
To run the config, use the CLI:
sinapsis run name_of_config.yml
📙 Documentation
Documentation for this and other sinapsis packages is available on the sinapsis website
Tutorials for different projects within sinapsis are available at sinapsis tutorials page
🔍 License
This project is licensed under the AGPLv3 license, which encourages open collaboration and sharing. For more details, please refer to the LICENSE file.
For commercial use, please refer to our official Sinapsis website for information on obtaining a commercial license.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file sinapsis_data_analysis-0.1.6.tar.gz.
File metadata
- Download URL: sinapsis_data_analysis-0.1.6.tar.gz
- Upload date:
- Size: 24.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.5.16
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a8b337e09c648258a425d4f089d031b6bc162b4fa2951c5b43d595419d106d02
|
|
| MD5 |
1b4864a25aff007a405f43e13de79896
|
|
| BLAKE2b-256 |
e286a4f04e56b6734f16c69e2c5a730970e72760c816674e199f5bbd4bff0755
|
File details
Details for the file sinapsis_data_analysis-0.1.6-py3-none-any.whl.
File metadata
- Download URL: sinapsis_data_analysis-0.1.6-py3-none-any.whl
- Upload date:
- Size: 28.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.5.16
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c92a7e1c8bb48e33ed4afdc81aa5a302bbc507121e03c7a59c4654611ff43dbc
|
|
| MD5 |
87da066c66ae00f2c16f8f15cc3d9b0c
|
|
| BLAKE2b-256 |
070aae4e66869019afbd4ba9be2e792a305a1ccea558b28e6de971dc1babfa58
|