Skip to main content

The core machine learning library for Skyulf.

Project description

Skyulf Core

Skyulf Core (skyulf-core) is the standalone machine learning library that powers the Skyulf MLOps platform. It provides a robust, type-safe, and modular set of tools for:

  • Data Preprocessing: A comprehensive suite of transformers for cleaning, scaling, encoding, and feature engineering.
  • Modeling: Unified interfaces for classification and regression models, wrapping Scikit-Learn and other libraries.
  • Pipeline Management: Tools to build, serialize, and execute complex ML pipelines.
  • Tuning: Advanced hyperparameter tuning capabilities with support for Grid Search, Random Search, and Optuna.
  • Evaluation: Standardized metrics and evaluation schemas for model performance tracking.

Docs PyPI License Downloads issues contributors

Website & Documentation

Visit the full documentation and project site for guides, API reference, and examples:

Installation

pip install skyulf-core

# EDA-focused install (core EDA + optional advanced EDA + visualization)
pip install skyulf-core[eda,viz]

# For visualization support (Rich dashboard + Matplotlib plots)
pip install skyulf-core[viz]

# For advanced EDA add-ons (sentiment + causal discovery)
pip install skyulf-core[eda]

# For hyperparameter tuning engines
pip install skyulf-core[tuning]

# For imbalance-aware preprocessing (e.g., SMOTE)
pip install skyulf-core[preprocessing-imbalanced]

# For XGBoost modeling nodes
pip install skyulf-core[modeling-xgboost]

Quick Start: Automated EDA

Skyulf Core includes a powerful automated Exploratory Data Analysis (EDA) module.

import polars as pl
from skyulf.profiling.analyzer import EDAAnalyzer
from skyulf.profiling.visualizer import EDAVisualizer

# 1. Load Data
df = pl.read_csv("data.csv")

# 2. Analyze
analyzer = EDAAnalyzer(df)
# Optional: Manually specify special columns if auto-detection fails
profile = analyzer.analyze(
    target_col="target",
    date_col="timestamp",  # Optional
    lat_col="latitude",    # Optional
    lon_col="longitude"    # Optional
)

# 3. Visualize
viz = EDAVisualizer(profile, df)
viz.summary()  # Prints rich terminal dashboard
viz.plot()     # Opens interactive plots

Features

  • Automated EDA: One-line profiling with Data Quality, Outliers, Time Series, and Geospatial analysis.
  • Type-Safe: Built with modern Python type hints and Pydantic models.
  • Modular: Use only the components you need.
  • Serializable: All components are designed to be easily serialized for storage and deployment.
  • Extensible: Easy to extend with your own custom transformers and models.

License

This project is licensed under the terms of the Apache 2.0 license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

skyulf_core-0.1.13.tar.gz (132.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

skyulf_core-0.1.13-py3-none-any.whl (148.0 kB view details)

Uploaded Python 3

File details

Details for the file skyulf_core-0.1.13.tar.gz.

File metadata

  • Download URL: skyulf_core-0.1.13.tar.gz
  • Upload date:
  • Size: 132.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for skyulf_core-0.1.13.tar.gz
Algorithm Hash digest
SHA256 6ff720a5fc97023d9d807a214ce2e19f09b2ba470d00d764484f373d2387ef28
MD5 079f099290273c92182f36e70d2f4caf
BLAKE2b-256 1939647c9c970dfcf26fd0e240933b8584a496a3df7a751fe019d42e9dba564d

See more details on using hashes here.

File details

Details for the file skyulf_core-0.1.13-py3-none-any.whl.

File metadata

  • Download URL: skyulf_core-0.1.13-py3-none-any.whl
  • Upload date:
  • Size: 148.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for skyulf_core-0.1.13-py3-none-any.whl
Algorithm Hash digest
SHA256 bb68bd6d0d593887bf200fbacd70310aa1450898a9c36dfc002b8564698ca7c0
MD5 5a8f5ff351824228f3a9801492e0e320
BLAKE2b-256 4db2c24c7013ea7588e10128a77e5472d302e11a188dfcd3cf94c499b7360b97

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page