Skip to main content

YTsaurus pipeline framework with utilities and common modules

Project description

YT Framework

GitHub License PyPI - Version PyPI - Python Version

A powerful Python framework for building and executing data processing pipelines on YTsaurus (YT) clusters. YT Framework simplifies pipeline development with automatic stage discovery, seamless dev/prod mode switching, and comprehensive support for YT operations.

Key Features

  • Pipeline & Stage Architecture: Organize complex workflows into reusable stages
  • Automatic Stage Discovery: No manual registration needed - just create stages and run
  • Dev/Prod Modes: Develop locally with file system simulation, deploy to YT cluster seamlessly
  • Multiple Operation Types: Support for Map, Vanilla, YQL, and S3 operations
  • Code Upload: Automatic code packaging and deployment to YT cluster
  • Docker Support: Custom Docker images for GPU workloads and special dependencies
  • Checkpoint Management: Built-in support for ML model checkpoints
  • Configuration Management: Flexible YAML-based configuration with multiple config support

Quick Links

Installation

For Users

Install from PyPI:

pip install yt_framework

For Developers and Contributors

Install in editable mode from source:

pip install -e .

See Installation Guide for prerequisites and detailed setup instructions.

Quick Start

Create your first pipeline in 3 steps:

  1. Create pipeline structure:

    mkdir my_pipeline && cd my_pipeline
    mkdir -p stages/my_stage configs
    
  2. Create pipeline.py:

    from yt_framework.core.pipeline import DefaultPipeline
    
    if __name__ == "__main__":
        DefaultPipeline.main()
    
  3. Create stage and config:

    # stages/my_stage/stage.py
    from yt_framework.core.stage import BaseStage
    
    class MyStage(BaseStage):
        def run(self, debug):
            self.logger.info("Hello from YT Framework!")
            return debug
    

See Quick Start Guide for complete example.

Examples

The examples/ directory contains comprehensive examples demonstrating all framework features:

Documentation

Full documentation is available in the docs/ directory:

Requirements

  • Python 3.11+
  • YTsaurus cluster access (for production mode)
  • YT credentials (for production mode)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

yt_framework-0.1.0.tar.gz (64.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

yt_framework-0.1.0-py3-none-any.whl (79.8 kB view details)

Uploaded Python 3

File details

Details for the file yt_framework-0.1.0.tar.gz.

File metadata

  • Download URL: yt_framework-0.1.0.tar.gz
  • Upload date:
  • Size: 64.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for yt_framework-0.1.0.tar.gz
Algorithm Hash digest
SHA256 7893ab70d10cf9b6a0c641ae99aeed74fb319b6b3ccf8767cc52fe6f5699cf5d
MD5 ef1cbd384f5c59671a4760825f505238
BLAKE2b-256 0f4204386a56afb6d8fb2b8c333b4588c26597994501c32a027a64c47ef7a361

See more details on using hashes here.

Provenance

The following attestation bundles were made for yt_framework-0.1.0.tar.gz:

Publisher: publish.yml on GregoryKogan/yt-framework

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file yt_framework-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: yt_framework-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 79.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for yt_framework-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 234d19598d82e601d5f1b6c508ea631a1983001ab6bbb9bb9cf26e86f3a73f46
MD5 0c717ea1a35673d25c795a51fe1d6dfa
BLAKE2b-256 e2459da86246825f2a5e3dee49caea60b0b5b2310b7932767f86cff13b64b99a

See more details on using hashes here.

Provenance

The following attestation bundles were made for yt_framework-0.1.0-py3-none-any.whl:

Publisher: publish.yml on GregoryKogan/yt-framework

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page