Skip to main content

RHOAI tool kit for managing and upgrading RHOAI

Project description

RHOAI Tool Kit

Python Version OpenShift Compatible

A comprehensive toolkit for managing and upgrading Red Hat OpenShift AI (RHOAI) installations with parallel installation support.

๐Ÿ“‹ Table of Contents

โœจ Features

  • Install single or multiple OpenShift operators
  • Parallel installation for faster deployments
  • Configurable timeouts and retries
  • Comprehensive logging system
  • Supports:
    • Serverless Operator
    • Service Mesh Operator
    • Authorino Operator
    • cert-manager Operator (Kueue dependency)
    • RHOAI Operator
    • Kueue Operator
    • KEDA (Custom Metrics Autoscaler) Operator
  • Automatic Dependency Resolution: Installs required operators in correct order
  • Smart Validation: Pre-installation compatibility and conflict detection

๐Ÿ“ Project Structure

rhoshift/
โ”œโ”€โ”€ rhoshift/              # Main package directory
โ”‚   โ”œโ”€โ”€ __init__.py
โ”‚   โ”œโ”€โ”€ main.py           # CLI entry point
โ”‚   โ”œโ”€โ”€ cli/              # Command-line interface
โ”‚   โ”‚   โ”œโ”€โ”€ __init__.py
โ”‚   โ”‚   โ”œโ”€โ”€ args.py      # Argument parsing
โ”‚   โ”‚   โ””โ”€โ”€ commands.py  # Command implementations
โ”‚   โ”œโ”€โ”€ logger/          # Logging utilities
โ”‚   โ”‚   โ”œโ”€โ”€ __init__.py
โ”‚   โ”‚   โ””โ”€โ”€ logger.py    # Logging configuration
โ”‚   โ””โ”€โ”€ utils/           # Core utilities
โ”‚       โ”œโ”€โ”€ __init__.py
โ”‚       โ”œโ”€โ”€ constants.py # Constants and configurations
โ”‚       โ”œโ”€โ”€ operator.py  # Operator management
โ”‚       โ””โ”€โ”€ utils.py     # Utility functions
โ”œโ”€โ”€ run_upgrade_matrix.sh  # Upgrade matrix execution script
โ”œโ”€โ”€ upgrade_matrix_usage.md # Upgrade matrix documentation
โ”œโ”€โ”€ pyproject.toml        # Project dependencies and configuration
โ””โ”€โ”€ README.md            # This document

๐Ÿ“‹ Components

Core Components

  • CLI: Command-line interface for operator management
  • Logger: Logging configuration and utilities (logs to /tmp/rhoshift.log)
  • Utils: Core utilities and operator management logic

RHOAI Components

  • RHOAI Upgrade Matrix: Utilities for testing RHOAI upgrades
  • Upgrade Matrix Scripts: Execution and documentation for upgrade testing

Maintenance Scripts

  • Cleanup Scripts: Utilities for cleaning up operator installations
  • Worker Node Scripts: Utilities for managing worker node configurations

๐Ÿš€ Installation

  1. Clone the repository:
git clone https://github.com/mwaykole/O.git
cd O
  1. Install dependencies:
pip install -e .
  1. Verify installation:
rhoshift --help

๐Ÿ”ง New CLI Options

rhoshift --help
usage: rhoshift [-h] [--serverless] [--servicemesh] [--authorino] [--cert-manager] 
                [--rhoai] [--kueue] [--keda] [--all] [--cleanup] [--deploy-rhoai-resources]
                [--oc-binary OC_BINARY] [--retries RETRIES] [--retry-delay RETRY_DELAY]
                [--timeout TIMEOUT] [--rhoai-channel RHOAI_CHANNEL] [--raw RAW]
                [--rhoai-image RHOAI_IMAGE] [-v]

Operator Selection:
  --cert-manager        Install cert-manager Operator
  --kueue              Install Kueue Operator (auto-installs cert-manager)
  --keda               Install KEDA (Custom Metrics Autoscaler) Operator
  [... other options ...]

๐Ÿ’ป Usage

Basic Commands

# Install single operator
rhoshift --serverless

# Install multiple operators
rhoshift --serverless --servicemesh

# Install cert-manager operator
rhoshift --cert-manager

# Install Kueue operator (automatically installs cert-manager dependency)
rhoshift --kueue

# Install KEDA (Custom Metrics Autoscaler) operator
rhoshift --keda

# Install RHOAI with raw configuration
rhoshift --rhoai --rhoai-channel=<channel> --rhoai-image=<image> --raw=True

# Install RHOAI with Serverless configuration
rhoshift --rhoai --rhoai-channel=<channel> --rhoai-image=<image> --raw=False --all

# Install all operators (including Kueue and KEDA)
rhoshift --all

# Create DSC and DSCI with RHOAI operator installation
rhoshift --rhoai --deploy-rhoai-resources

# Clean up all operators
rhoshift --cleanup

๐Ÿ”— Operator Dependencies & Validation

The tool automatically handles operator dependencies and provides smart validation:

Automatic Dependency Resolution

  • Kueue requires cert-manager: Installing Kueue automatically includes cert-manager
  • Dependencies are installed in the correct order to prevent failures
  • Missing dependencies are automatically detected and added
# This command will install BOTH cert-manager AND Kueue (in correct order)
rhoshift --kueue

# You'll see output like:
# ๐Ÿ“ฆ Auto-adding dependency: cert-manager
# Installing 2 operators in order: cert-manager โ†’ kueue

Smart Validation

  • Compatibility Checking: Warns about potential operator conflicts
  • Namespace Validation: Detects if operators conflict in shared namespaces
  • Pre-Installation Validation: Catches issues before installation starts
# Example validation warnings:
# โš ๏ธ  Note: Kueue and KEDA may have resource conflicts. Monitor for admission webhook issues.
# โš ๏ธ  Installation order will be adjusted for dependencies: cert-manager โ†’ kueue

Supported Dependencies

Primary Operator Required Dependencies
Kueue cert-manager

Note: When installing Kueue individually (--kueue), you will see dependency warnings. For automatic dependency installation, use batch mode (--cert-manager --kueue) or install dependencies manually first.

Advanced Options

# Custom oc binary path
rhoshift --serverless --oc-binary /path/to/oc

# Custom timeout (seconds)
rhoshift --all --timeout 900

# Install queue management and auto-scaling operators together
# (cert-manager will be automatically installed as Kueue dependency)
rhoshift --kueue --keda

# Install complete ML/AI stack with queue management
rhoshift --rhoai --kueue --keda --rhoai-channel=stable --rhoai-image=<image>

# Install only cert-manager for other uses
rhoshift --cert-manager

# Verbose output
rhoshift --all --verbose

Upgrade Matrix Testing

To run the upgrade matrix tests, you can use either method:

  1. Using the shell script:
./run_upgrade_matrix.sh [options] <current_version> <current_channel> <new_version> <new_channel>
  1. Using the Python command:
run-upgrade-matrix [options] <current_version> <current_channel> <new_version> <new_channel>

Options:

  • -s, --scenario: Run specific scenario(s) (serverless, rawdeployment, serverless,rawdeployment)
  • --skip-cleanup: Skip cleanup before each scenario
  • --from-image: Custom source image path
  • --to-image: Custom target image path

Example:

# Using shell script
./run_upgrade_matrix.sh -s serverless -s rawdeployment 2.10 stable 2.12 stable

# Using Python command
run-upgrade-matrix -s serverless -s rawdeployment 2.10 stable 2.12 stable

๐Ÿ“ Logging

The toolkit uses a comprehensive logging system:

  • Logs are stored in /tmp/rhoshift.log
  • Console output shows INFO level and above
  • File logging captures DEBUG level and above
  • Automatic log rotation (10MB max size, 5 backup files)
  • Colored output in supported terminals

To view logs:

tail -f /tmp/rhoshift.log

๐Ÿ”ง Configuration

Environment Variables

  • LOG_FILE_LEVEL: Set file logging level (default: DEBUG)
  • LOG_CONSOLE_LEVEL: Set console logging level (default: INFO)

Command Options

  • --oc-binary: Path to oc CLI (default: oc)
  • --retries: Max retry attempts (default: 3)
  • --retry-delay: Delay between retries in seconds (default: 10)
  • --timeout: Command timeout in seconds (default: 300)

๐Ÿ› ๏ธ Development

Prerequisites

  • Python 3.8 or higher
  • OpenShift CLI (oc)
  • Access to an OpenShift cluster

Running Tests

pytest tests/

๐Ÿ” Troubleshooting

Common Issues

  1. Operator Installation Fails

    • Check cluster access: oc whoami
    • Verify operator catalog: oc get catalogsource
    • Check logs: tail -f /tmp/rhoshift.log
  2. Permission Issues

    • Ensure you have cluster-admin privileges
    • Check namespace permissions
  3. Timeout Errors

    • Increase timeout: --timeout 900
    • Check cluster resources

๐Ÿค Contributing

  1. Fork the repository
  2. Create a feature branch
  3. Commit your changes
  4. Push to the branch
  5. Create a Pull Request

๐Ÿ“„ License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rhoshift-0.1.5.0.tar.gz (50.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

rhoshift-0.1.5.0-py3-none-any.whl (47.3 kB view details)

Uploaded Python 3

File details

Details for the file rhoshift-0.1.5.0.tar.gz.

File metadata

  • Download URL: rhoshift-0.1.5.0.tar.gz
  • Upload date:
  • Size: 50.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.4

File hashes

Hashes for rhoshift-0.1.5.0.tar.gz
Algorithm Hash digest
SHA256 621d7f9c20e23467accd8a059a25d174a895ea435911c2a0c4ae4b626d961a7d
MD5 f848cab082ff4c6f34b2afe113aee358
BLAKE2b-256 ecca03d0c29a6ccae43a6016cd64b6223cf4648f3319c273dfe7ade6a2a4c27e

See more details on using hashes here.

File details

Details for the file rhoshift-0.1.5.0-py3-none-any.whl.

File metadata

  • Download URL: rhoshift-0.1.5.0-py3-none-any.whl
  • Upload date:
  • Size: 47.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.4

File hashes

Hashes for rhoshift-0.1.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 d7615e4eae31cf97587faa11ec620e1062afd2287c15bfe2d90ebea7c39c4264
MD5 5556c0715663b0e6e8edd84686de0a67
BLAKE2b-256 9a558fbab3605ed2333ac7e44fb82cb42ec39bf5687e0a010c085de77023c2f4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page