Skip to main content

Run ray on tracto.ai

Project description

Tractoray

The tool for running Ray clusters on Tracto.ai. Allows you to easily deploy and manage Ray clusters within Tracto.ai infrastructure.

Features

  • Launch Ray clusters with configurable resources.
  • Support native ray dashboard and ray client.
  • Support Ray Debugger.
  • Flexible Docker image configuration.

To see all the configuration options for Ray, please check the output of the tractoray start --help command.

Installation

Install tractoray with Ray CLI and all required dependencies:

pip install -U "tractoray[ray]"

Usage

Basic Commands

To use tractoray, you need to specify the working directory, for example your homedir //home/<login>/tractoray.

Start a cluster:

tractoray start --workdir //your/cypress/path --node-count 2

An output of the command contains an instruction to connect to the cluster.

Check the cluster status:

tractoray status --workdir //your/yt/path

Also JSON format is supported:

tractoray status --workdir //your/cypress/path --format json

Stop the cluster:

tractoray stop --workdir //your/cypress/path

For detailed information about task submission, log reading, and other operations, please check the ray status command output.

Using Custom Docker Images

You have two options for Docker images:

  1. Use the default image as base (recommended):

    FROM cr.eu-north1.nebius.cloud/e00faee7vas5hpsh3s/tractoray/default:2025-06-12-16-59-42-9a2ce5611
    
    # Add your dependencies
    RUN pip install your-package
    
  2. Build from scratch:

    • Install tractoray via pip
    • Use the same version as in your local environment and install all necessary dependencies for CUDA and Infiniband.
    FROM python:3.12
    
    RUN pip install tractoray==<your-local-version>
    # Add other dependencies
    

The default image includes all necessary dependencies and configurations to run Ray clusters for ML workloads. Using it as a base image is recommended to ensure compatibility.

Connect to the Ray Head Node terminal

Connect to the Ray head node terminal via the yt run-job-shell command or the web terminal in the UI. The tractoratay status command shows all necessary instructions and links.

Ray Debugging

Currently, only the legacy Ray debugger is supported.

To use ray debugger on Tracto.ai:

  1. Start your Ray cluster with RAY_DEBUG=legacy environment variable and the --ray-debugger-external option: tractoray start --workdir //your/cypress/path --ray-head-params="--ray-debugger-external" --ray-worker-params="--ray-debugger-external" --env-var "RAY_DEBUG=legacy"
  2. Connect to the Ray head node terminal using yt run-job-shell or web UI (instructions and necessary links can be found in the output of ray start and ray status).
  3. Run ray debug in the terminal on the head node to start the Ray Debugger.

You can read more about debugging Ray tasks in the official Ray documentation.

Environment Variables

  • YT_LOG_LEVEL: Set logging level

Limitations

  • Some Ray CLI options, like ray status, may not work properly on a laptop due to Ray authentication constraints. Use the Ray dashboard and the Ray SDK, or run Ray CLI on the head node instead.
  • Ray Serve is not supported.
  • Observability features are disabled by default. You can enable and configure them in your custom image.
  • Only legacy Ray Debugger is supported.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tractoray-0.0.12.tar.gz (91.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

tractoray-0.0.12-py3-none-any.whl (15.9 kB view details)

Uploaded Python 3

File details

Details for the file tractoray-0.0.12.tar.gz.

File metadata

  • Download URL: tractoray-0.0.12.tar.gz
  • Upload date:
  • Size: 91.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for tractoray-0.0.12.tar.gz
Algorithm Hash digest
SHA256 5ec7ffee3d4ffb30ce71d5ef31c2d6a4977f2e1e4aa19638531679f14b9ded64
MD5 cb20896b6e80c3664b29b0d7ae42209e
BLAKE2b-256 ad9cabfe5be3cf5ea674ca869816d5849cdc2e96b0854bee75fe2cb947506924

See more details on using hashes here.

Provenance

The following attestation bundles were made for tractoray-0.0.12.tar.gz:

Publisher: tractoray-pypi.yaml on tractoai/farm

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file tractoray-0.0.12-py3-none-any.whl.

File metadata

  • Download URL: tractoray-0.0.12-py3-none-any.whl
  • Upload date:
  • Size: 15.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for tractoray-0.0.12-py3-none-any.whl
Algorithm Hash digest
SHA256 ae7aef182c5efe6f28fea12b9dac0f707f81a44c1b4bcc68550e09ebc854ca82
MD5 4849a25c423ed88b9d391f39f3fd7550
BLAKE2b-256 14db701ec56824dee25acc04493f9d87fe27b6da9fe38dd23a73f9d3a4728939

See more details on using hashes here.

Provenance

The following attestation bundles were made for tractoray-0.0.12-py3-none-any.whl:

Publisher: tractoray-pypi.yaml on tractoai/farm

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page