Skip to main content

dstack is an open-source orchestration engine for running AI workloads on any cloud or on-premises.

Project description

dstack is a streamlined alternative to Kubernetes, specifically designed for AI. It simplifies container orchestration for AI workloads both in the cloud and on-prem, speeding up the development, training, and deployment of AI models.

dstack is easy to use with any cloud providers as well as on-prem servers.

Accelerators

dstack supports NVIDIA GPU, AMD GPU, and Google Cloud TPU out of the box.

Major news ✨

Installation

Before using dstack through CLI or API, set up a dstack server.

Configure backends

To use dstack with your own cloud accounts, create the ~/.dstack/server/config.yml file and configure backends.

Start the server

Once backends are configured, proceed to start the server:

$ pip install "dstack[all]" -U
$ dstack server

Applying ~/.dstack/server/config.yml...

The admin token is "bbae0f28-d3dd-4820-bf61-8f4bb40815da"
The server is running at http://127.0.0.1:3000/

For more details on server configuration options, see the server deployment guide.

Set up the CLI

To point the CLI to the dstack server, configure it with the server address, user token, and project name:

$ pip install dstack
$ dstack config --url http://127.0.0.1:3000 \
    --project main \
    --token bbae0f28-d3dd-4820-bf61-8f4bb40815da
    
Configuration is updated at ~/.dstack/config.yml

Create SSH fleets

If you want the dstack server to run containers on your on-prem servers, use fleets.

How does it work?

Before using dstack, install the server and configure backends.

1. Define configurations

dstack supports the following configurations:

  • Dev environments — for interactive development using a desktop IDE
  • Tasks — for scheduling jobs (incl. distributed jobs) or running web apps
  • Services — for deployment of models and web apps (with auto-scaling and authorization)
  • Fleets — for managing cloud and on-prem clusters
  • Volumes — for managing persisted volumes
  • Gateways — for configuring the ingress traffic and public endpoints

Configuration can be defined as YAML files within your repo.

2. Apply configurations

Apply the configuration either via the dstack apply CLI command or through a programmatic API.

dstack automatically manages provisioning, job queuing, auto-scaling, networking, volumes, run failures, out-of-capacity errors, port-forwarding, and more — across clouds and on-prem clusters.

More information

For additional information and examples, see the following links:

Contributing

You're very welcome to contribute to dstack. Learn more about how to contribute to the project at CONTRIBUTING.md.

License

Mozilla Public License 2.0

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dstack-0.18.15rc1.tar.gz (14.9 MB view details)

Uploaded Source

Built Distribution

dstack-0.18.15rc1-py3-none-any.whl (15.1 MB view details)

Uploaded Python 3

File details

Details for the file dstack-0.18.15rc1.tar.gz.

File metadata

  • Download URL: dstack-0.18.15rc1.tar.gz
  • Upload date:
  • Size: 14.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.10

File hashes

Hashes for dstack-0.18.15rc1.tar.gz
Algorithm Hash digest
SHA256 bf9a011641e242075517601462ed969c16d7761cd3a3623280dbcab8f49dcde8
MD5 606269776b1ad1dd2c8fe5ff41cf1ebf
BLAKE2b-256 0cacc032dbf55ddcf562c4de7c30d145e90147b8a38caee78c2ce1acef1547e1

See more details on using hashes here.

File details

Details for the file dstack-0.18.15rc1-py3-none-any.whl.

File metadata

  • Download URL: dstack-0.18.15rc1-py3-none-any.whl
  • Upload date:
  • Size: 15.1 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.10

File hashes

Hashes for dstack-0.18.15rc1-py3-none-any.whl
Algorithm Hash digest
SHA256 3014b1fe54fe403bdbade21516b7474322fd7a64936139d9b98c0de14d1707c4
MD5 e25e4d4bdb3c04ebae3bae28235abf91
BLAKE2b-256 9903bd78f332999512ab6cb5e2c8609f6c251053fd327d799ba7276982262f5f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page