dstack is an open-source orchestration engine for running AI workloads on any cloud or on-premises.
Project description
dstack
is a streamlined alternative to Kubernetes, specifically designed for AI. It simplifies container orchestration
for AI workloads both in the cloud and on-prem, speeding up the development, training, and deployment of AI models.
dstack
is easy to use with any cloud providers as well as on-prem servers.
Accelerators
dstack
supports NVIDIA GPU
, AMD GPU
, and Google Cloud TPU
out of the box.
Major news ✨
- [2024/08] dstack 0.18.11: AMD, encryption, and more (Release)
- [2024/08] dstack 0.18.10: Control plane UI (Release)
- [2024/07] dstack 0.18.7: Fleets, RunPod volumes, dstack apply, and more (Release)
- [2024/05] dstack 0.18.4: Google Cloud TPU, and more (Release)
- [2024/05] dstack 0.18.2: On-prem clusters, private subnets, and more (Release)
Installation
Before using dstack
through CLI or API, set up a dstack
server.
Configure backends
To use dstack
with your own cloud accounts, create the ~/.dstack/server/config.yml
file and
configure backends.
Start the server
Once backends are configured, proceed to start the server:
$ pip install "dstack[all]" -U
$ dstack server
Applying ~/.dstack/server/config.yml...
The admin token is "bbae0f28-d3dd-4820-bf61-8f4bb40815da"
The server is running at http://127.0.0.1:3000/
For more details on server configuration options, see the server deployment guide.
Set up the CLI
To point the CLI to the dstack
server, configure it
with the server address, user token, and project name:
$ pip install dstack
$ dstack config --url http://127.0.0.1:3000 \
--project main \
--token bbae0f28-d3dd-4820-bf61-8f4bb40815da
Configuration is updated at ~/.dstack/config.yml
Create SSH fleets
If you want the dstack
server to run containers on your on-prem servers,
use fleets.
How does it work?
Before using
dstack
, install the server and configure backends.
1. Define configurations
dstack
supports the following configurations:
- Dev environments — for interactive development using a desktop IDE
- Tasks — for scheduling jobs (incl. distributed jobs) or running web apps
- Services — for deployment of models and web apps (with auto-scaling and authorization)
- Fleets — for managing cloud and on-prem clusters
- Volumes — for managing persisted volumes
- Gateways — for configuring the ingress traffic and public endpoints
Configuration can be defined as YAML files within your repo.
2. Apply configurations
Apply the configuration either via the dstack apply
CLI command or through a programmatic API.
dstack
automatically manages provisioning, job queuing, auto-scaling, networking, volumes, run failures,
out-of-capacity errors, port-forwarding, and more — across clouds and on-prem clusters.
More information
For additional information and examples, see the following links:
Contributing
You're very welcome to contribute to dstack
.
Learn more about how to contribute to the project at CONTRIBUTING.md.
License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file dstack-0.18.15rc1.tar.gz
.
File metadata
- Download URL: dstack-0.18.15rc1.tar.gz
- Upload date:
- Size: 14.9 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | bf9a011641e242075517601462ed969c16d7761cd3a3623280dbcab8f49dcde8 |
|
MD5 | 606269776b1ad1dd2c8fe5ff41cf1ebf |
|
BLAKE2b-256 | 0cacc032dbf55ddcf562c4de7c30d145e90147b8a38caee78c2ce1acef1547e1 |
File details
Details for the file dstack-0.18.15rc1-py3-none-any.whl
.
File metadata
- Download URL: dstack-0.18.15rc1-py3-none-any.whl
- Upload date:
- Size: 15.1 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3014b1fe54fe403bdbade21516b7474322fd7a64936139d9b98c0de14d1707c4 |
|
MD5 | e25e4d4bdb3c04ebae3bae28235abf91 |
|
BLAKE2b-256 | 9903bd78f332999512ab6cb5e2c8609f6c251053fd327d799ba7276982262f5f |