dstack is an open-source orchestration engine for running AI workloads on any cloud or on-premises.
Project description
dstack
is an open-source container orchestration engine designed for AI workloads across any cloud or data center.
The supported cloud providers include AWS, GCP, Azure, Lambda, TensorDock, Vast.ai, CUDO, and RunPod.
You can also use dstack
ro run workloads on on-prem servers.
Latest news ✨
- [2024/04] dstack 0.18.1: On-prem servers, and new example (Release)
- [2024/04] dstack 0.18.0: RunPod, mult-node tasks, and more (Release)
- [2024/03] dstack 0.17.0: Auto-scaling, and other improvements (Release)
- [2024/02] dstack 0.16.0: Pools (Release)
- [2024/02] dstack 0.15.1: Kubernetes (Release)
Installation
Before using dstack
through CLI or API, set up a dstack
server.
Install the server
The easiest way to install the server, is via pip
:
pip install "dstack[all]" -U
Configure backends
If you have default AWS, GCP, or Azure credentials on your machine, the dstack
server will pick them up automatically.
Otherwise, you need to manually specify the cloud credentials in ~/.dstack/server/config.yml
.
For further details on setting up the server, refer to installation.
Start the server
To start the server, use the dstack server
command:
$ dstack server
Applying ~/.dstack/server/config.yml...
The admin token is "bbae0f28-d3dd-4820-bf61-8f4bb40815da"
The server is running at http://127.0.0.1:3000/
Note It's also possible to run the server via Docker.
CLI & API
Once the server is up, you can use either dstack
's CLI or API to run workloads.
Below is a live demo of how it works with the CLI.
Dev environments
You specify the required environment and resources, then run it. dstack provisions the dev environment in the cloud and enables access via your desktop IDE.
Tasks
Tasks allow for convenient scheduling of any kind of batch jobs, such as training, fine-tuning, or data processing, as well as running web applications.
Specify the environment and resources, then run it. dstack executes the task in the cloud, enabling port forwarding to your local machine for convenient access.
Services
Services make it very easy to deploy any kind of model or web application as public endpoints.
Use any serving frameworks and specify required resources. dstack deploys it in the configured backend, handles authorization, and provides an OpenAI-compatible interface if needed.
Pools
Pools simplify managing the lifecycle of cloud instances and enable their efficient reuse across runs.
You can have instances provisioned in the cloud automatically, or add them manually, configuring the required resources, idle duration, etc.
Examples
Here are some featured examples:
Browse examples for more examples.
More information
For additional information and examples, see the following links:
Licence
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file dstack-0.18.2rc2.tar.gz
.
File metadata
- Download URL: dstack-0.18.2rc2.tar.gz
- Upload date:
- Size: 262.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.0 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 113023fcb6d1c5c005275fae9b31b2aa9e44954042e192190756bce91001f759 |
|
MD5 | acf4c126872a68309fb2c5605b6836f3 |
|
BLAKE2b-256 | 5ca835c44fff77b80e9ccc790ceb867d60355b569a835fa1061a779d396d3aa5 |
File details
Details for the file dstack-0.18.2rc2-py3-none-any.whl
.
File metadata
- Download URL: dstack-0.18.2rc2-py3-none-any.whl
- Upload date:
- Size: 399.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.0 CPython/3.9.19
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d61207ce0a54562d904810c845ec33fab3ab2ecac55a2b3b1402167be1f1337f |
|
MD5 | 96ec8551107f779f23e53865372aaf3e |
|
BLAKE2b-256 | a7e161584b11612d4f15755d44c2cf517b13dee78306458d7d26e2337d928bfd |