A suite of AI libraries and tools that accelerates model serving and provides programmability all the way to the GPU kernels
Project description
Modular
The Modular Platform is an open and fully-integrated suite of AI libraries and tools that accelerates model serving and scales GenAI deployments. It abstracts away hardware complexity so you can run the most popular open AI models with industry-leading performance on GPUs and CPUs. You can also customize everything from the serving pipeline and model architecture all the way down to the metal by writing custom ops and GPU kernels in Mojo. Read more.
Get started
It takes only a moment to start an OpenAI-compatible endpoint or use our Python API to run inference with a GenAI model from Hugging Face.
Try it now with our quickstart guide.
Stay in touch
Check out our GitHub repo and join our community.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
File details
Details for the file modular-25.4.0-py3-none-any.whl
.
File metadata
- Download URL: modular-25.4.0-py3-none-any.whl
- Upload date:
- Size: 1.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.12.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 |
b9cd824e11759d1aa92919cd57d97e43b038b9452b169dd63d32e33bcdd7eb0b
|
|
MD5 |
5382eea535d38406b2ab48573ad2c0e2
|
|
BLAKE2b-256 |
dd4aefd4141f3094183a45160b29b247243a376a2d9cac0a1e823fe59405c48f
|