Skip to main content

A suite of AI libraries and tools that accelerates model serving and provides programmability all the way to the GPU kernels

Project description

Modular

The Modular Platform is an open and fully-integrated suite of AI libraries and tools that accelerates model serving and scales GenAI deployments. It abstracts away hardware complexity so you can run the most popular open AI models with industry-leading performance on GPUs and CPUs. You can also customize everything from the serving pipeline and model architecture all the way down to the metal by writing custom ops and GPU kernels in Mojo. Read more.

Get started

It takes only a moment to start an OpenAI-compatible endpoint or use our Python API to run inference with a GenAI model from Hugging Face.

Try it now with our quickstart guide.

Stay in touch

Check out our GitHub repo and join our community.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

modular-25.4.0-py3-none-any.whl (1.4 kB view details)

Uploaded Python 3

File details

Details for the file modular-25.4.0-py3-none-any.whl.

File metadata

  • Download URL: modular-25.4.0-py3-none-any.whl
  • Upload date:
  • Size: 1.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.3

File hashes

Hashes for modular-25.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 b9cd824e11759d1aa92919cd57d97e43b038b9452b169dd63d32e33bcdd7eb0b
MD5 5382eea535d38406b2ab48573ad2c0e2
BLAKE2b-256 dd4aefd4141f3094183a45160b29b247243a376a2d9cac0a1e823fe59405c48f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page