Skip to main content

First OSS production-grade serving engine for diffusion language models

Project description

dlmserve

The first OSS production-grade serving engine for diffusion language models.

Coming soon. Diffusion LLM serving engine — LLaDA, DiffuLLaMA, and more.


Diffusion language models (LLaDA, DiffuLLaMA, Mercury Coder) are architecturally distinct from autoregressive transformers. They need their own scheduler, KV cache semantics, batching strategy, and sampling logic. dlmserve is the missing piece.

  • Bidirectional attention, not causal
  • Denoising-step-aware continuous batching
  • Committed/pending KV cache split
  • OpenAI-compatible HTTP API

Status

Pre-alpha. Not ready for use. Watch this space.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dlmserve-0.0.0.tar.gz (34.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dlmserve-0.0.0-py3-none-any.whl (7.1 kB view details)

Uploaded Python 3

File details

Details for the file dlmserve-0.0.0.tar.gz.

File metadata

  • Download URL: dlmserve-0.0.0.tar.gz
  • Upload date:
  • Size: 34.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.15 {"installer":{"name":"uv","version":"0.11.15","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"26.04","id":"resolute","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for dlmserve-0.0.0.tar.gz
Algorithm Hash digest
SHA256 13c68c9e03c41f2a5908e52cc7588fd2d9243df691d1de13d22eeda58901b8f0
MD5 3ddc3b958823e9f6f2b082ac7d6e6c81
BLAKE2b-256 f0ebcf9da9ca231009ccce1847ad73d8cca8d0db7a7b2e3332b939cd6a696e1c

See more details on using hashes here.

File details

Details for the file dlmserve-0.0.0-py3-none-any.whl.

File metadata

  • Download URL: dlmserve-0.0.0-py3-none-any.whl
  • Upload date:
  • Size: 7.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.15 {"installer":{"name":"uv","version":"0.11.15","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"26.04","id":"resolute","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for dlmserve-0.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 273216c4833d368f23f9f348b5e0713e0b7cfdda9a15c16a547c7ef74a7aa243
MD5 68b134e9a9d6173eab44fe19f107ed1e
BLAKE2b-256 0bba6bb12c25fd3685c7fe39ebdd08e95d9aee3c18ce2b358e40c31ce8c5f773

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page