Plug-and-play long context adaptation for diffusion language models

These details have not been verified by PyPI

Project links

Project description

LongDLLM

🚀 Plug-and-play long context adaptation for diffusion language models

LongDLLM enables seamless extension of diffusion language models to handle long-context inputs (up to 128k tokens) with minimal code changes and a unified interface.

✨ Features

🎯 Drop-in compatibility: Works with existing code - just add one function call
🧠 Memory efficient: Handle 128k+ tokens on a single A6000 GPU (48GB VRAM)
⚡ Long Context Performance: Provide pre-tuned rescale factors for context extension
🔧 Unified interface: Same API for all supported models

🤖 Supported Models

Apple DiffuCoder-7B-Instruct - Code generation with long context
GSAI-ML LLaDA-8B-Instruct - General instruction following with extended context

📦 Installation

Basic Installation

pip install longdllm

Installing FlashAttention is highly recommended, you can install it separately via pip install flash-attn --no-build-isolation.

🚀 Quick Start

DiffuCoder Example

import torch
from transformers import AutoModel, AutoTokenizer
from longdllm import adapt_for_long_context

# 1. Load your model as usual
model = AutoModel.from_pretrained(
    "apple/DiffuCoder-7B-Instruct",
    torch_dtype=torch.bfloat16,
    trust_remote_code=True
)

# 2. Adapt for long context (128k tokens)
model = adapt_for_long_context(model, target_length=131072)

# 3. Generate with long sequences
tokenizer = AutoTokenizer.from_pretrained("apple/DiffuCoder-7B-Instruct")
inputs = tokenizer("Your long prompt here...", return_tensors="pt")

output = model.diffusion_generate(
    inputs.input_ids,
    attention_mask=inputs.attention_mask,
    max_new_tokens=256,
    steps=32,  # Diffusion steps
    temperature=0.3,
    top_p=0.95,
    alg="entropy"
)

LLaDA Example

⚠️ LLaDA Note: Patched methods ignore attention_bias for memory efficiency. This is safe per LLaDA issue #90.

from transformers import AutoTokenizer, AutoModel
from longdllm import adapt_for_long_context

# 1. Load and adapt LLaDA model  
model = AutoModel.from_pretrained("GSAI-ML/LLaDA-8B-Instruct", trust_remote_code=True)
model = adapt_for_long_context(model, target_length=131072)

# 2. Use unified diffusion_generate interface
tokenizer = AutoTokenizer.from_pretrained("GSAI-ML/LLaDA-8B-Instruct")
inputs = tokenizer("Your instruction here...", return_tensors="pt")

outputs = model.diffusion_generate(
    input_ids=inputs.input_ids,
    max_new_tokens=512,
    temperature=0.0,
    steps=128,
    block_length=128,
    remasking='low_confidence'
)

💡 Examples

Check out our example scripts to see LongDLLM in action:

examples/test_diffucoder.py - DiffuCoder passkey retrieval test
examples/test_llada.py - LLaDA passkey retrieval test

Running Examples

# Test DiffuCoder with 128k context
cd examples && python test_diffucoder.py

# Test LLaDA with 128k context  
cd examples && python test_llada.py

Both examples demonstrate passkey retrieval - finding a hidden number in long documents, a common benchmark for long-context capabilities.

⚙️ Advanced Configuration

Custom Rescale Factors

Want to experiment? You can provide custom factors:

# Example: Exponential rescale factors (approximating optimized values)
import numpy as np
custom_factors = (
    list(np.logspace(0, 1.5, 34)) +  # 1.0 to ~31.6, exponentially spaced
    list(np.linspace(16.3, 31.3, 30))  # Linear spacing for higher frequencies
)  

model = adapt_for_long_context(
    model,
    target_length=65536,  # Custom length
    scaling_method='longrope',
    rescale_factors=custom_factors
)

License

MIT

Citation

If you use LongDLLM in your research, please cite:

@misc{ge2025longcontext,
  title = {Long-Context Extension for Language Diffusion Models up to 128k Tokens},
  url = {https://albertge.notion.site/longcontext},
  author = {Ge, Albert and Singh, Chandan and Zhang, Dinghuai and Peng, Letian and Shang, Ning and Zhang, Li Lyna and Liu, Liyuan and Gao, Jianfeng},
  journal = {Albert Ge's Notion},
  year = {2025},
  month = sep,
}

🤝 Support & Contributing

🐛 Issues & Questions

GitHub Issues: Report bugs or ask questions
Email: Albert Ge for direct support

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.5

Sep 17, 2025

This version

0.1.4

Sep 13, 2025

0.1.3

Sep 12, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

longdllm-0.1.4.tar.gz (53.9 kB view details)

Uploaded Sep 13, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

longdllm-0.1.4-py3-none-any.whl (38.9 kB view details)

Uploaded Sep 13, 2025 Python 3

File details

Details for the file longdllm-0.1.4.tar.gz.

File metadata

Download URL: longdllm-0.1.4.tar.gz
Upload date: Sep 13, 2025
Size: 53.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.18

File hashes

Hashes for longdllm-0.1.4.tar.gz
Algorithm	Hash digest
SHA256	`c8f2b4e88a657eaf5b230f85ee91a1bc0d752d07c384e4cba0e938774c7a471d`
MD5	`f7cc96a6af085ffd638a66d8ae918639`
BLAKE2b-256	`330d6b2d80978bb4e681b84910e3ba6d70a8d3bbbc65407aedc07fdae77aea09`

See more details on using hashes here.

File details

Details for the file longdllm-0.1.4-py3-none-any.whl.

File metadata

Download URL: longdllm-0.1.4-py3-none-any.whl
Upload date: Sep 13, 2025
Size: 38.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.18

File hashes

Hashes for longdllm-0.1.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4c6b5bdf7f5cbefbe611dad196e9ae92702686aae9a6c5aea4f6a23eda0e0c99`
MD5	`23a9de08793926dd7ef9272bfa3fe258`
BLAKE2b-256	`db9ec0b6e1159ef68596fe0f5282f7fa1614675bcfd2d9f351ff7fff68494907`

See more details on using hashes here.

longdllm 0.1.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

LongDLLM

✨ Features

🤖 Supported Models

📦 Installation

Basic Installation

🚀 Quick Start

DiffuCoder Example

LLaDA Example

💡 Examples

Running Examples

⚙️ Advanced Configuration

Custom Rescale Factors

License

Citation

🤝 Support & Contributing

🐛 Issues & Questions

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes