Celery-like task queue for ML inference.
Project description
ModelQ
ModelQ is a lightweight Python library for scheduling and queuing machine learning inference tasks. It's designed as a faster and simpler alternative to Celery for ML workloads, using Redis and threading to efficiently run background tasks.
ModelQ is developed and maintained by the team at Modelslab.
About Modelslab: Modelslab provides powerful APIs for AI-native applications including:
- Image generation
- Uncensored chat
- Video generation
- Audio generation
- And much more
✨ Features
- ✅ Retry support (automatic and manual)
- ⏱ Timeout handling for long-running tasks
- 🔁 Manual retry using
RetryTaskException - 🎮 Streaming results from tasks in real-time
- 🧹 Middleware hooks for task lifecycle events
- ⚡ Fast, non-blocking concurrency using threads
- 🧵 Built-in decorators to register tasks quickly
- 💃 Redis-based task queueing
- 🖥️ CLI interface for orchestration
- 🔢 Pydantic model support for task validation and typing
- 🌐 Auto-generated REST API for tasks
🗆 Installation
pip install modelq
🚀 Auto-Generated REST API
One of ModelQ's most powerful features is the ability to expose your tasks as HTTP endpoints automatically.
By running a single command, every registered task becomes an API route:
modelq serve-api --app-path main:modelq_app --host 0.0.0.0 --port 8000
How It Works
- Each task registered with
@q.task(...)is turned into a POST endpoint under/tasks/{task_name} - If your task uses Pydantic input/output, the endpoint will validate the request and return a proper response schema
- The API is built using FastAPI, so you get automatic Swagger docs at:
http://localhost:8000/docs
Example Usage
curl -X POST http://localhost:8000/tasks/add \
-H "Content-Type: application/json" \
-d '{"a": 3, "b": 7}'
You can now build ML inference APIs without needing to write any web code!
🖥️ CLI Usage
You can interact with ModelQ using the modelq command-line tool. All commands require an --app-path parameter to locate your ModelQ instance in module:object format.
Start Workers
modelq run-workers main:modelq_app --workers 2
Start background worker threads for executing tasks.
Check Queue Status
modelq status --app-path main:modelq_app
Show number of servers, queued tasks, and registered task types.
List Queued Tasks
modelq list-queued --app-path main:modelq_app
Display a list of all currently queued task IDs and their names.
Clear the Queue
modelq clear-queue --app-path main:modelq_app
Remove all tasks from the queue.
Remove a Specific Task
modelq remove-task --app-path main:modelq_app --task-id <task_id>
Remove a specific task from the queue by ID.
Serve API
modelq serve-api --app-path main:modelq_app --host 0.0.0.0 --port 8000 --log-level info
Start a FastAPI server for ModelQ to accept task submissions over HTTP.
Version
modelq version
Print the current version of ModelQ CLI.
More commands like requeue-stuck, prune-results, and get-task-status are coming soon.
🧠 Basic Usage
from modelq import ModelQ
from modelq.exceptions import RetryTaskException
from redis import Redis
import time
imagine_db = Redis(host="localhost", port=6379, db=0)
q = ModelQ(redis_client=imagine_db)
@q.task(timeout=10, retries=2)
def add(a, b):
return a + b
@q.task(stream=True)
def stream_multiples(x):
for i in range(5):
time.sleep(1)
yield f"{i+1} * {x} = {(i+1) * x}"
@q.task()
def fragile(x):
if x < 5:
raise RetryTaskException("Try again.")
return x
q.start_workers()
task = add(2, 3)
print(task.get_result(q.redis_client))
🔢 Pydantic Support
ModelQ supports Pydantic models as both input and output types for tasks. This allows automatic validation of input parameters and structured return values.
Example
from pydantic import BaseModel, Field
from redis import Redis
from modelq import ModelQ
import time
class AddIn(BaseModel):
a: int = Field(ge=0)
b: int = Field(ge=0)
class AddOut(BaseModel):
total: int
redis_client = Redis(host="localhost", port=6379, db=0)
mq = ModelQ(redis_client=redis_client)
@mq.task(schema=AddIn, returns=AddOut)
def add(payload: AddIn) -> AddOut:
print(f"Processing addition: {payload.a} + {payload.b}.")
time.sleep(10) # Simulate some processing time
return AddOut(total=payload.a + payload.b)
Getting Result
output = job.get_result(mq.redis_client, returns=AddOut)
ModelQ will validate inputs using Pydantic and serialize/deserialize results seamlessly.
⚙️ Middleware Support
ModelQ allows you to plug in custom middleware to hook into events:
Supported Events
before_worker_bootafter_worker_bootbefore_worker_shutdownafter_worker_shutdownbefore_enqueueafter_enqueueon_error
Example
from modelq.app.middleware import Middleware
class LoggingMiddleware(Middleware):
def before_enqueue(self, *args, **kwargs):
print("Task about to be enqueued")
def on_error(self, task, error):
print(f"Error in task {task.task_id}: {error}")
Attach to ModelQ instance:
q.middleware = LoggingMiddleware()
🛠️ Configuration
Connect to Redis using custom config:
from redis import Redis
imagine_db = Redis(host="localhost", port=6379, db=0)
modelq = ModelQ(
redis_client=imagine_db,
delay_seconds=10, # delay between retries
webhook_url="https://your.error.receiver/discord-or-slack"
)
📜 License
ModelQ is released under the MIT License.
🤝 Contributing
We welcome contributions! Open an issue or submit a PR at github.com/modelslab/modelq.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file modelq-1.0.1.tar.gz.
File metadata
- Download URL: modelq-1.0.1.tar.gz
- Upload date:
- Size: 19.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.3 CPython/3.12.1 Linux/6.11.0-1015-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c16d46baa250206268ee907ce3cc033df68750cc43eef841b380ed13ae44b18f
|
|
| MD5 |
f431fecd02712fce88074383c2ef71d9
|
|
| BLAKE2b-256 |
97e5248ff8a73c29d04fed8b6e61cce6f1cce044ff5088fe23e790952f224534
|
File details
Details for the file modelq-1.0.1-py3-none-any.whl.
File metadata
- Download URL: modelq-1.0.1-py3-none-any.whl
- Upload date:
- Size: 21.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.3 CPython/3.12.1 Linux/6.11.0-1015-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ecb90caaf1064676c55ab1a85f5c7a57e0b3a455ac1e97d30586b7c58385939c
|
|
| MD5 |
8c4672d97419d26fd68a6111c1bb96a8
|
|
| BLAKE2b-256 |
78ac4cbcb0cc6e5a618417bfcc51d9a54468930be33dcd16bc3b4805a42e09a1
|