LLM inference SDK, for telemetry and internal model routing
Project description
Maniac Python Client
A minimal python client for Maniac's API. Supports chat completions and dataset uploads.
Installation
pip install maniac
Example Usage
from __future__ import annotations
import asyncio
from maniac import Maniac
async def main() -> None:
client = Maniac() # or Maniac({"apiKey": os.environ["MANIAC_API_KEY"]})
try:
# Run inference without a container
# Using kwargs
standard_response = await client.chat.completions.create(
model="openai/gpt-4o-mini",
messages=[{"role": "user", "content": "Tell me a story about france"}],
)
print(standard_response["choices"][0]["message"]["content"]) # type: ignore[index]
# Create a container to collect telemetry
container = await client.containers.create(
label="local-test",
initial_model="openai/gpt-4o-mini",
initial_system_prompt="You are a helpful assistant that answers questions and discusses travel topics.",
)
container_response = await client.chat.completions.create(
container=container,
messages=[{"role": "user", "content": "Tell me a story about france"}],
)
print(container_response["choices"][0]["message"]["content"]) # type: ignore[index]
# Stream responses as async iterable
gen = await client.chat.completions.stream(
container=container,
messages=[{"role": "user", "content": "Tell me a story about france"}],
)
async for chunk in gen: # type: ignore[union-attr]
piece = (
(chunk.get("choices") or [{}])[0].get("delta", {}).get("content", "")
)
if piece:
print(piece, end="", flush=True)
print()
# Stream responses with callback
async def on_chunk(chunk) -> None:
piece = (
(chunk.get("choices") or [{}])[0].get("delta", {}).get("content", "")
)
if piece:
print(piece, end="", flush=True)
await client.chat.completions.stream(
{"container": container, "messages": [{"role": "user", "content": "Tell me a story about france"}]},
on_chunk,
)
# Get a container by label and run a completion
travel_agent = await client.containers.get("local-test")
email_resp = await client.chat.completions.create(
container=travel_agent,
messages=[{"role": "user", "content": "Tell me a story about france"}],
)
print(email_resp["choices"][0]["message"]["content"]) # type: ignore[index]
# Models list / retrieve
models = await client.models.list()
print([m["id"] for m in models.get("data", [])])
model = await client.models.retrieve("openai/gpt-4o-mini")
print(model)
finally:
await client.aclose()
if __name__ == "__main__":
asyncio.run(main())
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
maniac-0.3.3.tar.gz
(49.3 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
maniac-0.3.3-py3-none-any.whl
(39.6 kB
view details)
File details
Details for the file maniac-0.3.3.tar.gz.
File metadata
- Download URL: maniac-0.3.3.tar.gz
- Upload date:
- Size: 49.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
56ae549cae8c79c7f3958c7c36b13998655e7a6db79ded79e6b3eea37ba2c573
|
|
| MD5 |
c2f857a7dd4da64fbb2db5207fb38cc3
|
|
| BLAKE2b-256 |
9d61eb37ee7d268b5a75085dd39afcdfa71cd7f2326667f78135790cb06b4a20
|
File details
Details for the file maniac-0.3.3-py3-none-any.whl.
File metadata
- Download URL: maniac-0.3.3-py3-none-any.whl
- Upload date:
- Size: 39.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fe607d9313082488560fb066841a9f79f18ed5d45a8520d9480543988b4ae524
|
|
| MD5 |
0cbba45238c6df3c7572e6f57d9b5b1a
|
|
| BLAKE2b-256 |
bc99bab171f61a91371e08545fbbd26f9ae89b838bac0f061bc79f53c99ca034
|