LLM inference SDK, for telemetry and internal model routing
Project description
Maniac Python Client
A minimal python client for Maniac's API. Supports chat completions and dataset uploads.
Installation
pip install maniac
Example Usage
from __future__ import annotations
import asyncio
from maniac import Maniac
async def main() -> None:
client = Maniac() # or Maniac({"apiKey": os.environ["MANIAC_API_KEY"]})
try:
# Run inference without a container
# Using kwargs
standard_response = await client.chat.completions.create(
model="openai/gpt-4o-mini",
messages=[{"role": "user", "content": "Tell me a story about france"}],
)
print(standard_response["choices"][0]["message"]["content"]) # type: ignore[index]
# Create a container to collect telemetry
container = await client.containers.create(
label="local-test",
initial_model="openai/gpt-4o-mini",
initial_system_prompt="You are a helpful assistant that answers questions and discusses travel topics.",
)
container_response = await client.chat.completions.create(
container=container,
messages=[{"role": "user", "content": "Tell me a story about france"}],
)
print(container_response["choices"][0]["message"]["content"]) # type: ignore[index]
# Stream responses as async iterable
gen = await client.chat.completions.stream(
container=container,
messages=[{"role": "user", "content": "Tell me a story about france"}],
)
async for chunk in gen: # type: ignore[union-attr]
piece = (
(chunk.get("choices") or [{}])[0].get("delta", {}).get("content", "")
)
if piece:
print(piece, end="", flush=True)
print()
# Stream responses with callback
async def on_chunk(chunk) -> None:
piece = (
(chunk.get("choices") or [{}])[0].get("delta", {}).get("content", "")
)
if piece:
print(piece, end="", flush=True)
await client.chat.completions.stream(
{"container": container, "messages": [{"role": "user", "content": "Tell me a story about france"}]},
on_chunk,
)
# Get a container by label and run a completion
travel_agent = await client.containers.get("local-test")
email_resp = await client.chat.completions.create(
container=travel_agent,
messages=[{"role": "user", "content": "Tell me a story about france"}],
)
print(email_resp["choices"][0]["message"]["content"]) # type: ignore[index]
# Models list / retrieve
models = await client.models.list()
print([m["id"] for m in models.get("data", [])])
model = await client.models.retrieve("openai/gpt-4o-mini")
print(model)
finally:
await client.aclose()
if __name__ == "__main__":
asyncio.run(main())
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
maniac-0.3.6.tar.gz
(51.6 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
maniac-0.3.6-py3-none-any.whl
(42.1 kB
view details)
File details
Details for the file maniac-0.3.6.tar.gz.
File metadata
- Download URL: maniac-0.3.6.tar.gz
- Upload date:
- Size: 51.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
99290846975e2a5a1c6069de05c98ad6918956f521ae9669caf82f4559692883
|
|
| MD5 |
a8b29e94a1641c9eedfb6bfb1a8fcfcd
|
|
| BLAKE2b-256 |
de47a9f5f4a49909598216442730ca81ff87c38c51790ff2d7a6cd09331a62da
|
File details
Details for the file maniac-0.3.6-py3-none-any.whl.
File metadata
- Download URL: maniac-0.3.6-py3-none-any.whl
- Upload date:
- Size: 42.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2948aa0c99f95a379dd2a5f23f21ab6dac179726716d8709e965733923050032
|
|
| MD5 |
123f5363d3d30dda5fd7578162e1223c
|
|
| BLAKE2b-256 |
325201376687f52b43935ace23a2b03e41b6c5d40d08b21c325fbd2d3dca35ec
|