Lightweight TOXO Cloud client and CLI for .toxo layers

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

spiderdev27

These details have not been verified by PyPI

Project links

Project description

toxo-cloud

Lightweight Python client and CLI for TOXO Cloud — use TOXO without heavy local dependencies.

What is TOXO?

TOXO is a smart layer platform that turns any black-box LLM (Gemini, GPT, Claude) into a Context Augmented Language Model (CALM) — giving it domain expertise, memory, and better outputs without fine-tuning.

The idea

Base LLM (Gemini, GPT, Claude) = general intelligence
.toxo layer = domain specialization, memory, rules, soft prompt
TOXO runtime = runs the layer on top of the LLM

.toxo files are small (tens of KB): soft prompts, memory state, and configuration. No LLM weights. The heavy compute stays on your LLM provider.

Why TOXO vs fine-tuning

Fine-tuning	TOXO
$10k–$100k+ per domain	$5–$500
Days to weeks	Minutes to hours
GPU clusters	API-based, no GPUs
One model per domain	Unlimited layers
Permanent changes	Remove layer anytime
Model-specific	Works with any LLM

TOXO features and capabilities

TOXO provides:

Smart layer training

Train domain experts from Q&A examples and documents
No GPUs — uses your LLM API
Derives static rules, contexts, and example phrases from your data
Output: portable .toxo layer you can run anywhere

Advanced memory

Persistent knowledge storage and retrieval
Context-aware learning
Conversation history and domain expertise
Memory state baked into the .toxo layer

Quality enhancement

Response depth control: concise, balanced, detailed
Feedback loop: rate responses, add suggestions
Query-type intelligence (factual, procedural, comparison, etc.)
Domain-specific output refinement

Multimodal understanding

Images: PNG, JPG, screenshots
Documents: PDF, DOCX, HTML, TXT, CSV, RTF, EPUB
Direct-to-Gemini: no local rendering deps
MIME auto-inference for documents and images

RAG (retrieval-augmented generation)

Index documents into a vector store
Query with grounded answers and citations
Works with cloud or local index

Multi-agent orchestration

Pipeline, hierarchical, peer-to-peer coordination
Agents can ask each other questions (nudges)
Dynamic planning: auto-generate execution plan from goal
Final synthesis pass merges step outputs into one artifact

Universal LLM support

Gemini (recommended)
OpenAI GPT
Anthropic Claude

No-code layer creation

Create layers at toxotune.com
Upload examples, download .toxo
Use with any TOXO runtime (cloud or local)

toxo-cloud: thin client for TOXO Cloud

toxo-cloud is a lightweight Python client that talks to TOXO Cloud. Your .toxo layer and API key stay on your side; all heavy processing runs in the cloud.

What toxo-cloud supports (via TOXO Cloud)

Capability	Endpoint	Python method
Text query	`POST /v1/query`	`client.query()`
Multimodal (image/document)	`POST /v1/query_multimodal`	`client.query_multimodal()`
Feedback	`POST /v1/feedback`	`client.feedback()`
Train from data	`POST /v1/train_from_data`	`client.train_from_data()`
Resume training	`POST /v1/train_resume_from_data`	`client.train_resume_from_data()`
Extract contexts	`POST /v1/train/extract_contexts`	`client.train_extract_contexts()`
RAG index	`POST /v1/rag/index`	`client.rag_index()`
RAG query	`POST /v1/rag/query`	`client.rag_query()`
Agent workflow	`POST /v1/agent/run`	`client.agent_run()`

When to use toxo-cloud

You want minimal install (requests + pyyaml)
You prefer cloud execution — no local LLM, doc conversion, or vector store
You have a .toxo layer and want to call TOXO Cloud from Python or the CLI

Alternative: local runtime

The toxo package (pip install toxo) provides the full local runtime: ToxoLayer, training, RAG, agents — all on your machine. Use it when you need offline execution or heavy local workflows.

End-to-end flow with toxo-cloud

Create a .toxo layer (train via train_from_data, use toxotune.com, or use an existing layer)
Call TOXO Cloud with toxo-cloud (Python or CLI)
Get domain-aware responses; optionally send feedback or retrain

Install

pip install -U toxo-cloud

Quick start

Python

from toxo_cloud import ToxoCloudClient

client = ToxoCloudClient(api_key="YOUR_GEMINI_API_KEY")

answer = client.query(
    layer="finance_expert.toxo",
    question="Explain inflation in simple words.",
)
print(answer)

CLI

export GEMINI_API_KEY="YOUR_GEMINI_API_KEY"

toxo-cloud health
toxo-cloud query finance_expert.toxo "Explain inflation in simple words."

Use-case quick recipes

1) Build a domain expert API quickly

from toxo_cloud import ToxoCloudClient

client = ToxoCloudClient()
response = client.query(
    layer="my_support_expert.toxo",
    question="How should we handle a failed payment escalation?",
    response_depth="balanced",
)
print(response)

2) Analyze documents and images

doc_answer = client.query_multimodal(
    layer="ops_expert.toxo",
    question="Summarize this policy and list action items.",
    document_path="policy.pdf",
)

image_answer = client.query_multimodal(
    layer="ops_expert.toxo",
    question="Extract metrics from this dashboard screenshot.",
    image_path="dashboard.png",
)

3) Add cloud RAG over your docs

client.rag_index(
    index_id="handbook_v1",
    docs=["docs/employee_handbook.pdf", "docs/benefits.md"],
    domain="hr",
)

result = client.rag_query(
    layer="hr_assistant.toxo",
    index_id="handbook_v1",
    question="What is our parental leave policy?",
)
print(result["response"])

Authentication

toxo-cloud resolves API keys in this order:

explicit api_key=... argument
GEMINI_API_KEY
GOOGLE_API_KEY

If none is available, requests that require provider access will fail.

Python API

`ToxoCloudClient(...)`

from toxo_cloud import ToxoCloudClient
client = ToxoCloudClient(api_key="...", timeout=120)

api_key: optional, can come from env vars
timeout: default HTTP timeout in seconds

Text query

response = client.query(
    layer="my_layer.toxo",
    question="What can you help with?",
    provider="gemini",
    model="gemini-2.5-flash-lite",
    response_depth="balanced",  # concise | balanced | detailed
)

Multimodal query (image or document)

Exactly one source is required:

image_path
document_path
image_base64
document_base64

# Document from path (mime type auto-inferred)
doc_resp = client.query_multimodal(
    layer="my_layer.toxo",
    question="Summarize this report in 5 bullets.",
    document_path="report.pdf",
)

# Image from base64
img_resp = client.query_multimodal(
    layer="my_layer.toxo",
    question="Extract key values from this screenshot.",
    image_base64="iVBORw0KGgoAAAANSUhEUgAA...",
    mime_type="image/png",
)

Notes:

document_path now works without explicit mime_type (auto inference).
Raw base64 is automatically wrapped into a data:<mime>;base64,... format.

Feedback

result = client.feedback(
    layer="my_layer.toxo",
    question="Original user question",
    response="Model answer",
    rating=8.5,
    suggestions=["Be more concise", "Include caveats"],
)

Train from data

examples = [
    {"input": "What is inflation?", "output": "Inflation is a rise in prices over time."},
]

result = client.train_from_data(
    description="Finance helper for basic personal finance questions.",
    out_path="finance_layer.toxo",
    examples=examples,
    epochs=3,
    llm_provider="gemini",
    llm_model="gemini-2.5-flash-lite",
)

Extract contexts from documents

result = client.train_extract_contexts(
    docs=["docs/policy.md", "docs/faq.pdf"],
    index_id="billing_contexts",
    domain="billing",
)

Cloud RAG

index_result = client.rag_index(
    index_id="product_docs_v1",
    docs=["docs/guide.md", "docs/faq.pdf"],
    domain="product",
)

query_result = client.rag_query(
    layer="my_layer.toxo",
    index_id="product_docs_v1",
    question="What are the cancellation rules?",
    top_k=5,
)

Agent orchestration

result = client.agent_run(
    config="agent_config.yaml",
    layer_map={
        "research": "research_expert.toxo",
        "writer": "writer_expert.toxo",
    },
    llm_provider="gemini",
    llm_model="gemini-2.5-flash-lite",
)

print(result.get("final"))

CLI reference

toxo-cloud --help

Core commands:

toxo-cloud root
toxo-cloud health
toxo-cloud query <layer> "<question>"
toxo-cloud ask-doc <layer> <document> "<question>"
toxo-cloud ask-image <layer> <image> "<question>"
toxo-cloud feedback <layer> "<question>" "<response>" [--rating 8.5] [--suggestion "..."]

Training and retrieval:

toxo-cloud train-auto "<description>" --out layer.toxo
toxo-cloud train-from-data --description "..." --out layer.toxo --examples-jsonl examples.jsonl --contexts-file contexts.txt
toxo-cloud train-extract-contexts --docs docs/ --recursive --pattern "*.md"
toxo-cloud rag-index --index-id docs_v1 --docs docs/ --recursive
toxo-cloud rag-query --layer layer.toxo --index-id docs_v1 --question "..." --top-k 5

Agents:

toxo-cloud agent-run --config agent_config.yaml --layer research=research.toxo --layer writer=writer.toxo

Endpoint coverage

toxo-cloud supports:

GET /
GET /health
POST /v1/query
POST /v1/query_multimodal
POST /v1/feedback
POST /v1/train_from_data
POST /v1/train/extract_contexts
POST /v1/rag/index
POST /v1/rag/query
POST /v1/agent/run

Endpoint-by-endpoint examples

All examples below use the toxo-cloud Python client. For raw HTTP testing, curl examples are also included.

from toxo_cloud import ToxoCloudClient

client = ToxoCloudClient(api_key="YOUR_GEMINI_API_KEY")

1) `GET /` (service root)

info = client.root()
print(info)

curl -s https://toxo-api-cwsddklqcq-uc.a.run.app/

2) `GET /health` (health check)

health = client.health()
print(health)  # {"status": "ok"}

curl -s https://toxo-api-cwsddklqcq-uc.a.run.app/health

3) `POST /v1/query` (text query)

response = client.query(
    layer="finance_expert.toxo",
    question="Explain inflation in 3 bullets.",
    provider="gemini",
    model="gemini-2.5-flash-lite",
    response_depth="balanced",
)
print(response)

4) `POST /v1/query_multimodal` (image or document query)

# Document path
doc_response = client.query_multimodal(
    layer="finance_expert.toxo",
    question="Summarize this document in 5 bullets.",
    document_path="report.pdf",
)
print(doc_response)

# Image path
img_response = client.query_multimodal(
    layer="finance_expert.toxo",
    question="What key numbers are visible in this image?",
    image_path="dashboard.png",
)
print(img_response)

5) `POST /v1/feedback` (quality feedback)

result = client.feedback(
    layer="finance_expert.toxo",
    question="What is inflation?",
    response="Inflation is the rise in prices over time.",
    rating=9.0,
    suggestions=["Add one real-world example."],
)
print(result)

6) `POST /v1/train_from_data` (train and download `.toxo`)

examples = [
    {"input": "What is compound interest?", "output": "Interest on principal plus accumulated interest."},
    {"input": "What is diversification?", "output": "Spreading risk across asset types."},
]

train_result = client.train_from_data(
    description="Finance helper for beginner investing questions.",
    out_path="finance_trained.toxo",
    examples=examples,
    epochs=3,
    llm_provider="gemini",
    llm_model="gemini-2.5-flash-lite",
)
print(train_result.get("summary", {}))

7) `POST /v1/train_resume_from_data` (resume training from an existing `.toxo`)

This continues training from an existing layer (preserves its learned state) and applies the new examples/contexts on top.

resume_result = client.train_resume_from_data(
    base_layer="finance_trained.toxo",
    description="Finance helper for beginner investing questions (v2).",
    out_path="finance_trained_v2.toxo",
    examples=[
        {"input": "What is diversification?", "output": "Spreading risk across asset types."},
        {"input": "Explain compounding.", "output": "Interest on interest over time."},
    ],
    contexts=[
        "When the user asks for investing help, include risk + disclaimers.",
        "Prefer simple explanations for beginners unless they request details.",
    ],
    epochs=3,
    llm_provider="gemini",
    llm_model="gemini-2.5-flash-lite",
)
print(resume_result.get("summary", {}))

8) `POST /v1/train/extract_contexts` (extract contexts from docs)

extract_result = client.train_extract_contexts(
    docs=["docs/policies.md", "docs/faq.pdf"],
    index_id="finance_contexts",
    domain="finance",
)
print(extract_result)

9) `POST /v1/rag/index` (index documents for retrieval)

index_result = client.rag_index(
    index_id="finance_docs_v1",
    docs=["docs/handbook.pdf", "docs/fees.md"],
    domain="finance",
    layer="finance_expert.toxo",  # optional
)
print(index_result)

10) `POST /v1/rag/query` (query indexed docs)

rag_result = client.rag_query(
    layer="finance_expert.toxo",
    index_id="finance_docs_v1",
    question="What are the account closure charges?",
    top_k=5,
    llm_provider="gemini",
    llm_model="gemini-2.5-flash-lite",
    response_depth="balanced",
)
print(rag_result.get("response", rag_result))

11) `POST /v1/agent/run` (multi-agent workflow)

agent_config = {
    "goal": "Create a concise customer-facing refund policy summary.",
    "coordination": "pipeline",
    "agents": [
        {"id": "research", "layer": "unused", "role": "Policy researcher"},
        {"id": "writer", "layer": "unused", "role": "Customer-facing writer"},
    ],
    "steps": [
        {"agent": "research", "question": "Extract key policy constraints and timelines."},
        {"agent": "writer", "question": "Use {{prev}} and produce a concise final summary."},
    ],
}

agent_result = client.agent_run(
    config=agent_config,
    layer_map={
        "research": "finance_expert.toxo",
        "writer": "finance_expert.toxo",
    },
    llm_provider="gemini",
    llm_model="gemini-2.5-flash-lite",
)
print(agent_result.get("final", agent_result))

CLI one-liners for each endpoint

toxo-cloud root
toxo-cloud health
toxo-cloud query finance_expert.toxo "Explain inflation briefly."
toxo-cloud ask-doc finance_expert.toxo ./report.pdf "Summarize this report."
toxo-cloud ask-image finance_expert.toxo ./chart.png "What trend is shown?"
toxo-cloud feedback finance_expert.toxo "What is inflation?" "Inflation is..." --rating 8.5 --suggestion "Add an example"
toxo-cloud train-auto "Finance helper for beginner users" --out finance_auto.toxo
toxo-cloud train-from-data --description "Finance helper" --out finance_data.toxo --examples-jsonl examples.jsonl --contexts-file contexts.txt
toxo-cloud train-resume-from-data --base-layer finance_trained.toxo --description "Finance helper (v2)" --out finance_trained_v2.toxo --examples-jsonl examples.jsonl --contexts-file contexts.txt
toxo-cloud train-extract-contexts --docs docs/ --recursive --pattern "*.md"
toxo-cloud rag-index --index-id finance_docs_v1 --docs docs/ --recursive
toxo-cloud rag-query --layer finance_expert.toxo --index-id finance_docs_v1 --question "What are account fees?"
toxo-cloud agent-run --config agent_config.yaml --layer research=finance_expert.toxo --layer writer=finance_expert.toxo

SEO / GEO friendly search intents

This package is relevant for queries like:

TOXO Python client
how to use .toxo file in Python
smart layer for LLMs
Context Augmented Language Models (CALM)
Gemini document and image analysis Python
Python RAG client with cloud index
multi-agent LLM workflow Python
lightweight LLM cloud connector package

TOXO FAQ

Does `toxo-cloud` train foundation models?

No. TOXO uses smart layers (.toxo) that guide and specialize provider models.

Do I need to host my own inference servers?

No for this package. toxo-cloud is a thin client and the runtime executes in TOXO Cloud.

Can I use this for multimodal and RAG workflows?

Yes. toxo-cloud supports multimodal endpoints plus cloud RAG index/query endpoints.

Where should I start?

Start with query(...), then add:

query_multimodal(...) for documents/images
rag_index(...) + rag_query(...) for grounded retrieval
agent_run(...) for orchestrated multi-agent workflows

Troubleshooting

401/403 or provider errors: verify GEMINI_API_KEY / GOOGLE_API_KEY.
Layer not found: check layer path; the client reads layer bytes locally before sending.
Multimodal failures: pass exactly one source (image_path, document_path, image_base64, document_base64).
YAML config errors for agents: ensure pyyaml is installed (included by default dependency).

License

Proprietary. See repository license terms.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

spiderdev27

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.4

Mar 30, 2026

0.1.3

Mar 24, 2026

0.1.2

Mar 24, 2026

0.1.1

Mar 24, 2026

0.1.0

Mar 24, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

toxo_cloud-0.1.4.tar.gz (18.2 kB view details)

Uploaded Mar 30, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

toxo_cloud-0.1.4-py3-none-any.whl (13.2 kB view details)

Uploaded Mar 30, 2026 Python 3

File details

Details for the file toxo_cloud-0.1.4.tar.gz.

File metadata

Download URL: toxo_cloud-0.1.4.tar.gz
Upload date: Mar 30, 2026
Size: 18.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for toxo_cloud-0.1.4.tar.gz
Algorithm	Hash digest
SHA256	`21100266c8c3811a2d7750896d7fcf61bfbe18bc32c13efea8aa14f7bd6ec947`
MD5	`10510cb163bf70cb854080c5ba12383e`
BLAKE2b-256	`c11c466e27406232f0553b1b422dc49557dd75f6c31d7815aa8ac093cebec2e0`

See more details on using hashes here.

Provenance

The following attestation bundles were made for toxo_cloud-0.1.4.tar.gz:

Publisher: publish.yml on spiderdev27/toxo-cloud

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: toxo_cloud-0.1.4.tar.gz
- Subject digest: 21100266c8c3811a2d7750896d7fcf61bfbe18bc32c13efea8aa14f7bd6ec947
- Sigstore transparency entry: 1198820350
- Sigstore integration time: Mar 30, 2026
Source repository:
- Permalink: spiderdev27/toxo-cloud@95aa0572d63e2aa4afe3d4e6a4c7e187a9ae1461
- Branch / Tag: refs/tags/v0.1.4
- Owner: https://github.com/spiderdev27
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@95aa0572d63e2aa4afe3d4e6a4c7e187a9ae1461
- Trigger Event: push

File details

Details for the file toxo_cloud-0.1.4-py3-none-any.whl.

File metadata

Download URL: toxo_cloud-0.1.4-py3-none-any.whl
Upload date: Mar 30, 2026
Size: 13.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for toxo_cloud-0.1.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a65beb1f932cfcf5b1ce1e4aa1e8cf7f8584a3bda7ad6da046d789d4bec933b4`
MD5	`4c0a1a8459542fc8ccd7c87d7246bbc0`
BLAKE2b-256	`77b58fbf51bbb01b5818fcbe38d4425abcabbe39d2ccd133399801fe356ad796`

See more details on using hashes here.

Provenance

The following attestation bundles were made for toxo_cloud-0.1.4-py3-none-any.whl:

Publisher: publish.yml on spiderdev27/toxo-cloud

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: toxo_cloud-0.1.4-py3-none-any.whl
- Subject digest: a65beb1f932cfcf5b1ce1e4aa1e8cf7f8584a3bda7ad6da046d789d4bec933b4
- Sigstore transparency entry: 1198820355
- Sigstore integration time: Mar 30, 2026
Source repository:
- Permalink: spiderdev27/toxo-cloud@95aa0572d63e2aa4afe3d4e6a4c7e187a9ae1461
- Branch / Tag: refs/tags/v0.1.4
- Owner: https://github.com/spiderdev27
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@95aa0572d63e2aa4afe3d4e6a4c7e187a9ae1461
- Trigger Event: push

toxo-cloud 0.1.4

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

toxo-cloud

What is TOXO?

The idea

Why TOXO vs fine-tuning

TOXO features and capabilities

Smart layer training

Advanced memory

Quality enhancement

Multimodal understanding

RAG (retrieval-augmented generation)

Multi-agent orchestration

Universal LLM support

No-code layer creation

toxo-cloud: thin client for TOXO Cloud

What toxo-cloud supports (via TOXO Cloud)

When to use toxo-cloud

Alternative: local runtime

End-to-end flow with toxo-cloud

Install

Quick start

Python

CLI

Use-case quick recipes

1) Build a domain expert API quickly

2) Analyze documents and images

3) Add cloud RAG over your docs

Authentication

Python API

ToxoCloudClient(...)

Text query

Multimodal query (image or document)

Feedback

Train from data

Extract contexts from documents

Cloud RAG

Agent orchestration

CLI reference

Endpoint coverage

Endpoint-by-endpoint examples

1) GET / (service root)

2) GET /health (health check)

3) POST /v1/query (text query)

4) POST /v1/query_multimodal (image or document query)

5) POST /v1/feedback (quality feedback)

6) POST /v1/train_from_data (train and download .toxo)

7) POST /v1/train_resume_from_data (resume training from an existing .toxo)

8) POST /v1/train/extract_contexts (extract contexts from docs)

9) POST /v1/rag/index (index documents for retrieval)

10) POST /v1/rag/query (query indexed docs)

11) POST /v1/agent/run (multi-agent workflow)

CLI one-liners for each endpoint

SEO / GEO friendly search intents

TOXO FAQ

Does toxo-cloud train foundation models?

Do I need to host my own inference servers?

Can I use this for multimodal and RAG workflows?

Where should I start?

Troubleshooting

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

`ToxoCloudClient(...)`

1) `GET /` (service root)

2) `GET /health` (health check)

3) `POST /v1/query` (text query)

4) `POST /v1/query_multimodal` (image or document query)

5) `POST /v1/feedback` (quality feedback)

6) `POST /v1/train_from_data` (train and download `.toxo`)

7) `POST /v1/train_resume_from_data` (resume training from an existing `.toxo`)

8) `POST /v1/train/extract_contexts` (extract contexts from docs)

9) `POST /v1/rag/index` (index documents for retrieval)

10) `POST /v1/rag/query` (query indexed docs)

11) `POST /v1/agent/run` (multi-agent workflow)

Does `toxo-cloud` train foundation models?