The fastest way to build, chain, and reuse LLM agents and flows

These details have not been verified by PyPI

Project links

Project description

🏎️ cruise-llm

Quickly build and reuse LLM workflows/agents with a clean, composable API — inspired by scikit-learn's chainability and litellm's model flexibility.

from cruise_llm import LLM
LLM().user("Explain quantum computing").chat(stream=True)

⛓️ Multi-turn Prompt Queues

Build complex micro-workflows by queuing prompts that the model will execute sequentially.

# Automatic multi-step processing
news_processor = (
    LLM(model="fast")
    .user(f"Process this article: {raw_text}")
    .queue("Summarize the key points into 3 bullet points for an executive.")
    .queue("Translate those points into Spanish.")
    .queue("Format the Spanish summary as a Slack message with emojis.")
    .chat()
)

# Create reusable bot templates
def style_refiner(style):
    return LLM().sys(f"Rewrite in a {style} tone").queue("Make it half the length")

casual = style_refiner("casual")
formal = style_refiner("formal")

casual.user("We need to discuss Q3 deliverables").res()
formal.user("hey wanna grab coffee and chat about the project?").res()

🔧 Easy Tool Calling for Fast Agent Building

Simply define functions, no schema necessary:

def search_docs(query: str):
    """Search internal documentation."""
    return f"Found: '{query}' appears in onboarding.md and api-reference.md"

def create_ticket(title: str, priority: str):
    """Create a support ticket."""
    return f"Created ticket #{hash(title) % 1000}: {title} [{priority}]"

def send_slack(channel: str, message: str):
    """Send a Slack message."""
    return f"Sent to #{channel}: {message[:50]}..."

support_agent = (
    LLM()
    .sys("You are a support agent")
    .tools(fns=[search_docs, create_ticket, send_slack])
)

support_agent.user("User can't log in. Check docs, create a P1 ticket, and alert #incidents").chat()

🖼️ Image Support

Attach images to prompts - auto-switches to a vision-capable model if needed:

# Single image
LLM().user("What's in this image?", image="photo.jpg").chat()

# Multiple images
LLM().user("Compare these", image=["before.png", "after.png"]).chat()

# URL
LLM().user("Describe this", image="https://example.com/image.jpg").chat()

🔄 Flexible conversations

Chat instances with swappable models and minimal verbosity:

chat1 = (
    LLM(model="fast")
    .sys("You are a bitcoin analyst")
    .user("What is proof of work?").chat()
    .user("Steel man the case for bitcoin mining").chat()
    .user("Now steel man the case against").chat()
)

# Replay history with more intelligent yet expensive config
chat2 = chat1.run_history(model="best", reasoning=True, reasoning_effort="high")

# Save chat histories to analyze offline or load later
chat1.save_llm("chats/bitcoin_analysis_fast_model.json")
chat2.save_llm("chats/bitcoin_analysis_best_model.json")

🔀 Model Discovery & A/B Testing

Pick specific models or get up-to-date top-10 from category:

LLM(model="gpt-5.2")
LLM(model="best")     # top intelligence rankings
LLM(model="fast")     # optimized for speed
LLM(model="cheap")    
LLM(model="open")     # open-source models
LLM(model="optimal")  # balanced best+fast (default)
LLM(model="codex")    

# Deterministic selection by rank
LLM(model="best0")    # top model in best category
LLM(model="fast2")    # 3rd fastest model

# Discover and filter what's available
LLM().get_models("claude")
LLM().models_with_vision()
LLM().models_with_search()

💰 Cost Tracking

Track token usage and costs across your session:

llm = LLM(model="best")
llm.user("Explain quantum computing").chat()
llm.user("Summarize in one sentence").chat()

print(f"Last call: ${llm.last_cost():.6f}")
print(f"Session total: ${llm.total_cost():.6f}")
print(f"Breakdown: {llm.all_costs()}")

💾 Save, Load, Export

# Save an agent config
researcher = LLM("claude-sonnet-4-5").tools(search=True)
researcher.save_llm("agents/researcher.json")

# Load
r = LLM.load_llm("agents/researcher.json")
r.user(f"What happened in tech {todays_date}?").chat()

# Export conversation to markdown
r.to_md(f"tech_briefing/{todays_date}.md")

📦 Install

pip install cruise-llm

Your access to models is based on your API keys from the various providers—keys are available for free from most providers. Create a local .env file in your project root with at least one API key. Use litellm-specific variable names:

OPENAI_API_KEY=sk-proj-...
ANTHROPIC_API_KEY=sk-ant-...
GEMINI_API_KEY=AIza...
GROQ_API_KEY=gsk_...
XAI_API_KEY=xai-...

Caveat: Search, reasoning, and model categories/rankings (best, cheap, fast, open, etc.) has only been tested with the above listed providers. Calling other providers (perplexity, huggingface etc.) is still available with explicit litellm model strings but may require different search/reasoning setup.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.9.0

Feb 15, 2026

0.8.0

Feb 9, 2026

0.7.0

Feb 6, 2026

0.6.0

Feb 6, 2026

0.5.0

Feb 3, 2026

This version

0.4.0

Jan 28, 2026

0.3.0

Jan 22, 2026

0.2.1

Jan 16, 2026

0.2.0

Jan 1, 2026

0.1.3

Dec 19, 2025

0.1.2

Dec 17, 2025

0.1.0

Dec 15, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cruise_llm-0.4.0.tar.gz (16.9 kB view details)

Uploaded Jan 28, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

cruise_llm-0.4.0-py3-none-any.whl (17.8 kB view details)

Uploaded Jan 28, 2026 Python 3

File details

Details for the file cruise_llm-0.4.0.tar.gz.

File metadata

Download URL: cruise_llm-0.4.0.tar.gz
Upload date: Jan 28, 2026
Size: 16.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for cruise_llm-0.4.0.tar.gz
Algorithm	Hash digest
SHA256	`0b99bab93af670c6e132be71643b12a550bc83b7d4f90c754839e24fc5ef6c48`
MD5	`6cc6e1fd934cb70e56a761c37c991496`
BLAKE2b-256	`a322fecfbdf6f8f68993a6403695eb36b288fcbb9df59f79c85211bcbc575f4f`

See more details on using hashes here.

File details

Details for the file cruise_llm-0.4.0-py3-none-any.whl.

File metadata

Download URL: cruise_llm-0.4.0-py3-none-any.whl
Upload date: Jan 28, 2026
Size: 17.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for cruise_llm-0.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`54f93edea8d409ab450e5aa76ab02292caf5d3b26f7b495120b5f9c0feac9bbf`
MD5	`44a5037d664e65157b76bbde7479cee7`
BLAKE2b-256	`0b63ef0b6ff2feade1c9df45b1f903d37ebddd67ecadfb9b8bc8c3f074d53f66`

See more details on using hashes here.

cruise-llm 0.4.0

Navigation

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Project description

🏎️ cruise-llm

⛓️ Multi-turn Prompt Queues

🔧 Easy Tool Calling for Fast Agent Building

🖼️ Image Support

🔄 Flexible conversations

🔀 Model Discovery & A/B Testing

💰 Cost Tracking

💾 Save, Load, Export

📦 Install

Project details

Verified details

Maintainers

Meta

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes