llcat

/usr/bin/cat for LLMs

These details have not been verified by PyPI

Project links

Project description

llcat
/usr/bin/cat for LLMs

You want to pipe something into or out of a model sitting on a server.

Existing tools require you to:

install plugins
pick from a pre-baked provider boutique
pick a list of models which don't update
swap around credentials like you're Indiana Jones with a bag of sand

It's wildly inconvenient.

This fixes all that noise.

llcat, part of the DAY50 suite of open-source tools for AI workflows, allows for targeted precision, focused interaction with models and servers in order to do things like

show a server's models
see each model capabilities
do tool-calls
manage context windows

llcat works through regular JSON files through a principle of "least magic" - prioritizing predictability, compatibility, coherency, transparency and functionality.

It exists as a general-purpose CLI-based OpenAI-compatible /chat/completions caller.

It is like cURL or cat for LLMs: a stateless, transparent, explicit, low-level, composable tool for scripting and glue.

Conversations, keys, servers and other configurations are explicitly specified each execution as command line arguments.

This makes building things with llcat direct.

There is no caching or state saved between runs. Everything gets surfaced and errors are JSON parsable. There's a --curlify option as well. It's also quite fast and permits custom timeouts.

Very Quick Start

List the models on OpenRouter:

uvx llcat -u openrouter.ai/api -m

What about just the qwen ones?

uvx llcat -u openrouter.ai/api -m '*qwen3*'

What about their capabilities in JSON?

uvx llcat -u openrouter.ai/api -m '*qwen3*' --info | jq .

Sure. What about a different protocol, say ollama?

uvx llcat -u localhost:11434 -m '*qwen3*' --info | jq .

You might think "That's funny ... it looks the same."

Correct. Welcome to llcat.

All the abstraction without those pesky leaks.

llcat can:

Use local or remote servers, authenticated or not.
Store conversation history optionally, as a JSON file.
Pipe things from stdin and/or be prompted on the command line.
Do tool calling using the OpenAI spec and MCP STDIO servers.
List and choose models, system prompts, and add attachments.

llcat's basic CLI parameters are also compatible with Simon Willison's llm.

Examples

Here's some examples of how to use llcat as a building block for many common use-cases:

Transferrable Conversations
Stateful Interaction
Interactive Chat
Structured Output
Evals
Tool Calling

Example: Transferrable Conversations

Because conversations, models and servers are decoupled, you can mix and match them at any time.

Here's one conversation, hopping across models and servers.

Start a chat with Deepseek:

$ llcat -u https://openrouter.ai/api \
        -m deepseek/deepseek-r1-0528:free \
        -c /tmp/convo.txt \
        -sk "$(cat openrouter.key)" \
        "What is the capital of France?"

Continue it with Qwen using MAS format and using the @ syntax for including the key by file:

$ llcat -u "https://openrouter.ai/api#m=qwen/qwen3-4b:free"
        -c /tmp/convo.txt \
        -sk @openrouter.key \
        "And what about Canada?"

And finish on the local network:

$ llcat -u http://192.168.1.21:8080 \
        -c /tmp/convo.txt \
        "And what about Japan?"

Since the conversation goes to the filesystem as JSON you can use things like inotify or fuse and push it off to a vector search backend or modify the context window between calls.

Example: Adding State

llcat's explicit syntax means lots of things are within reach.

For instance wrappers can be made custom to your workflow.

Here's a way to store state with environment variables to make invocation more convenient:

llf()        { llc "$@" 2> >(jq . >&2) | examples/spinner sd }
llc()        { llcat -m "$LLC_MODEL" -u "$LLC_SERVER" -sk "$LLC_KEY" "$@" }
llc-model()  { LLC_MODEL=$(llcat -m  -u "$LLC_SERVER" -sk "$LLC_KEY" | fzf) }
llc-server() { LLC_SERVER=$1 }
llc-key()    { LLC_KEY=$1 }

And now you can do things like this:

$ llc-server http://192.168.1.21:8080
$ llc "write a diss track where the knapsack problem hates on the towers of hanoi"

And what's that llf at the top? That uses jq to pretty print the errors and streamdown to pretty print the output along with a program to display a spinner while you wait.

There's no configuration files to parse or implicit states to manage.

Example: Interactive Chat

A conversation interface is also quick:

#!/usr/bin/env bash

# We pick a file for the conversation or allow a user to pass it in with a CONV environment variable
conv=${CONV:-$(mktemp)}
echo -e "  Using: $conv\n"

# Show the previous conversation if there is any, stylize it with streamdown
jq -r '.[] | "\n**\(.role)**: \(.content)"' $conv | sd

# Read prompts in a loop
while read -E -p "  >> " query; do

    # Take the command line arguments of the shell script, pass them to llcat
    llcat -c $conv "$@" "$query" |& sd
    echo
done

So now instead of

llcat -u http://myserver -k mykey -m model

Our conversation loop can be invoked like

conversation.sh -u http://myserver -k mykey -m model

Adding additional features is trivial.

Example: Structured Output

Using the schema feature you can pass json in to enforce a schema. Try something like

$ llcat -u http://localhost:11434 -sc @examples/schema.json "give me a person"

Example: Evals

Running the same thing on multiple models and assessing the outcome is straight forward. Here we're using ollama

pre="llcat -u http://localhost:11434"
for model in $($pre -m); do
   $pre -m $model "translate 国際化がサポートされています。to english" > ${model}.outcome
done

You can use patterns like that also for testing tool calling completion. Here's a bigger example: a humor eval to see if models know a funny joke when they see one

If an error happens contacting the server, you get the request, response, and a non-zero exit.

Try this to see what that looks like

uvx llcat -u fakecomputer

Example: Tool calling

The examples directory contains this music playing tool listing the contents of this album:

$ llcat -u http://127.1:8080 -tf tool_file.json -tp tool_program.py "what mp3s do i have in my ~/mp3 directory"
{"level": "debug", "class": "toolcall", "message": "request", "obj": {"id": "iwCGjcRic8GAFB2jUvBUOeF9NNrldfxz", "type": "function", "function": {"name": "list_mp3s", "arguments": {"path":"~/mp3"}}}}
{"level": "debug", "class": "toolcall", "message": "result", "obj": ["Elektrobopacek - Towards the final Battle.mp3", "Elektrobopacek - Escape the Labyrinth.mp3", "Elektrobopacek - Journey to the misty Lands.mp3", "Elektrobopacek - Mistral Forte.mp3", "Elektrobopacek - Leaving Spaceport X-19.mp3", "Elektrobopacek - Dracula Rising.mp3"]}
Here are the MP3 files in your `~/mp3` directory:

1. **Elektrobopacek - Towards the final Battle.mp3**
2. **Elektrobopacek - Escape the Labyrinth.mp3**
3. **Elektrobopacek - Journey to the misty Lands.mp3**
4. **Elektrobopacek - Mistral Forte.mp3**
5. **Elektrobopacek - Leaving Spaceport X-19.mp3**
6. **Elektrobopacek - Dracula Rising.mp3**

Would you like to play any of these? Just share the filename, and I can play it for you! 🎵

In this example you can see how nothing is hidden so if the model makes a mistake it is immediately identifiable.

The debug JSON objects are sent to stderr so routing it separately is trivial.

MCP

MCPFile

This file is what you usually need to make for an mcp server definition:

{
  "mcpServers": {
    "<some_server>": {
      "command": "<some_command>",
      "args": ["<some>", "<args>"]
    }
    ...
  }
}

There's a basic extension on MCP here. You can explicity disable an MCP server by adding a flag "disabled": true like so:

{
  "mcpServers": {
    "<some_server>": {
      "command": "<some_command>",
      "disabled": true,
      "args": ["<some>", "<args>"]
    }
    ...
  }
}

MCPCat

MCP can be simple with simple tools. There's one included here. mcpcat is a 22 line Bash script.

Here is an example of it in use:

$ mcpcat init list | \
  uv run python -m my-server | \
  jq .

Let's say there's a calculator mcp, you can do something like

$ mcpcat init call calculate '{"expression":"2+2"}' | \
   uv run python -m mcp_server_calculator \
   jq .

The beauty here is you can see the Emperor's new clothes up close. Simply omit the pipe.

$ mcpcat init call calculate '{"expression":"2+2"}'
{"jsonrpc":"2.0","id":4,"method":"initialize","params":{"protocolVersion":"2024-11-05","capabilities":{},"clientInfo":{"name":"mcpcat","version":"1.0"}}}
{"jsonrpc":"2.0","method":"notifications/initialized"}
{"jsonrpc":"2.0","id":3,"method":"tools/call","params":{"name":"calculate","arguments":{"expression":"2+2"}}}

That's all the STDIO Transport is.

There's ways of doing the network transports with this script as well. All you need is the appropriate network tools and compose away.

Usage

Now it's your turn.

usage: llcat [-h] [-su SERVER_URL] [-sk [@]SERVERKEY] [-to TIMEOUT]
             [-pr PROTO] [-m [MODEL]] [-s [@]SYSTEM] [-a ATTACH]
             [-c CONVERSATION] [-cr CONVERSATIONRO] [-eb [@]EXTRABODY]
             [-sc [@]SCHEMA] [-mf MCP_FILE] [-tp TOOL_PROGRAM]
             [-tf TOOL_FILE] [-ps] [-bq BE_QUIET] [-nt] [-ns] [-nw]
             [--curlify] [--dry] [--version] [--info [INFO]]
             [user_prompt ...]

llcat is /usr/bin/cat for LLMs. 

        🐱 Me-wow! 

https://github.com/day50-dev/llcat

Options with a [@] prefix can either be strings or paths to a file, curl style, @/like/this.

positional arguments:
  user_prompt           your prompt

options:
  -h, --help            show this help message and exit
  -su, -u, --server_url SERVER_URL
                        server URL (e.g., http://::1:8080). Also supports MAS
                        format
  -sk, -k, --server_key [@]SERVERKEY
                        server API key for authorization
  -to, --timeout TIMEOUT
                        timeout in seconds for the read
  -pr, --proto PROTO    protocol to use (ollama, llama.cpp, openai, auto)
  -m, --model [MODEL]   model to use (or list models if no value)
  -s, --system [@]SYSTEM
                        system prompt
  -a, --attach ATTACH   attach file(s)
  -c, --conversation CONVERSATION
                        conversation history file (r/w)
  -cr, --conversationro CONVERSATIONRO
                        the readonly conversation input (ro)
  -eb, --extra_body [@]EXTRABODY
                        JSON to add to the body, such as max_tokens or
                        temperature
  -sc, --schema [@]SCHEMA
                        set a schema to force structured output
  -mf, --mcp_file MCP_FILE
                        MCP file to use
  -tp, --tool_program TOOL_PROGRAM
                        program to execute tool calls
  -tf, --tool_file TOOL_FILE
                        JSON file with tool definitions
  -ps, --ps             currently running model (if supported)
  -bq, --be_quiet BE_QUIET
                        make it shutup about things
  -nt, --no_think       disable thinking
  -ns, --no_stream      disable streaming
  -nw, --no_wrap        do not wrap inputs in <xml-like-syntax>
  --curlify             write curl equivalents of calls to stdout
  --dry                 dry run
  --version             show program's version number and exit
  --info [INFO]         get the info for a model

We're excited to see what you build.

Brought to you by DA`/50: Make the future obvious.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.16.0

Jun 27, 2026

0.15.2

Jun 26, 2026

0.15.1

Jun 25, 2026

This version

0.15.0

Jun 15, 2026

0.14.11

May 31, 2026

0.14.9

May 29, 2026

0.14.8

May 28, 2026

0.14.7

May 26, 2026

0.14.6

May 16, 2026

0.14.5

May 16, 2026

0.14.4

May 2, 2026

0.14.3

Apr 24, 2026

0.14.2

Apr 24, 2026

0.14.1

Apr 13, 2026

0.14.0

Apr 10, 2026

0.13.19

Mar 26, 2026

0.13.18

Mar 22, 2026

0.13.17

Mar 14, 2026

0.13.16

Mar 9, 2026

0.13.15

Mar 8, 2026

0.13.14

Mar 2, 2026

0.13.13

Mar 2, 2026

0.13.12

Mar 1, 2026

0.13.11

Mar 1, 2026

0.13.10

Feb 17, 2026

0.13.9

Feb 15, 2026

0.13.8

Feb 15, 2026

0.13.7

Feb 14, 2026

0.13.6

Feb 9, 2026

0.13.5

Feb 6, 2026

0.13.4

Feb 5, 2026

0.13.3

Feb 4, 2026

0.13.2

Feb 4, 2026

0.13.1

Feb 4, 2026

0.13.0

Feb 4, 2026

0.12.5

Feb 2, 2026

0.12.4

Feb 2, 2026

0.12.3

Feb 2, 2026

0.12.2

Feb 1, 2026

0.12.1

Feb 1, 2026

0.12.0

Jan 31, 2026

0.11.5

Jan 27, 2026

0.11.4

Jan 25, 2026

0.11.3

Jan 22, 2026

0.11.2

Jan 22, 2026

0.11.1

Jan 20, 2026

0.11.0

Jan 20, 2026

0.10.1

Jan 20, 2026

0.10.0

Jan 19, 2026

0.9.7

Jan 18, 2026

0.9.6

Jan 18, 2026

0.9.5

Jan 17, 2026

0.9.4

Jan 13, 2026

0.9.3

Jan 12, 2026

0.9.2

Jan 11, 2026

0.9.1

Jan 11, 2026

0.9

Jan 11, 2026

0.8.3

Jan 11, 2026

0.8.2

Jan 11, 2026

0.8.1

Jan 11, 2026

0.8

Jan 11, 2026

0.7

Jan 10, 2026

0.6.2

Jan 10, 2026

0.6.1

Jan 9, 2026

0.6.0

Jan 9, 2026

0.5.0

Jan 9, 2026

0.4.0

Jan 9, 2026

0.3.0

Jan 9, 2026

0.2.0

Jan 9, 2026

0.1.0

Jan 9, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llcat-0.15.0.tar.gz (15.8 kB view details)

Uploaded Jun 15, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llcat-0.15.0-py3-none-any.whl (16.0 kB view details)

Uploaded Jun 15, 2026 Python 3

File details

Details for the file llcat-0.15.0.tar.gz.

File metadata

Download URL: llcat-0.15.0.tar.gz
Upload date: Jun 15, 2026
Size: 15.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for llcat-0.15.0.tar.gz
Algorithm	Hash digest
SHA256	`215542d5a6cb92ce38a4679bb8864d3251747351c6cde55179d0d110f855cc75`
MD5	`a226a3235ae8798a3f6593701a9887c9`
BLAKE2b-256	`7926ce644f70197d72615422e758aeb069553e55d2afc1db712fc3a8419ea3c3`

See more details on using hashes here.

File details

Details for the file llcat-0.15.0-py3-none-any.whl.

File metadata

Download URL: llcat-0.15.0-py3-none-any.whl
Upload date: Jun 15, 2026
Size: 16.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for llcat-0.15.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4057d45c9b429ed0b04085b917a4ee7c75d5adaf065c5981ead2decedf6083c1`
MD5	`69fc4ba6b7058d95c2e28aba736c7b61`
BLAKE2b-256	`1c3c41b121068be243ddb41b2cfb97e2b525bdad48b6c81e7db8d9d07019e4e2`

See more details on using hashes here.

llcat 0.15.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Very Quick Start

Examples

Example: Transferrable Conversations

Example: Adding State

Example: Interactive Chat

Example: Structured Output

Example: Evals

Example: Tool calling

MCP

MCPFile

MCPCat

Usage

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes