Skip to main content

/usr/bin/cat for LLMs

Project description

/usr/bin/cat for LLMs

llcat is like cURL or cat for LLMs: a stateless, low-level, composable tool for scripting and glue.

Conversations, keys, servers and other configurations are intentionally left unmanaged.

They are specified using classic UNIX patterns. Nothing is hidden. There's no magic. it's transparent and explicit.

llcat

Very Quick Start

Got 0.3 seconds to spare?

List the models on OpenRouter:

uvx llcat -s https://openrouter.ai/api -m


llcat can:

  • Use local or remote servers, authenticated or not.
  • Store conversation history optionally, as a boring JSON file.
  • Pipe things from stdin and/or be prompted on the command line.
  • Do tool calling using the OpenAI spec that's also MCP compatible. (Example below)
  • List and choose models, system prompts, and add attachments.

Example: Transferrable conversations

Because conversations, models and servers are decoupled, you can easily mix and match them at any time.

Here's one conversation, hopping across models and servers.

Start a chat with Deepseek:

$ llcat -s https://openrouter.ai/api \
        -m deepseek/deepseek-r1-0528:free \
        -c /tmp/convo.txt \
        -k $(cat openrouter.key) \
        "What is the capital of France?"

Continue it with Qwen:

$ llcat -s https://openrouter.ai/api \
        -m qwen/qwen3-4b:free \
        -c /tmp/convo.txt \
        -k $(cat openrouter.key) \
        "And what about Canada?"

And finish on the local network:

$ llcat -s http://192.168.1.21:8080 \
        -c /tmp/convo.txt \
        "And what about Japan?"

Since the conversation goes to the filesystem as easily parsable JSON you can use things like inotify or fuse and push it off to a vector search backend or even modify the context window between calls simply by editing the file

Example: Adding State

Explicit specifica simple lots of things are within reach.

For instance, simple wrappers can be made custom to your workflow.

Here's one way you could store state with environment variables to make invocation more convenient:

llc()        { llcat -m "$LLC_MODEL" -s "$LLC_SERVER" -k "$LLC_KEY" "$@" }
llc-model()  { LLC_MODEL=$(llcat -m  -s "$LLC_SERVER" -k "$LLC_KEY" | fzf) }
llc-server() { LLC_SERVER=$1 }
llc-key()    { LLC_KEY=$1 }

And now you can do things like this:

$ llc-server http://192.168.1.21:8080
$ llc "write a diss track where the knapsack problem hates on the towers of hanoi"

There's no configuration files to parse or implicit states.

Example: Interactive Chat

A conversation interface is also quite quick:

#!/usr/bin/env bash
conv=${CONV:-$(mktemp)}
echo -e "  Using: $conv\n"
jq -r '.[] | "\n**\(.role)**: \(.content)"' $conv | sd
while read -E -p "  >> " query; do
    llcat -c $conv "$@" "$query" |& sd
    echo
done
2026-01-09_07-35

Example: Evals

This just means running the same thing on multiple models and assessing the outcome. Here's a very simple pattern, this time using ollama

pre="llcat -s http://localhost:11434"
for model in $($pre -m); do
   $pre -m $model "translate 国際化がサポートされています。to english" > ${model}.outcome
done

You can use patterns like that also for testing tool calling completion.

If an error happens contacting the server, you get the request, response, and exits non-zero.

Example: Tool calling

This example, a very strange way to play mp3s, uses a 21 line tool_program.py included in this repository.

tc

In this example you can see how nothing is hidden so when the LLM made the mistake it was immediately identifiable.

That meta information goes to stderr.

llcat's tool calling is also MCP compatible.

Usage

Now it's your turn.

usage: llcat  [-h] [-c CONVERSATION] [-m [MODEL]] [-k KEY] [-s SERVER]
              [-p PROMPT] [-tf TOOL_FILE] [-tp TOOL_PROGRAM] [-a ATTACH]
              [user_prompt ...]

positional arguments:
  user_prompt           Your prompt

options:
  -h, --help            show this help message and exit
  -c, --conversation CONVERSATION
                        Conversation history file
  -m, --model [MODEL]   Model to use (or list models if no value)
  -k, --key KEY         API key for authorization
  -s, --server SERVER   Server URL (e.g., http://::1:8080)
  -p, --prompt PROMPT   System prompt
  -tf, --tool_file TOOL_FILE
                        JSON file with tool definitions
  -tp, --tool_program TOOL_PROGRAM
                        Program to execute tool calls
  -a, --attach ATTACH   Attach file(s)

We're excited to see what you build.


Brought to you by DA`/50: Make the future obvious.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llcat-0.8.1.tar.gz (6.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

llcat-0.8.1-py3-none-any.whl (6.8 kB view details)

Uploaded Python 3

File details

Details for the file llcat-0.8.1.tar.gz.

File metadata

  • Download URL: llcat-0.8.1.tar.gz
  • Upload date:
  • Size: 6.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.11

File hashes

Hashes for llcat-0.8.1.tar.gz
Algorithm Hash digest
SHA256 7386ecb95d76da9e575e0d1fc957adfd53ad82bdcc053982694dcc23501a1586
MD5 bcc9540cf2cc6a588d85d77cf77572ff
BLAKE2b-256 29a77ead9a29abe42b4a2b8558f716270c26301ad1640d32cb88808bd7cf575d

See more details on using hashes here.

File details

Details for the file llcat-0.8.1-py3-none-any.whl.

File metadata

  • Download URL: llcat-0.8.1-py3-none-any.whl
  • Upload date:
  • Size: 6.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.11

File hashes

Hashes for llcat-0.8.1-py3-none-any.whl
Algorithm Hash digest
SHA256 5495f530b1daa8bba5a52b4d2e0d15fa133a254d81b6c8ca0f945ad4e28d6921
MD5 fccbbf1b4bfb24c26911e946d3b6a8d8
BLAKE2b-256 99625beef12a717f5f597d8e49d43f7de464f950ee4eeb85df071b96dbb0eb2a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page