Making agents cuter
Project description
llmoji
Llmoji is a small CLI that makes your agents cuter. (´-ω-`)
Llmoji configures your agent to start each message with a kaomoji. It locally saves them, and provides optional tools to summarize and upload the aggregated meaning per face to contribute to a shared database.
The companion research repo llmoji-study is where this data is processed.
There are three main commands:
llmoji install <provider>: writes hooks to prompt for and record kaomojillmoji analyze: scrape and aggregate your logsllmoji upload --target {hf,email}: ship the bundle (HF: loose files; email: tarball)
analyze needs a synthesis backend. The default uses Anthropic Haiku and reads $ANTHROPIC_API_KEY; --backend openai uses GPT-5.4 mini and reads $OPENAI_API_KEY; --backend local runs against any OpenAI-compatible endpoint (Ollama, vLLM, etc.) and needs --base-url and --model. upload --target hf needs $HF_TOKEN. The email path tarballs the bundle and has you attach it manually.
What this is for
The shared HuggingFace dataset at a9lim/llmoji collects kaomoji counts and a single summarized description per face per source model, across many users' coding agents. The companion repo processes those descriptions. After you run analyze, you can inspect the files yourself under ~/.llmoji/bundle/ before you choose to upload.
Quick start
pip install llmoji
llmoji install claude_code # or: codex, hermes
From now on, your agent will use kaomoji at the start of each message.
After letting it run for a week or so:
export ANTHROPIC_API_KEY=...
llmoji status # check what's been logged
llmoji analyze # scrape + canonicalize + summarize
llmoji upload --target hf # commit to a9lim/llmoji
# or:
llmoji upload --target email # opens mailto:
You can pick a different synthesis backend:
export OPENAI_API_KEY=...
llmoji analyze --backend openai # GPT-5.4 mini via the Responses API
# or:
llmoji analyze --backend local \ # any OpenAI-compatible endpoint
--base-url http://localhost:11434/v1 \
--model llama3.1
analyze caches per-instance descriptions at ~/.llmoji/cache/per_instance.jsonl keyed by content hash plus synthesis model id. llmoji cache clear wipes it.
Install
pip install llmoji
This requires Python 3.11+. The runtime dependency footprint is three packages: anthropic, openai, and huggingface_hub. Hooks run in bash and need jq.
From source:
git clone https://github.com/a9lim/llmoji
cd llmoji
pip install -e ".[dev]" # adds pytest + ruff
How it works
Journal capture
Llmoji first registers a UserPromptSubmit hook that injects a reminder on every turn, asking the model to begin its reply with a kaomoji. It then registers a Stop hook that fires once per assistant turn, that extracts the reply, strips the kaomoji from the body, and appends one JSONL row to ~/.<harness>/kaomoji-journal.jsonl. The schema is the same across every provider:
{"ts": "...", "model": "...", "cwd": "...", "kaomoji": "(◕‿◕)", "user_text": "...", "assistant_text": "..."}
Synthesis pipeline
llmoji analyze scrapes every installed provider's journal plus any extra JSONL files under ~/.llmoji/journals/. For each entry a model wrote, the chosen synthesizer describes that specific instance. Then, it aggregates the descriptions for each unique kaomoji per model and writes an overall meaning. This summarized output is the only thing that ships in the bundle.
The synthesizer is one of three backends, chosen via --backend. The same synthesizer evaluates everything in a single analyze run, so the descriptions across source models are directly comparable.
| Backend | API | Default model |
|---|---|---|
anthropic |
Anthropic SDK, messages.create |
claude-haiku-4-5-20251001 |
openai |
OpenAI SDK, Responses API | gpt-5.4-mini-2026-03-17 |
local |
OpenAI-compatible Chat Completions endpoint | (set via --model) |
Bundle structure
analyze writes to ~/.llmoji/bundle/:
~/.llmoji/bundle/
manifest.json
claude-sonnet-4-5-20250929.jsonl
claude-haiku-4-5-20251001.jsonl
gpt-5.4-mini-2026-03-17.jsonl
manifest.json: package version, the synthesis backend and model id used, a salted submitter id, generation timestamp, list of providers seen, per-source-model row counts, total synthesized rows, and anything you include as--notes.<source-model>.jsonl: one row per kaomoji as that model used it, with the synthesized meaning. The filename stem is the sanitized model id (lowercase, slashes become double-underscores, colons become hyphens).
Privacy
| Tier | Where | Shipped on upload? |
|---|---|---|
| Raw user and assistant text | ~/.<harness>/kaomoji-journal.jsonl |
Never |
| Per-instance synthesizer paraphrase | ~/.llmoji/cache/per_instance.jsonl |
Never |
| Synthesized summaries and counts per model | ~/.llmoji/bundle/ |
Yes |
Please see SECURITY.md for the full privacy model.
Providers
llmoji install <provider> writes the hook script and registers it with the harness's settings file, idempotently.
| Provider | Hook events | Settings format | Notes |
|---|---|---|---|
claude_code |
Stop, UserPromptSubmit | JSON | Stable, in daily use. |
codex |
Stop, UserPromptSubmit | JSON | Stable, in daily use. |
hermes |
post_llm_call, pre_llm_call | YAML | Subagent traffic is not currently filtered (no child id on the upstream payload). |
install does not clobber existing config. llmoji uninstall <provider> removes the hooks and the settings entry. Journals and the per-instance cache are preserved; wipe those with llmoji cache clear.
Static dumps
To pull kaomoji out of a Claude.ai or ChatGPT data export:
llmoji parse --provider claude.ai ~/Downloads/data-...-batch-0000
llmoji parse --provider chatgpt ~/Downloads/chatgpt-export
Both exports happen to ship a file named conversations.json, with different schemas under the same filename; the parsers handle each. Rows land at ~/.llmoji/journals/claude_ai_export.jsonl or ~/.llmoji/journals/chatgpt_export.jsonl, and llmoji analyze picks them up alongside the live provider journals. The ChatGPT reader walks the message tree from current_node along the active branch only, so abandoned regenerations stay out of the corpus.
For Claude Code, Codex, or Hermes history that predates installing the live hook, the historical transcripts (~/.claude/projects/**/*.jsonl, ~/.codex/sessions/**/rollout-*.jsonl, ~/.hermes/sessions/session_*.json) can be replayed into the journals via the llmoji.backfill module.
Custom harness
For harnesses we don't ship a first-class adapter for (notably OpenClaw):
- Append one row per kaomoji-bearing assistant turn to
~/.llmoji/journals/<harness>.jsonl. - Use the canonical six-field schema:
{ts, model, cwd, kaomoji, user_text, assistant_text}. - Strip the leading kaomoji from
assistant_texton the way in (the prefix lives in thekaomojifield). - Validate the prefix the same way the package does:
llmoji.taxonomy.is_kaomoji_candidate(prefix).
llmoji analyze picks up everything under ~/.llmoji/journals/ automatically. Please see examples/openclaw_hook.ts for a worked example.
The Python module llmoji.taxonomy is the single source of truth for the validator and the leading-glyph set; rendered bash hooks (under llmoji._hooks/) read from it at install time. If you're porting the validator to another language for a harness like OpenClaw, please mirror the rules in is_kaomoji_candidate faithfully. Bumping any of them is a cross-corpus invariant change on the package side and your port needs to follow.
Tests
pytest tests/ # everything
pytest tests/test_canonicalize.py # rule-by-rule regression for canonicalize_kaomoji and extract
pytest tests/test_public_surface.py # locks the cross-corpus invariant contract
The full suite runs anywhere. CI runs ruff check . and pytest on every PR.
The public-surface test exercises taxonomy invariants, synth-prompt content checks, the synthesizer factory dispatch, provider rendering plus bash -n validation of every hook template, the bundle allowlist, the corrupt-config refusal paths, and the unified mask_kaomoji prepend contract. The canonicalize tests run rule-by-rule.
Prior art
Llmoji replicates and scales eriskii's Claude-faces catalog, the original post that came up with the idea of prompting and tracking Claude's kaomoji use. The shared HuggingFace dataset extends that pipeline across many users, many harnesses, and many model releases.
Contributing and security
Please see CONTRIBUTING.md for dev setup. For security and privacy, please see SECURITY.md.
License
GPL-3.0-or-later. See LICENSE. The companion research repo llmoji-study is CC-BY-SA-4.0. The shared corpus on HuggingFace is also CC-BY-SA-4.0; running llmoji upload --target hf contributes a bundle under those terms.
If you use llmoji or the central corpus in published research, please cite this repository.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file llmoji-1.1.0.tar.gz.
File metadata
- Download URL: llmoji-1.1.0.tar.gz
- Upload date:
- Size: 121.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f3546a6dfba6e3e43853287509b6b48b4c76c4b44e5e3bc39a5349d1f9341241
|
|
| MD5 |
a572d71c8352180e1635ab21ae3b1848
|
|
| BLAKE2b-256 |
55d0dedc1c1fa5e5d30744b64556f07ede5164422fa3e5e7e1a031067a4d59f9
|
Provenance
The following attestation bundles were made for llmoji-1.1.0.tar.gz:
Publisher:
release.yml on a9lim/llmoji
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
llmoji-1.1.0.tar.gz -
Subject digest:
f3546a6dfba6e3e43853287509b6b48b4c76c4b44e5e3bc39a5349d1f9341241 - Sigstore transparency entry: 1397746626
- Sigstore integration time:
-
Permalink:
a9lim/llmoji@75c96e12c2f7e89a0c36af9676e20bf2ac5e23f1 -
Branch / Tag:
refs/heads/main - Owner: https://github.com/a9lim
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@75c96e12c2f7e89a0c36af9676e20bf2ac5e23f1 -
Trigger Event:
push
-
Statement type:
File details
Details for the file llmoji-1.1.0-py3-none-any.whl.
File metadata
- Download URL: llmoji-1.1.0-py3-none-any.whl
- Upload date:
- Size: 96.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1cc600657d04c257c931d46355fa1dfe0d8a41cb98e9a5d56a862e72a17d7ccf
|
|
| MD5 |
9499bee6a4d3439b85f78243e7735316
|
|
| BLAKE2b-256 |
28e872316c83c49dca10b6ecdb59169e9f7144cfd09a355ed8c13a8fbe704a8f
|
Provenance
The following attestation bundles were made for llmoji-1.1.0-py3-none-any.whl:
Publisher:
release.yml on a9lim/llmoji
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
llmoji-1.1.0-py3-none-any.whl -
Subject digest:
1cc600657d04c257c931d46355fa1dfe0d8a41cb98e9a5d56a862e72a17d7ccf - Sigstore transparency entry: 1397746652
- Sigstore integration time:
-
Permalink:
a9lim/llmoji@75c96e12c2f7e89a0c36af9676e20bf2ac5e23f1 -
Branch / Tag:
refs/heads/main - Owner: https://github.com/a9lim
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@75c96e12c2f7e89a0c36af9676e20bf2ac5e23f1 -
Trigger Event:
push
-
Statement type: