Skip to main content

Generate agent-ready VOICE.md packs and CI benchmarks for website copy.

Project description

site2voice

Turn any website into agent-ready copy context. Then test whether the agent followed it.

PyPI CI License: MIT

site2voice terminal demo

pipx install site2voice

site2voice init https://example.com
site2voice bench https://example.com draft.md --strict

No LLM dependency. No browser dependency. Just a small Python CLI that extracts measurable voice signals and turns them into files coding agents can use.

Why

Agents are getting better at building interfaces, but they still guess the words: headings, CTAs, navigation labels, claim boundaries, and product tone.

DESIGN.md gives agents visual taste. site2voice gives them copy taste.

Voice Packs

The repo includes downloadable voice packs for well-known product and editorial sites: Apple, Stripe, Linear, Vercel, Notion, Figma, Shopify, GitHub, OpenAI, Anthropic, Hypebeast, Highsnobiety, Monocle, Wallpaper, and EYESMAG.

curl -L https://raw.githubusercontent.com/SihyeonJeon/site2voice/main/packs/stripe/VOICE.md -o VOICE.md

See packs.

What You Get

.site2voice/
  VOICE.md          human-readable writing brief
  voice.json        machine-readable voice profile
  agent-prompt.md   drop-in instruction for Codex, Claude Code, Cursor, etc.

The profile includes:

  • heading shape, sentence rhythm, paragraph length, CTA shape;
  • repeated vocabulary and CTA verbs;
  • deterministic tone labels;
  • unsupported-claim detection;
  • copied-span detection.

Benchmark

site2voice bench scores candidate copy against a source voice profile and can fail CI when the copy drifts.

site2voice bench examples/editorial-home.html examples/after-copy.md --strict

Example fixture:

Candidate Result Overall Lexicon Copy safety
after-copy PASS 83.8 70.0 93.2
before-copy FAIL 36.6 0.0 100.0

The score rewards measurable voice alignment without rewarding verbatim copying.

Commands

# Generate one Markdown brief
site2voice https://example.com --out VOICE.md

# Generate an agent-ready context pack
site2voice init https://example.com --dir .site2voice

# Write JSON for custom pipelines
site2voice https://example.com --format json --out voice.json

# Gate candidate copy in CI
site2voice bench https://example.com draft.md \
  --fail-under 75 \
  --min-copy-safety 85 \
  --min-claim-safety 75

What It Is Not

  • Not an official brand guideline.
  • Not a DESIGN.md visual-token extractor.
  • Not a crawler for private pages or authenticated apps.
  • Not a tool for copying another site's prose.

Develop

python3 -m pip install -e .
make test
make bench-ci

Docs

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

site2voice-0.4.0.tar.gz (48.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

site2voice-0.4.0-py3-none-any.whl (15.0 kB view details)

Uploaded Python 3

File details

Details for the file site2voice-0.4.0.tar.gz.

File metadata

  • Download URL: site2voice-0.4.0.tar.gz
  • Upload date:
  • Size: 48.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for site2voice-0.4.0.tar.gz
Algorithm Hash digest
SHA256 adf78fce18aa43e8c7b9ecffebf43be13aef053f117f743c40c373d38457066f
MD5 cf621c0a0a554b14b6d8d0ee102e56a6
BLAKE2b-256 259bfde22b55f039c5459a04e788b399a4818b6b5b4c8e49f47e6777bc17d6b0

See more details on using hashes here.

Provenance

The following attestation bundles were made for site2voice-0.4.0.tar.gz:

Publisher: publish.yml on SihyeonJeon/site2voice

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file site2voice-0.4.0-py3-none-any.whl.

File metadata

  • Download URL: site2voice-0.4.0-py3-none-any.whl
  • Upload date:
  • Size: 15.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for site2voice-0.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 7c1e6022160fef67380aa2b75fad20b2da7e078b781992d3303d2cb0cc8c9b78
MD5 b2d0b414b97c7805ca8533fc50a2b985
BLAKE2b-256 0da031985b17e17a0378477c04c205c7305d326efdf4ef0da13c638e3764f941

See more details on using hashes here.

Provenance

The following attestation bundles were made for site2voice-0.4.0-py3-none-any.whl:

Publisher: publish.yml on SihyeonJeon/site2voice

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page