Heuristic Yield Websearch - LLM-powered web search assistant

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

kumoSleeping

Project description

HYW — Heuristic Yield Websearch

An LLM-powered terminal assistant that searches, cross-validates, then answers.

English · 中文

Why

LLMs have a knowledge cutoff. Ask "what happened today" and they can only guess.

HYW lets the model decide what to search, how many rounds, and how to cross-validate — then gives you the answer. Not a simple "search and paste" — it's a multi-round heuristic search loop.

Features

Multi-round autonomous search — The model breaks down questions, crafts search queries, and validates results across up to 6 iterations
XML tag tool calling — No function calling dependency; works with any LLM provider
Streaming output — Think and display in real-time; search progress visible as it happens
Pluggable tool backends — Search / page extract / render are selected by capability, not hard-coded per module
Built-in retrieval runtime — Ships with ddgs, optional jina_ddgs search rendering, Jina AI page extraction, and non-browser Markdown render
Rich terminal UI — Gradient titles, Markdown rendering, live spinners
Multi-turn conversation — Context auto-carried; toggle mode with arrow keys
Any model via LiteLLM — OpenAI / Anthropic / Google / OpenRouter / local models

Quickstart

# Default install: CLI + ddgs + Jina AI + md2png-lite render
pip install hyw

# Add entari plugin support
pip install "hyw[entari]"

# Add Entari + Noto font sync support
pip install "hyw[entari,notosans]"

# Interactive mode
hyw

# Single question
hyw -q "What's the latest in tech news?"

The hyw command is available in the default install.

Configuration

Config file: ~/.hyw/config.yml. Use /config in interactive mode to edit directly. An example based on the multi-model layout lives at config.example.yml. In interactive mode, ← / → switches models, and ↑ / ↓ toggles multi-turn vs new session. Legacy single-model fields (model / api_key / api_base) still work. You can also define named transport presets via model_provider / model_providers for OpenAI-compatible relays.

# Shared provider defaults.
# `models[*]` inherit these unless they override them.
api_key: sk-or-xxx
api_base: https://openrouter.ai/api/v1

# Optional LiteLLM transport preset.
# `requires_openai_auth: true` means "use OPENAI_API_KEY if api_key is omitted".
# model_provider: mirror
# model_providers:
#   mirror:
#     base_url: https://chat.soruxgpt.com/codex
#     wire_api: responses
#     requires_openai_auth: true
#     custom_llm_provider: openai
# models:
#   - name: codex-fast
#     model: gpt-5.4
#     model_provider: mirror
#     reasoning_effort: xhigh
#     # `fast` is normalized to `priority` for Codex mirror on wire.
#     service_tier: fast

# Main controller model used at startup / single-shot mode.
# You can set this to either a profile `name` or a raw model id.
active_model: gemini-lite

models:
  - name: gemini-lite
    model: openrouter/google/gemini-3.1-flash-lite-preview
  - name: kimi-k2.5
    model: openrouter/moonshotai/kimi-k2.5
  - name: cerebras-gpt-oss
    model: cerebras/gpt-oss-120b
    api_key: csk-xxx
    api_base: https://api.cerebras.ai/v1

# Runtime options actually used by the app
language: zh-CN
# Set `false` if the upstream provider has streaming / tool-call compatibility issues.
stream: true
headless: true
# Maximum main-loop rounds. Default is 8.
max_rounds: 8
# Optional provider hint, for example Codex mirror can be configured as `fast`
# and will be normalized to the transport value it accepts on wire.
# service_tier: fast
# Per-search-provider timeout before falling back. Default is 4s.
search_handler_timeout_s: 4
# Model-call retries only; search / page_extract tools are not retried here.
model_retries: 2
model_retry_base_delay_s: 1.0
model_retry_max_delay_s: 8.0
# Custom system prompt appended to the main controller prompt.
system_prompt: ""

# Tool capability registry + default provider selection
tools:
  index:
    ddgs:
      search: core.search_ddgs:ddgs_search
    jina_ddgs:
      search: core.search_ddgs:jina_ddgs_search
    jina_ai:
      page_extract: core.search_jina_ai:jina_ai_page_extract
    md2png_lite:
      render: md2png_lite.provider:render_md2png_lite_result
  config:
    jina_ddgs:
      search:
        headers:
          Accept: text/plain
          X-Engine: browser
          X-Return-Format: markdown
    jina_ai:
      page_extract:
        headers:
          # Authorization: Bearer jina_xxx
          Accept: text/plain
  use:
    search: ddgs
    page_extract: jina_ai
    render: md2png_lite

# Legacy stage-specific model slots kept only for compatibility.
# The current main loop does not read them.
stages:
  search:
    model: ""
  fetch:
    model: ""
  summary:
    model: ""

What each block does now:

api_key / api_base: shared defaults inherited by models[*].
model_provider / model_providers: named transport presets that expand into LiteLLM fields such as api_base, custom_llm_provider, and api_key_env.
active_model: the main controller model currently selected; can match either a profile name or a raw model id.
models: switchable main-model profiles for CLI left/right model selection.
language / stream / headless / system_prompt: active runtime options used by the current flow.
max_rounds: maximum main-loop rounds; default is 8.
service_tier: optional provider hint. For Codex mirror, fast is normalized to the accepted wire value priority.
search_handler_timeout_s: per-search-provider timeout before fallback; default is 4.
model_retries / model_retry_base_delay_s / model_retry_max_delay_s: model-only retry budget and exponential backoff; tool providers are not retried here.
tools.index: capability-to-provider registry.
tools.config: per-provider extra options such as headers.
tools.use: which provider is selected by default for each capability.
stages.*: legacy stage-specific model slots kept only for old configs; the current main loop does not use them.

Runtime context carryover now keeps only Latest Round Raw:

Latest Round Raw: the previous round's full raw search/page results stay visible for the next round.

How It Works

User Question
  │
  ▼
┌─────────────────────────────────────┐
│  Main model plans the next step     │
│  Outputs <sub_agent ...> XML tags   │
└──────────────┬──────────────────────┘
               │
               ▼
┌─────────────────────────────────────┐
┌─────────────────────────────────────┐
│  Main model chooses concrete pages  │
│  page_extract returns numbered lines│
│  and the main model answers         │
└─────────────────────────────────────┘

Commands

Command	Description
`/config`	Open config file in editor
`/stats`	Show session statistics
`/exit`	Exit
`←` / `→`	Switch active model
`↑` / `↓`	Toggle Multi Turn / New Session mode

Project Structure

core/
├── config.py               # Model config + tool capability registry
├── main.py                 # Conversation loop, tool calls, LLM interaction
├── cli.py                  # Rich terminal UI, streaming output
├── __main__.py             # python -m core entry point
├── search_ddgs.py          # DDGS + jina_ddgs search providers
├── search_jina_ai.py       # Jina AI page extract provider
├── web_runtime.py          # WebToolSuite + retrieval runtime
└── render.py               # md2png-lite render dispatch

Requirements

Python ≥ 3.12
Default deps: litellm · pyyaml · loguru · rich · prompt-toolkit · ddgs · httpx · md2png-lite · Pillow
entari: arclet-alconna · arclet-entari · md2png-lite
notosans: md2png-lite[notosans]

Roadmap

Contributing

Issues and PRs welcome.

License

MIT

_{Built with curiosity and caffeine.}

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

kumoSleeping

Release history Release notifications | RSS feed

This version

0.0.9

Mar 27, 2026

0.0.7

Mar 26, 2026

0.0.6

Mar 26, 2026

0.0.5

Mar 25, 2026

0.0.4

Mar 19, 2026

0.0.3

Mar 18, 2026

0.0.2

Mar 13, 2026

0.0.1

Mar 13, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hyw-0.0.9.tar.gz (366.1 kB view details)

Uploaded Mar 27, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

hyw-0.0.9-py3-none-any.whl (105.1 kB view details)

Uploaded Mar 27, 2026 Python 3

File details

Details for the file hyw-0.0.9.tar.gz.

File metadata

Download URL: hyw-0.0.9.tar.gz
Upload date: Mar 27, 2026
Size: 366.1 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for hyw-0.0.9.tar.gz
Algorithm	Hash digest
SHA256	`41e58852aed2718af7df62ad0e943df7f68a329485973c8974d95b4845811a58`
MD5	`25cf238acbac5cc19b745b84c1825de7`
BLAKE2b-256	`246f88bd9486f31db8d31a395d7a597a402b4ead2925e875514d49b62443fa42`

See more details on using hashes here.

Provenance

The following attestation bundles were made for hyw-0.0.9.tar.gz:

Publisher: workflow.yml on kumoSleeping/heuristic_yield_websearch

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: hyw-0.0.9.tar.gz
- Subject digest: 41e58852aed2718af7df62ad0e943df7f68a329485973c8974d95b4845811a58
- Sigstore transparency entry: 1188441534
- Sigstore integration time: Mar 27, 2026
Source repository:
- Permalink: kumoSleeping/heuristic_yield_websearch@7b694ffb36cffb2d1e5f08544a882fe93d5399c8
- Branch / Tag: refs/heads/main
- Owner: https://github.com/kumoSleeping
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: workflow.yml@7b694ffb36cffb2d1e5f08544a882fe93d5399c8
- Trigger Event: push

File details

Details for the file hyw-0.0.9-py3-none-any.whl.

File metadata

Download URL: hyw-0.0.9-py3-none-any.whl
Upload date: Mar 27, 2026
Size: 105.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for hyw-0.0.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b59502ca8167d7b0d8189d1b308b81ace3ab8ab6a85e35e32e5d6667e2207ad7`
MD5	`bf13c9ea5eb4636eb31ed162d6d0d88d`
BLAKE2b-256	`6c22352380b7fd53ceae8497e97054cb2ec18c2fc7bdf7a5b22c38b203c33ae0`

See more details on using hashes here.

Provenance

The following attestation bundles were made for hyw-0.0.9-py3-none-any.whl:

Publisher: workflow.yml on kumoSleeping/heuristic_yield_websearch

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: hyw-0.0.9-py3-none-any.whl
- Subject digest: b59502ca8167d7b0d8189d1b308b81ace3ab8ab6a85e35e32e5d6667e2207ad7
- Sigstore transparency entry: 1188441551
- Sigstore integration time: Mar 27, 2026
Source repository:
- Permalink: kumoSleeping/heuristic_yield_websearch@7b694ffb36cffb2d1e5f08544a882fe93d5399c8
- Branch / Tag: refs/heads/main
- Owner: https://github.com/kumoSleeping
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: workflow.yml@7b694ffb36cffb2d1e5f08544a882fe93d5399c8
- Trigger Event: push

hyw 0.0.9

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Project description

HYW — Heuristic Yield Websearch

Why

Features

Quickstart

Configuration

How It Works

Commands

Project Structure

Requirements

Roadmap

Contributing

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance