Unified AI-first web_research and read-only web primitive kit surfaces for WeaveRT product kits.
Project description
Web Research Common Kit
Canonical import root: weavert_kit_common_web_research
What this package owns
- the unified public
web_researchentrypoint for read-only public-web information retrieval - low-level
web_search, single-pageweb_fetch, andweb_findprimitives backed byweavert-web-research - a package-owned goal-driven research loop behind
web_researchthat plans queries, selects pages, evaluates evidence coverage, and stops with explicit reasons - the package-owned
web-searcherdelegated worker reserved for bounded implementation-period fallback paths - first-party research profiles:
general,coding,business,academic,legal_compliance, andproduct_shopping - common result envelopes with sources, evidence, conflicts, gaps, freshness, provider metadata, research trace, and profile facets
Canonical names
- package root:
packages/product-kits/common/web-research - install name:
weavert-kit-common-web-research - import root:
weavert_kit_common_web_research - runtime activation:
weavert-shared-web-research
Boundary
Use web_research for goal-driven AI-first research and pass profile="coding" or another supported profile when the scenario needs profile-specific source ranking or facets. web_research is the supported path for multi-page source discovery and inspection: it derives bounded queries from the objective, ranks candidate pages, inspects ledger-verified sources, reports gaps or conflicts, and exposes loop decisions in research_trace and trace_summary. Use low-level web_fetch only for one explicit page at a time; callers that need manual multi-page inspection should issue repeated single-page fetches.
This package is read-only. Browser navigation, clicks, form filling, authenticated browsing, and DOM interaction remain browser-bridge responsibilities.
Search Provider Selection
Public tool names stay stable: callers continue to use web_research, web_search, web_fetch, and web_find. Search provider selection is handled by the shared weavert-web-research core.
google-search: setGOOGLE_SEARCH_API_KEYandGOOGLE_SEARCH_CX; optionally setWEAVERT_WEB_SEARCH_PROVIDER=google-search.brave-search: setBRAVE_SEARCH_API_KEYorWEAVERT_BRAVE_SEARCH_API_KEY; optionally setWEAVERT_WEB_SEARCH_PROVIDER=brave-search.bing-grounding: setFOUNDRY_PROJECT_ENDPOINT,FOUNDRY_MODEL_DEPLOYMENT_NAME,BING_PROJECT_CONNECTION_ID, andAGENT_TOKEN; optionally setWEAVERT_WEB_SEARCH_PROVIDER=bing-grounding.duckduckgo-html: no-credential fallback. It does not expose a stable freshness filter through this adapter.
Bing grounding uses Azure AI Foundry Responses API bing_grounding and normalizes stable public URL citations into the shared result shape. It is not the retired Bing Search API v7 endpoint. Google and Brave map domain constraints into provider query operators where supported, while Bing grounding and DuckDuckGo report those controls as framework-filtered. The shared core still revalidates accepted result URLs against allowed domains, blocked domains, and public-host policy. Freshness semantics are provider-specific: Google uses approximate dateRestrict, Brave uses its freshness parameter, Bing grounding maps supported 1/7/30 day freshness windows, and DuckDuckGo reports freshness as unsupported.
Research Profiles and Quality Signals
web_research applies profile strategy before inspecting pages. Coding prioritizes official documentation, release notes, changelogs, source repositories, and issue trackers, with facets for API names, versions, compatibility notes, and breaking changes. Legal compliance prioritizes statutes, regulations, standards, and official guidance, and preserves jurisdiction, authority, freshness, and effective-date gaps. Business research favors company sources, filings, announcements, credible news, reviews, competitors, timelines, comparison axes, and market claims. Academic research favors papers, publishers, institutions, preprints, methods, experiments, conclusions, and citation metadata. Product shopping favors official specs, current prices, reviews, alternatives, comparison axes, and purchase-risk signals.
Candidate sources receive traceable quality metadata before fetch: objective relevance, profile priority, provider metadata, freshness signals, preferred or allowed domains, duplicate clusters, and deterministic tie-breaking by domain and URL. After inspection, ledger evidence keeps source class and quality metadata so callers and tests can explain why a source was selected.
Claims, Conflicts, Gaps, and Limits
Claim annotations are accepted only when they bind to an inspected ledger source, page, or evidence item. Unbound annotations are dropped and traced. Rule-derived dates, versions, prices, numbers, source-type hints, and duplicate signals appear as auxiliary_signals; they help diagnostics and facets but do not prove claim correctness.
Conflicting ledger-bound claims are projected into conflicts. Unresolved conflicts lower confidence and produce stop_reason="unresolved_conflict"; resolved conflicts keep a resolution rationale when stronger evidence is identified. Gaps describe missing preferred evidence, unsupported freshness, provider fallback, policy blocks, or partial results.
Remaining limits are explicit: this kit does not drive a browser, click through pages, authenticate, inspect local workspaces, run shell-assisted searches, or guarantee truth beyond inspected public evidence. Host-level browser bridges, local workspace search, and shell tools remain separate surfaces.
See also
../README.md../../../framework-packs/capabilities/web-research/README.md
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file weavert_kit_common_web_research-0.1.0.tar.gz.
File metadata
- Download URL: weavert_kit_common_web_research-0.1.0.tar.gz
- Upload date:
- Size: 28.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5b98612931d8dbc7dff18a809cf2cbc517e4bde053f645348c675eb2ae876a4a
|
|
| MD5 |
65e130cafb679e6a379bfe4b19507266
|
|
| BLAKE2b-256 |
67c2027e82ebde3497b74795754fa3f3270a6e9e4289a585eda735f486eabaf1
|
Provenance
The following attestation bundles were made for weavert_kit_common_web_research-0.1.0.tar.gz:
Publisher:
publish-public-packages.yml on xyz2b/weave-ai-runtime
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
weavert_kit_common_web_research-0.1.0.tar.gz -
Subject digest:
5b98612931d8dbc7dff18a809cf2cbc517e4bde053f645348c675eb2ae876a4a - Sigstore transparency entry: 1563143165
- Sigstore integration time:
-
Permalink:
xyz2b/weave-ai-runtime@0f39e07008e531fb0b60175824f22019483b909b -
Branch / Tag:
refs/heads/main - Owner: https://github.com/xyz2b
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish-public-packages.yml@0f39e07008e531fb0b60175824f22019483b909b -
Trigger Event:
workflow_dispatch
-
Statement type:
File details
Details for the file weavert_kit_common_web_research-0.1.0-py3-none-any.whl.
File metadata
- Download URL: weavert_kit_common_web_research-0.1.0-py3-none-any.whl
- Upload date:
- Size: 27.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2ae1b2cdd2e2d31dd00c350b9f30af60da40589beb078fb415ae93cefd2f3fbe
|
|
| MD5 |
5ed506fcd5f8900366b057041fc0c9a0
|
|
| BLAKE2b-256 |
5d09ae0ee24f322a4c87dbdd074f24cf8f826c897f1c6347783872eba5d4a995
|
Provenance
The following attestation bundles were made for weavert_kit_common_web_research-0.1.0-py3-none-any.whl:
Publisher:
publish-public-packages.yml on xyz2b/weave-ai-runtime
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
weavert_kit_common_web_research-0.1.0-py3-none-any.whl -
Subject digest:
2ae1b2cdd2e2d31dd00c350b9f30af60da40589beb078fb415ae93cefd2f3fbe - Sigstore transparency entry: 1563143210
- Sigstore integration time:
-
Permalink:
xyz2b/weave-ai-runtime@0f39e07008e531fb0b60175824f22019483b909b -
Branch / Tag:
refs/heads/main - Owner: https://github.com/xyz2b
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish-public-packages.yml@0f39e07008e531fb0b60175824f22019483b909b -
Trigger Event:
workflow_dispatch
-
Statement type: