Skip to main content

ovos wikipedia skill plugin

Project description

ovos-skill-wikipedia

PyPI License Python

Wikipedia skill for OpenVoiceOS. Adds a voice interface on top of ovos-wikipedia-plugin, which handles all Wikipedia search and retrieval.

Supports two answer modes:

  • Explicit intent — handles utterances that target Wikipedia directly (e.g. "search Wikipedia for X", "what does Wikipedia say about X"). These always go to this skill.
  • Common Query — handles general knowledge questions (e.g. "what is X", "tell me about X") via the OVOS Common Query pipeline. The pipeline asks all registered knowledge skills and picks the best answer — Wikipedia competes alongside Wolfram Alpha, WordNet, and others.

In other words: if you mention Wikipedia by name, this skill answers directly. If you just ask a general question, it enters the competition and wins only if it has the most confident answer.


Installation

pip install ovos-skill-wikipedia

Explicit intent utterances

These always route to this skill because they name Wikipedia explicitly:

  • "Search Wikipedia for Ada Lovelace"
  • "What does Wikipedia say about beans?"
  • "Look up the Pembroke Welsh Corgi on Wikipedia"
  • "Wiki chocolate"
  • "Check wiki for Elon Musk"

Common Query utterances

These go through the pipeline — Wikipedia answers if it wins:

  • "What is a black hole?"
  • "Tell me about the Roman Empire"
  • "Who was Marie Curie?"

Common Query pipeline

When the Common Query pipeline plugin is active, this skill competes against other knowledge skills (e.g. Wolfram Alpha, WordNet) to answer general questions. The pipeline selects the response with the highest confidence score.


Sub-plugins

All sub-plugins are optional. Configure them via ~/.config/mycroft/skills/ovos-skill-wikipedia.openvoiceos/settings.json:

{
  "extractive_qa": "ovos-bm25-solver",
  "reranker": "ovos-bm25-reranker",
  "keyword_extractor": "ovos-rake-keywords"
}

Reranker (reranker)

Wikipedia searches often return multiple candidate pages for ambiguous topics — a search for "Mercury" might return the planet, the element, and the god. Without a reranker, results are ordered only by fuzzy title/summary match.

A reranker re-scores all candidate pages against the original query using a cross-encoder model, promoting the most contextually relevant page to the top. This is the single biggest quality improvement for voice use because a wrong page pick means a completely irrelevant spoken answer.

{ "reranker": "ovos-bm25-reranker" }

For better accuracy at the cost of more CPU, use a neural cross-encoder like ovos-flashrank-reranker-plugin.

Extractive QA (extractive_qa)

By default the skill speaks the full introductory summary of the best Wikipedia article, which can be several sentences long. An extractive QA plugin reads that summary and extracts the single passage most likely to directly answer the user's question.

Without extractive QA — "Tell me about Ada Lovelace":

"Ada Lovelace was an English mathematician and writer, chiefly known for her work on Charles Babbage's proposed mechanical general-purpose computer, the Analytical Engine. She was the first to recognise that the machine had applications beyond pure calculation..."

With extractive QA — same query:

"Ada Lovelace was the first to recognise that the machine had applications beyond pure calculation."

Much more natural for a voice interface.

{ "extractive_qa": "ovos-bm25-solver" }

For better precision, use a reading-comprehension model like ovos-roberta-qa-plugin.

Keyword Extractor (keyword_extractor)

When the initial Wikipedia search returns no results — common with conversational phrasing like "tell me more about that thing on the moon" — the keyword extractor pulls the most salient terms from the query and retries the search with a cleaner keyword.

Without it, over-specific or colloquial queries silently return nothing. With it, the skill gracefully recovers and finds the right page in most cases.

{ "keyword_extractor": "ovos-rake-keywords" }

License

Apache 2.0 — see LICENSE.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ovos_skill_wikipedia-0.8.17a1.tar.gz (888.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ovos_skill_wikipedia-0.8.17a1-py3-none-any.whl (909.8 kB view details)

Uploaded Python 3

File details

Details for the file ovos_skill_wikipedia-0.8.17a1.tar.gz.

File metadata

  • Download URL: ovos_skill_wikipedia-0.8.17a1.tar.gz
  • Upload date:
  • Size: 888.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for ovos_skill_wikipedia-0.8.17a1.tar.gz
Algorithm Hash digest
SHA256 79721725ed881b138443d7dd2b9e499d003384825bc78abb15f247f537413b7f
MD5 de2c72decdd6c1c1c123b8605cc82fcb
BLAKE2b-256 4b8df6c1f791310672b771ac9aba4a0384c36d91b77fad991425684efed8e067

See more details on using hashes here.

File details

Details for the file ovos_skill_wikipedia-0.8.17a1-py3-none-any.whl.

File metadata

File hashes

Hashes for ovos_skill_wikipedia-0.8.17a1-py3-none-any.whl
Algorithm Hash digest
SHA256 97e781b53bce68658a90417f56f76917af88f99a2ebfc242ffd598ea3e9b2842
MD5 f81fc12e74eab5f62dc0ac3906b5361d
BLAKE2b-256 9e72d65109619539b91326049e7e324c5b25bf69e29766801fd3622b9c1cab5b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page