Dead-simple keyword-based intent parser

These details have not been verified by PyPI

Project links

Homepage

Project description

palavreado

Keyword-based intent parser for OVOS voice assistants — the drop-in replacement for Adapt.

Palavreado matches natural-language utterances against named intents built from required and optional keyword slots. Each slot holds a list of vocabulary words; if the right words are present in the utterance, the intent fires. Optional regex and simplematch autoregex patterns enable entity extraction.

Install

pip install palavreado

Quick start

Keyword intent

from palavreado import IntentContainer, IntentCreator

container = IntentContainer()

intent = (
    IntentCreator("lights_off")
    .require("off",   ["off", "disable", "shutdown"])
    .require("light", ["light", "lights", "lamp"])
)
container.add_intent(intent)

result = container.calc_intent("turn off the lights")
print(result["name"])      # lights_off
print(result["conf"])      # 0.9438
print(result["keywords"])  # {'off': ['off'], 'light': ['light']}
print(result["utterance_remainder"])  # 'turn the'

An intent only fires when every required slot has at least one keyword match in the utterance.

Optional slots

Optional slots increase confidence when matched but do not gate the intent:

intent = (
    IntentCreator("lights_off")
    .require("off",   ["off", "disable"])
    .require("light", ["light", "lights"])
    .optionally("room", ["kitchen", "bedroom", "bathroom"])
)
container.add_intent(intent)

result = container.calc_intent("turn off the bedroom lights")
print(result["keywords"]["room"])  # ['bedroom']

Raw regex intent

rx = r'\b(at|in|for) (?P<Location>.*)'
intent = (
    IntentCreator("time_in_location")
    .require_regex("Location", rx)
    .require("time", ["time"])
)
container.add_intent(intent)

result = container.calc_intent("what time is it in London")
print(result["keywords"]["Location"])  # ['London']

Autoregex / entity extraction

Simplematch {entity} patterns are compiled to regexes automatically:

intent = (
    IntentCreator("buy")
    .require_autoregex("item", ["buy {item}", "purchase {item}", "get {item}"])
)
container.add_intent(intent)

result = container.calc_intent("buy some milk")
print(result["keywords"]["item"])  # ['some milk']

Bracket/pipe expansion is supported in all sample strings:

IntentCreator("lights_on") \
    .require("action", ["turn on", "switch on", "flick on"]) \
    .require("light",  ["(the |)(lights|light|lamp)"])

`IntentCreator` API

Method	Description
`require(name, samples)`	Required keyword slot — plain strings, bracket/pipe notation
`optionally(name, samples)`	Optional keyword slot
`require_regex(name, patterns)`	Required slot matched with a raw regex string
`optional_regex(name, patterns)`	Optional slot matched with a raw regex string
`require_autoregex(name, patterns)`	Required slot using simplematch `{entity}` patterns
`optional_autoregex(name, patterns)`	Optional slot using simplematch patterns
`build()`	Serialise to a plain `dict`

All builder methods return self for fluent chaining. The result of build() can be passed directly to IntentContainer.add_intent().

Breaking changes

add_intent raises RuntimeError on duplicate names.
Previously, registering the same intent name twice silently overwrote the first entry. Now a RuntimeError is raised so accidental double-registration is caught early.

Callers that re-register intents (e.g. on skill reload) must call remove_intent first:

container.remove_intent("my_intent")   # no-op if not present
container.add_intent(new_creator)

`IntentContainer` API

Method / property	Description
`add_intent(intent)`	Register an `IntentCreator` or built dict
`remove_intent(name)`	Unregister by name, creator, or dict
`calc_intent(query)`	Return the single best-matching result dict
`calc_intents(query)`	Yield all matching result dicts (conf > 0)
`intent_names`	List of registered intent name strings
`set_context(intent, context)`	Mark a context as active for an intent
`unset_context(intent, context)`	Remove an active context
`require_context(intent, context)`	Gate intent on context being active
`exclude_context(intent, context)`	Suppress intent when context is active
`exclude_keywords(intent, words)`	Suppress intent when any word appears in the query

Result fields

Every dict returned by calc_intent / yielded by calc_intents:

Field	Type	Description
`name`	`str \| None`	Matched intent name, or `None` on no match
`conf`	`float`	Confidence score in `[0.0, 1.0]`, rounded to 4 decimal places
`keywords`	`dict[str, list]`	Matched slot values keyed by slot name
`utterance`	`str`	The normalised query string
`utterance_remainder`	`str`	Part of the utterance not consumed by any slot

Confidence scoring

Raw confidence is built up as:

+1 / n_required per matched required slot
+0.15 / n_optional per matched optional slot
×quality multiplier per slot: 1.0 for contiguous matches, 0.8 for non-contiguous multi-word matches (e.g. "turn down" found in "turn it down")

Then adjusted by:

Remainder penalty −0.2 × (unmatched_words / query_words) — more leftover words = lower confidence
Coverage bonus +0.05 × (matched_words / query_words) — reward intents that explain more of the query
Slot bonus +0.05 × (matched_slots / total_slots) — more matched slots = stronger signal

Result is clamped to [0.0, 1.0] and rounded to 4 decimal places.

A score of 1.0 means every slot was satisfied and nothing was left over.

Normalisation

Queries and training samples are normalised at match time:

Apostrophes (all Unicode variants including ', ', ʼ, `) are replaced with a space — "it's" → "it s".
Whitespace is collapsed to a single space.
Plural/singular matching uses a language-agnostic lemmatizer that strips a trailing "s" (not "ss") so "lights" matches the training sample "light" and vice versa.

Multi-word keyword matching

Palavreado supports both contiguous and non-contiguous multi-word keyword matching:

Contiguous (quality 1.0): "put on" matches "put on some music" exactly.
Non-contiguous (quality 0.8): "turn down" matches "turn it down a bit" even though "it" intervenes.

Non-contiguous matches carry a lower quality multiplier so they never override a precise contiguous match when both are present.

Context gating

Intents can be gated on named session contexts:

container.require_context("lights_off", "lights_active")
container.set_context("lights_off", "lights_active")
result = container.calc_intent("turn off the lights")  # fires

container.unset_context("lights_off", "lights_active")
result = container.calc_intent("turn off the lights")  # suppressed (context missing)

exclude_context suppresses an intent while a specific context is active:

container.exclude_context("lights_off", "lights_already_off")
container.set_context("lights_off", "lights_already_off")
result = container.calc_intent("turn off the lights")  # suppressed

Keyword exclusion

Suppress an intent when specific words appear in the query:

container.exclude_keywords("play_music", ["stop", "pause"])
result = container.calc_intent("stop the music")  # play_music suppressed

Single-word exclusions use whole-word matching; multi-word exclusions use \b word-boundary regex so "play" does not fire on "display".

OVOS pipeline plugin

Palavreado ships an OVOS pipeline plugin that replaces Adapt as the keyword intent engine. It responds to the same bus events (register_vocab, register_intent, detach_intent, detach_skill) so existing skills need no changes.

Configure in mycroft.conf:

{
  "intents": {
    "palavreado": {
      "conf_high": 0.65,
      "conf_med":  0.45,
      "conf_low":  0.25
    }
  }
}

Entry point: palavreado.opm:PalavreadoPipeline

Benchmark

Evaluated on a keyword-intent dataset of 284 cases (217 match utterances across 22 intents, 67 no-match utterances). The dataset spans short (1–3 words), medium (4–8), long (9–14), and very long (15+ word) utterances, plus multi-intent queries where two intents' keywords are both present. No-match cases cover easy off-topic utterances, single keyword in incidental context (past tense, reported speech, third-person, rhetorical), and harder traps with multiple keywords that are still not commands.

Engine	Accuracy	Precision	Recall	F1	TN / no-match	FP	Median latency
palavreado	81.7%	80.6%	94.0%	0.868	28 / 67	49	0.58 ms
adapt	80.3%	81.0%	90.3%	0.854	32 / 67	46	0.20 ms

TN / no-match = utterances that correctly returned no intent out of the 67 no-match cases.

Palavreado beats Adapt on accuracy, recall, and F1, but Adapt bails out more conservatively (32 vs 28 correct no-matches). Both engines share the same fundamental limitation of keyword-based matching: a vocabulary word appearing incidentally in an off-topic sentence triggers a false positive. The high FP rate reflects real hardness in the dataset — keyword parsers have no grammatical or pragmatic context, so past-tense, rhetorical, and third-person uses of vocabulary words are indistinguishable from commands.

Run the benchmark yourself:

python benchmark/compare.py

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

1.0.0a1 pre-release

Apr 22, 2026

0.2.0

Apr 29, 2021

0.1.0

Apr 29, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

palavreado-1.0.0a1.tar.gz (42.9 kB view details)

Uploaded Apr 22, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

palavreado-1.0.0a1-py3-none-any.whl (36.7 kB view details)

Uploaded Apr 22, 2026 Python 3

File details

Details for the file palavreado-1.0.0a1.tar.gz.

File metadata

Download URL: palavreado-1.0.0a1.tar.gz
Upload date: Apr 22, 2026
Size: 42.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for palavreado-1.0.0a1.tar.gz
Algorithm	Hash digest
SHA256	`cd2354aa9895f85c0344b9d94116c5f1244d30d4b245b700c6327896532cead2`
MD5	`eeed7b9d45b63fbd63eefd378643fd5e`
BLAKE2b-256	`d45b0b1f47990d2a931b4ddf52ca2fb1872814ff96e0410e853f78539ea53a20`

See more details on using hashes here.

File details

Details for the file palavreado-1.0.0a1-py3-none-any.whl.

File metadata

Download URL: palavreado-1.0.0a1-py3-none-any.whl
Upload date: Apr 22, 2026
Size: 36.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for palavreado-1.0.0a1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0dcb16e61acf4d5d7956a6d9e628d81b98c88e2cbfb7f7d859d5cd66e722c9c8`
MD5	`dab760546c4a2be18c71f6cdee3122c4`
BLAKE2b-256	`c10247f3dd39a1f10fe62a0accb7101e58747ce1547e71d1551b882feb58da82`

See more details on using hashes here.

palavreado 1.0.0a1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

palavreado

Install

Quick start

Keyword intent

Optional slots

Raw regex intent

Autoregex / entity extraction

IntentCreator API

Breaking changes

IntentContainer API

Result fields

Confidence scoring

Normalisation

Multi-word keyword matching

Context gating

Keyword exclusion

OVOS pipeline plugin

Benchmark

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`IntentCreator` API

`IntentContainer` API