ML classifiers for axor-core: TaskSignalClassifier + MLAnomalyDetector

These details have not been verified by PyPI

Project description

axor-classifier-simple

ML classifiers for axor-core: task signal classification and behavioral anomaly detection.

Two independent components, zero required dependencies — scikit-learn is an optional extra.

What's included

Component	Model	Inference target
`TaskSignalClassifier`	TF-IDF + LogisticRegression (3 independent heads)	< 1 ms
`MLAnomalyDetector`	GradientBoostingClassifier + optional LLM verifier	< 1 ms

Both implement protocols from axor-core and plug in with zero coupling to core internals.

Installation

pip install axor-core axor-classifier-simple[ml]

Without [ml], the package installs with no dependencies but raises ImportError on instantiation with an actionable message.

TaskSignalClassifier

ML replacement for the built-in HeuristicClassifier. Implements the SignalClassifier ABC from axor_core.contracts.policy.

Architecture

Three independent TF-IDF (char n-grams 1–3, char_wb analyzer) + LogisticRegression heads:

Head	Labels
`complexity`	`focused` · `moderate` · `expansive`
`nature`	`generative` · `mutative` · `readonly`
`domain`	`analysis` · `coding` · `general` · `research` · `support`

Confidence is reported as min(complexity_confidence, nature_confidence). Domain is a hint used by GovernedSession for policy defaults.

Train

# train and save to ~/.axor/models/task_signal.joblib
python -m axor_classifier_simple.train_task_signal

# custom output path
python -m axor_classifier_simple.train_task_signal --out /path/to/model.joblib

# fail if validation accuracy < threshold (default 0.85)
python -m axor_classifier_simple.train_task_signal --min-accuracy 0.90

Training generates synthetic data, trains three pipelines, validates each head, and saves a joblib bundle. Raises RuntimeError if any head falls below min_accuracy.

Use with axor-core

from axor_classifier_simple import TaskSignalClassifier
from axor_core import GovernedSession, CapabilityExecutor

classifier = TaskSignalClassifier()  # loads ~/.axor/models/task_signal.joblib

session = GovernedSession(
    executor=my_executor,
    capability_executor=cap_executor,
    classifier=classifier,
    # heuristic runs first; escalates to ML only when confidence < 0.75
)
result = await session.run("refactor the auth module")

Custom model path or environment variable:

export AXOR_TASK_SIGNAL_MODEL=/path/to/task_signal.joblib

classifier = TaskSignalClassifier(model_path="/path/to/task_signal.joblib")

Inspect raw scores

signal, confidence, scores = await classifier.classify_with_scores("write a test for /login")
# signal.complexity → TaskComplexity.FOCUSED
# signal.nature     → TaskNature.GENERATIVE
# confidence        → 0.91
# scores            → {"complexity.focused": 0.91, "nature.generative": 0.88, "domain.coding": 0.76, ...}

MLAnomalyDetector

GradientBoostingClassifier that scores behavioral trajectories from sequences of NormalizedIntent objects. Implements the AnomalyDetector Protocol from axor_core.contracts.anomaly.

Optionally delegates gray-zone cases to an LLMVerifier (e.g. LLMAnomalyVerifier from axor-classifier-llm).

Score thresholds

Class	Score range
`NORMAL`	`[0.0, 0.40)`
`SUSPICIOUS`	`[0.40, 0.75)`
`CRITICAL`	`[0.75, 1.0]`

Train

# train and save to ~/.axor/models/anomaly_detector.joblib
python -m axor_classifier_simple.train_anomaly

# custom options
python -m axor_classifier_simple.train_anomaly \
    --out /path/to/model.joblib \
    --n-estimators 300 \
    --max-depth 5 \
    --min-accuracy 0.92

Raises RuntimeError if validation accuracy falls below min_accuracy (default 0.90).

Basic use

from axor_classifier_simple import MLAnomalyDetector

detector = MLAnomalyDetector()  # loads ~/.axor/models/anomaly_detector.joblib

result = await detector.score(
    window=normalized_intents,
    task_signal_hint="focused_mutative",
    policy_name="focused_mutative",
)
print(result.score)    # float 0.0 – 1.0
print(result.cls)      # AnomalyClass.NORMAL / SUSPICIOUS / CRITICAL
print(result.reasons)  # ("external_read_seen", "executes_generated_code", ...)

Custom model path:

export AXOR_ANOMALY_MODEL=/path/to/anomaly_detector.joblib

detector = MLAnomalyDetector(model_path="/path/to/anomaly_detector.joblib")

With LLM verifier for gray-zone escalation

import anthropic
from axor_classifier_simple import MLAnomalyDetector
from axor_classifier_llm import LLMAnomalyVerifier

verifier = LLMAnomalyVerifier(client=anthropic.AsyncAnthropic())

detector = MLAnomalyDetector(
    gray_zone_verifier=verifier,
    gray_zone_threshold=0.50,   # call verifier when score >= 0.50 and class is SUSPICIOUS
)

If the LLM call fails, the detector falls back to the ML score with a warning log.

Constructor parameters

Parameter	Default	Description
`model_path`	`~/.axor/models/anomaly_detector.joblib`	Path to trained model
`gray_zone_verifier`	`None`	`LLMVerifier` for uncertain cases
`window_size`	`10`	Number of intents per observation window
`gray_zone_threshold`	`0.50`	Min score to invoke verifier in suspicious range
`suspicious_threshold`	`0.40`	Score boundary NORMAL / SUSPICIOUS
`critical_threshold`	`0.75`	Score boundary SUSPICIOUS / CRITICAL
`score_weights`	`{"critical": 1.0, "suspicious": 0.55, "normal": 0.0}`	Class probability weights

Feature encoding

Each NormalizedIntent is encoded to a fixed-length feature vector. Windows of 10 intents produce 300 features total (shorter windows are zero-padded on the left).

Group	Fields	Size
Boolean flags	`reads_secret_like_data`, `writes_outside_workdir`, `executes_generated_code`, `after_external_read`, `after_secret_access`	5
`operation`	`execute_generated_code`, `file_read`, `file_write`, `network_request`, `other`, `package_install`, `search`, `test`	8 (one-hot)
`target_kind`	`cloud_metadata`, `docker_socket`, `external_url`, `localhost`, `private_network`, `secret`, `system_path`, `workdir`	8 (one-hot)
`data_flow`	`external_to_shell`, `local_to_external`, `local_to_local`, `none`	4 (one-hot)
`provenance`	`external_web`, `official_docs`, `repo`, `unknown`, `user`	5 (one-hot)

Feature order is fixed — it must match between training and inference. The encoding is defined in anomaly_detector.py and data/anomaly_data.py.

Development

git clone https://github.com/Bucha11/axor-classifier-simple
cd axor-classifier-simple
pip install -e ".[dev]"
pytest tests/

License

MIT

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.2.1

Jun 2, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

axor_classifier_simple-0.2.1.tar.gz (39.8 kB view details)

Uploaded Jun 2, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

axor_classifier_simple-0.2.1-py3-none-any.whl (41.6 kB view details)

Uploaded Jun 2, 2026 Python 3

File details

Details for the file axor_classifier_simple-0.2.1.tar.gz.

File metadata

Download URL: axor_classifier_simple-0.2.1.tar.gz
Upload date: Jun 2, 2026
Size: 39.8 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for axor_classifier_simple-0.2.1.tar.gz
Algorithm	Hash digest
SHA256	`f2fe01440e93d7bb8b8e6c43901829d18484cc11e2ebaec41c0c581ffc8f9658`
MD5	`efdfc5cf3b82311a4c6afb5b63345d77`
BLAKE2b-256	`675cb782b53ddc689fe40a1f88f4e9f6d5ad119160cf7fe0a380bc4ad893b77d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for axor_classifier_simple-0.2.1.tar.gz:

Publisher: ci.yml on Bucha11/axor-classifier-simple

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: axor_classifier_simple-0.2.1.tar.gz
- Subject digest: f2fe01440e93d7bb8b8e6c43901829d18484cc11e2ebaec41c0c581ffc8f9658
- Sigstore transparency entry: 1706214224
- Sigstore integration time: Jun 2, 2026
Source repository:
- Permalink: Bucha11/axor-classifier-simple@98b00e2f56b5fe1f479fb5e2a8819f61ae56464c
- Branch / Tag: refs/tags/v0.2.1
- Owner: https://github.com/Bucha11
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: ci.yml@98b00e2f56b5fe1f479fb5e2a8819f61ae56464c
- Trigger Event: push

File details

Details for the file axor_classifier_simple-0.2.1-py3-none-any.whl.

File metadata

Download URL: axor_classifier_simple-0.2.1-py3-none-any.whl
Upload date: Jun 2, 2026
Size: 41.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for axor_classifier_simple-0.2.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f001d6dfc12154fdeb6e5331197733407046d9cbca65fdfad0eb822f15c4a0eb`
MD5	`604c066f1b848e0d00bd8997fdbc77f6`
BLAKE2b-256	`e1b385c2e09fc37b2493dcd68884e091e5dc1775505c44bba670891479e93995`

See more details on using hashes here.

Provenance

The following attestation bundles were made for axor_classifier_simple-0.2.1-py3-none-any.whl:

Publisher: ci.yml on Bucha11/axor-classifier-simple

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: axor_classifier_simple-0.2.1-py3-none-any.whl
- Subject digest: f001d6dfc12154fdeb6e5331197733407046d9cbca65fdfad0eb822f15c4a0eb
- Sigstore transparency entry: 1706214286
- Sigstore integration time: Jun 2, 2026
Source repository:
- Permalink: Bucha11/axor-classifier-simple@98b00e2f56b5fe1f479fb5e2a8819f61ae56464c
- Branch / Tag: refs/tags/v0.2.1
- Owner: https://github.com/Bucha11
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: ci.yml@98b00e2f56b5fe1f479fb5e2a8819f61ae56464c
- Trigger Event: push

axor-classifier-simple 0.2.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

axor-classifier-simple

What's included

Installation

TaskSignalClassifier

Architecture

Train

Use with axor-core

Inspect raw scores

MLAnomalyDetector

Score thresholds

Train

Basic use

With LLM verifier for gray-zone escalation

Constructor parameters

Feature encoding

Development

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance