Shared utilities for Security Verifiers RL environments

These details have not been verified by PyPI

Project links

Project description

security-verifiers-utils

Shared utilities and components for Security Verifiers RL environments.

Overview

This package provides common functionality used across all Security Verifiers environments, including:

Dataset Loading: Multi-tiered dataset loading with automatic fallback (local → HuggingFace → synthetic)
Response Parsing: Strict JSON schema validation for classification outputs
Reward Functions: Reusable reward components for accuracy, calibration, and asymmetric costs
Logging: Structured rollout logging with Weave and Weights & Biases integration
Weave Integration: Automatic tracing initialization for Verifiers operations

Installation

pip install security-verifiers-utils
# or with uv
uv add security-verifiers-utils

Usage

Dataset Loading

from sv_shared import load_dataset_with_fallback, DatasetSource

# Automatic fallback: local → hub → synthetic
dataset = load_dataset_with_fallback(
    dataset_name="my-dataset.jsonl",
    local_path="./data",
    hf_repo="org/repo",
    dataset_source="auto",  # or "local", "hub", "synthetic"
    synthetic_factory=lambda: [{"example": "data"}],
    max_examples=1000,
)

Response Parsing

from sv_shared import JsonClassificationParser

parser = JsonClassificationParser(
    allowed_labels=["Benign", "Malicious", "Abstain"]
)

# Parse and validate JSON response
result = parser(response)
# Returns: {"label": str, "confidence": float, "rationale": str}
# Or None if invalid

Reward Functions

from sv_shared import (
    reward_accuracy,
    reward_calibration,
    reward_asymmetric_cost,
)

# Accuracy reward (0.0 to 1.0)
acc_reward = reward_accuracy(
    predicted="Malicious",
    ground_truth="Malicious",
    abstain_label="Abstain"
)

# Calibration reward (encourages well-calibrated confidence)
cal_reward = reward_calibration(
    predicted="Malicious",
    ground_truth="Malicious",
    confidence=0.85,
    abstain_label="Abstain"
)

# Asymmetric cost (penalizes false negatives more than false positives)
cost_reward = reward_asymmetric_cost(
    predicted="Benign",
    ground_truth="Malicious",
    fn_cost=10.0,  # False negative cost
    fp_cost=1.0,   # False positive cost
    abstain_label="Abstain"
)

Rollout Logging

from sv_shared import build_rollout_logger

# Build logger with Weave and W&B backends
logger = build_rollout_logger({
    "weave_project": "my-project",
    "wandb_project": "my-project",
    "wandb_entity": "my-org",
})

# Use with environment
env = load_environment(logger=logger)

Weave Auto-tracing

# Import before verifiers to enable automatic tracing
from sv_shared import weave_init  # Initializes Weave if enabled
import verifiers as vf

# Configure via environment variables:
# WEAVE_AUTO_INIT=true/false (default: true)
# WEAVE_PROJECT=<name> (default: security-verifiers)
# WEAVE_DISABLED=true/false

Components

Dataset Loader (`dataset_loader.py`)

load_dataset_with_fallback() - Multi-tiered dataset loading
DatasetSource - Type alias for dataset source modes
DEFAULT_E1_HF_REPO, DEFAULT_E2_HF_REPO - Default HuggingFace repositories

Parsers (`parsers.py`)

JsonClassificationParser - Validates JSON classification outputs with strict schema adherence

Rewards (`rewards.py`)

reward_accuracy() - Classification accuracy reward
reward_calibration() - Confidence calibration reward
reward_asymmetric_cost() - Asymmetric false positive/negative costs

Rollout Logging (`rollout_logging.py`)

RolloutLogger - Base logger class
build_rollout_logger() - Factory for creating loggers with backends
RolloutLoggingConfig - Configuration dataclass
DEFAULT_ROLLOUT_LOGGING_CONFIG - Default configuration

Weave Initialization (`weave_init.py`)

initialize_weave_if_enabled() - Conditional Weave initialization
Automatic tracing setup for Verifiers

Utilities (`utils.py`)

get_response_text() - Extract text from Verifiers response objects

Development

Setup

git clone https://github.com/intertwine/security-verifiers.git
cd security-verifiers/sv_shared
uv venv && source .venv/bin/activate
uv sync --extra dev

Testing

pytest

Linting

ruff check .
ruff format .

Environment Variables

WEAVE_AUTO_INIT - Enable/disable automatic Weave initialization (default: true)
WEAVE_PROJECT - Weave project name (default: security-verifiers)
WEAVE_DISABLED - Completely disable Weave (default: false)
WANDB_API_KEY - Weights & Biases API key (required for W&B logging)
HF_TOKEN - HuggingFace token (required for private dataset access)

License

MIT License - see LICENSE file for details.

Related Packages

verifiers - Core Verifiers RL framework
sv-env-network-logs - Network log anomaly detection environment
sv-env-config-verification - Configuration security verification environment

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.3.1

Feb 28, 2026

0.3.0

Feb 17, 2026

0.2.3

Jan 24, 2026

0.2.2

Jan 23, 2026

0.2.1

Nov 5, 2025

This version

0.2.0

Oct 30, 2025

0.1.3

Oct 30, 2025

0.1.2

Oct 30, 2025

0.1.1

Oct 24, 2025

0.1.0

Oct 24, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

security_verifiers_utils-0.2.0-py3-none-any.whl (15.2 kB view details)

Uploaded Oct 30, 2025 Python 3

File details

Details for the file security_verifiers_utils-0.2.0-py3-none-any.whl.

File metadata

Download URL: security_verifiers_utils-0.2.0-py3-none-any.whl
Upload date: Oct 30, 2025
Size: 15.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for security_verifiers_utils-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`290cc05991fd7538e0179303213e8741ccbb7384bd0d51bff652c4126b5a746a`
MD5	`4963cacd955d25445d335b1481c63d6e`
BLAKE2b-256	`3f61a5323ea630c11ab89124ea72b6661c495d4f16bb8995fe27564aeb77c502`

See more details on using hashes here.

security-verifiers-utils 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

security-verifiers-utils

Overview

Installation

Usage

Dataset Loading

Response Parsing

Reward Functions

Rollout Logging

Weave Auto-tracing

Components

Dataset Loader (dataset_loader.py)

Parsers (parsers.py)

Rewards (rewards.py)

Rollout Logging (rollout_logging.py)

Weave Initialization (weave_init.py)

Utilities (utils.py)

Development

Setup

Testing

Linting

Environment Variables

License

Links

Related Packages

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes

Dataset Loader (`dataset_loader.py`)

Parsers (`parsers.py`)

Rewards (`rewards.py`)

Rollout Logging (`rollout_logging.py`)

Weave Initialization (`weave_init.py`)

Utilities (`utils.py`)