Public Preview — Agent-Lightning RL integration for the Agent Governance Toolkit: governed training with policy enforcement

These details have not been verified by PyPI

Project links

Project description

Agent Lightning — RL Training Governance

[!IMPORTANT] Public Preview — The agentmesh-lightning package on PyPI is a Microsoft-signed public preview release. APIs may change before GA.

Train AI agents with RL while maintaining 0% policy violations.

Part of the Agent Governance Toolkit

🎯 Overview

This package provides governed RL training integration:

Agent-Lightning = Training/Optimization (the "brains")
Agent-OS = Governance/Safety (the "guardrails")

Result: Agents learn to be smart AND safe from the start.

Note: This package was extracted from agent_os.integrations.agent_lightning. The old import path still works via a backward-compatibility shim but new code should import from agent_lightning_gov directly.

🚀 Quick Start

pip install agentmesh-lightning
# Optional: pip install agent-os-kernel  # for kernel integration

from agent_lightning_gov import GovernedRunner, PolicyReward
from agent_os import KernelSpace
from agent_os.policies import SQLPolicy, CostControlPolicy

# 1. Create governed kernel
kernel = KernelSpace(policy=[
    SQLPolicy(deny=["DROP", "DELETE"]),
    CostControlPolicy(max_cost_usd=100)
])

# 2. Create governed runner
runner = GovernedRunner(kernel)

# 3. Create policy-aware reward function
def base_accuracy(rollout):
    return rollout.task_output.accuracy if rollout.success else 0.0

reward_fn = PolicyReward(kernel, base_reward_fn=base_accuracy)

# 4. Train with Agent-Lightning
from agentlightning import Trainer
trainer = Trainer(
    runner=runner,
    reward_fn=reward_fn,
    algorithm="GRPO"
)

trainer.train(num_epochs=100)

📊 Key Benefits

Metric	Without Agent-OS	With Agent-OS
Policy Violations	12.3%	0.0%
Task Accuracy	76.4%	79.2%
Training Stability	Variable	Consistent

🔧 Components

GovernedRunner

Agent-Lightning runner that enforces policies during execution:

from agent_lightning_gov import GovernedRunner

runner = GovernedRunner(
    kernel,
    fail_on_violation=False,   # Continue but penalize
    log_violations=True,        # Log all violations
)

# Execute a task
rollout = await runner.step(task_input)
print(f"Violations: {len(rollout.violations)}")
print(f"Total penalty: {rollout.total_penalty}")

PolicyReward

Converts policy violations to RL penalties:

from agent_lightning_gov import PolicyReward, RewardConfig

config = RewardConfig(
    critical_penalty=-100.0,  # Harsh penalty for critical violations
    high_penalty=-50.0,
    medium_penalty=-10.0,
    low_penalty=-1.0,
    clean_bonus=5.0,          # Bonus for no violations
)

reward_fn = PolicyReward(kernel, config=config)

# Calculate reward
reward = reward_fn(rollout)  # Base reward + policy penalties

GovernedEnvironment

Gym-compatible training environment:

from agent_lightning_gov import GovernedEnvironment

env = GovernedEnvironment(
    kernel,
    config=EnvironmentConfig(
        max_steps=100,
        terminate_on_critical=True,
    )
)

# Standard Gym interface
state, info = env.reset()
while not env.terminated:
    action = agent.get_action(state)
    state, reward, terminated, truncated, info = env.step(action)

FlightRecorderEmitter

Export audit logs to LightningStore:

from agent_os import FlightRecorder
from agent_lightning_gov import FlightRecorderEmitter

recorder = FlightRecorder()
emitter = FlightRecorderEmitter(recorder)

# Export to LightningStore
emitter.emit_to_store(lightning_store)

# Or export to file for analysis
emitter.export_to_file("training_audit.json")

# Get violation summary
summary = emitter.get_violation_summary()
print(f"Violation rate: {summary['violation_rate']:.1%}")

Ecosystem

Agent Lightning is one of 7 packages in the Agent Governance Toolkit:

Package	Role
Agent OS	Policy engine — deterministic action evaluation
AgentMesh	Trust infrastructure — identity, credentials, protocol bridges
Agent Runtime	Execution supervisor — rings, sessions, sagas
Agent SRE	Reliability — SLOs, circuit breakers, chaos testing
Agent Compliance	Regulatory compliance — GDPR, HIPAA, SOX frameworks
Agent Marketplace	Plugin lifecycle — discover, install, verify, sign
Agent Lightning	RL training governance — governed runners, policy rewards (this package)

📋 License

MIT — see LICENSE.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

3.6.0

May 12, 2026

3.5.0

May 8, 2026

3.4.0

May 5, 2026

3.3.0

Apr 30, 2026

3.2.2

Apr 22, 2026

3.2.1

Apr 22, 2026

3.2.0

Apr 22, 2026

3.1.0

Apr 13, 2026

3.0.2

Apr 2, 2026

3.0.1

Apr 1, 2026

3.0.0

Mar 26, 2026

2.3.0

Mar 26, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agentmesh_lightning-3.6.0.tar.gz (28.1 kB view details)

Uploaded May 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

agentmesh_lightning-3.6.0-py3-none-any.whl (19.6 kB view details)

Uploaded May 12, 2026 Python 3

File details

Details for the file agentmesh_lightning-3.6.0.tar.gz.

File metadata

Download URL: agentmesh_lightning-3.6.0.tar.gz
Upload date: May 12, 2026
Size: 28.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: RestSharp/106.13.0.0

File hashes

Hashes for agentmesh_lightning-3.6.0.tar.gz
Algorithm	Hash digest
SHA256	`639164e39e30567b2bddde2edbdda8266b56ac88a8f3576ac89f0a033667233f`
MD5	`71826deb5fefcab20ca07453c393e5cc`
BLAKE2b-256	`b115f79172c66ad1f4a947d357203997086bfdc0de0c034a20237f7ae084c828`

See more details on using hashes here.

File details

Details for the file agentmesh_lightning-3.6.0-py3-none-any.whl.

File metadata

Download URL: agentmesh_lightning-3.6.0-py3-none-any.whl
Upload date: May 12, 2026
Size: 19.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: RestSharp/106.13.0.0

File hashes

Hashes for agentmesh_lightning-3.6.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9408c40a691b59c46de2940ba421e5894f35ead5ea7272f408a78d730d3183a4`
MD5	`91c23889684d3b70bed01566064a2936`
BLAKE2b-256	`ce9ae0acde1934fe1eaf6e1947b122f679892a5fbd146ecb0f4dc139adf36803`

See more details on using hashes here.

agentmesh_lightning 3.6.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Agent Lightning — RL Training Governance

🎯 Overview

🚀 Quick Start

📊 Key Benefits

🔧 Components

GovernedRunner

PolicyReward

GovernedEnvironment

FlightRecorderEmitter

Ecosystem

📋 License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes