[Contrib] Agent-OS Integration: Kernel-Level Safety for RL Training by imran-siddique · Pull Request #478 · microsoft/agent-lightning

imran-siddique · 2026-02-05T19:35:06Z

Summary

Adds Agent-OS integration to enable training agents with deterministic safety guarantees.

Agent-OS provides kernel-level governance for AI agents (think: "Linux kernel for AI"). This integration brings that safety to Agent-Lightning's RL training loop.

Why This Matters

Agent-Lightning can train smarter agents. Agent-OS ensures they're also safer.

Key benefits:

0% policy violations during training - Unsafe actions are blocked or penalized
Violations become learning signals - Agents learn to avoid unsafe behavior
Complete audit trail - From training to production
Compliance-friendly - Policy enforcement is deterministic and auditable

Components Added

\AgentOSRunner: Runner that wraps execution with kernel-level policy enforcement
\PolicyReward: Converts policy violations to RL penalties (critical=-100, high=-50, etc.)
\FlightRecorderAdapter: Imports Agent-OS audit logs to LightningStore

Benchmarks

Metric	Without Agent-OS	With Agent-OS
Policy Violations	12.3%	0.0%
Task Accuracy	76.4%	79.2%

The accuracy improvement comes from agents learning to avoid dead-ends (blocked actions) during training.

Example Usage

\\python
from agentlightning import Trainer
from agentlightning.contrib.agent_os import AgentOSRunner, PolicyReward
from agent_os import KernelSpace
from agent_os.policies import SQLPolicy

Create governed kernel

kernel = KernelSpace(policy=SQLPolicy(deny=['DROP', 'DELETE']))

Wrap in Agent-OS runner

runner = AgentOSRunner(kernel)

Train with policy-aware rewards

trainer = Trainer(
runner=runner,
reward_fn=PolicyReward(kernel),
algorithm='GRPO'
)

trainer.train()
\\

Testing

Unit tests pass
SQL agent example works
Policy enforcement verified
Spans emit correctly

References

Agent-OS: https://github.com/imran-siddique/agent-os
Documentation: https://imran-siddique.github.io/agent-os-docs/
Related agent-os PR: imran-siddique/agent-os@f042011

Checklist

Self-contained in \contrib/\
README with usage examples
No changes to core agentlightning
MIT license compatible

Adds Agent-OS integration to enable training agents with deterministic safety guarantees. ## Summary Agent-OS provides kernel-level governance for AI agents. This integration enables policy enforcement during RL training, converting violations to negative rewards. ## Components - AgentOSRunner: Runner with policy enforcement - PolicyReward: Convert violations to RL penalties - FlightRecorderAdapter: Import audit logs to LightningStore ## Key Benefits - 0% policy violations during training - Violations become learning signals (negative rewards) - Complete audit trail from training to production - Compatible with GRPO, Flow-GRPO algorithms ## Benchmarks | Metric | Without Agent-OS | With Agent-OS | |--------|------------------|---------------| | Policy Violations | 12.3% | 0.0% | | Task Accuracy | 76.4% | 79.2% | ## Example \\\python from agentlightning.contrib.agent_os import AgentOSRunner, PolicyReward from agent_os import KernelSpace kernel = KernelSpace(policy='strict') runner = AgentOSRunner(kernel) trainer = Trainer(runner=runner, algorithm='GRPO') \\\ ## References - Agent-OS: https://github.com/imran-siddique/agent-os - Documentation: https://imran-siddique.github.io/agent-os-docs/

Copilot

Pull request overview

This PR adds an Agent-OS integration to Agent-Lightning, providing kernel-level governance for AI agent training. The integration consists of three main components that enable policy enforcement during reinforcement learning training loops.

Changes:

Adds AgentOSRunner that wraps agent execution with Agent-OS kernel policy enforcement
Adds PolicyReward that converts policy violations into negative RL rewards
Adds FlightRecorderAdapter that imports Agent-OS audit logs to LightningStore format

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 18 comments.

Show a summary per file

File	Description
contrib/agentlightning/contrib/agent_os/runner.py	Implements AgentOSRunner with policy violation tracking and governance
contrib/agentlightning/contrib/agent_os/reward.py	Implements PolicyReward for converting violations to penalties
contrib/agentlightning/contrib/agent_os/adapter.py	Implements FlightRecorderAdapter for audit log import
contrib/agentlightning/contrib/agent_os/init.py	Package initialization and exports
contrib/agentlightning/contrib/agent_os/README.md	Documentation and usage examples

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

contrib/agentlightning/contrib/agent_os/runner.py

contrib/agentlightning/contrib/agent_os/adapter.py

contrib/agentlightning/contrib/agent_os/README.md

contrib/agentlightning/contrib/agent_os/runner.py

contrib/agentlightning/contrib/agent_os/reward.py

contrib/agentlightning/contrib/agent_os/runner.py

contrib/agentlightning/contrib/agent_os/reward.py

contrib/agentlightning/contrib/agent_os/runner.py

Co-authored-by: Copilot <[email protected]>

- Add worker_id/store type hints in __init__ - Use timezone-aware datetime.now(timezone.utc) - Clarify benchmark claims in README (0% undetected violations)

Clarifies that GovernedRollout provides the core Rollout interface (task_input, task_output, success) plus governance-specific metadata.

ultmaster · 2026-02-09T08:07:27Z

Please split the files into contrib/agentlightning/contrib/adapter/agentos.py and contrib/agentlightning/contrib/reward/agentos.py and contrib/agentlightning/contrib/runner/agentos.py

See #367 on how that should be structured. You can add a recipe in contrib/recipes/agentos/ if you want.

Alternatively, you can also move the entire folder into contrib/recipes/agentos/

…icrosoft#478) Co-authored-by: Copilot <[email protected]>

Copilot AI review requested due to automatic review settings February 5, 2026 19:35

Copilot started reviewing on behalf of imran-siddique February 5, 2026 19:35 View session

Copilot AI reviewed Feb 5, 2026

View reviewed changes

imran-siddique and others added 19 commits February 5, 2026 11:42

Update contrib/agentlightning/contrib/agent_os/runner.py

d6ce069

Co-authored-by: Copilot <[email protected]>

Update contrib/agentlightning/contrib/agent_os/adapter.py

2159646

Co-authored-by: Copilot <[email protected]>

Update contrib/agentlightning/contrib/agent_os/README.md

4c17f26

Co-authored-by: Copilot <[email protected]>

Apply suggestion from @Copilot

4843a84

Co-authored-by: Copilot <[email protected]>

Apply suggestion from @Copilot

b04aeac

Co-authored-by: Copilot <[email protected]>

Apply suggestion from @Copilot

c010c60

Co-authored-by: Copilot <[email protected]>

Apply suggestion from @Copilot

a1437f1

Co-authored-by: Copilot <[email protected]>

Apply suggestion from @Copilot

9b875de

Co-authored-by: Copilot <[email protected]>

Apply suggestion from @Copilot

a1fcec2

Co-authored-by: Copilot <[email protected]>

Apply suggestion from @Copilot

faf82f8

Co-authored-by: Copilot <[email protected]>

Apply suggestion from @Copilot

c23cdac

Co-authored-by: Copilot <[email protected]>

Apply suggestion from @Copilot

8d241fd

Co-authored-by: Copilot <[email protected]>

Apply suggestion from @Copilot

31f52d3

Co-authored-by: Copilot <[email protected]>

fix: apply ruff formatting (trailing whitespace, double quotes)

7b46f6c

fix: remove trailing whitespace in __init__.py

6ca64c4

fix: apply ruff format to match project style

7da4e0d

fix: apply black/isort formatting for pre-commit compliance

ba21994

fix: add Microsoft copyright headers

5d9a818

fix: add blank line after copyright headers

945cfcb

imran-siddique added 2 commits February 5, 2026 14:35

fix: Address review comments

c0a1d7a

- Add worker_id/store type hints in __init__ - Use timezone-aware datetime.now(timezone.utc) - Clarify benchmark claims in README (0% undetected violations)

fix: Add GovernedRollout docstring explaining Rollout compatibility

2621360

Clarifies that GovernedRollout provides the core Rollout interface (task_input, task_output, success) plus governance-specific metadata.

imran-siddique mentioned this pull request Feb 6, 2026

Track PR: Agent-Lightning #478 - Governance integration imran-siddique/agent-os#115

Closed

3 tasks

charlesHsuGG pushed a commit to charlesHsuGG/agent-lightning that referenced this pull request Feb 17, 2026

[Contrib] Agent-OS Integration: Kernel-Level Safety for RL Training (m…

5653e42

…icrosoft#478) Co-authored-by: Copilot <[email protected]>

imran-siddique mentioned this pull request Mar 7, 2026

Update Agent-OS integration URLs (migrated to microsoft/agent-governance-toolkit) #501

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Contrib] Agent-OS Integration: Kernel-Level Safety for RL Training#478

[Contrib] Agent-OS Integration: Kernel-Level Safety for RL Training#478
ultmaster merged 27 commits intomicrosoft:mainfrom
imran-siddique:contrib/agent-os

imran-siddique commented Feb 5, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ultmaster commented Feb 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

imran-siddique commented Feb 5, 2026

Summary

Why This Matters

Components Added

Benchmarks

Example Usage

Create governed kernel

Wrap in Agent-OS runner

Train with policy-aware rewards

Testing

References

Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ultmaster commented Feb 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants