Seraph Master Roadmap

Seraph is an AI guardian that remembers, watches, and acts. Today that means a browser cockpit with persistent memory, screen awareness, proactive workflows, tool use, and MCP integration.

Summary

Use this page when you want the delivery-side truth:

docs/implementation/ is the shipped-state and delivery surface
docs/implementation/STATUS.md is the fastest current snapshot
docs/research/ is the design, benchmark, and product-thesis surface

This implementation tree is the canonical delivery-side answer to four questions:

What is shipped on develop?
How does the research-defined target product shape translate into delivery on develop?
What is still left on develop before Seraph reaches that research-defined target?
What are the next most valuable PRs?

When these docs are updated on an open feature branch, they describe the intended post-merge develop state for that branch. Until merge, the open PR and its validation remain the integration truth.

Docs Contract

docs/research/ defines target product shape, evidence rules, benchmark logic, and superiority program logic.
docs/implementation/STATUS.md is the fastest shipped-state snapshot.
this roadmap owns the live implementation queue and queue refresh rule.
docs/implementation/08-docs-contract.md explains the boundary between research truth and implementation truth.
docs/implementation/09-benchmark-status.md mirrors the benchmark axes from research as shipped implementation status.
docs/implementation/10-superiority-delivery.md mirrors the superiority program from research as delivery ownership and implementation translation.
docs/implementation/01 through 07 are the only workstream docs; 08 through 10 are cross-cutting implementation mirrors, not extra workstreams.
if research adds a new benchmark/program layer without an implementation mirror, the docs are incomplete.

Current Status

Read this roadmap together with Development Status. For the implementation-side mirrors of the evidence, benchmark, and superiority layers, also read 08. Docs Contract, 09. Benchmark Status, and 10. Superiority Delivery.

Legend for the checklist column:

[x] shipped on develop
[ ] not fully shipped on develop

Workstream	Checklist	Notes
01. Trust Boundaries	`[ ]`	Policy modes, approvals, audit logging, and secret handling are shipped; deeper isolation and narrower privileged execution paths are still left
02. Execution Plane	`[ ]`	Real tools, MCP, browser, shell, filesystem, goals, vault, web search, first-class reusable workflows, starter packs, threaded workflow history, step diagnostics, parameterized replay context, capability preflight/bootstrap, cockpit-native extension authoring, and first branch/resume workflow control are shipped; stronger execution safety and deeper visual workflow control are still left
03. Runtime Reliability	`[ ]`	Fallback chains, routing rules, local runtime paths, weighted provider scoring, capability/cost/latency/task/budget safeguards, richer routing explainability, routing-summary audit visibility, and guardian-behavior runtime evals are shipped; simulation-grade policy planning and still broader eval depth are still left
04. Presence And Reach	`[ ]`	Browser UI, WebSocket chat, proactive delivery, observer refresh, native daemon foundations, a first coherent desktop presence surface, unified browser/native continuity, and native action-card resume payloads are shipped; broader channel reach and deeper cross-surface continuity are still left
05. Guardian Intelligence	`[ ]`	Guardian record, memory, goals, strategist, briefings, reviews, observer-driven state, observer salience/confidence scoring, explicit guardian state, corroboration-aware world-model fusion, continuity-thread memory signals, project timelines, obligations, collaborators, intervention policy, and learned timing/suppression/thread guidance are shipped foundations; stronger long-horizon learning loops are still left
06. Embodied Interface	`[ ]`	The guardian cockpit is now the active browser shell, with a pane workspace, drag/resize plus grid snap, saved layout composition, session continuity restore, linked evidence, a searchable capability surface, a separate activity ledger window, richer live operator views, preflight/repair flows, and an extension studio shipped; deeper visual workflow debugging and denser keyboard-first control are still left
07. Ecosystem And Delegation	`[ ]`	Skills, MCP, catalog/install surfaces, delegation foundations, reusable workflow composition, starter packs, capability discovery, threaded workflow history plus a separate activity ledger, parameterized runbooks, preflight/autorepair, bounded bootstrap, cockpit-native extension authoring, and first branch/resume control are shipped; stronger extension ergonomics, versioning, and clearer workflow visual control are still left

Progress Summary

Completed 10-PR Batches

Completed batches stay visible instead of being deleted on queue refresh.

Latest Completed 10-PR Batch

Previous Completed 10-PR Batch

Earlier Completed 10-PR Batch

Previous Completed 10-PR Batch

Older Completed 10-PR Batch

Archived Completed 10-PR Batch

Legacy Completed 10-PR Batch

Historical Completed 10-PR Batch

Current Extension Platform Transition Queue

This is the authoritative PR list for the implementation side. For this architecture migration, the roadmap keeps the full multi-batch transition queue visible instead of truncating it to 10 items.

every entry below is a numbered PR-sized slice
the queue below is the full canonical transition program; shipped slices stay visible here until the entire migration is complete
this roadmap is the canonical queue for the transition; Workstream 07 summarizes the same program by phase and deliverable set rather than restating every item

Queue Maintenance Rule

keep the full active queue for architecture-transition programs visible here until the transition is complete
keep the most recent completed 10-PR batch visible above with checkmarks
do not delete the immediately previous completed batch until a later cleanup pass
keep landed slices in the active queue marked [x] until a full 10-slice completed batch is ready to move into the completed-batches section
when 5 slices from the published queue have landed, rerank only the remaining open items while leaving the landed items in place
when 10 slices from the published queue have landed, move that completed set into the completed-batches history and renumber the remaining active queue
rerank earlier if new evidence from docs/research/ materially changes the priority order
each internal slice must close with a subagent review pass against bugs, missing tests, design drift, and hallucinated assumptions before it is marked complete or rolled into a final GitHub PR
the result of that subagent review must be recorded in the eventual GitHub PR Validation section before any slice is marked complete in these docs

Delivery Order

Trust Boundaries
Execution Plane
Runtime Reliability
Presence And Reach
Guardian Intelligence
Embodied Interface
Ecosystem And Delegation

Implementation docs 08 through 10 are supporting mirror layers for this roadmap, not additional workstreams.

Stable Interfaces Outside This Transition

the browser and WebSocket chat surface
the observer daemon ingest path
runtime-path-based LLM routing and fallback settings
runtime audit and eval harness contracts

Transitional Interfaces Slated For Migration

SKILL.md-based skill loading
loose workflow loading from the current workspace file layout
MCP server configuration and server-management APIs as they exist before connector manifests and packaged install flows land

Seraph Master Roadmap

Summary

Docs Contract

Current Status

Progress Summary

Completed 10-PR Batches

Latest Completed 10-PR Batch

Previous Completed 10-PR Batch

Earlier Completed 10-PR Batch

Previous Completed 10-PR Batch

Older Completed 10-PR Batch

Archived Completed 10-PR Batch

Legacy Completed 10-PR Batch

Historical Completed 10-PR Batch

Current Extension Platform Transition Queue

Queue Maintenance Rule

Delivery Order

Stable Interfaces Outside This Transition

Transitional Interfaces Slated For Migration

Current Shipped Slice On `develop`

Recommended Reading Order

Summary​

Docs Contract​

Current Status​

Progress Summary​

Completed 10-PR Batches​

Latest Completed 10-PR Batch​

Previous Completed 10-PR Batch​

Earlier Completed 10-PR Batch​

Previous Completed 10-PR Batch​

Older Completed 10-PR Batch​

Archived Completed 10-PR Batch​

Legacy Completed 10-PR Batch​

Historical Completed 10-PR Batch​

Current Extension Platform Transition Queue​

Queue Maintenance Rule​

Delivery Order​

Stable Interfaces Outside This Transition​

Transitional Interfaces Slated For Migration​

Current Shipped Slice On develop​

Recommended Reading Order​

Summary

Docs Contract

Current Status

Progress Summary

Completed 10-PR Batches

Latest Completed 10-PR Batch

Previous Completed 10-PR Batch

Earlier Completed 10-PR Batch

Previous Completed 10-PR Batch

Older Completed 10-PR Batch

Archived Completed 10-PR Batch

Legacy Completed 10-PR Batch

Historical Completed 10-PR Batch

Current Extension Platform Transition Queue

Queue Maintenance Rule

Delivery Order

Stable Interfaces Outside This Transition

Transitional Interfaces Slated For Migration

Current Shipped Slice On `develop`

Recommended Reading Order