Capabilities

Agentic Capabilities: A Tutorial Introduction

Deployment patterns from local execution to autonomous systems

We provide here example code that implements a set of agentic capabilities of increasing sophistication, from local execution to governed, autonomous scientific systems.

The code makes use of two complementary agent frameworks:

LangGraph — LLM reasoning, tool calling, and structured workflows. Handles the intelligence layer.
Academy — Distributed execution, federation, and HPC integration. Handles the distribution layer.

Why both? Production scientific agents need both capabilities. LangGraph excels at LLM orchestration but runs in a single process. Academy excels at distributed execution but doesn't handle LLM reasoning. Together, they enable intelligent agents that run anywhere—from laptops to federated DOE infrastructure.

See these slides for a brief review of these two systems, and one more, Microsoft Agent Framework.

There are excellent tutorial materials online for LangGraph and for Academy. Here we focus on showing how to use the two systems, independently and together, to realize scalable agentic systems for science.

Getting Started: Tutorial Review of Agentic Patterns

Building Scientific Agents — Learn to build production scientific agents:

LLM Agents — Build agents that reason and call tools (LangGraph)
Distributed Agents — Run agents across machines (Academy)
Production Agents — Combine both for real deployments

Additional Resources
LLM Configuration	Configure OpenAI, Ollama, or FIRST backends

Code Samples by Area

The sample code shows how to implement a series of increasingly sophisticated agentic patterns.

1. Local Agent Execution

LangGraph, Academy

Your on-ramp to CAF. Run persistent, stateful agents on a laptop or workstation—no federation required.

Status: Mature

2. Federated Agent Execution

LangGraph + Academy

Cross-institutional agent execution under federated identity and policy.

Status: Mature

3. Massively Parallel Agent Inference

LangGraph, Aegis

Fan out thousands of LLM requests in parallel on HPC.

Status: Prototype

4. Governed Tool Use at Scale

Academy governance

Invoke expensive, stateful, or dangerous tools under proactive policy enforcement.

Status: WIP

5. Multi-Agent Coordination

Shared state + policy + budgets

Many agents under shared governance—within one institution or across many.

Status: Emerging

6. Long-Lived Autonomous Agents

Lifecycle management

Agents that persist for days to months, maintaining state, memory, and goals.

Status: Emerging

7. Agent-Mediated Scientific Workflows

Dynamic workflow construction

Agents dynamically construct, adapt, and execute scientific workflows.

Status: Early

Maturity Levels

Level	Meaning
Mature	Documented with working examples on this site
Prototype	Demonstrated on DOE systems; documentation in progress
WIP	Work in progress
Emerging	Active development; early adopters welcome
Early	Early stage; design and prototyping

Capability Matrix

Stage	Capability	What you can do	CAF Components	Where it runs	Scale	Status
1	Local Agent Execution	Run persistent, stateful agents	LangGraph	Laptop, workstation, VM	Single agent	Mature
2	Federated Agent Execution	Invoke tools under federated identity	LangGraph + Academy	DOE HPC	Multi-resource	Mature
3	Parallel Agent Inference	Fan out thousands of LLM requests	LangGraph + FIRST	HPC accelerators	O(10³–10⁴) streams	Prototype
4	Governed Tool Use	Invoke tools under policy enforcement	Academy governance	DOE HPC	O(10²–10³) tools	WIP
5	Multi-Agent Coordination	Coordinate agents under shared governance	Shared state + policy	Distributed	O(10²–10³) agents	Emerging
6	Long-Lived Agents	Persistent agents with memory and goals	Lifecycle management	Any	Days–months	Emerging
7	Agent Workflows	Dynamic workflow construction	Workflow integration	DOE infrastructure	Varies	Early

Scale notation: O(10²) means "order of 100" (tens to hundreds), O(10³) means "order of 1,000" (hundreds to thousands), etc. These indicate typical operating ranges, not hard limits.

Examples Index

All examples support multiple LLM backends (OpenAI, FIRST, Ollama) and include a mock mode for testing without API keys.

Local Agents

Example	Tech	Description	Key Pattern
AgentsCalculator	LangGraph	Minimal tool-calling agent	`@tool` decorator
AgentsRAG	LangGraph	Retrieval-augmented generation	Vector search
AgentsDatabase	LangGraph	Natural language data queries	Pandas integration
AgentsAPI	LangGraph	External API calls	PubChem REST API
AgentsConversation	LangGraph	Stateful conversation	Short/long-term memory
AgentsLangGraph	LangGraph	5-agent pipeline	StateGraph orchestration
AgentsAcademyBasic	Academy	Minimal Academy example	Two-agent messaging
AgentsRemoteTools	Academy	Remote tool invocation	Coordinator + ToolProvider
AgentsPersistent	Academy	Persistent workflows	Checkpoint and resume
AgentsFederated	Academy	Federated collaboration	Cross-institutional (DOE labs)
AgentsAcademy	Academy	5-agent pipeline	Agent-to-agent messaging
AgentsAcademyHubSpoke	Academy	Hub-and-spoke pattern	Central orchestrator
AgentsAcademyDashboard	Academy	Live progress dashboard	Rich TUI
AgentsHybrid	Both	Academy + LangGraph hybrid	Distributed LLM agents

Federated Agents

Example	Tech	Description	Key Pattern
AgentsHPCJob	LangGraph	HPC job submission	Batch scheduler lifecycle
CharacterizeChemicals	Academy	Molecular properties	LLM-planned RDKit + xTB

Governed Tool Use

Example	Tech	Description	Key Pattern
AgentsGovernedTools	LangGraph	Policy enforcement	Budget, rate limits, approval

Multi-Agent Coordination

Example	Tech	Description	Key Pattern
AgentsCoordination	LangGraph	Shared resources	Budget, blackboard, claims

Long-Lived Agents

Example	Tech	Description	Key Pattern
AgentsCheckpoint	LangGraph	Persistent workflows	Checkpoint/resume

Agent Workflows

Example	Tech	Description	Key Pattern
AgentsWorkflow	LangGraph	Dynamic DAG construction	Adaptive execution

Credits

Thanks to Kyle Chard, Yadu Babuji, Ian Foster, Suman Raj, and others for material and feedback.

Name		Name	Last commit message	Last commit date
parent directory ..
Assets		Assets
agent-workflows		agent-workflows
federated-agents		federated-agents
governed-tool-use		governed-tool-use
local-agents		local-agents
long-lived-agents		long-lived-agents
multi-agent-coordination		multi-agent-coordination
scale-inference		scale-inference
NOTES		NOTES
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Agentic Capabilities: A Tutorial Introduction

Getting Started: Tutorial Review of Agentic Patterns

Code Samples by Area

1. Local Agent Execution

2. Federated Agent Execution

3. Massively Parallel Agent Inference

4. Governed Tool Use at Scale

5. Multi-Agent Coordination

6. Long-Lived Autonomous Agents

7. Agent-Mediated Scientific Workflows

Maturity Levels

Capability Matrix

Examples Index

Local Agents

Federated Agents

Governed Tool Use

Multi-Agent Coordination

Long-Lived Agents

Agent Workflows

Credits

FilesExpand file tree

Capabilities

Directory actions

More options

Directory actions

More options

Latest commit

History

Capabilities

Folders and files

parent directory

README.md

Agentic Capabilities: A Tutorial Introduction

Getting Started: Tutorial Review of Agentic Patterns

Code Samples by Area

1. Local Agent Execution

2. Federated Agent Execution

3. Massively Parallel Agent Inference

4. Governed Tool Use at Scale

5. Multi-Agent Coordination

6. Long-Lived Autonomous Agents

7. Agent-Mediated Scientific Workflows

Maturity Levels

Capability Matrix

Examples Index

Local Agents

Federated Agents

Governed Tool Use

Multi-Agent Coordination

Long-Lived Agents

Agent Workflows

Credits