Why Actionbook?
Without Actionbook
Building reliable browser agents is difficult and expensive:- Slow Execution: Agents waste time parsing full HTML pages to find elements
- High Token Costs: Sending entire DOM trees to LLMs consumes massive context windows
- Brittle Selectors: Updates to website UIs break hardcoded selectors and agent logic immediately
- Hallucinations: LLMs often guess incorrect actions when faced with complex, unstructured DOMs
With Actionbook
Actionbook places up-to-date action manuals with the relevant DOM selectors directly into your LLM’s context:- 10x Faster: Agents access pre-computed “Action manuals” to know exactly what to do without exploring
- 100x Token Savings: Instead of whole HTML page, agents receive only related DOM elements in concise, semantic JSON definitions
- Resilient Automation: Action manuals are maintained and versioned. If a site changes, the manual is updated, not your agent
- Universal Compatibility: Works with any LLM (OpenAI, Anthropic, Gemini) and any AI operator framework
How It Works
Search for Actions
Find available actions for any website using the CLI (
actionbook search "airbnb login").Get Action Details
Retrieve the full action manual with precise selectors and step-by-step
instructions (
actionbook get "google.com:/:default").Integration Options
CLI
Default mode — launches browser automatically
Browser Automation
Isolated or extension — two modes for AI agents
MCP Server
Integrate with Cursor, Claude Code, VS Code, etc.
Quick Start
Quickstart Guide
Get started in under 2 minutes
API Reference
CLI commands and API documentation
Community
GitHub
Star us and contribute
Discord
Get help and share your agents
X / Twitter
Product updates and announcements