Inspiration
Email + calendar is the “death by a thousand clicks” loop: triage, reply, schedule, follow up—repeat forever. We wanted a robot computer that doesn’t just chat about work, but finishes the work end-to-end with enterprise-grade safety and receipts. Amazon Nova’s stack made that possible: real-time voice, strong reasoning, and reliable UI automation.
What it does
Nova Clerk is a voice-driven auto-worker that turns natural language into verified outcomes:
- Triage inbox: classifies unread email into Reply / Schedule / Follow-up / FYI.
- Draft (or send) replies with the right tone, correct context, and clear next steps.
- Propose times and update calendar: checks conflicts, suggests slots, creates holds/invites when allowed.
- Robot-computer fallback: when APIs aren’t enough, Nova Act can operate web UIs to complete or verify steps.
- Receipts bundle for every run: a structured log of what happened, what changed, what was verified, and what still needs approval.
Modes: READ_ONLY, DRAFT, EXECUTE—so it can be safe by default and still go full auto when permitted.
How we built it
We built Nova Clerk as a modular agent system on AWS:
- Amazon Nova 2 Sonic for real-time voice interaction (hands-free “operator” experience).
- Amazon Nova 2 Lite as the orchestrator: planning, tool selection, policy gating, and structured outputs (tool/function calling).
- Amazon Nova Act for browser automation (“robot computer” moments) and visual verification when needed.
- FastAPI backend exposing a simple
/api/runworkflow endpoint plus run history / receipts retrieval. - Avant-garde operator console UI (web app) designed around a Command Bar, live Timeline, and a Receipts Viewer so users can trust what the agent did.
- Mock mode fixtures so the demo runs reliably without credentials, plus “real mode” connectors for actual inbox/calendar integrations.
Challenges we ran into
- Reliability vs. wow-factor: it’s easy to make agents look magical; harder to make them consistently correct. We prioritized deterministic flows and verification.
- Safety boundaries: sending email or creating invites can be high-impact. We implemented modes + approvals so the agent never over-commits.
- Tooling realism: UI automation is powerful but brittle; API integrations are robust but not always available. We designed a hybrid routing approach.
- Auditability: enterprise-grade automation needs “receipts,” not vibes. We had to standardize logs, artifacts, and verification checks into a clean bundle per run.
Accomplishments that we're proud of
- A full end-to-end auto-worker demo: voice/text command → triage → drafts → scheduling → receipts.
- A clean operator-grade UI that makes agent behavior legible: timeline, artifacts, verification, approvals.
- A pragmatic hybrid execution engine: API-first for reliability, Nova Act for browser automation and proof.
- A Receipts Bundle format that makes the system auditable and enterprise-ready.
What we learned
- The killer feature isn’t “automation”—it’s automation you can trust.
- Agents need explicit operating modes, guardrails, and verification loops to be viable in real organizations.
- UI matters: a strong console turns an opaque agent into a controllable system (and makes demos dramatically better).
- Nova Act shines as a robot-computer layer when you keep tasks atomic and add verification.
What's next for Nova Clerk
- Multi-user / org mode: team inboxes, delegated roles, approval chains, and workspace policies.
- Agent fleets: parallel workers handling multiple workflows (inbox, CRM, support queues) with a unified receipts ledger.
- Richer approvals: granular “two-person rule,” policy templates, and risk scoring per action.
- Streaming timeline + replay: step-by-step playback of decisions and UI actions for debugging and compliance.
- Broader connectors: Slack, Jira, Salesforce, Zendesk, and internal tools via MCP—while keeping Nova Act as universal fallback.
Built With
- amazonbedrock
- amazoncloudwatch
- amazonnova2lite
- amazonnova2sonic
- amazonnovaact
- amazons3
- awsiam
- awssecretsmanager
- docker
- fastapi
- googlecalendarapi
- googlegmailapi
- next.js
- oauth2
- openapi
- playwright
- python
- react
- tailwindcss
- typescript
Log in or sign up for Devpost to join the conversation.