🦞🧪 TensorClaw is an ML research harness on top of pi, inspired by autoresearch.
pip install -e .Requires:
- Python 3.10+
pion yourPATH(brew install pi-coding-agent)OPENAI_API_KEYorANTHROPIC_API_KEY
tensorclaw [--dry-run]Run from the target repo root.
TensorClaw enforces a metric-gated loop:
- Chat with the agent.
- Agent proposes one experiment plan.
- You approve (
Y) or reject (N). - Harness runs the experiment and parses objective metrics.
- Harness keeps improvements, reverts regressions/crashes, and records the iteration.
- Chat pane: streamed user/agent conversation + pending proposal card
- Metrics pane: objective sparkline, live monitor, run context, and
pitoken usage - Iterations pane: baseline + keep/discard/crash history with best/latest markers
- Output pane: streamed setup/apply/experiment output
- Status bar: phase, metric snapshot, and total
pitokens
| Command | Description |
|---|---|
Enter (with text) |
Send a chat message |
Y |
Approve pending proposal and start run |
N |
Reject pending proposal |
/status |
Print current state summary |
/reset |
Clear local TensorClaw history in .tensorclaw/ |
/help |
Show command help |
TensorClaw stores run state under .tensorclaw/ in the target project:
.tensorclaw/
results.tsv # iteration ledger
journal.md # readable run journal
metrics.jsonl # live metric samples
memory.jsonl # retrieved memory for future prompts
logs/ # run/apply logs
instructions/ # saved prompts/instructions
session.jsonl # chat + pending plan state
We use the TinyStories dataset. See autoresearch for more info.
For a demo on Apple Silicon, run TensorClaw from your autoresearch-mlx project root:
tensorclawMIT
