Skip to content

Autoresearch: Execution engine optimization (large in-memory query)#4348

Draft
dondonz wants to merge 3 commits intomasterfrom
claude/autoresearch-execution-large
Draft

Autoresearch: Execution engine optimization (large in-memory query)#4348
dondonz wants to merge 3 commits intomasterfrom
claude/autoresearch-execution-large

Conversation

@dondonz
Copy link
Member

@dondonz dondonz commented Mar 22, 2026

This is a bit of fun to see what Autoresearch can do

Summary

  • Autoresearch setup for optimizing LargeInMemoryQueryPerformance.benchMarkSimpleQueriesThroughput
  • Targets the core execution engine: ExecutionStrategy, ResultNodesInfo, field resolution, result assembly
  • 10M scalar field benchmark — cleanly isolates execution overhead
  • Designed for overnight autonomous optimization runs (200 iterations)

Approach

  • Profile-guided optimization using async-profiler
  • Targeted local test subset (graphql.execution.*) for fast iteration
  • Lessons from ENF autoresearch baked into program.md

Test plan

  • Run autoresearch loop locally
  • Verify improvements with full test suite on EC2
  • Review and cherry-pick winning optimizations

🤖 Generated with Claude Code

…ptimization

Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>
@github-actions
Copy link
Contributor

github-actions bot commented Mar 22, 2026

Test Report

Test Results

Java Version Total Passed Failed Errors Skipped
Java 11 5708 (±0) 5652 (±0) 0 (±0) 0 (±0) 56 (±0)
Java 17 5708 (±0) 5651 (±0) 0 (±0) 0 (±0) 57 (±0)
Java 21 5708 (±0) 5651 (±0) 0 (±0) 0 (±0) 57 (±0)
Java 25 5708 (±0) 5651 (±0) 0 (±0) 0 (±0) 57 (±0)
jcstress 32 (±0) 32 (±0) 0 (±0) 0 (±0) 0 (±0)
Total 22864 (±0) 22637 (±0) 0 (±0) 0 (±0) 227 (±0)

Code Coverage (Java 25)

Metric Covered Missed Coverage vs Master
Lines 28774 3118 90.2% ±0.0%
Branches 8353 1505 84.7% ±0.0%
Methods 7698 1221 86.3% ±0.0%

No per-class coverage changes detected.

Full HTML report: build artifact jacoco-html-report

Updated: 2026-03-22 06:53:39 UTC

echo "--- Asking Claude to make an optimization ---"
CLAUDE_OUTPUT=$(claude \
--model sonnet \
--dangerously-skip-permissions \
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yolo again!

I gave it a try with Opus and it was fine too

dondonz and others added 2 commits March 22, 2026 17:35
Use --permission-mode plan with explicit --allowedTools whitelist instead
of bypassing all permissions. The agent can read files, edit source code,
and run gradle (for profiling), but cannot run tests, git commit, or
perform other destructive operations. The outer harness handles those.

Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant