Skip to content

fix(docs): correct cross-model coefficient count from 4 to 7#479

Merged
sriumcp merged 1 commit intoinference-sim:mainfrom
sriumcp:fix-coeff-count
Mar 3, 2026
Merged

fix(docs): correct cross-model coefficient count from 4 to 7#479
sriumcp merged 1 commit intoinference-sim:mainfrom
sriumcp:fix-coeff-count

Conversation

@sriumcp
Copy link
Copy Markdown
Collaborator

@sriumcp sriumcp commented Mar 3, 2026

Summary

The cross-model backend documentation incorrectly states "4 globally-fitted coefficients" when it actually uses 7: 4 beta (step time) + 3 alpha (CPU overhead). All 7 are model-independent and stored in crossmodel_defaults in defaults.yaml.

Changes

  • docs/guide/latency-models.md: "4 globally-fitted physics coefficients" → "7 globally-fitted coefficients (4 beta for step time + 3 alpha for CPU overhead)"
  • docs/reference/configuration.md: same fix in mode selection description
  • CLAUDE.md: expanded to list all 7 coefficient roles

Context

The "4 coefficients" framing propagated from the issue design (#472) which focused on the StepTime formula (4 beta terms) and treated the alpha coefficients as a footnote. But from the user's perspective, all 7 come from crossmodel_defaults — they're all part of the cross-model prediction.

🤖 Generated with Claude Code

…+ 3 alpha)

The cross-model backend uses 7 globally-fitted coefficients, not 4:
- 4 beta coefficients for step time (per-layer, KV bandwidth, MoE dispatch, TP sync)
- 3 alpha coefficients for CPU overhead (pre-scheduling, per-token, output processing)

All 7 are model-independent and stored in crossmodel_defaults. The "4 coefficients"
framing incorrectly excluded the alpha parameters which are equally global.

Co-Authored-By: Claude Opus 4.6 (1M context) <[email protected]>
@sriumcp sriumcp merged commit eea0821 into inference-sim:main Mar 3, 2026
4 checks passed
@sriumcp sriumcp deleted the fix-coeff-count branch March 6, 2026 21:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant