Optimize ConnectionCostMatrix layout and Viterbi hot loop performance by mosuka · Pull Request #600 · lindera/lindera

mosuka · 2026-01-10T07:43:38Z

Optimize ConnectionCostMatrix layout and Viterbi hot loop performance

Transpose ConnectionCostMatrix memory layout from [forward_id][backward_id] to [backward_id][forward_id] to improve cache hit rate in the Viterbi search loop.
Introduce a version flag in Compiled ConnectionCostMatrix (matrix.mtx) to support the new layout while maintaining backward compatibility for older dictionary formats.
Add #[inline] attributes to hot methods in ConnectionCostMatrix, Mode, and Penalty.
Specialize Lattice::add_edge_in_lattice for Mode::Normal to eliminate penalty calculation overhead during standard tokenization.
Significant performance improvements observed in IPADIC benchmarks:
- bench-tokenize-ipadic: about 30 percent faster (15.4 us to 13.3 us)
- bench-tokenize-long-text-ipadic: about 16 percent faster
- bench-constructor-ipadic: about 35 percent faster

Optimize ConnectionCostMatrix layout and Viterbi hot loop performance

04647cf

mosuka force-pushed the refactoring branch from 9f37112 to 04647cf Compare January 10, 2026 14:02

mosuka changed the title ~~Optimize Lattice structure using flat buffer and linked list~~ Optimize ConnectionCostMatrix layout and Viterbi hot loop performance Jan 10, 2026

mosuka merged commit d36fd3a into main Jan 10, 2026
8 checks passed

mosuka deleted the refactoring branch January 10, 2026 14:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimize ConnectionCostMatrix layout and Viterbi hot loop performance#600

Optimize ConnectionCostMatrix layout and Viterbi hot loop performance#600
mosuka merged 1 commit intomainfrom
refactoring

mosuka commented Jan 10, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

mosuka commented Jan 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

mosuka commented Jan 10, 2026 •

edited

Loading