Tags · ml-rust/boostr

v0.1.0

feat(quant/cpu): add NEON fused dequant+dot kernels for aarch64

Implement fused dequantization and dot product kernels for Q4_K,
Q5_K, and Q6_K formats using ARM NEON intrinsics. Adds a shared
horizontal sum helper (dot_f32.rs) used across all three kernels.

These kernels are the aarch64 counterpart to the existing x86 AVX2
implementations and are wired into the dispatch logic added in the
previous commit.

Mar 15, 2026
c8ffd21
zip
tar.gz

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.1.0

Tags: ml-rust/boostr

v0.1.0