-
Notifications
You must be signed in to change notification settings - Fork 502
Pull requests: fla-org/flash-linear-attention
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: add Quasar Attention and standalone model implementation
#805
opened Mar 31, 2026 by
troy12x
Loading…
[GDN] Tricked kernels: ungated KKT + fused inference via similarity transform
#797
opened Mar 28, 2026 by
hypnopump
Contributor
Loading…
5 tasks
[Layernorm] Fix autotuner crash and OOB writes in layer_norm_bwd on high-SM GPUs
#796
opened Mar 28, 2026 by
mpurland
Contributor
Loading…
5 tasks done
Add fused short convolution kernel with L2 norm
#661
opened Nov 24, 2025 by
sustcsonglin
Collaborator
Loading…
[kda] add recursive block intra implementation
#656
opened Nov 22, 2025 by
sustcsonglin
Collaborator
Loading…
[Deltaformer] kernel improvement; if-else optimization; change w to fp32; add 1e-9 to avoid nan
#603
opened Sep 30, 2025 by
foreverpiano
Loading…
Update README.md of ops delta_rule
#595
opened Sep 17, 2025 by
SeepingFragranceLock
Contributor
Loading…
ProTip!
Follow long discussions with comments:>50.