Tags: SharpAI/mlx
Tags
[CUDA] Use qmv kernel for fp quantizations (ml-explore#3239)
suppress gcc 10.1 warnings (ml-explore#2679) * suppress gcc 10.1 warnings * suppress gcc 10.1 warnings
PreviousNext
[CUDA] Use qmv kernel for fp quantizations (ml-explore#3239)
suppress gcc 10.1 warnings (ml-explore#2679) * suppress gcc 10.1 warnings * suppress gcc 10.1 warnings