All major platform APIs have now released a similar extensions for cooperative matrix: * Metal introduced simdgroup_matrix in MSL 3.1 * HLSL has support in [SM6.8](https://microsoft.github.io/DirectX-Specs/d3d/HLSL_SM_6_8_WaveMatrix.html) (currently experimental release) * SPIR-V/Vulkan released [SPV_KHR_cooperative_matrix](https://github.com/KhronosGroup/SPIRV-Registry/pull/207) This feature is very useful for accelerating ML operations by performing matrix calculations across multiple threads.