Mojo module
matmul_kernels
Structs
-
HopperMatmulSM90Kernel: Hopper SM90 Matrix Multiplication kernel optimized for NVIDIA H100 GPUs. -
HopperMatmulSM90Kernel_SMem: Shared memory layout for Hopper SM90 matrix multiplication kernel.
Functions
-
find_K_alignment_upto_16B: Find alignment among 1B, 2B, 4B, 16B based on the row's bytes.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!