Mojo module
matmul_kernels
Structs
- HopperMatmulSM90Kernel: Hopper SM90 Matrix Multiplication kernel optimized for NVIDIA H100 GPUs.
- HopperMatmulSM90Kernel_SMem: Shared memory layout for Hopper SM90 matrix multiplication kernel.
Functions
- find_K_alignment_upto_16B: Find alignment among 1B, 2B, 4B, 16B based on the row's bytes.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!
