Mojo module
matmul
SM100 Matmul CPU entry points - TMA setup and kernel launch wrappers.
This module contains the CPU-side code for SM100 matrix multiplication:
- TMA descriptor creation
- Kernel instantiation and launch via ctx.enqueue_function
All GPU code (kernel structs, runtime functions) is in matmul_kernels.mojo.
comptime values
UnsafePointer
comptime UnsafePointer = LegacyUnsafePointer[?, address_space=?, origin=?]
Functions
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!