Mojo module
pingpong_kernel
Structs
-
AMDPingPongMatmul: 8-warp ping-pong matmul for AMD MI355X. -
KernelConfig: -
MmaOp: Encapsulates MMA register tiles and operations for matrix multiplication. -
TileBuffers: Double-buffered LDS tiles and TileLoaders for ping-pong matmul. -
TileLoaderLDS: Cooperative global→LDS tile loader with swizzle support.
Functions
-
load_lds_fragment: Load LDS → registers with MMA access pattern. -
make_mma_swizzle: Create swizzle pattern for MMA LDS access. -
ping_pong_matmul:
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!