Skip to main content

Mojo module

dispatch

RDNA dispatch for 2-D convolution.

Provides two paths for 2-D convolution on RDNA 3+:

  1. Implicit GEMM (preferred): Fuses im2col into the WMMA matmul kernel's A-tile loader, eliminating the large intermediate buffer. Requires C_in % BLOCK_K == 0 for vectorized im2col loads.

  2. Explicit im2col + matmul (fallback): Materializes the im2col buffer then calls the standard RDNA matmul. Used when C_in alignment requirements aren't met.

Functions

Was this page helpful?