IMPORTANT: To view this page as Markdown, append `.md` to the URL (e.g. /max/get-started.md). For the complete documentation index, see llms.txt.
Skip to main content
For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).

Mojo module

tile_loader

TMA tile loader for SM100 matrix multiplication.

Provides a wrapper around TMA async_multicast_load operations, following the SM90 TileLoaderTMA pattern. Orchestration logic (k-group iteration, expect_bytes, barrier management) is handled by the kernel, not the loader.

Usage: # In kernel - create separate A and B loaders var a_loader = ATileLoaderType(Pointer(to=a_tma_op), ctx.a_multicast_mask) var b_loader = BTileLoaderType(Pointer(to=b_tma_op), ctx.b_multicast_mask)

# Load tiles using TileTensor
a_loader.load(a_tile, barrier, k_coord, m_coord)
b_loader.load(b_tile, barrier, k_coord, n_coord)

# TileTensor tiles are passed directly to TMA ops

Structs​