IMPORTANT: To view this page as Markdown, append `.md` to the URL (e.g. /max/get-started.md). For the complete documentation index, see llms.txt.
Skip to main content
For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).

Mojo module

blockwise_fp8_output_writer

Output writer for blockwise FP8 SM100 matmul.

Handles Register β†’ SMEM β†’ GMEM (via TMA) flow. Unlike standard matmul which reads from TMEM, blockwise FP8 accumulators are already in registers.

Supports two write modes:

  • write(): TMA store for standard non-grouped matmul
  • write_absolute_with_bounds_check(): Element-by-element store for 1D2D grouped matmul with expert boundary bounds checking

Structs​