Skip to main content

Mojo module

blockwise_fp8_output_writer

Output writer for blockwise FP8 SM100 matmul.

Handles Register β†’ SMEM β†’ GMEM (via TMA) flow. Unlike standard matmul which reads from TMEM, blockwise FP8 accumulators are already in registers.

Supports two write modes:

  • write(): TMA store for standard non-grouped matmul
  • write_absolute_with_bounds_check(): Element-by-element store for 1D2D grouped matmul with expert boundary bounds checking

Structs​

Was this page helpful?