Mojo module
blockwise_fp8_output_writer
Output writer for blockwise FP8 SM100 matmul.
Handles Register β SMEM β GMEM (via TMA) flow. Unlike standard matmul which reads from TMEM, blockwise FP8 accumulators are already in registers.
Structsβ
- β
BlockwiseFP8TileWriter: Write register accumulators to GMEM via SMEM and TMA.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!