Skip to main content

Mojo module

tile_writer

TileWriter components for SM100 matrix multiplication epilogue.

This module provides modular components for the output pipeline:

  1. store_fragment_to_smem: Register to shared memory via st.matrix instructions
  2. TMEMToSMemWriter: Write TMEM accumulators to shared memory
  3. TMAStoreExecutor: Execute TMA stores with proper SMEM tiling
  4. EpilogueApplier: Apply element-wise operations on fragments

The SM100 epilogue pipeline flows as: TMEM (accumulators) → Registers → SMEM → GMEM (via TMA)

comptime values

RLayout32Bits

comptime RLayout32Bits[layout: Layout] = RuntimeLayout[layout, element_type=DType.uint32, linear_idx_type=DType.uint32]

Parameters

Structs

Functions

Was this page helpful?