Mojo module
block_scaled_smem
Shared memory layout for block-scaled SM100 matmul.
Provides A/B/C tile storage plus scaling factor tile storage (SFA, SFB) following MXFP8 layout conventions. Also includes pipeline barriers and TMEM state.
The tile storage, derived constants, layouts, and accessors are factored into BlockScaledTileCore and shared with GroupedBlockScaledSmem and Grouped1D1DSmem. Each SMEM struct is a thin wrapper that adds the appropriate pipeline bundle.
Structs
-
BlockScaledSmem: SMEM struct for block-scaled matmul with CLC scheduler pipeline. -
BlockScaledTileCore: Core tile storage for block-scaled matmul SMEM structs.
Functions
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!