Mojo struct
OutputRegisterBufferRDNA
struct OutputRegisterBufferRDNA[dtype: DType, num_m_mmas: Int, num_n_mmas: Int]
Output accumulator register buffer. Layout is (num_n_mmas * num_m_mmas, RDNA_CD_FRAG_SIZE) row_major β one row per MMA tile, one column per per-lane C/D register.
Fieldsβ
- βreg_tile (
OutputRegisterBufferRDNA[dtype, num_m_mmas, num_n_mmas].RegisterTileType):
Implemented traitsβ
AnyType,
ImplicitlyDestructible
comptime membersβ
output_frag_sizeβ
comptime output_frag_size = RDNA_CD_FRAG_SIZE
reg_dtypeβ
comptime reg_dtype = dtype
reg_tile_layoutβ
comptime reg_tile_layout = row_major[(num_n_mmas * num_m_mmas), 8]()
RegisterTileTypeβ
comptime RegisterTileType = TileTensor[dtype, Layout[*?, *?], MutAnyOrigin, address_space=AddressSpace.LOCAL]
Methodsβ
__init__β
__init__(out self)
get_dtypeβ
zeroβ
zero(self)
get_reg_tileβ
get_reg_tile[stage: Int = 0](self) -> OutputRegisterBufferRDNA[dtype, num_m_mmas, num_n_mmas].RegisterTileType
Returns:
OutputRegisterBufferRDNA[dtype, num_m_mmas, num_n_mmas].RegisterTileType
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!