Skip to main content

Mojo struct

OutputRegisterBufferRDNA

struct OutputRegisterBufferRDNA[dtype: DType, num_m_mmas: Int, num_n_mmas: Int]

Output accumulator register buffer. Layout is (num_n_mmas * num_m_mmas, RDNA_CD_FRAG_SIZE) row_major β€” one row per MMA tile, one column per per-lane C/D register.

Fields​

  • ​reg_tile (OutputRegisterBufferRDNA[dtype, num_m_mmas, num_n_mmas].RegisterTileType):

Implemented traits​

AnyType, ImplicitlyDestructible

comptime members​

output_frag_size​

comptime output_frag_size = RDNA_CD_FRAG_SIZE

reg_dtype​

comptime reg_dtype = dtype

reg_tile_layout​

comptime reg_tile_layout = row_major[(num_n_mmas * num_m_mmas), 8]()

RegisterTileType​

comptime RegisterTileType = TileTensor[dtype, Layout[*?, *?], MutAnyOrigin, address_space=AddressSpace.LOCAL]

Methods​

__init__​

__init__(out self)

get_dtype​

static get_dtype() -> DType

Returns:

DType

zero​

zero(self)

get_reg_tile​

get_reg_tile[stage: Int = 0](self) -> OutputRegisterBufferRDNA[dtype, num_m_mmas, num_n_mmas].RegisterTileType

Returns:

OutputRegisterBufferRDNA[dtype, num_m_mmas, num_n_mmas].RegisterTileType