Mojo function
register_epilogue
register_epilogue[MMA_M: Int, data_paths: Int, num_stages: Int, bits: Int, stage: Int, stageN: Int, compute_lambda_fn: def[dtype: DType, width: Int, *, alignment: Int = 1](IndexList[2], SIMD[dtype, width]) capturing -> SIMD[dtype, width], num_output_warps: Int, epilogue_dtype: DType, frag_size: Int, repeats: Int, transpose_c: Bool, cta_group: Int, is_lower_frag_required: Bool](mut upper_frag_casted: InlineArray[Scalar[epilogue_dtype], frag_size], mut lower_frag_casted: InlineArray[Scalar[epilogue_dtype], frag_size], c_row: UInt32, c_col: UInt32, N: UInt32)
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!