Mojo function
matmul_gpu_qint4
matmul_gpu_qint4[c_type: DType, a_type: DType, //, group_size: Int, target: StringSlice[StaticConstantOrigin], elementwise_lambda_fn: Optional[elementwise_epilogue_type] = None](c_tt: TileTensor[c_type, c_tt.LayoutType, c_tt.origin, linear_idx_type=c_tt.linear_idx_type, element_size=c_tt.element_size], a_tt: TileTensor[a_type, a_tt.LayoutType, a_tt.origin, linear_idx_type=a_tt.linear_idx_type, element_size=a_tt.element_size], b_tt: TileTensor[DType.uint8, b_tt.LayoutType, b_tt.origin, linear_idx_type=b_tt.linear_idx_type, element_size=b_tt.element_size], ctx: DeviceContextPtr = DeviceContextPtr())
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!