Mojo package
linalg
Provides CPU and GPU implementations of linear algebra functions.
Packagesβ
- β
arch: Provides architecture specific utility functions. - β
matmul: Provides the backend implementation for matmuls.
Modulesβ
- β
accumulate: - β
bmm: - β
fp4_quantization: - β
fp4_utils: - β
fp8_quantization: - β
fp8_utils: Re-exports FP8 quantization utilities from comm.fp8_utils. - β
gemv: - β
grouped_matmul: - β
grouped_matmul_block_scaled_dispatch: General dispatch for grouped block-scaled matmul. - β
grouped_matmul_sm100: - β
grouped_matmul_sm100_1d1d: - β
grouped_matmul_sm100_blockwise_fp8: - β
grouped_matmul_tile_scheduler: - β
lora: - β
matrix_band_part: The module implements matrix band part functions. - β
mxfp4_dequant: MXFP4 dequantization kernel for H100 (SM90). - β
mxfp4_matmul_sm90: MXFP4 matmul on H100 (SM90) via dequant-to-FP8 + FP8 GEMM. - β
packing: - β
qr_factorization: - β
structuring: - β
transpose: The module implements Transpose functions. - β
utils: - β
utils_gpu:
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!