For the complete documentation index, see llms.txt. Markdown versions of all pages are available by appending .md to any URL (e.g. /max/get-started.md).
Mojo module
ps_metadata
comptime valuesβ
WORKINFO_DWβ
comptime WORKINFO_DW = 8
Structsβ
- β
PsMetadata: - β
QTile: Per ps.h:69-86; qo_start/qo_end are global TOKEN offsets.
Functionsβ
- β
build_ps_metadata: Port ofget_ps_metadata_v1_2_host(v1_2_host.cuh:265-314): the host wrapper that GCD-clusters heads across TGs, then calls the per-clusterkn_generate_ps_metadataand concatenates. - β
build_uniform: Build metadata for uniform-seqlen self-attention, FP8 causal MLA-prefill. - β
ceil_div: - β
kn_generate_ps_metadata: Faithful port of v1_2_host.cuh:61-241 (single-cluster: cluster_id=0, current_work_idx=0 -- the nkv=1 prefill case). - β
pack_dword:
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!