Mojo module
moe
Functions
-
calculate_warp_offset: -
group_limited_router_kernel: A manually fused MoE router with the group-limited strategy. It divides all the experts inton_groupsgroups and then finds the toptopk_groupgroups with the highest scores. The final experts for each token are selected from the experts in the selected groups. The bias will be applied to the scores during the selection process, but the final weights will not include the bias. -
moe_create_indices: -
moe_create_indices_bucket_group_kernel: Create indices for MoE routing using bucket sort algorithm. -
moe_create_indices_kernel: -
router_group_limited: A manually fused MoE router with the group-limited strategy.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!