Mojo module
device_query
Provides device query utilities for communication primitives.
Structsβ
- β
CommTuningConfig: Parameters: ngpus: Number of GPUs for running allreduce. num_bytes: Total number of input bytes supported by the config. sm_version: SM version (as string). num_blocks: Number of thread blocks for running allreduce.
Functionsβ
- β
dispatch_max_num_blocks: This function searches for tuning configs with matching sm_version and ngpus. If such configs are found, then the search continues for finding the config x where num_bytes <= x.num_bytes. - β
get_sm_version:
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!