Mojo function
warpgroup_reg_dealloc
warpgroup_reg_dealloc[count: Int]()
Deallocates additional registers for the executing warp group.
Hints to the system to decrease per-thread registers owned by the executing warp. Releases extra registers to reduce the absolute per-thread maximum register count from its current value to the specified count.
Note:
- Only supported on NVIDIA SM90+ GPUs.
 - Performance optimization hint that may be ignored by the hardware.
 - Pair with 
warpgroup_reg_alloc()when extra registers are needed. 
Parameters:
- count (
Int): The desired number of registers per thread. Must be:- A multiple of 8.
 - Between 24 and 256 (inclusive).
 
 
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!