Skip to main content

Python class

TransformerBlock

TransformerBlock

class max.nn.TransformerBlock(attention, mlp, attention_norm, mlp_norm, residual_multiplier=1.0)

source

Bases: Module

Stack of Attention, FeedForward, and RMSNorm layers.

Parameters: