Python class
Conv2d
Conv2d
class max.nn.Conv2d(kernel_size, in_channels, out_channels, dtype, stride=1, padding=0, dilation=1, num_groups=1, device=None, has_bias=False, permute=False, name=None)
A 2D convolution over an input signal composed of several input planes.
When called, Conv2d accepts a TensorValue of shape
(batch, height, width, in_channels) and returns a
TensorValue of shape (batch, new_height, new_width, out_channels). If permute=True, the input and output follow PyTorch
channel-first layout: (batch, in_channels, height, width) and (batch, out_channels, new_height, new_width).
conv = nn.Conv2d(
kernel_size=3,
in_channels=64,
out_channels=128,
dtype=DType.float32,
stride=1,
padding=0,
has_bias=False,
name="conv2d_weight",
device=DeviceRef.GPU(),
)Initializes the Conv2d layer with weights and optional bias.
-
Parameters:
-
- kernel_size (int | tuple[int, int]) – Size of the convolving kernel. Can be a single int (square kernel) or tuple (height, width).
- in_channels (int) – Number of channels in the input image.
- out_channels (int) – Number of channels produced by the convolution.
- dtype (DType) – The data type for both weights and bias.
- stride (tuple[int, int]) – Stride of the convolution for height and width dimensions. Can be int (applied to both dimensions) or tuple (stride_h, stride_w). Default: 1
- padding (tuple[int, int, int, int]) – Padding added to input. Can be int (applied to all sides), tuple of 2 ints (pad_h, pad_w), or tuple of 4 ints (pad_top, pad_bottom, pad_left, pad_right) to support asymmetric padding. Default: 0
- dilation (tuple[int, int]) – Spacing between kernel elements for height and width dimensions. Can be int (applied to both dimensions) or tuple (dilation_h, dilation_w). Default: 1
- num_groups (int) – Number of blocked connections from input channels to output channels. Input channels and output channels are divided into groups. Default: 1
- device (DeviceRef | None) – The target device for computation. If None, defaults to CPU. Weights are initially stored on CPU and moved to target device during computation.
- name (str | None) – Base name for weights. If provided, weights are named
{name}.weightand{name}.bias(if bias is enabled). If None, uses “weight” and “bias”. - has_bias (bool) – If true, adds a learnable bias vector to the layer.
Defaults to
False. - permute (bool) – If true, permutes weights from PyTorch format to MAX format.
PyTorch order: (out_channels, in_channels / num_groups, height, width).
MAX API order: (height, width, in_channels / num_groups, out_channels).
Defaults to
False.
bias
The optional bias vector stored on CPU with shape (out_channels,).
Model init moves the bias to device if present.
device
The device where matrix operations are performed.
dilation
Controls the dilation rate.
filter
filter: Weight
The weight matrix stored on CPU with shape (height, width, in_channels / num_groups, out_channels).
Model init moves the weight to device.
num_groups
num_groups: int
Number of blocked connections from input channels to output channels.
padding
Controls the amount of padding applied before and after the input for height and width dimensions.
Format: (pad_top, pad_bottom, pad_left, pad_right).
permute
permute: bool = False
bool controls whether self.filter is permuted from PyTorch order to max order. PyTorch order is: (out_channels, in_channels / num_groups, height, width) Max API order: (height, width, in_channels / num_groups, out_channels).
shard()
shard(devices)
Creates sharded views of this Conv2d layer across multiple devices.
sharding_strategy
property sharding_strategy: ShardingStrategy | None
Get the Conv2d sharding strategy.
stride
Controls the stride for the cross-correlation.
Was this page helpful?
Thank you! We'll create more content like this.
Thank you for helping us improve!