Math

CNN Output Size Calculator

Trace CNN feature map dimensions through conv and pooling layers

Input height (px)

Input width (px)

Add layers above to see output dimensions.

Frequently Asked Questions

Why do I get a shape mismatch error in PyTorch?

Shape mismatches usually occur when a layer receives input dimensions it was not designed for. Use this calculator to trace dimensions through your architecture and identify which layer causes the size to drop to 0 or produce unexpected shapes.

What does "same padding" mean?

Same padding means padding is chosen so the output size equals the input size (assuming stride=1). For a kernel of size k, same padding is (k-1)/2. For example, a 3×3 conv with padding=1 produces the same spatial size as its input.

How does dilation affect output size?

Dilation expands the kernel by inserting gaps between kernel elements, effectively increasing the kernel size to dilation × (kernel-1) + 1. A 3×3 kernel with dilation=2 behaves like a 5×5 kernel for output size computation, but still has only 9 parameters.

What is the receptive field?

The receptive field is the region of the input image that a single output neuron "sees". It grows with each layer. Deeper layers have larger receptive fields. This calculator shows a cumulative estimate of the receptive field size after each layer.

What is ConvTranspose2d used for?

ConvTranspose2d (also called fractionally strided convolution or deconvolution) performs upsampling — it increases spatial dimensions. It is used in decoder paths of U-Nets, GANs generators, and segmentation networks to recover original resolution.

Does this calculator handle channels or batch size?

No. Conv and pool operations do not change spatial height and width based on channel count — channels are determined by the number of filters, which this tool does not track. Only spatial dimensions (H × W) are computed.