When working with convolution layers, the key metrics to understand are the output size and receptive field. These depend on kernel size, stride, and padding.
Output size tells us how big the feature map will be after convolution. It affects model speed and memory.
Receptive field is how much of the input each output pixel 'sees'. Larger receptive fields capture more context.
We measure output size using this formula:
Output = floor((Input + 2 * Padding - Kernel Size) / Stride) + 1
Understanding these helps us design models that balance detail and efficiency.