Convolution layer

Slide small kernel over input. Kernel weights shared across positions → translation equivariance + parameter efficiency.

Advertisement

Pooling

Downsample via max or average over regions. Reduces spatial size + adds slight invariance. Modern architectures use strided conv instead.

Advertisement

Classic architectures

LeNet (1998, MNIST). AlexNet (2012, ImageNet breakthrough). VGG. ResNet (skip connections). EfficientNet. ConvNeXt.