Kernel Size and Why Everyone Loves 3x3 - Neural Network Convolution

21,859

1,162 0

Published 2022-07-14

Patreon: www.patreon.com/Animated_AI

Find out what the Kernel Size option controls and which values you should use in your neural network.

All Comments (21)

@IoannisKazlaris 1 year ago

The basic reason we don't use (even number) x (even number) layers, is because those layers don't have a "center". Having a "center" pixel (as in a 3 x 3 configuration) is very useful for max and average pooling - it's much more convenient for us.
@pritomroy2465 21 days ago

In the Unet, GAN architecture when it is required to generate a feature map half of its actual size a 4x4 kernel size is used.
@maxlawwk 1 year ago

Perhaps 2x2 kernel is a common trick for learnable stride-2 downsample kernel and upsample deconvolution kernel. It is a more likely about computation efficiency instead of network performance, because such kernels are almost equivalent to downsample/upsample followed by a 3x3 kernel. In this regard, 2x2 combo with stride-2 down/upsample operations do not shrink the resultant feature map size by 2 as 3x3 kernel does, possibly beneficial to image generation tasks. In GAN, 2x2 or 4x4 kernels are commonly found in discriminators which emphasize non-overlapping kernels to avoid grid artifacts.
@Firestorm-tq7fy 14 days ago

I don’t see a reason for 1x1. All you achieve is loosing information, while also creating N-features, each scaled by a certain factor. This can also be achieved within a normal layer (the scaling i mean). There is rly no point. Obviously outside of Depthwise-Pointwise combo. Pls correct me if I’m missing smt.
@newperspective5918 1 year ago

I think odd sized filters are mainly used since we often use a stride of 1. Each pixel (except for the edges) will then be filtered based on the surrounding pixels (defined by the kernel size). If the kernel size is even the pixel that the kernel represents would be the average pixel of the 4 middle pixels. It introduces a sort of shift of 0.5 pixel. I think it might be fine mathematically speaking, but it feels odd or wrong. Also if you worked with Gaussian filters (which I assume many CNN researchers has) you are literaly forced to use odd sized filters there.
@axelanderson2030 1 year ago

This is honestly the best video related to machine learning in general I have seen, amazing work. Most people just pull architectures out of thin air or make a clumsy disclaimer to experiment with numbers. This video shows 3d visual representations of popular CNN architectures, and really helps you build all cnns in general.
@schorsch7400 1 month ago

Thanks for the effort of maxing this excellent visualization! This creates a very good intuition for how convolutions work and why 3x3 is dominant.
@kznsq77 1 year ago

The even size of the kernel does not allow symmetrical coverage of the area around the pixel
@naevan1 1 year ago

Wow really beatiful animations , great job! However I got kinda confused since I always saw convolution in 2d haha
@bangsa_puja 3 months ago

How about of kernel 1x7, 7x1 in inception modul C. Please help me
@matthewboughton8320 11 months ago

Such an amazing video. Your going to hit 50k soon! Keep this up!!!
@yoursubconscious 1 month ago

"we dont talk about the goose goblin" - MadTV
@josephpark2093 10 months ago

There was no reason that I should have this very question and there had to be a great video telling me the exact reason why on the internet. Bless!
@haofanren6284 10 months ago

About 2*2 filter, a paper maybe helpful
@md.zahidulislam3548 1 year ago

Good Work. amazing explanation
@rewanthnayak2972 1 year ago

great work in animation and research
@ankitvyas8534 1 year ago

good explanation. Looking forward to more.
@alansart5147 8 months ago

friking love your videos! Keep up with your awesome work! :D
@Antagon666 7 months ago

Wait so why do we need larger filters in first layer ? To extract more features from only the 3 channels ? And what is better, more chained filters with lower channel count, or lesser amount of chained filters with more channels ?
@ocamlmail 1 year ago

Super cool, thank you!