MIT+ML08 :CNN

阅读数: 7次 2020-08-21

The outline of CNNs

map x to $h_L\times h_{L-1}\times h_{L-2}….\times h_1(x)$

$h_i(z)=\sigma(w_iz+b_i).$

注意这里 $w_i$ 不能用FFNN中的任意matrix，而是要用一个有special structure的矩阵。

make image into vector/matrix

Image has locality and translation invariance.

Convolution: local detectors spatial locality
Weight sharing: apply same detector to all image patches

efficiency (much fewer parameters!)

/ translation invariance.
Pooling

*abstract away locality *

Filter: detect signal

为了防止越算越短，所以需要padding

pad with what?

convolution is a linear operation. We move same window of weights over all patches and compute linear combinations.

即在数学意义上，卷积运算也是”linear”运算，因为它做的是 $\sum w_{ij} \times x_{ij}$

将局部性提取出来。

convolution layer to do feature engineering.

不断地去learn more & more complex features.

直到这些feature可以让我们有足够的自信去做到分类。

最后放一个图开心一下：