A Closer Look at the Convolutional Layer

时间：2023-01-11 17:24:13浏览次数：70

标签：Closer Convolutional Layer CNNs sum feature input still

1. What CNNs Can Do

2. Image Classification

Different lighting, contrast, viewpoints, etc.
This is hard for traditional methods like multi-layer perceptrons, because the prediction is basically based on a sum of pixel intensities.

3. Convolutional Neural Network Basics

Relational Inductive Biases:
Independence, Locality, Sequentiality

LeNet-5:
1989, Backpropagation Applied to Handwritten Zip Code Recognition

Main Concepts Behind Convolutional Neural Networks

Sparse-connectivity: A single element in the feature map in connected to only a small patch of pixels.
Parameter-sharing: The same weights are used for different patches of the input image.
Many layers: Combining extracted local patterns to global patterns

Rationale: A feature detector that works well in one region may also work well in another region. Plus, it is a nice reduction in parameters to fit.

Multiple "feature detectors" (kernel) are used to create multiple feature maps

Size Before and After Convolutions

Feature map size:
output_width = (input_width-kernel_width + 2*padding)/stride+1

Note that CNNs are not really invariant to scale, ratation, translation, etc.

Pooling Layers Can Help With Local Invariance
Downside: Information is lost.
May not matter for classification, but applications where relative position is important.
In practice for CNNs: some image preprocessing still recommended.

5. Cross-Correlation vs Convolution

Why are Convolutional Nets Using Cross-Correlation?
Deep Learning Jargon: convolution in DL is actually cross-correlation

Cross-Correlation:

\[Z[i,j]=\sum_{u=-k}^{k}\sum_{v=-k}^{k}K[u,v]A[i+u,j+v] \]

1)2)3)
4)5)6)
7)8)9)

Convolution:

\[Z[i,j]=\sum_{u=-k}^{k}\sum_{v=-k}^{k}K[u,v]A[i-u,j-v] \]

9)8)7)
6)5)4)
3)2)1)

Basically, we are flipping the kernel (or the receptive field) horizontally and vertically.

In DL, we usually don't care about that (as opposed to many traditional computer vision and signal processing applications).
Also, cross-correlation is easier to implement.

6. CNNs & Backpropagation

Same overall concept as before: Multivariable chain rule, but now with an additional weight sharing constraint.

7. CNN Architectures

main breakthrough for CNNs AlexNet & ImageNet
Note that the actual network inputs were still 224x224 images (random crops from downsampled 256x256 images)
224x224 is still a good/reasonable size tody (2242243=150,528 features)

8. What a CNN Can See

2014, Visualizing and understanding convolutional network.
Method: backpropagate strong activation siganls in hidden layers to the input images, then apply "unpooling" to map the values to the original pixel space for visualization.

Grad-GAM...
https://thegradient.pub/a-visual-history-of-interpretation-for-image-recognition/

9. CNNs in PyTorch

标签：Closer,Convolutional,Layer,CNNs,sum,feature,input,still
From： https://www.cnblogs.com/prettysky/p/17044384.html

2020,Transformation-invariant Gabor convolutional networks
Introduction深度卷积神经网络(DCNNs)在字符识别、目标检测、人脸识别和语义分割等各个领域都取得了一系列突破。然而，由于缺乏为空间几何变换设计的特定模块，学习到的特征......
2018,Gabor Convolutional Networks
Abstract传统滤波器(如Gabor滤波器)的设计主要采用可调控的特性，并赋予特征处理空间变换的能力。然而，这些优秀的特性在目前流行的深度卷积神经网络(DCNNs)中还没有得到很好......
Elmedia Player Pro for Mac(万能视频播放器)
Mac上哪款视频播放器最好用？ElmediaPlayerPromac版集成了在线视频下载和视频播放的功能！能够通过软件内置的浏览器直接下载在线视频，包括优酷、土豆、Youtube等视频网站，还......
DPlayer播放器H5页面实现视频全屏播放滑动操作(滑动快进，滑动音量增减)
官方文档：https://dplayer.diygod.dev/zh/guide.html#%E5%8F%82%E6%95%B0 player.on('fullscreen',function(){$("body").on("touc......
数播经验hqplayer roon naa diretta
一、HQPlayerNAA 1、固件在百度网盘2、安装教程：http://www.linklab.tech/post/moode7%20x86%20%E5%AE%89%E8%A3%85.html 3、安装后登录用户名和密码：http://www.li......
客服弹窗中使用layer库自定义展示的标题 - 网站/网页在线客服源码教程
我在实现客服系统的过程中，使用layer实现右下角弹窗效果，现在需要自定义layer弹窗的标题和增加自定义按钮layer.open({type:2,title:'MyWindow<buttonclass="btn......
【Android 】使用MediaPlayer播放音频以及AudioManager简介
这里主要通过MediaPlayer以及AudioManager来实现的对应的功能。1.第一种，播放本地媒体文件：你需要自己准备一个MP3格式的音频文件；然后在资源目录(res)里面新建一个raw......
噩梦系列篇之Player受伤功能及伤害效果
下面设定player的受伤数值变化及伤害效果显示；添加一个Health脚本。。。下面看脚本的内容；让我们再次Coding起来；usingUnityEngine;usingSystem.Collections;publicclasshe......
噩梦系列篇之Player之激光制作及射击完成
下面完成player的射击功能。该功能设定为自动开枪，也就是用计时器来开枪。。。。首先给player加入一个脚本Gunshoot如图：然后要给player在射击的时候加入光效，很简单，在枪口的位......
噩梦系列篇之敌人自动追击Player功能（NavMeshAgent）
敌人自动追击功能用到NavMeshAgent制作NavMeshAgent：首先选择地图，保持地图为static状态；之后就是选择window里面的Navigation如下图:然后点击Bake后得到如下图；蓝色地图区域就......

A Closer Look at the Convolutional Layer

1. What CNNs Can Do

2. Image Classification

3. Convolutional Neural Network Basics

Size Before and After Convolutions

5. Cross-Correlation vs Convolution

6. CNNs & Backpropagation

7. CNN Architectures

8. What a CNN Can See

9. CNNs in PyTorch

相关文章

赞助商

阅读排行

A Closer Look at the Convolutional Layer

1. What CNNs Can Do

2. Image Classification

3. Convolutional Neural Network Basics

4. Convolutional Filters and Weight-Sharing

Weight Sharing

Size Before and After Convolutions

5. Cross-Correlation vs Convolution

6. CNNs & Backpropagation

7. CNN Architectures

8. What a CNN Can See

9. CNNs in PyTorch

相关文章

赞助商

阅读排行