[深度學習論文筆記] Convolutional Neuron Networks and its Applications

阿新 • • 發佈：2019-01-26

In artificial intelligence, there exists a Moravec’s Paradox, 1 “High-level reasoning requires very little computation, but low-level sensorimotor skills require enormous computational resources”. It is comparatively easy to make computers exhibit adult level performance on intelligence tests or playing checkers, and difficult or impossible to give them the skills of a one-year-old when it comes to perception and mobility.

Computer vision is one of the such low-level sensorimotor skills. The task of recognizing an object is trivial for human, but it is quite hard for computers due to the semantic gap. Computers only see a collection of integers from 0 to 255. It is hard to write an explicit algorithm for compute to identity object from a 3D array of numbers. Therefore, inspired by the human learning process, we are going to provide the compute with many examples of each class and let the compute learn from data. This is called data-driven approach.

Convolution Neural Network (CNN) is the state-of-the-art approach to object recognition, and it has show greatly advance on the performance of many compute vision tasks. To have a deep understanding of CNN and to inspire ideas for cutting-edge research, I think the most fundamental and effective way is to look at recent CNN publications from top-tier vision conferences and journals. Therefore, I decided to write a note to take down the basic ideas and my understandings of those publications. At present, this note contains around 60 papers from ICCV, ECCV, CVPR, NIPS, ICML, ICLR and so on. The content covers the basic topics in computer vision including image classification, object localization, object detection, object segmentation, image and language, video classification, GAN, etc.

I would like to give acknowledgment to the followings for providing fabulous materials on CNN/deep learning.

• Andrew Ng et al. “UFLDL: Deep Learning Tutorial.” Stanford.
• Fei-Fei Li, Andrej Karpathy, and Justin Johnson. “cs231n: Convolutional Neural Networks for Visual Recognition.” Stanford.
• Andrea Vedaldi, Andrew Zisserman. “VGG Convolutional Neural Networks Practical.” Oxford Visual Geometry Group.
• Ian Goodfellow, Aaron Courville, and Yoshua Bengio. “Deep Learning.” Book in preparation for MIT Press. 2015.

• Jianxin Wu. “Introduction to Convolutional Neural Networks”. Nanjing University.

This note is still under continuous update. If you have any question or advice, please feel free to contact with me via email.

The pdf file can be download at here.

1 https://en.wikipedia.org/wiki/Moravec’s_paradox.

[深度學習論文筆記] Convolutional Neuron Networks and its Applications

In artificial intelligence, there exists a Moravec’s Paradox, 1 “High-level reasoning requires very little computation, but low-level sen

深度學習論文筆記（六）--- FCN-2015年（Fully Convolutional Networks for Semantic Segmentation）

深度學習論文筆記（六）--- FCN 全連線網路 FullyConvolutional Networks for Semantic Segmentation Author：J Long ， E Shelhamer， T Darrell Year： 2015 1、導

深度學習論文筆記：Deep Residual Networks with Dynamically Weighted Wavelet Coefficients for Fault Diagnosis of Planetary Gearboxes

這篇文章將深度學習演算法應用於機械故障診斷，採用了“小波包分解+深度殘差網路(ResNet)”的思路，將機械振動訊號按照故障型別進行分類。文章的核心創新點：複雜旋轉機械系統的振動訊號包含著很多不同頻率的衝擊和振盪成分，而且不同頻帶內的振動成分在故障診斷中的重要程度經常是不同的，因此可以按照如下步驟設計深度

【深度學習論文筆記】Deep Neural Networks for Object Detection

論文:<<Deep Neural Networks for Object Detection>> 作者:Christian Szegedy Al

[深度學習論文筆記][總結]Invariant gait feature extraction based on image transformation

近期有兩篇來自於同一第一作者單位的工作，使用基於神經網路的影象變換模型來處理不同視角、不同衣著或手持物的CEI特徵到統一的90°正常特徵(SPAE與GaitGAN)。在這裡加以簡單總結與對比。 [Neurocomputing 17] Invariant fea

[深度學習論文筆記][AAAI 18]Accelerated Training for Massive Classification via Dynamic Class Selection

[AAAI 18] Accelerated Training for Massive Classification via Dynamic Class Selection Xingcheng Zhang, Lei Yang, Junjie Yan, Dahua

[深度學習論文筆記][Image Classification] 影象分類部分論文導讀

[ImageNet] • Over 15M labeled high resolution images. • Roughly 22k categories.• Collected from web and labeled by Amazon Mechanical Turk

[深度學習論文筆記][Visualizing] 網路視覺化部分論文導讀

There are several ways to understanding and visualing CNN 1 Visualizing Activations Show the activations of the network during the forwar

[深度學習論文筆記][arxiv 1804]ExFuse: Enhancing Feature Fusion for Semantic Segmentation

[arxiv 1804]ExFuse: Enhancing Feature Fusion for Semantic Segmentation Zhenli Zhang, Xiangyu Zhang, Chao Peng, Dazhi Cheng, Jian S

[深度學習論文筆記][CVPR 18]Path Aggregation Network for Instance Segmentation

[CVPR 18]Path Aggregation Network for Instance Segmentation Shu Liu, Lu Qi, Haifang Qin, Jianping Shi and Jiaya Jia from CUHK, P

深度學習論文翻譯解析（十）：Visualizing and Understanding Convolutional Networks

論文標題：Visualizing and Understanding Convolutional Networks 　　標題翻譯：視覺化和理解卷積網路論文作者：Matthew D. Zeiler Rob Fergus 論文地址：https://arxiv.org/pdf/1311.2901v3.

深度學習論文翻譯解析（十一）：OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks

論文標題：OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks 　　　　標題翻譯：OverFeat：使用卷積神經網路整合識別，定位和檢測論文作者：Pierre Sermanet&nb

深度學習論文隨記（二）---VGGNet模型解讀-2014年（Very Deep Convolutional Networks for Large-Scale Image Recognition）

深度學習論文隨記（二）---VGGNet模型解讀 Very Deep Convolutional Networks forLarge-Scale Image Recognition Author: K Simonyan ， A Zisserman Year: 2014

深度學習論文翻譯解析（六）：MobileNets：Efficient Convolutional Neural Networks for Mobile Vision Appliications

論文標題：MobileNets：Efficient Convolutional Neural Networks for Mobile Vision Appliications 論文作者：Andrew G.Howard Menglong Zhu Bo Chen ..... 論文地址：ht

深度學習論文翻譯解析（九）：Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition

論文標題：Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition　　　　　　標題翻譯：用於視覺識別的深度卷積神經網路中的空間金字塔池論文作者：Kaiming He, Xiangyu Zhang, Shao

深度學習論文翻譯解析（十五）：Densely Connected Convolutional Networks

論文標題：Densely Connected Convolutional Networks 論文作者：Gao Huang Zhuang Liu Laurens van der Maaten Kilian Q. Weinberger 論文地址：https://arxiv.org/pdf/1608.0

深度學習論文翻譯解析（十六）：Squeeze-and-Excitation Networks

論文標題：Squeeze-and-Excitation Networks 論文作者：Jie Hu Li Shen Gang Sun 論文地址：https://openaccess.thecvf.com/content_cvpr_2018/papers/Hu_Squeeze-and-E

深度學習論文翻譯解析（十七）：MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications

論文標題：MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications 論文作者：Andrew G. Howard, Menglong Zhu, Bo Chen, Dmitry

深度學習論文翻譯解析（二）：An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

論文標題：An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition 論文作者： Baoguang Shi, Xiang B

[深度學習論文閱讀]Facenet論文閱讀筆記（包括GoogLenet引數計算方式）

1 統述功能：face verification (is this the same person) recognition (who is this person) clustering (find common people among

[深度學習論文筆記] Convolutional Neuron Networks and its Applications

相關推薦