Large Kernel Matters—— Improve Semantic Segmentation by Global Convolutional Network

阿新 • • 發佈：2018-11-22

The large kernel (and effective receptive field) plays an important role when we have to perform the classification and localization tasks simultaneously.（當我們必須同時執行分類和定位任務時，大核心（和有效的感知野）起著重要作用）
we propose a Global Convolutional Network to address both the classification and localization issues for the semantic segmentation.（提出GCN來解決語義分割的分類和定位問題）
We also suggest a residual-based boundary refinement to further refine the object boundaries.（基於殘差的邊界細化以進一步細化目標邊界）
82.2% (vs 80.2%) on PASCAL VOC 2012 dataset and 76.9% (vs 71.8%) on Cityscapes dataset.

引言
For the classification task, the models are required to be invariant to various transformations like translation and rotation. But for the localization task, models should be transformation-sensitive
方法
First from the localization view, the structure must be fully-convolutional without any fully-connected layer or global pooling layer that used by many classification networks, since the latter will discard localization information. Second from the classification view, motivated by the densely-connected structure of classification models, the kernel size of the convolutional structure should be as large as possible.

GCN module employs a combination of 1 x k + k x 1 and k x 1 + 1 x k convolutions, which enables densely connections within a large k x k region in the feature map，we do not use any nonlinearity after convolution layers.

we models the boundary alignment as a residual structure.S = S + R（S）,where S is
the coarse score map and R() is the residual branch.

3. 實驗
We evaluate our approach on the standard benchmark PASCAL VOC 2012 [11, 10] and Cityscapes [8]. PASCAL VOC 2012 has 1464 images for training, 1449 images for validation and 1456 images for testing, which belongs to 20 object classes along with one background class. We also use the Semantic Boundaries Dataset [13] as auxiliary dataset, resulting in 10,582 images for training. We choose the state-of-the-art network ResNet 152 [14] (pretrained on ImageNet [28]) as our base model for fine tuning. During the training time, we use standard SGD [20] with batch size 1, momentum 0.99 and weight decay 0.0005 . Data augmentations like mean subtraction and horizontal flip are also applied in training.

we pad each input image into 512 x 512 so that the top-most feature map is 16 x 16.

在這裡插入圖片描述
Only odd size are used just to avoid alignment error.(k值)（k值的範圍可根據最後一個特徵圖的大小來定）
雖然GCN的網路結構增加了引數量，但是通過與實驗C進行對比證明了效能的提升並不是因為引數量的增加。
GCN模型主要提高了內部區域的準確性，而邊界區域的影響較小（大型物體中心的畫素+GCN≈“純”分類問題）

Large Kernel Matters—— Improve Semantic Segmentation by Global Convolutional Network

The large kernel (and effective receptive field) plays an important role when we have to perform the classification and localization tasks simulta

語義分割之large kernel matters個人總結

1. Architecture 作者想要解決的是分類與定位的對立矛盾。分類具有平移不變性，而定位則對位置變化非常敏感。在分割任務中，全卷積的網路更側重於定位，往往會讓分類任務獲得的感受野較小，這會導致無法獲得足夠的object資訊，不利於分類。於是作者增大

影象語義分割(8)-Large Kernel Matters:通過全域性卷積網路改進語義分割

論文地址：Large Kernel Matters——Improve Semantic Segmentation by Global Convolutional Network 1. 問題提出當前網路的設定傾向於使用小尺寸濾波器，在相同的計算代價下效果與大核的效果相同，但

Improve Facial Recognition using Semantic Segmentation & Landmark Annotation

Landmark Annotation to recognize Facial attributes Firms in the security and surveillance sector which build facial recognition models require high-qual

semantic segmentation with deeplearning

bsp code caf tree anti get codes https pap ParseNet: Looking Wider to See Better [ICLR 2016] paper: https://arxiv.org/pdf/1506.04579.pdf

FCN筆記（Fully Convolutional Networks for Semantic Segmentation）

width height training 註意 die str 指標 his repl FCN筆記（Fully Convolutional Networks for Semantic Segmentation）（1）FCN做的主要操作 (a)將之前分類網絡的全連接

語義分割(semantic segmentation) 常用神經網絡介紹對比-FCN SegNet U-net DeconvNet，語義分割,簡單來說就是給定一張圖片,對圖片中的每一個像素點進行分類；目標檢測只有兩類,目標和非目標，就是在一張圖片中找到並用box標註出所有的目標.

avi projects div 般的 ict 中間接受 img dense from：https://blog.csdn.net/u012931582/article/details/70314859 2017年04月21日 14:54:10 閱讀數：4369

論文筆記 Locality-Sensitive Deconvolution Networks with Gated Fusion for RGB-D Indoor Semantic Segmentation

extract pear rain bsp ble rgb oge nbsp png 用於RGB-D室內語義分割的具有門控融合的局部敏感反卷積網絡 abstract problem: indoor semantic segmentation using RGB

論文閱讀筆記 DeepLabv1：SEMANTIC IMAGE SEGMENTATION WITH DEEP CONVOLUTIONAL NETS AND FULLY CONNECTED CRFS

bar pro 依賴性後處理主題處理分配位置平滑論文鏈接：https://arxiv.org/abs/1412.7062 摘要該文將DCNN與概率模型結合進行語義分割，並指出DCNN的最後一層feature map不足以進行準確的語義分割

【Semantic Segmentation】 Instance-sensitive Fully Convolutional Networks論文解析

ict rain 反向傳播 line sem segment 獲取工作 xtra 這篇文章比較簡單，但還是不想寫overview，轉自： https://blog.csdn.net/zimenglan_sysu/article/details/52451098 另外，讀這

Rich feature hierarchies for accurate object detection and semantic segmentation（理解）

0 - 背景　　該論文是2014年CVPR的經典論文，其提出的模型稱為R-CNN（Regions with Convolutional Neural Network Features），曾經是物體檢測領域的state-of-art模型。 1 - 相關知識補充 1.1 - Selective Searc

【深度學習】Semantic Segmentation 語義分割

翻譯自 A 2017 Guide to Semantic Segmentation with Deep Learning What exactly is semantic segmentation? 對圖片的每個畫素都做分類。較為重要的語義分割資料集有：VOC2

多篇用DL做Semantic Segmentation的文章總結

http://www.letpub.com.cn/index.php?page=journalapp&view=search&searchname=IEEE%20TRANSACTIONS%20ON%20NEURAL%20NETWORKS%20AND%20LEARNING%20S

語義分割--PANet和Understanding Convolution for Semantic Segmentation

語義分割 PAN Pyramid Attention Network for Semantic Segmentation FCN作為backbone的結構對小型目標預測不佳，論文認為這存在兩個挑戰。物體因為多尺度的原因，造成難以分類。針對這個問題，PSPNet和De

語義分割 - Semantic Segmentation Papers

Semantic Segmentation Adaptive Affinity Field for Semantic Segmentation – ECCV2018 [Paper] [HomePage] Pyramid Attention Network for

Semantic Segmentation with custom dataset

Semantic Segmentation with custom dataset - Prepare dataset X_train.npy Y_train.npy X_val.npy Y_val.npy weights.npy preprocess_

Rich feature hierarchies for accurate object detection and semantic segmentation (RCNN)筆記

RCNN系列對比圖，來源 1、本文主要是記錄RCNN。論文相對於以前的傳統方法的改進有：速度，經典的目標檢測演算法使用滑動視窗依次判斷所有可能的區域。本文則（採用selective Search方法）預先提取一系列較可能是物體的候選區域，之後僅僅在這些候選區域上進行featur

CVPR2018論文解析之《Fully Convolutional Adaptation Networks for Semantic Segmentation》(全卷積適配網路)

論文網址：Fully Convolutional Adaptation Networks for Semantic Segmentation 1.摘要：問題：收集大量畫素級標記的資料是一個費事費力的過程，一個比較好的選擇是使用合成數據，比如遊戲視訊，來自動產生標籤。

論文閱讀筆記十八：ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation

每一個內核基於 proc vgg 包含 rep 重要偏差論文源址：https://arxiv.org/abs/1606.02147 tensorflow github: https://github.com/kwotsin/TensorFlow-ENet 摘要

《18.Context Encoding for Semantic Segmentation》

語義分割–(EncNet)Context Encoding for Semantic Segmentation 動機擴張卷積存在的問題先進的語義分割系統通常是基於FCN架構，採用的深度卷積神經網路受益於從不同圖片中學習到的豐富的物件類別

Large Kernel Matters—— Improve Semantic Segmentation by Global Convolutional Network

相關推薦