1. 程式人生 > >CVPR2018】論文整理(收藏這一篇就夠了)

CVPR2018】論文整理(收藏這一篇就夠了)

CVPR 2018

CVPR作為CV界最受關注的三大頂會之一,每一個CVer都應該好好關注CVPR的論文。CVPR2018在今年6月18日-22日在美國鹽湖城舉行。

先介紹一下CVPR2018的一些資料:

今年一共收到3309篇文章,其中979篇被錄用。投錄比約為29.5%。
收錄論文按專家評分,分為三個層次:Poster, Spotlight, Oral。
Spotlight(亮點論文)一共有224篇,佔收錄論文(224/979)的22.88%。
Oral(演示論文)一共有70篇,佔收錄論文(70/979)的7.1%。

用一張韋恩圖表示收錄文章佔比:

所以說,不光中篇CVPR難,中篇spotlight更難,中篇oral基本可以說是灰常難了。就這麼說吧,今年國內所有高校加起來中的CVPR oral是個位數。

當然,最牛的還是Best paper 和best student paper,只會分別選出1篇。

今年的best paper給了來自Stanford和Berkeley的合作論文,論文標題為:

Taskonomy: Disentangling Task Transfer Learning

下載地址為:https://arxiv.org/abs/1804.08328

最佳學生論文來自CMU,標題為:

Total Capture: A 3D Deformation Model for Tracking Faces, Hands, and Bodies

下載地址為:https://arxiv.org/abs/1801.01615v1

當然,就像奧斯卡頒獎一樣,最佳論文獎提名也可以突出文章質量很高。今年四篇最佳論文提名獎如下:

標題 
第一單位 
下載地址


 Deep_Learning_of_Graph_Matching 
 Lund University

http://openaccess.thecvf.com
/content_cvpr_2018
/CameraReady/1830.pdf

SPLATNet: Sparse Lattice Networks for Point Cloud Processing 
UMass Amherst
 https://arxiv.org/pdf/1802.08275.pdf


 CodeSLAM-learning a Compact, Optimisable Representation for Dense Visual SLAM 
 帝國理工
  https://arxiv.org/pdf/1804.00874.pdf 


Efficient Optimization for Rank-based Loss Functions 
IIIT Hyderabad
 https://arxiv.org/pdf/1604.08269.pdf 

所以,客觀認為的論文含金量是:
best paper (2篇) > honorable mention(提名獎 4篇) > Oral (70篇) > Spotlight(224篇) > poster(其他)

CVPR2018雖好,可不要貪杯,一共有979篇,每天看1篇也得看3年,待你看完之日也是演算法過時之時。所以,給各位CVer(包括自己)一些建議:

從高質量論文開始看,至少優先看spotlight或者oral論文。
在自己的領域找論文看,別想做什麼CVPR的集大成者,如果你是CVPR oral大神,那麼當我這條沒說過。
哪裡有CVPR論文分享會就去聽,聽原作者自己講一個小時,比自己看一禮拜更管用。如果沒有現場版,看看視訊也是好的。

最後

附上68篇oral論文標題:(文末有下載連結)

1

DensePose: Multi-Person Dense Human Pose Estimation In The Wild

2

Context Encoding for Semantic Segmentation

3

Augmented Skeleton Space Transfer for Depth-based Hand Pose Estimation

4

Semi-parametric Image Synthesis

5

Practical Block-wise Neural Network Architecture Generation

6

Are You Talking to Me? Reasoned Visual Dialog Generation through Adversarial Learning

7

PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume

8

Illuminant Spectra-based Source Separation Using Flash Photography

9

SPLATNet: Sparse Lattice Networks for Point Cloud Processing

10

Total Capture: A 3D Deformation Model for Tracking Faces, Hands, and Bodies

11

Deep Layer Aggregation

12

Left-Right Comparative Recurrent Model for Stereo Matching

13

Analytic Expressions for Probabilistic Moments of PL-DNN with Gaussian Input

14

An Analysis of Scale Invariance in Object Detection - SNIP

15

Finding Tiny Faces in the Wild with Generative Adversarial Network

16

Taskonomy: Disentangling Task Transfer Learning

17

High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs

18

Finding “It”: Weakly-Supervised Reference-Aware Visual Grounding in Instructional Video

19

Unsupervised Discovery of Object Landmarks as Structural Representations

20

Rotation Averaging and Strong Duality

21

Im2Flow: Motion Hallucination from Static Images for Action Recognition

22

Group Consistent Similarity Learning via Deep CRFs for Person Re-Identification

23

3D-RCNN: Instance-level 3D Scene Understanding via Render-and-Compare

24

Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering

25

Context Contrasted Feature and Gated Multi-scale Aggregation for Scene Segmentation

26

Squeeze-and-Excitation Networks

27

DoubleFusion: Real-time Capture of Human Performance with Inner Body Shape from a Single Depth Sensor

28

Learning to Find Good Correspondences

29

Actor and Action Video Segmentation from a Sentence

30

Maximum Classifier Discrepancy for Unsupervised Domain Adaptation

31

Detail-Preserving Pooling in Deep Networks

32

Convolutional Neural Networks with Alternately Updated Clique

33

Deep Learning of Graph Matching

34

Synthesizing Images of Humans in Unseen Poses

35

Neural Inverse Kinematics for Unsupervised Motion Retargetting

36

Direction-aware Spatial Context Features for Shadow Detection

37

Density Adaptive Point Set Registration

38

Hybrid Camera Pose Estimation

39

Relation Networks for Object Detection

40

Revisiting Salient Object Detection: Simultaneous Detection, Ranking, and Subitizing of Multiple Salient Objects

41

Im2Pano3D: Extrapolating 360 Structure and Semantics Beyond the Field of View

42

Polarimetric Dense Monocular SLAM

43

Wasserstein Introspective Neural Networks

44

The Perception-Distortion Tradeoff

45

Discriminative Learning of Latent Features for Zero-Shot Recognition

46

Photometric Stereo in Participating Media Considering Shape-Dependent Forward Scatter

47

Fast and Furious: Real Time End-to-End 3D Detection, Tracking and Motion Forecasting with a Single Convolutional Net

48

Trapping Light for Time of Flight

49

Feature Space Transfer for Data Augmentation

50

Self-supervised Multi-level Face Model Learning for Monocular Reconstruction at over 250Hz

51

CodeSLAM — Learning a Compact, Optimisable Representation for Dense Visual SLAM

52

FlipDial: A Generative Model for Two-Way Visual Dialogue

53

OATM: Occlusion Aware Template Matching by Consensus Set Maximization

54

Surface Networks

55

VirtualHome: Simulating Household Activities via Programs

56

Egocentric Activity Recognition on a Budget

57

Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering

58

Efficient Optimization for Rank-based Loss Functions

59

MakeupGAN: Makeup Transfer via Cycle-Consistent Adversarial Networks

60

Revisiting Deep Intrinsic Image Decompositions

61

StarGAN: Unified Generative Adversarial Networks for Controllable Multi-Domain Image-to-Image Translation

62

Ordinal Depth Supervision for 3D Human Pose Estimation

63

Multi-Cell Classification by Convolutional Dictionary Learning with Class Proportion Priors

64

Accurate and Diverse Sampling of Sequences based on a “Best of Many” Sample Objective

65

MapNet: An Allocentric Spatial Memory for Mapping Environments

66

A Globally Optimal Solution to the Non-Minimal Relative Pose Problem

67

A Volumetric Descriptive Network for 3D Object Synthesis

68

Learning Face Age Progression: A Pyramid Architecture of GANs

我已經整理出所有oral文章,想打包下載的可以點選part1、part2