1. 程式人生 > >CVPR2017文章彙總

CVPR2017文章彙總

Date Time Location # Session Session Title Paper ID Paper Title Authors
Saturday, July 22, 2017 0900–1030 Kamehameha III 1 Spotlight 1-1A Machine Learning 1 305 Exclusivity-Consistency Regularized Multi-View Subspace Clustering Xiaobo Wang, Xiaojie Guo, Zhen Lei, Changqing Zhang, Stan Z. Li

373 Borrowing Treasures From the Wealthy: Deep Transfer Learning Through Selective Joint Fine-Tuning Weifeng Ge, Yizhou Yu

968 The More You Know: Using Knowledge Graphs for Image Classification Kenneth Marino, Ruslan Salakhutdinov, Abhinav Gupta

1358 Dynamic Edge-Conditioned Filters in Convolutional Neural Networks on Graphs Martin Simonovsky, Nikos Komodakis

2704 Convolutional Neural Network Architecture for Geometric Matching Ignacio Rocco, Relja Arandjelović, Josef Sivic

2715 Deep Affordance-Grounded Sensorimotor Object Recognition Spyridon Thermos, Georgios Th. Papadopoulos, Petros Daras, Gerasimos Potamianos

3235 Discovering Causal Signals in Images David Lopez-Paz, Robert Nishihara, Soumith Chintala, Bernhard Schölkopf, Léon Bottou

3653 On Compressing Deep Models by Low Rank and Sparse Decomposition Xiyu Yu, Tongliang Liu, Xinchao Wang, Dacheng Tao

Oral 1-1A
201 PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation Charles R. Qi, Hao Su, Kaichun Mo, Leonidas J. Guibas

649 Universal Adversarial Perturbations Seyed-Mohsen Moosavi-Dezfooli, Alhussein Fawzi, Omar Fawzi, Pascal Frossard

1385 Unsupervised Pixel-Level Domain Adaptation With Generative Adversarial Networks Konstantinos Bousmalis, Nathan Silberman, David Dohan, Dumitru Erhan, Dilip Krishnan

1948 Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network Christian Ledig, Lucas Theis, Ferenc Huszár, Jose Caballero, Andrew Cunningham, Alejandro Acosta, Andrew Aitken, Alykhan Tejani, Johannes Totz, Zehan Wang, Wenzhe Shi
Saturday, July 22, 2017 0900–1030 Kalākaua Ballroom A-B 2 Spotlight 1-1B 3D Vision 1 147 Global Hypothesis Generation for 6D Object Pose Estimation Frank Michel, Alexander Kirillov, Eric Brachmann, Alexander Krull, Stefan Gumhold, Bogdan Savchynskyy, Carsten Rother

380 A Practical Method for Fully Automatic Intrinsic Camera Calibration Using Directionally Encoded Light Mahdi Abbaspour Tehrani, Thabo Beeler, Anselm Grundhöfer

1075 CATS: A Color and Thermal Stereo Benchmark Wayne Treible, Philip Saponaro, Scott Sorensen, Abhishek Kolagunda, Michael O’Neal, Brian Phelan, Kelly Sherbondy, Chandra Kambhamettu

1248 Elastic Shape-From-Template With Spatially Sparse Deforming Forces Abed Malti, Cédric Herzet

1473 Distinguishing the Indistinguishable: Exploring Structural Ambiguities via Geodesic Context Qingan Yan, Long Yang, Ling Zhang, Chunxia Xiao

2250 Multi-Scale Continuous CRFs as Sequential Deep Networks for Monocular Depth Estimation Dan Xu, Elisa Ricci, Wanli Ouyang, Xiaogang Wang, Nicu Sebe

2685 Dynamic Time-Of-Flight Michael Schober, Amit Adam, Omer Yair, Shai Mazor, Sebastian Nowozin

2827 Training Object Class Detectors With Click Supervision Dim P. Papadopoulos, Jasper R. R. Uijlings, Frank Keller, Vittorio Ferrari

Oral 1-1B
646 Semantic Scene Completion From a Single Depth Image Shuran Song, Fisher Yu, Andy Zeng, Angel X. Chang, Manolis Savva, Thomas Funkhouser

668 3DMatch: Learning Local Geometric Descriptors From RGB-D Reconstructions Andy Zeng, Shuran Song, Matthias Nießner, Matthew Fisher, Jianxiong Xiao, Thomas Funkhouser

950 Multi-View Supervision for Single-View Reconstruction via Differentiable Ray Consistency Shubham Tulsiani, Tinghui Zhou, Alexei A. Efros, Jitendra Malik

1821 On-The-Fly Adaptation of Regression Forests for Online Camera Relocalisation Tommaso Cavallari, Stuart Golodetz, Nicholas A. Lord, Julien Valentin, Luigi Di Stefano, Philip H. S. Torr
Saturday, July 22, 2017 0900–1030 Kalākaua Ballroom C 3 Spotlight 1-1C Low- & Mid-Level Vision 23 Designing Effective Inter-Pixel Information Flow for Natural Image Matting Yağiz Aksoy, Tunç Ozan Aydin, Marc Pollefeys

460 Deep Video Deblurring for Hand-Held Cameras Shuochen Su, Mauricio Delbracio, Jue Wang, Guillermo Sapiro, Wolfgang Heidrich, Oliver Wang

877 Instance-Level Salient Object Segmentation Guanbin Li, Yuan Xie, Liang Lin, Yizhou Yu

1509 Deep Multi-Scale Convolutional Neural Network for Dynamic Scene Deblurring Seungjun Nah, Tae Hyun Kim, Kyoung Mu Lee

1532 Diversified Texture Synthesis With Feed-Forward Networks Yijun Li, Chen Fang, Jimei Yang, Zhaowen Wang, Xin Lu, Ming-Hsuan Yang

1969 Radiometric Calibration for Internet Photo Collections Zhipeng Mo, Boxin Shi, Sai-Kit Yeung, Yasuyuki Matsushita

2866 Deeply Aggregated Alternating Minimization for Image Restoration Youngjung Kim, Hyungjoo Jung, Dongbo Min, Kwanghoon Sohn

3051 End-To-End Instance Segmentation With Recurrent Attention Mengye Ren, Richard S. Zemel

Oral 1-1C
369 SRN: Side-output Residual Network for Object Symmetry Detection in the Wild Wei Ke, Jie Chen, Jianbin Jiao, Guoying Zhao, Qixiang Ye

1081 Deep Image Matting Ning Xu, Brian Price, Scott Cohen, Thomas Huang

1574 Wetness and Color From a Single Multispectral Image Mihoko Shimano, Hiroki Okawa, Yuta Asano, Ryoma Bise, Ko Nishino, Imari Sato

1641 FC4: Fully Convolutional Color Constancy With Confidence-Weighted Pooling Yuanming Hu, Baoyuan Wang, Stephen Lin
Saturday, July 22, 2017 1030–1230 Kamehameha I 4 Poster 1-1 3D Computer Vision 24 Face Normals “In-The-Wild” Using Fully Convolutional Networks George Trigeorgis, Patrick Snape, Iasonas Kokkinos, Stefanos Zafeiriou

39 A Non-Convex Variational Approach to Photometric Stereo Under Inaccurate Lighting Yvain Quéau, Tao Wu, François Lauze, Jean-Denis Durou, Daniel Cremers

179 A Linear Extrinsic Calibration of Kaleidoscopic Imaging System From Single 3D Point Kosuke Takahashi, Akihiro Miyata, Shohei Nobuhara, Takashi Matsuyama

579 Polarimetric Multi-View Stereo Zhaopeng Cui, Jinwei Gu, Boxin Shi, Ping Tan, Jan Kautz

702 An Exact Penalty Method for Locally Convergent Maximum Consensus Huu Le, Tat-Jun Chin, David Suter

2317 Deep Supervision With Shape Concepts for Occlusion-Aware 3D Object Parsing Chi Li, M. Zeeshan Zia, Quoc-Huy Tran, Xiang Yu, Gregory D. Hager, Manmohan Chandraker

2455 Amodal Detection of 3D Objects: Inferring 3D Bounding Boxes From 2D Ones in RGB-Depth Images Zhuo Deng, Longin Jan Latecki

Analyzing Humans in Images 143 Transition Forests: Learning Discriminative Temporal Transitions for Action Recognition and Detection Guillermo Garcia-Hernando, Tae-Kyun Kim

189 Scene Flow to Action Map: A New Representation for RGB-D Based Action Recognition With Convolutional Neural Networks Pichao Wang, Wanqing Li, Zhimin Gao, Yuyao Zhang, Chang Tang, Philip Ogunbona

970 Detecting Masked Faces in the Wild With LLE-CNNs Shiming Ge, Jia Li, Qiting Ye, Zhao Luo

1289 A Domain Based Approach to Social Relation Recognition Qianru Sun, Bernt Schiele, Mario Fritz

1698 Spatio-Temporal Naive-Bayes Nearest-Neighbor (ST-NBNN) for Skeleton-Based Action Recognition Junwu Weng, Chaoqun Weng, Junsong Yuan

2928 Personalizing Gesture Recognition Using Hierarchical Bayesian Neural Networks Ajjen Joshi, Soumya Ghosh, Margrit Betke, Stan Sclaroff, Hanspeter Pfister

Applications 237 Real-Time 3D Model Tracking in Color and Depth on a Single CPU Core Wadim Kehl, Federico Tombari, Slobodan Ilic, Nassir Navab

1304 Multi-Scale FCN With Cascaded Instance Aware Segmentation for Arbitrary Oriented Word Spotting in the Wild Dafang He, Xiao Yang, Chen Liang, Zihan Zhou, Alexander G. Ororbi II, Daniel Kifer, C. Lee Giles

2662 Viraliency: Pooling Local Virality Xavier Alameda-Pineda, Andrea Pilzer, Dan Xu, Nicu Sebe, Elisa Ricci

Biomedical Image/Video Analysis 2395 A Non-Local Low-Rank Framework for Ultrasound Speckle Reduction Lei Zhu, Chi-Wing Fu, Michael S. Brown, Pheng-Ann Heng

Image Motion & Tracking 162 Video Acceleration Magnification Yichao Zhang, Silvia L. Pintea, Jan C. van Gemert

670 Superpixel-Based Tracking-By-Segmentation Using Markov Chains Donghun Yeo, Jeany Son, Bohyung Han, Joon Hee Han

1251 BranchOut: Regularization for Online Ensemble Tracking With Convolutional Neural Networks Bohyung Han, Jack Sim, Hartwig Adam

1257 Learning Motion Patterns in Videos Pavel Tokmakov, Karteek Alahari, Cordelia Schmid

Low- & Mid-Level Vision 839 Deep Level Sets for Salient Object Detection Ping Hu, Bing Shuai, Jun Liu, Gang Wang

1789 Binary Constraint Preserving Graph Matching Bo Jiang, Jin Tang, Chris Ding, Bin Luo

1818 From Local to Global: Edge Profiles to Camera Motion in Blurred Images Subeesh Vasu, A. N. Rajagopalan

2033 What Is the Space of Attenuation Coefficients in Underwater Computer Vision? Derya Akkaynak, Tali Treibitz, Tom Shlesinger, Yossi Loya, Raz Tamir, David Iluz

2360 Robust Energy Minimization for BRDF-Invariant Shape From Light Fields Zhengqin Li, Zexiang Xu, Ravi Ramamoorthi, Manmohan Chandraker

2429 Boundary-Aware Instance Segmentation Zeeshan Hayder, Xuming He, Mathieu Salzmann

2497 Spatially-Varying Blur Detection Based on Multiscale Fused and Sorted Transform Coefficients of Gradient Magnitudes S. Alireza Golestaneh, Lina J. Karam

2606 Model-Based Iterative Restoration for Binary Document Image Compression With Dictionary Learning Yandong Guo, Cheng Lu, Jan P. Allebach, Charles A. Bouman

2965 FCSS: Fully Convolutional Self-Similarity for Dense Semantic Correspondence Seungryong Kim, Dongbo Min, Bumsub Ham, Sangryul Jeon, Stephen Lin, Kwanghoon Sohn

Machine Learning 38 Learning by Association — A Versatile Semi-Supervised Training Method for Neural Networks Philip Haeusser, Alexander Mordvintsev, Daniel Cremers

153 Dilated Residual Networks Fisher Yu, Vladlen Koltun, Thomas Funkhouser

367 Split-Brain Autoencoders: Unsupervised Learning by Cross-Channel Prediction Richard Zhang, Phillip Isola, Alexei A. Efros

755 Nonnegative Matrix Underapproximation for Robust Multiple Model Fitting Mariano Tepper, Guillermo Sapiro

875 Truncated Max-Of-Convex Models Pankaj Pansari, M. Pawan Kumar

908 Additive Component Analysis Calvin Murdock, Fernando De la Torre

1065 Subspace Clustering via Variance Regularized Ridge Regression Chong Peng, Zhao Kang, Qiang Cheng

1074 The Incremental Multiresolution Matrix Factorization Algorithm Vamsi K. Ithapu, Risi Kondor, Sterling C. Johnson, Vikas Singh

1299 Transformation-Grounded Image Generation Network for Novel 3D View Synthesis Eunbyung Park, Jimei Yang, Ersin Yumer, Duygu Ceylan, Alexander C. Berg

1424 Learning Dynamic Guidance for Depth Image Enhancement Shuhang Gu, Wangmeng Zuo, Shi Guo, Yunjin Chen, Chongyu Chen, Lei Zhang

1857 A-Lamp: Adaptive Layout-Aware Multi-Patch Deep Convolutional Neural Network for Photo Aesthetic Assessment Shuang Ma, Jing Liu, Chang Wen Chen

2083 Teaching Compositionality to CNNs Austin Stone, Huayan Wang, Michael Stark, Yi Liu, D. Scott Phoenix, Dileep George

2148 Using Ranking-CNN for Age Estimation Shixing Chen, Caojin Zhang, Ming Dong, Jialiang Le, Mike Rao

2293 Accurate Single Stage Detector Using Recurrent Rolling Convolution Jimmy Ren, Xiaohao Chen, Jianbo Liu, Wenxiu Sun, Jiahao Pang, Qiong Yan, Yu-Wing Tai, Li Xu

2412 A Compact DNN: Approaching GoogLeNet-Level Accuracy of Classification and Domain Adaptation Chunpeng Wu, Wei Wen, Tariq Afzal, Yongmei Zhang, Yiran Chen, Hai (Helen) Li

2539 The Impact of Typicality for Informative Representative Selection Jawadul H. Bappy, Sujoy Paul, Ertem Tuncel, Amit K. Roy-Chowdhury

2542 Infinite Variational Autoencoder for Semi-Supervised Learning M. Ehsan Abbasnejad, Anthony Dick, Anton van den Hengel

2635 SurfNet: Generating 3D Shape Surfaces Using Deep Residual Networks Ayan Sinha, Asim Unmesh, Qixing Huang, Karthik Ramani

2723 Intrinsic Grassmann Averages for Online Linear and Robust Subspace Learning Rudrasis Chakraborty, Søren Hauberg, Baba C. Vemuri

2966 Variational Bayesian Multiple Instance Learning With Gaussian Processes Manuel Haußmann, Fred A. Hamprecht, Melih Kandemir

3096 Temporal Attention-Gated Model for Robust Sequence Classification Wenjie Pei, Tadas Baltrušaitis, David M.J. Tax, Louis-Philippe Morency

3138 Non-Uniform Subset Selection for Active Learning in Structured Data Sujoy Paul, Jawadul H. Bappy, Amit K. Roy-Chowdhury

3160 Colorization as a Proxy Task for Visual Understanding Gustav Larsson, Michael Maire, Gregory Shakhnarovich

3260 Shading Annotations in the Wild Balazs Kovacs, Sean Bell, Noah Snavely, Kavita Bala

3345 LCNN: Lookup-Based Convolutional Neural Network Hessam Bagherinezhad, Mohammad Rastegari, Ali Farhadi

Object Recognition & Scene Understanding 17 Physics Inspired Optimization on Semantic Transfer Features: An Alternative Method for Room Layout Estimation Hao Zhao, Ming Lu, Anbang Yao, Yiwen Guo, Yurong Chen, Li Zhang

144 Pixelwise Instance Segmentation With a Dynamically Instantiated Network Anurag Arnab, Philip H. S. Torr

225 Object Detection in Videos With Tubelet Proposal Networks Kai Kang, Hongsheng Li, Tong Xiao, Wanli Ouyang, Junjie Yan, Xihui Liu, Xiaogang Wang

226 AMVH: Asymmetric Multi-Valued Hashing Cheng Da, Shibiao Xu, Kun Ding, Gaofeng Meng, Shiming Xiang, Chunhong Pan

372 Spindle Net: Person Re-Identification With Human Body Region Guided Feature Decomposition and Fusion Haiyu Zhao, Maoqing Tian, Shuyang Sun, Jing Shao, Junjie Yan, Shuai Yi, Xiaogang Wang, Xiaoou Tang

483 Deep Visual-Semantic Quantization for Efficient Image Retrieval Yue Cao, Mingsheng Long, Jianmin Wang, Shichen Liu

758 Efficient Diffusion on Region Manifolds: Recovering Small Objects With Compact CNN Representations Ahmet Iscen, Giorgos Tolias, Yannis Avrithis, Teddy Furon, Ondřej Chum

777 Feature Pyramid Networks for Object Detection Tsung-Yi Lin, Piotr Dollár, Ross Girshick, Kaiming He, Bharath Hariharan, Serge Belongie

828 Mind the Class Weight Bias: Weighted Maximum Mean Discrepancy for Unsupervised Domain Adaptation Hongliang Yan, Yukang Ding, Peihua Li, Qilong Wang, Yong Xu, Wangmeng Zuo

1161 StyleNet: Generating Attractive Visual Captions With Styles Chuang Gan, Zhe Gan, Xiaodong He, Jianfeng Gao, Li Deng

1665 Fine-Grained Recognition of Thousands of Object Categories With Single-Example Training Leonid Karlinsky, Joseph Shtok, Yochay Tzur, Asaf Tzadok

1742 Improving Interpretability of Deep Neural Networks With Semantic Information Yinpeng Dong, Hang Su, Jun Zhu, Bo Zhang

2927 Video Captioning With Transferred Semantic Attributes Yingwei Pan, Ting Yao, Houqiang Li, Tao Mei

3054 Fast Boosting Based Detection Using Scale Invariant Multimodal Multiresolution Filtered Features Arthur Daniel Costea, Robert Varga, Sergiu Nedevschi

Video Analytics 60 Temporal Convolutional Networks for Action Segmentation and Detection Colin Lea, Michael D. Flynn, René Vidal, Austin Reiter, Gregory D. Hager

141 Surveillance Video Parsing With Single Frame Supervision Si Liu, Changhu Wang, Ruihe Qian, Han Yu, Renda Bao, Yao Sun

471 Weakly Supervised Actor-Action Segmentation via Robust Multi-Task Ranking Yan Yan, Chenliang Xu, Dawen Cai, Jason J. Corso

790 Unsupervised Visual-Linguistic Reference Resolution in Instructional Videos De-An Huang, Joseph J. Lim, Li Fei-Fei, Juan Carlos Niebles

1035 Zero-Shot Action Recognition With Error-Correcting Output Codes Jie Qin, Li Liu, Ling Shao, Fumin Shen, Bingbing Ni, Jiaxin Chen, Yunhong Wang

2488 Enhancing Video Summarization via Vision-Language Embedding Bryan A. Plummer, Matthew Brown, Svetlana Lazebnik

3327 Synthesizing Dynamic Patterns by Spatial-Temporal Generative ConvNet Jianwen Xie, Song-Chun Zhu, Ying Nian Wu
Saturday, July 22, 2017 1330–1500 Kamehameha III 5 Spotlight 1-2A Object Recognition & Scene Understanding - Computer Vision & Language 99 Context-Aware Captions From Context-Agnostic Supervision Ramakrishna Vedantam, Samy Bengio, Kevin Murphy, Devi Parikh, Gal Chechik
121 Visual Dialog Abhishek Das, Satwik Kottur, Khushi Gupta, Avi Singh, Deshraj Yadav, José M. F. Moura, Devi Parikh, Dhruv Batra
178 Discriminative Bimodal Networks for Visual Localization and Detection With Natural Language Queries Yuting Zhang, Luyao Yuan, Yijie Guo, Zhiyuan He, I-An Huang, Honglak Lee

617 Automatic Understanding of Image and Video Advertisements Zaeem Hussain, Mingda Zhang, Xiaozhong Zhang, Keren Ye, Christopher Thomas, Zuha Agha, Nathan Ong, Adriana Kovashka

1145 Discover and Learn New Objects From Documentaries Kai Chen, Hang Song, Chen Change Loy, Dahua Lin

1962 Spatial-Semantic Image Search by Visual Feature Synthesis Long Mai, Hailin Jin, Zhe Lin, Chen Fang, Jonathan Brandt, Feng Liu

2243 Fully-Adaptive Feature Sharing in Multi-Task Networks With Applications in Person Attribute Classification Yongxi Lu, Abhishek Kumar, Shuangfei Zhai, Yu Cheng, Tara Javidi, Rogerio Feris

2390 Semantic Compositional Networks for Visual Captioning Zhe Gan, Chuang Gan, Xiaodong He, Yunchen Pu, Kenneth Tran, Jianfeng Gao, Lawrence Carin, Li Deng

Oral 1-2A
108 Deep Reinforcement Learning-Based Image Captioning With Embedding Reward Zhou Ren, Xiaoyu Wang, Ning Zhang, Xutao Lv, Li-Jia Li

665 From Red Wine to Red Tomato: Composition With Context Ishan Misra, Abhinav Gupta, Martial Hebert

2454 Captioning Images With Diverse Objects Subhashini Venugopalan, Lisa Anne Hendricks, Marcus Rohrbach, Raymond Mooney, Trevor Darrell, Kate Saenko

3266 Self-Critical Sequence Training for Image Captioning Steven J. Rennie, Etienne Marcheret, Youssef Mroueh, Jerret Ross, Vaibhava Goel
Saturday, July 22, 2017 1330–1500 Kalākaua Ballroom A-B 6 Spotlight 1-2B Analyzing Humans 1 210 Crossing Nets: Combining GANs and VAEs With a Shared Latent Space for Hand Pose Estimation Chengde Wan, Thomas Probst, Luc Van Gool, Angela Yao

554 Predicting Behaviors of Basketball Players From First Person Videos Shan Su, Jung Pyo Hong, Jianbo Shi, Hyun Soo Park

1270 LCR-Net: Localization-Classification-Regression for Human Pose Grégory Rogez, Philippe Weinzaepfel, Cordelia Schmid

1626 Learning Residual Images for Face Attribute Manipulation Wei Shen, Rujie Liu

2433 Seeing What Is Not There: Learning Context to Determine Where Objects Are Missing Jin Sun, David W. Jacobs

2684 Deep Learning on Lie Groups for Skeleton-Based Action Recognition Zhiwu Huang, Chengde Wan, Thomas Probst, Luc Van Gool

3245 Harvesting Multiple Views for Marker-Less 3D Human Pose Annotations Georgios Pavlakos, Xiaowei Zhou, Konstantinos G. Derpanis, Kostas Daniilidis

3269 Coarse-To-Fine Volumetric Prediction for Single-Image 3D Human Pose Georgios Pavlakos, Xiaowei Zhou, Konstantinos G. Derpanis, Kostas Daniilidis

Oral 1-2B
244 Weakly Supervised Action Learning With RNN Based Fine-To-Coarse Modeling Alexander Richard, Hilde Kuehne, Juergen Gall

528 Disentangled Representation Learning GAN for Pose-Invariant Face Recognition Luan Tran, Xi Yin, Xiaoming Liu

2880 ArtTrack: Articulated Multi-Person Tracking in the Wild Eldar Insafutdinov, Mykhaylo Andriluka, Leonid Pishchulin, Siyu Tang, Evgeny Levinkov, Bjoern Andres, Bernt Schiele

3550 Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields Zhe Cao, Tomas Simon, Shih-En Wei, Yaser Sheikh
Saturday, July 22, 2017 1330–1500 Kalākaua Ballroom C 7 Spotlight 1-2C Image Motion & Tracking; Video Analysis 67 Template Matching With Deformable Diversity Similarity Itamar Talmi, Roey Mechrez, Lihi Zelnik-Manor

140 Beyond Triplet Loss: A Deep Quadruplet Network for Person Re-Identification Weihua Chen, Xiaotang Chen, Jianguo Zhang, Kaiqi Huang

799 Agent-Centric Risk Assessment: Accident Anticipation and Risky Region Localization Kuo-Hao Zeng, Shih-Han Chou, Fu-Hsiang Chan, Juan Carlos Niebles, Min Sun

959 Bidirectional Multirate Reconstruction for Temporal Modeling in Videos Linchao Zhu, Zhongwen Xu, Yi Yang

985 Action-Decision Networks for Visual Tracking With Deep Reinforcement Learning Sangdoo Yun, Jongwon Choi, Youngjoon Yoo, Kimin Yun, Jin Young Choi

1002 TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering Yunseok Jang, Yale Song, Youngjae Yu, Youngjin Kim, Gunhee Kim

2894 Making 360° Video Watchable in 2D: Learning Videography for Click Free Viewing Yu-Chuan Su, Kristen Grauman

3289 Unsupervised Adaptive Re-Identification in Open World Dynamic Camera Networks Rameswar Panda, Amran Bhuiyan, Vittorio Murino, Amit K. Roy-Chowdhury

Oral 1-2C
526 Context-Aware Correlation Filter Tracking Matthias Mueller, Neil Smith, Bernard Ghanem

1277 Deep 360 Pilot: Learning a Deep Agent for Piloting Through 360° Sports Videos Hou-Ning Hu, Yen-Chen Lin, Ming-Yu Liu, Hsien-Tzu Cheng, Yung-Ju Chang, Min Sun

1321 Slow Flow: Exploiting High-Speed Cameras for Accurate and Diverse Optical Flow Reference Data Joel Janai, Fatma Güney, Jonas Wulff, Michael J. Black, Andreas Geiger

2448 CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos Zheng Shou, Jonathan Chan, Alireza Zareian, Kazuyuki Miyazawa, Shih-Fu Chang
Saturday, July 22, 2017 1500–1700 Kamehameha I 8 Poster 1-2 3D Computer Vision 111 Exploiting 2D Floorplan for Building-Scale Panorama RGBD Alignment Erik Wijmans, Yasutaka Furukawa

346 A Combinatorial Solution to Non-Rigid 3D Shape-To-Image Matching Florian Bernard, Frank R. Schmidt, Johan Thunberg, Daniel Cremers

536 NID-SLAM: Robust Monocular SLAM Using Normalised Information Distance Geoffrey Pascoe, Will Maddern, Michael Tanner, Pedro Piniés, Paul Newman

853 End-To-End Training of Hybrid CNN-CRF Models for Stereo Patrick Knöbelreiter, Christian Reinbacher, Alexander Shekhovtsov, Thomas Pock

951 Learning Shape Abstractions by Assembling Volumetric Primitives Shubham Tulsiani, Hao Su, Leonidas J. Guibas, Alexei A. Efros, Jitendra Malik

1117 Locality-Sensitive Deconvolution Networks With Gated Fusion for RGB-D Indoor Semantic Segmentation Yanhua Cheng, Rui Cai, Zhiwei Li, Xin Zhao, Kaiqi Huang

1314 Acquiring Axially-Symmetric Transparent Objects Using Single-View Transmission Imaging Jaewon Kim, Ilya Reshetouski, Abhijeet Ghosh

2144 Regressing Robust and Discriminative 3D Morphable Models With a Very Deep Neural Network Anh Tuấn Trần, Tal Hassner, Iacopo Masi, Gérard Medioni

2554 End-To-End 3D Face Reconstruction With Deep Neural Networks Pengfei Dou, Shishir K. Shah, Ioannis A. Kakadiaris

2775 DUST: Dual Union of Spatio-Temporal Subspaces for Monocular Multiple Object 3D Reconstruction Antonio Agudo, Francesc Moreno-Noguer

Analyzing Humans in Images 314 Finding Tiny Faces Peiyun Hu, Deva Ramanan

574 Dynamic Facial Analysis: From Bayesian Filtering to Recurrent Neural Network Jinwei Gu, Xiaodong Yang, Shalini De Mello, Jan Kautz

851 Deep Temporal Linear Encoding Networks Ali Diba, Vivek Sharma, Luc Van Gool

1006 Joint Registration and Representation Learning for Unconstrained Face Identification Munawar Hayat, Salman H. Khan, Naoufel Werghi, Roland Goecke

1034 3D Human Pose Estimation From a Single Image via Distance Matrix Regression Francesc Moreno-Noguer

1100 One-Shot Metric Learning for Person Re-Identification Slawomir Bąk, Peter Carr

1192 Generalized Rank Pooling for Activity Recognition Anoop Cherian, Basura Fernando, Mehrtash Harandi, Stephen Gould

2714 Deep Representation Learning for Human Motion Prediction and Classification Judith Bütepage, Michael J. Black, Danica Kragic, Hedvig Kjellström

3171 Interspecies Knowledge Transfer for Facial Keypoint Detection Maheen Rashid, Xiuye Gu, Yong Jae Lee

3624 Recurrent Convolutional Neural Networks for Continuous Sign Language Recognition by Staged Optimization Runpeng Cui, Hu Liu, Changshui Zhang

Applications 3469 Modeling Sub-Event Dynamics in First-Person Action Recognition Hasan F. M. Zaki, Faisal Shafait, Ajmal Mian

Computational Photography 137 Turning an Urban Scene Video Into a Cinemagraph Hang Yan, Yebin Liu, Yasutaka Furukawa

2806 Light Field Reconstruction Using Deep Convolutional Network on EPI Gaochang Wu, Mandan Zhao, Liangyong Wang, Qionghai Dai, Tianyou Chai, Yebin Liu

Image Motion & Tracking 900 FlowNet 2.0: Evolution of Optical Flow Estimation With Deep Networks Eddy Ilg, Nikolaus Mayer, Tonmoy Saikia, Margret Keuper, Alexey Dosovitskiy, Thomas Brox

Low- & Mid-Level Vision 213 Attention-Aware Face Hallucination via Deep Reinforcement Learning Qingxing Cao, Liang Lin, Yukai Shi, Xiaodan Liang, Guanbin Li

286 Simple Does It: Weakly Supervised Instance and Semantic Segmentation Anna Khoreva, Rodrigo Benenson, Jan Hosang, Matthias Hein, Bernt Schiele

450 Anti-Glare: Tightly Constrained Optimization for Eyeglass Reflection Removal Tushar Sandhan, Jin Young Choi

492 Deep Joint Rain Detection and Removal From a Single Image Wenhan Yang, Robby T. Tan, Jiashi Feng, Jiaying Liu, Zongming Guo, Shuicheng Yan

1149 Radiometric Calibration From Faces in Images Chen Li, Stephen Lin, Kun Zhou, Katsushi Ikeuchi

1333 Webly Supervised Semantic Segmentation Bin Jin, Maria V. Ortiz Segovia, Sabine Süsstrunk

1495 Removing Rain From Single Images via a Deep Detail Network Xueyang Fu, Jiabin Huang, Delu Zeng, Yue Huang, Xinghao Ding, John Paisley

1512 Deep Crisp Boundaries Yupei Wang, Xin Zhao, Kaiqi Huang

1722 Coarse-To-Fine Segmentation With Shape-Tailored Continuum Scale Spaces Naeemullah Khan, Byung-Woo Hong, Anthony Yezzi, Ganesh Sundaramoorthi

1770 Large Kernel Matters — Improve Semantic Segmentation by Global Convolutional Network Chao Peng, Xiangyu Zhang, Gang Yu, Guiming Luo, Jian Sun

1825 Single Image Reflection Suppression Nikolaos Arvanitopoulos, Radhakrishna Achanta, Sabine Süsstrunk

2581 CASENet: Deep Category-Aware Semantic Edge Detection Zhiding Yu, Chen Feng, Ming-Yu Liu, Srikumar Ramalingam

3113 Reflectance Adaptive Filtering Improves Intrinsic Image Estimation Thomas Nestmeyer, Peter V. Gehler

Machine Learning 273 Conditional Similarity Networks Andreas Veit, Serge Belongie, Theofanis Karaletsos

360 Spatially Adaptive Computation Time for Residual Networks Michael Figurnov, Maxwell D. Collins, Yukun Zhu, Li Zhang, Jonathan Huang, Dmitry Vetrov, Ruslan Salakhutdinov

451 Xception: Deep Learning With Depthwise Separable Convolutions François Chollet

474 Feedback Networks Amir R. Zamir, Te-Lin Wu, Lin Sun, William B. Shen, Bertram E. Shi, Jitendra Malik, Silvio Savarese

661 Online Summarization via Submodular and Convex Optimization Ehsan Elhamifar, M. Clara De Paolis Kaluza

753 Deep MANTA: A Coarse-To-Fine Many-Task Network for Joint 2D and 3D Vehicle Analysis From Monocular Image Florian Chabot, Mohamed Chaouch, Jaonary Rabarisoa, Céline Teulière, Thierry Chateau

1329 Improving Pairwise Ranking for Multi-Label Image Classification Yuncheng Li, Yale Song, Jiebo Luo

1709 Active Convolution: Learning the Shape of Convolution for Image Classification Yunho Jeon, Junmo Kim

1916 Linking Image and Text With 2-Way Nets Aviv Eisenschtat, Lior Wolf

2100 Stacked Generative Adversarial Networks Xun Huang, Yixuan Li, Omid Poursaeed, John Hopcroft, Serge Belongie

2110 Image Splicing Detection via Camera Response Function Analysis Can Chen, Scott McCloskey, Jingyi Yu

2121 Building a Regular Decision Boundary With Deep Networks Edouard Oyallon

2516 More Is Less: A More Complicated Network With Less Inference Complexity Xuanyi Dong, Junshi Huang, Yi Yang, Shuicheng Yan

2621 Joint Graph Decomposition and Node Labeling: Problem, Algorithms, Applications Evgeny Levinkov, Jonas Uhrig, Siyu Tang, Mohamed Omran, Eldar Insafutdinov, Alexander Kirillov, Carsten Rother, Thomas Brox, Bernt Schiele, Bjoern Andres

2721 Scale-Aware Face Detection Zekun Hao, Yu Liu, Hongwei Qin, Junjie Yan, Xiu Li, Xiaolin Hu

3350 Deep Unsupervised Similarity Learning Using Partially Ordered Sets Miguel A. Bautista, Artsiom Sanakoyeu, Björn Ommer

3692 Generative Hierarchical Learning of Sparse FRAME Models Jianwen Xie, Yifei Xu, Erik Nijkamp, Ying Nian Wu, Song-Chun Zhu

Object Recognition & Scene Understanding 69 Generating Holistic 3D Scene Abstractions for Text-Based Image Retrieval Ang Li, Jin Sun, Joe Yue-Hei Ng, Ruichi Yu, Vlad I. Morariu, Larry S. Davis

446 Perceptual Generative Adversarial Networks for Small Object Detection Jianan Li, Xiaodan Liang, Yunchao Wei, Tingfa Xu, Jiashi Feng, Shuicheng Yan

605 Emotion Recognition in Context Ronak Kosti, Jose M. Alvarez, Adria Recasens, Agata Lapedriza

610 Deep Learning of Human Visual Sensitivity in Image Quality Assessment Framework Jongyoo Kim, Sanghoon Lee

792 Dense Captioning With Joint Inference and Visual Context Linjie Yang, Kevin Tang, Jianchao Yang, Li-Jia Li

1062 CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning Justin Johnson, Bharath Hariharan, Laurens van der Maaten, Li Fei-Fei, C. Lawrence Zitnick, Ross Girshick

1325 Cross-View Image Matching for Geo-Localization in Urban Environments Yicong Tian, Chen Chen, Mubarak Shah

1440 Matrix Tri-Factorization With Manifold Regularizations for Zero-Shot Learning Xing Xu, Fumin Shen, Yang Yang, Dongxiang Zhang, Heng Tao Shen, Jingkuan Song

1721 Self-Supervised Learning of Visual Features Through Embedding Images Into Text Topic Spaces Lluis Gomez, Yash Patel, Marçal Rusiñol, Dimosthenis Karatzas, C. V. Jawahar

2328 Learning Spatial Regularization With Image-Level Supervisions for Multi-Label Image Classification Feng Zhu, Hongsheng Li, Wanli Ouyang, Nenghai Yu, Xiaogang Wang

2650 Semantically Consistent Regularization for Zero-Shot Recognition Pedro Morgado, Nuno Vasconcelos

2689 Can Walking and Measuring Along Chord Bunches Better Describe Leaf Shapes? Bin Wang, Yongsheng Gao, Changming Sun, Michael Blumenstein, John La Salle

Video Analytics 159 Self-Learning Scene-Specific Pedestrian Detectors Using a Progressive Latent Model Qixiang Ye, Tianliang Zhang, Wei Ke, Qiang Qiu, Jie Chen, Guillermo Sapiro, Baochang Zhang

328 Predictive-Corrective Networks for Action Detection Achal Dave, Olga Russakovsky, Deva Ramanan

356 Budget-Aware Deep Semantic Video Segmentation Behrooz Mahasseni, Sinisa Todorovic, Alan Fern

379 Unified Embedding and Metric Learning for Zero-Exemplar Event Detection Noureldien Hussein, Efstratios Gavves, Arnold W.M. Smeulders

569 Spatiotemporal Pyramid Network for Video Action Recognition Yunbo Wang, Mingsheng Long, Jianmin Wang, Philip S. Yu

810 ER3: A Unified Framework for Event Retrieval, Recognition and Recounting Zhanning Gao, Gang Hua, Dongqing Zhang, Nebojsa Jojic, Le Wang, Jianru Xue, Nanning Zheng

1345 FusionSeg: Learning to Combine Motion and Appearance for Fully Automatic Segmentation of Generic Objects in Videos Suyog Dutt Jain, Bo Xiong, Kristen Grauman

1990 Query-Focused Video Summarization: Dataset, Evaluation, and a Memory Network Based Approach Aidean Sharghi, Jacob S. Laurel, Boqing Gong

2945 Flexible Spatio-Temporal Networks for Video Prediction Chaochao Lu, Michael Hirsch, Bernhard Schölkopf

3131 Temporal Action Co-Segmentation in 3D Motion Capture Data and Videos Konstantinos Papoutsakis, Costas Panagiotakis, Antonis A. Argyros
Sunday, July 23, 2017 0830–1000 Kamehameha III 9 Spotlight 2-1A Machine Learning 2 110 Dual Attention Networks for Multimodal Reasoning and Matching Hyeonseob Nam, Jung-Woo Ha, Jeonghee Kim

122 DESIRE: Distant Future Prediction in Dynamic Scenes With Interacting Agents Namhoon Lee, Wongun Choi, Paul Vernaza, Christopher B. Choy, Philip H. S. Torr, Manmohan Chandraker

347 Interpretable Structure-Evolving LSTM Xiaodan Liang, Liang Lin, Xiaohui Shen, Jiashi Feng, Shuicheng Yan, Eric P. Xing

804 ShapeOdds: Variational Bayesian Learning of Generative Shape Models Shireen Elhabian, Ross Whitaker

1339 Fast Video Classification via Adaptive Cascading of Deep Models Haichen Shen, Seungyeop Han, Matthai Philipose, Arvind Krishnamurthy

2271 Deep Metric Learning via Facility Location Hyun Oh Song, Stefanie Jegelka, Vivek Rathod, Kevin Murphy

3050 Semi-Supervised Deep Learning for Monocular Depth Map Prediction Yevhen Kuznietsov, Jörg Stückler, Bastian Leibe

3568 Weakly Supervised Semantic Segmentation Using Web-Crawled Videos Seunghoon Hong, Donghun Yeo, Suha Kwak, Honglak Lee, Bohyung Han

Oral 2-1A
720 Making Deep Neural Networks Robust to Label Noise: A Loss Correction Approach Giorgio Patrini, Alessandro Rozza, Aditya Krishna Menon, Richard Nock, Lizhen Qu

767 Learning From Simulated and Unsupervised Images Through Adversarial Training Ashish Shrivastava, Tomas Pfister, Oncel Tuzel, Joshua Susskind, Wenda Wang, Russell Webb

932 Inverse Compositional Spatial Transformer Networks Chen-Hsuan Lin, Simon Lucey

1954 Densely Connected Convolutional Networks Gao Huang, Zhuang Liu, Laurens van der Maaten, Kilian Q. Weinberger
Sunday, July 23, 2017 0830–1000 Kalākaua Ballroom A-B 10 Spotlight 2-1B Computational Photography 205 Video Frame Interpolation via Adaptive Convolution Simon Niklaus, Long Mai, Feng Liu

340 FastMask: Segment Multi-Scale Object Candidates in One Shot Hexiang Hu, Shiyi Lan, Yuning Jiang, Zhimin Cao, Fei Sha

571 Reconstructing Transient Images From Single-Photon Sensors Matthew O’Toole, Felix Heide, David B. Lindell, Kai Zang, Steven Diamond, Gordon Wetzstein

1043 Deep Sketch Hashing: Fast Free-Hand Sketch-Based Image Retrieval Li Liu, Fumin Shen, Yuming Shen, Xianglong Liu, Ling Shao

1637 DeshadowNet: A Multi-Context Embedding Deep Network for Shadow Removal Liangqiong Qu, Jiandong Tian, Shengfeng He, Yandong Tang, Rynson W. H. Lau

2120 Illuminant-Camera Communication to Observe Moving Objects Under Strong External Light by Spread Spectrum Modulation Ryusuke Sagawa, Yutaka Satoh

2133 Photorealistic Facial Texture Inference Using Deep Neural Networks Shunsuke Saito, Lingyu Wei, Liwen Hu, Koki Nagano, Hao Li

3428 The Geometry of First-Returning Photons for Non-Line-Of-Sight Imaging Chia-Yin Tsai, Kiriakos N. Kutulakos, Srinivasa G. Narasimhan, Aswin C. Sankaranarayanan

Oral 2-1B
833 Unrolling the Shutter: CNN to Correct Motion Distortions Vijay Rengarajan, Yogesh Balaji, A. N. Rajagopalan

1572 Light Field Blind Motion Deblurring Pratul P. Srinivasan, Ren Ng, Ravi Ramamoorthi

2872 Computational Imaging on the Electric Grid Mark Sheinin, Yoav Y. Schechner, Kiriakos N. Kutulakos

3565 Deep Outdoor Illumination Estimation Yannick Hold-Geoffroy, Kalyan Sunkavalli, Sunil Hadap, Emiliano Gambaretto, Jean-François Lalonde
Sunday, July 23, 2017 0830–1000 Kalākaua Ballroom C 11 Spotlight 2-1C 3D Vision 2 270 Efficient Solvers for Minimal Problems by Syzygy-Based Reduction Viktor Larsson, Kalle Åström, Magnus Oskarsson

439 HSfM: Hybrid Structure-from-Motion Hainan Cui, Xiang Gao, Shuhan Shen, Zhanyi Hu

1071 Efficient Global Point Cloud Alignment Using Bayesian Nonparametric Mixtures Julian Straub, Trevor Campbell, Jonathan P. How, John W. Fisher III

1992 A New Rank Constraint on Multi-View Fundamental Matrices, and Its Application to Camera Location Recovery Soumyadip Sengupta, Tal Amir, Meirav Galun, Tom Goldstein, David W. Jacobs, Amit Singer, Ronen Basri

2129 IM2CAD Hamid Izadinia, Qi Shan, Steven M. Seitz

2513 ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes Angela Dai, Angel X. Chang, Manolis Savva, Maciej Halber, Thomas Funkhouser, Matthias Nießner

2807 Noise Robust Depth From Focus Using a Ring Difference Filter Jaeheung Surh, Hae-Gon Jeon, Yunwon Park, Sunghoon Im, Hyowon Ha, In So Kweon

3074 Group-Wise Point-Set Registration Based on Rényi’s Second Order Entropy Luis G. Sanchez Giraldo, Erion Hasanbelliu, Murali Rao, Jose C. Principe

Oral 2-1C
190 A Point Set Generation Network for 3D Object Reconstruction From a Single Image Haoqiang Fan, Hao Su, Leonidas J. Guibas

1931 3D Point Cloud Registration for Localization Using a Deep Neural Network Auto-Encoder Gil Elbaz, Tamar Avraham, Anath Fischer

2630 Flight Dynamics-Based Recovery of a UAV Trajectory Using Ground Cameras Artem Rozantsev, Sudipta N. Sinha, Debadeepta Dey, Pascal Fua

3060 DSAC - Differentiable RANSAC for Camera Localization Eric Brachmann, Alexander Krull, Sebastian Nowozin, Jamie Shotton, Frank Michel, Stefan Gumhold, Carsten Rother
Sunday, July 23, 2017 1000–1200 Kamehameha I 12 Poster 2-1 3D Computer Vision 299 Scalable Surface Reconstruction From Point Clouds With Extreme Scale and Density Diversity Christian Mostegel, Rudolf Prettenthaler, Friedrich Fraundorfer, Horst Bischof

555 Synthesizing 3D Shapes via Modeling Multi-View Depth Maps and Silhouettes With Deep Generative Networks Amir Arsalan Soltani, Haibin Huang, Jiajun Wu, Tejas D. Kulkarni, Joshua B. Tenenbaum

718 General Models for Rational Cameras and the Case of Two-Slit Projections Matthew Trager, Bernd Sturmfels, John Canny, Martial Hebert, Jean Ponce

1030 Accurate Depth and Normal Maps From Occlusion-Aware Focal Stack Symmetry Michael Strecke, Anna Alperovich, Bastian Goldluecke

1213 A Multi-View Stereo Benchmark With High-Resolution Images and Multi-Camera Videos Thomas Schöps, Johannes L. Schönberger, Silvano Galliani, Torsten Sattler, Konrad Schindler, Marc Pollefeys, Andreas Geiger

1343 Non-Contact Full Field Vibration Measurement Based on Phase-Shifting Hiroyuki Kayaba, Yuji Kokumai

2619 A Minimal Solution for Two-View Focal-Length Estimation Using Two Affine Correspondences Daniel Barath, Tekla Toth, Levente Hajder

3085 PoseAgent: Budget-Constrained 6D Object Pose Estimation via Reinforcement Learning Alexander Krull, Eric Brachmann, Sebastian Nowozin, Frank Michel, Jamie Shotton, Carsten Rother

3398 An Efficient Background Term for 3D Reconstruction and Tracking With Smooth Surface Models Mariano Jaimez, Thomas J. Cashman, Andrew Fitzgibbon, Javier Gonzalez-Jimenez, Daniel Cremers

Analyzing Humans in Images 1040 Reliable Crowdsourcing and Deep Locality-Preserving Learning for Expression Recognition in the Wild Shan Li, Weihong Deng, JunPing Du

1978 Procedural Generation of Videos to Train Deep Action Recognition Networks César Roberto de Souza, Adrien Gaidon, Yohann Cabon, Antonio Manuel López

2015 BigHand2.2M Benchmark: Hand Pose Dataset and State of the Art Analysis Shanxin Yuan, Qi Ye, Björn Stenger, Siddhant Jain, Tae-Kyun Kim

3115 DenseReg: Fully Convolutional Dense Shape Regression In-The-Wild Rıza Alp Güler, George Trigeorgis, Epameinondas Antonakos, Patrick Snape, Stefanos Zafeiriou, Iasonas Kokkinos

3839 Adaptive Class Preserving Representation for Image Classification Jian-Xun Mi, Qiankun Fu, Weisheng Li

Applications 1639 Generalized Semantic Preserving Hashing for N-Label Cross-Modal Retrieval Devraj Mandal, Kunal N. Chaudhury, Soma Biswas

2343 EAST: An Efficient and Accurate Scene Text Detector Xinyu Zhou, Cong Yao, He Wen, Yuzhi Wang, Shuchang Zhou, Weiran He, Jiajun Liang

3151 VidLoc: A Deep Spatio-Temporal Model for 6-DoF Video-Clip Relocalization Ronald Clark, Sen Wang, Andrew Markham, Niki Trigoni, Hongkai Wen

Biomedical Image/Video Analysis 2808 Improving RANSAC-Based Segmentation Through CNN Encapsulation Dustin Morley, Hassan Foroosh

Computational Photography 32 Position Tracking for Virtual Reality Using Commodity WiFi Manikanta Kotaru, Sachin Katti

787 Designing Illuminant Spectral Power Distributions for Surface Classification Henryk Blasinski, Joyce Farrell, Brian Wandell

1627 One-Shot Hyperspectral Imaging Using Faced Reflectors Tsuyoshi Takatani, Takahito Aoto, Yasuhiro Mukaigawa

Image Motion & Tracking 884 Direct Photometric Alignment by Mesh Deformation Kaimo Lin, Nianjuan Jiang, Shuaicheng Liu, Loong-Fah Cheong, Minh Do, Jiangbo Lu

1208 CNN-Based Patch Matching for Optical Flow With Thresholded Hinge Embedding Loss Christian Bailer, Kiran Varanasi, Didier Stricker

1696 Optical Flow Estimation Using a Spatial Pyramid Network Anurag Ranjan, Michael J. Black

3208 Deep Network Flow for Multi-Object Tracking Samuel Schulter, Paul Vernaza, Wongun Choi, Manmohan Chandraker

Low- & Mid-Level Vision 36 Material Classification Using Frequency- and Depth-Dependent Time-Of-Flight Distortion Kenichiro Tanaka, Yasuhiro Mukaigawa, Takuya Funatomi, Hiroyuki Kubo, Yasuyuki Matsushita, Yasushi Yagi

585 Benchmarking Denoising Algorithms With Real Photographs Tobias Plötz, Stefan Roth

645 A Unified Approach of Multi-Scale Deep and Hand-Crafted Features for Defocus Estimation Jinsun Park, Yu-Wing Tai, Donghyeon Cho, In So Kweon

703 StyleBank: An Explicit Representation for Neural Image Style Transfer Dongdong Chen, Lu Yuan, Jing Liao, Nenghai Yu, Gang Hua

1148 Specular Highlight Removal in Facial Images Chen Li, Stephen Lin, Kun Zhou, Katsushi Ikeuchi

1162 Image Super-Resolution via Deep Recursive Residual Network Ying Tai, Jian Yang, Xiaoming Liu

1434 Deep Image Harmonization Yi-Hsuan Tsai, Xiaohui Shen, Zhe Lin, Kalyan Sunkavalli, Xin Lu, Ming-Hsuan Yang

1535 Learning Deep CNN Denoiser Prior for Image Restoration Kai Zhang, Wangmeng Zuo, Shuhang Gu, Lei Zhang

1636 A Novel Tensor-Based Video Rain Streaks Removal Approach via Utilizing Discriminatively Intrinsic Priors Tai-Xiang Jiang, Ting-Zhu Huang, Xi-Le Zhao, Liang-Jian Deng, Yao Wang

1703 GMS: Grid-based Motion Statistics for Fast, Ultra-Robust Feature Correspondence JiaWang Bian, Wen-Yan Lin, Yasuyuki Matsushita, Sai-Kit Yeung, Tan-Dat Nguyen, Ming-Ming Cheng

1710 Video Desnowing and Deraining Based on Matrix Decomposition Weihong Ren, Jiandong Tian, Zhi Han, Antoni Chan, Yandong Tang

1989 Real-Time Video Super-Resolution With Spatio-Temporal Networks and Motion Compensation Jose Caballero, Christian Ledig, Andrew Aitken, Alejandro Acosta, Johannes Totz, Zehan Wang, Wenzhe Shi

2179 Deep Watershed Transform for Instance Segmentation Min Bai, Raquel Urtasun

2210 AnchorNet: A Weakly Supervised Network to Learn Geometry-Sensitive Features for Semantic Matching David Novotny, Diane Larlus, Andrea Vedaldi

3134 Learning Diverse Image Colorization Aditya Deshpande, Jiajun Lu, Mao-Chuang Yeh, Min Jin Chong, David Forsyth

3867 Awesome Typography: Statistics-Based Text Effects Transfer Shuai Yang, Jiaying Liu, Zhouhui Lian, Zongming Guo

Machine Learning 71 Unsupervised Video Summarization With Adversarial LSTM Networks Behrooz Mahasseni, Michael Lam, Sinisa Todorovic

221 Deep TEN: Texture Encoding Network Hang Zhang, Jia Xue, Kristin Dana

365 Order-Preserving Wasserstein Distance for Sequence Matching Bing Su, Gang Hua

700 Attend in Groups: A Weakly-Supervised Deep Learning Framework for Learning From Web Data Bohan Zhuang, Lingqiao Liu, Yao Li, Chunhua Shen, Ian Reid

1128 Hierarchical Multimodal Metric Learning for Multimodal Classification Heng Zhang, Vishal M. Patel, Rama Chellappa

1227 Efficient Linear Programming for Dense CRFs Thalaiyasingam Ajanthan, Alban Desmaison, Rudy Bunel, Mathieu Salzmann, Philip H. S. Torr, M. Pawan Kumar

1355 Variational Autoencoded Regression: High Dimensional Regression of Visual Data on Complex Manifold YoungJoon Yoo, Sangdoo Yun, Hyung Jin Chang, Yiannis Demiris, Jin Young Choi

3380 Learning Random-Walk Label Propagation for Weakly-Supervised Semantic Segmentation Paul Vernaza, Manmohan Chandraker

3384 Adversarial Discriminative Domain Adaptation Eric Tzeng, Judy Hoffman, Kate Saenko, Trevor Darrell

3854 Low-Rank-Sparse Subspace Representation for Robust Regression Yongqiang Zhang, Daming Shi, Junbin Gao, Dansong Cheng

Object Recognition & Scene Understanding 353 Generating the Future With Adversarial Transformers Carl Vondrick, Antonio Torralba

547 Semantic Amodal Segmentation Yan Zhu, Yuandong Tian, Dimitris Metaxas, Piotr Dollár

741 Learning a Deep Embedding Model for Zero-Shot Learning Li Zhang, Tao Xiang, Shaogang Gong

757 BIND: Binary Integrated Net Descriptors for Texture-Less Object Recognition Jacob Chan, Jimmy Addison Lee, Qian Kemao

902 Growing a Brain: Fine-Tuning by Increasing Model Capacity Yu-Xiong Wang, Deva Ramanan, Martial Hebert

946 A-Fast-RCNN: Hard Positive Generation via Adversary for Object Detection Xiaolong Wang, Abhinav Shrivastava, Abhinav Gupta

1036 Multiple Instance Detection Network With Online Instance Classifier Refinement Peng Tang, Xinggang Wang, Xiang Bai, Wenyu Liu

1064 Kernel Pooling for Convolutional Neural Networks Yin Cui, Feng Zhou, Jiang Wang, Xiao Liu, Yuanqing Lin, Serge Belongie

1110 Learning Cross-Modal Embeddings for Cooking Recipes and Food Images Amaia Salvador, Nicholas Hynes, Yusuf Aytar, Javier Marin, Ferda Ofli, Ingmar Weber, Antonio Torralba

1896 Zero-Shot Learning - the Good, the Bad and the Ugly Yongqin Xian, Bernt Schiele, Zeynep Akata

2150 DeepNav: Learning to Navigate Large Cities Samarth Brahmbhatt, James Hays

2289 Scene Graph Generation by Iterative Message Passing Danfei Xu, Yuke Zhu, Christopher B. Choy, Li Fei-Fei

2334 Visual Translation Embedding Network for Visual Relation Detection Hanwang Zhang, Zawlin Kyaw, Shih-Fu Chang, Tat-Seng Chua

2780 Unsupervised Part Learning for Visual Recognition Ronan Sicre, Yannis Avrithis, Ewa Kijak, Frédéric Jurie

3329 Comprehension-Guided Referring Expressions Ruotian Luo, Gregory Shakhnarovich

3426 Top-Down Visual Saliency Guided by Captions Vasili Ramanishka, Abir Das, Jianming Zhang, Kate Saenko

Theory 2208 Grassmannian Manifold Optimization Assisted Sparse Spectral Clustering Qiong Wang, Junbin Gao, Hong Li

Video Analytics 146 Video Propagation Networks Varun Jampani, Raghudeep Gadde, Peter V. Gehler

327 ActionVLAD: Learning Spatio-Temporal Aggregation for Action Classification Rohit Girdhar, Deva Ramanan, Abhinav Gupta, Josef Sivic, Bryan Russell

543 SCC: Semantic Context Cascade for Efficient Action Detection Fabian Caba Heilbron, Wayner Barrios, Victor Escorcia, Bernard Ghanem

601 Hierarchical Boundary-Aware Neural Encoder for Video Captioning Lorenzo Baraldi, Costantino Grana, Rita Cucchiara

887 HOPE: Hierarchical Object Prototype Encoding for Efficient Object Instance Search in Videos Tan Yu, Yuwei Wu, Junsong Yuan

1146 Spatio-Temporal Vector of Locally Max Pooled Features for Action Recognition in Videos Ionut Cosmin Duta, Bogdan Ionescu, Kiyoharu Aizawa, Nicu Sebe

1357 Temporal Action Localization by Structured Maximal Sums Zehuan Yuan, Jonathan C. Stroud, Tong Lu, Jia Deng

1805 Predicting Salient Face in Multiple-Face Videos Yufan Liu, Songyang Zhang, Mai Xu, Xuming He
Sunday, July 23, 2017 1300–1430 Kamehameha III 13 Spotlight 2-2A Object Recognition & Scene Understanding 1 6 Graph-Structured Representations for Visual Question Answering Damien Teney, Lingqiao Liu, Anton van den Hengel

133 Knowing When to Look: Adaptive Attention via a Visual Sentinel for Image Captioning Jiasen Lu, Caiming Xiong, Devi Parikh, Richard Socher

780 Learned Contextual Feature Reweighting for Image Geo-Localization Hyo Jin Kim, Enrique Dunn, Jan-Michael Frahm

1165 End-To-End Concept Word Detection for Video Captioning, Retrieval, and Question Answering Youngjae Yu, Hyungjin Ko, Jongwook Choi, Gunhee Kim

1194 Deep Cross-Modal Hashing Qing-Yuan Jiang, Wu-Jun Li

2326 Unambiguous Text Localization and Retrieval for Cluttered Scenes Xuejian Rong, Chucai Yi, Yingli Tian

2812 Bayesian Supervised Hashing Zihao Hu, Junxuan Chen, Hongtao Lu, Tongzhen Zhang

3562 Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors Jonathan Huang, Vivek Rathod, Chen Sun, Menglong Zhu, Anoop Korattikara, Alireza Fathi, Ian Fischer, Zbigniew Wojna, Yang Song, Sergio Guadarrama, Kevin Murphy

Oral 2-2A
1142 Detecting Visual Relationships With Deep Relational Networks Bo Dai, Yuqi Zhang, Dahua Lin

1691 Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes Tobias Pohlen, Alexander Hermans, Markus Mathias, Bastian Leibe

2955 Network Dissection: Quantifying Interpretability of Deep Visual Representations David Bau, Bolei Zhou, Aditya Khosla, Aude Oliva, Antonio Torralba

3856 AGA: Attribute-Guided Augmentation Mandar Dixit, Roland Kwitt, Marc Niethammer, Nuno Vasconcelos
Sunday, July 23, 2017 1300–1430 Kalākaua Ballroom A-B 14 Spotlight 2-2B Analyzing Humans 2 120 A Hierarchical Approach for Generating Descriptive Image Paragraphs Jonathan Krause, Justin Johnson, Ranjay Krishna, Li Fei-Fei

505 Person Re-Identification in the Wild Liang Zheng, Hengheng Zhang, Shaoyan Sun, Manmohan Chandraker, Yi Yang, Qi Tian

915 Scalable Person Re-Identification on Supervised Smoothed Manifold Song Bai, Xiang Bai, Qi Tian

945 Binge Watching: Scaling Affordance Learning From Sitcoms Xiaolong Wang, Rohit Girdhar, Abhinav Gupta

1262 Joint Detection and Identification Feature Learning for Person Search Tong Xiao, Shuang Li, Bochao Wang, Liang Lin, Xiaogang Wang

1362 Synthesizing Normalized Faces From Facial Identity Features Forrester Cole, David Belanger, Dilip Krishnan, Aaron Sarna, Inbar Mosseri, William T. Freeman

2458 Consistent-Aware Deep Learning for Person Re-Identification in a Camera Network Ji Lin, Liangliang Ren, Jiwen Lu, Jianjiang Feng, Jie Zhou

3276 Level Playing Field for Million Scale Face Recognition Aaron Nech, Ira Kemelmacher-Shlizerman

Oral 2-2B
1734 Re-Sign: Re-Aligned End-To-End Sequence Modelling With Deep Recurrent CNN-HMMs Oscar Koller, Sepehr Zargaran, Hermann Ney

1744 Social Scene Understanding: End-To-End Multi-Person Action Localization and Collective Activity Recognition Timur Bagautdinov, Alexandre Alahi, François Fleuret, Pascal Fua, Silvio Savarese

2626 Detangling People: Individuating Multiple Close People and Their Body Parts via Region Assembly Hao Jiang, Kristen Grauman

2873 Lip Reading Sentences in the Wild Joon Son Chung, Andrew Senior, Oriol Vinyals, Andrew Zisserman
Sunday, July 23, 2017 1300–1430 Kalākaua Ballroom C 15 Spotlight 2-2C Applications 728 Deep Matching Prior Network: Toward Tighter Multi-Oriented Text Detection Yuliang Liu, Lianwen Jin

766 ChestX-ray8: Hospital-Scale Chest X-Ray Database and Benchmarks on Weakly-Supervised Classification and Localization of Common Thorax Diseases Xiaosong Wang, Yifan Peng, Le Lu, Zhiyong Lu, Mohammadhadi Bagheri, Ronald M. Summers

911 Attentional Push: A Deep Convolutional Network for Augmenting Image Salience With Shared Attention Modeling in Social Scenes Siavash Gorji, James J. Clark

919 Detecting Oriented Text in Natural Images by Linking Segments Baoguang Shi, Xiang Bai, Serge Belongie

967 Learning Video Object Segmentation From Static Images Federico Perazzi, Anna Khoreva, Rodrigo Benenson, Bernt Schiele, Alexander Sorkine-Hornung

1579 Seeing Invisible Poses: Estimating 3D Body Pose From Egocentric Video Hao Jiang, Kristen Grauman

1822 Plug & Play Generative Networks: Conditional Iterative Generation of Images in Latent Space Anh Nguyen, Jeff Clune, Yoshua Bengio, Alexey Dosovitskiy, Jason Yosinski

3493 A Joint Speaker-Listener-Reinforcer Model for Referring Expressions Licheng Yu, Hao Tan, Mohit Bansal, Tamara L. Berg

Oral 2-2C
788 End-To-End Learning of Driving Models From Large-Scale Video Datasets Huazhe Xu, Yang Gao, Fisher Yu, Trevor Darrell

1775 Deep Future Gaze: Gaze Anticipation on Egocentric Videos Using Adversarial Networks Mengmi Zhang, Keng Teck Ma, Joo Hwee Lim, Qi Zhao, Jiashi Feng

2871 MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis Network Zizhao Zhang, Yuanpu Xie, Fuyong Xing, Mason McGough, Lin Yang
Sunday, July 23, 2017 1430–1630 Kamehameha I 16 Poster 2-2 3D Computer Vision 68 Surface Motion Capture Transfer With Gaussian Process Regression Adnane Boukhayma, Jean-Sébastien Franco, Edmond Boyer

326 Visual-Inertial-Semantic Scene Representation for 3D Object Detection Jingming Dong, Xiaohan Fei, Stefano Soatto

1646 Template-Based Monocular 3D Recovery of Elastic Shapes Using Lagrangian Multipliers Nazim Haouchine, Stephane Cotin

1892 Learning Category-Specific 3D Shape Models From Weakly Labeled 2D Images Dingwen Zhang, Junwei Han, Yang Yang, Dong Huang

2017 Simultaneous Geometric and Radiometric Calibration of a Projector-Camera Pair Marjan Shahpaski, Luis Ricardo Sapaico, Gaspard Chevassus, Sabine Süsstrunk

2027 A Clever Elimination Strategy for Efficient Minimal Solvers Zuzana Kukelova, Joe Kileel, Bernd Sturmfels, Tomas Pajdla

2084 Learning Barycentric Representations of 3D Shapes for Sketch-Based 3D Shape Retrieval Jin Xie, Guoxian Dai, Fan Zhu, Yi Fang

2860 Geodesic Distance Descriptors Gil Shamai, Ron Kimmel

Analyzing Humans in Images 158 Modeling Temporal Dynamics and Spatial Configurations of Actions Using Two-Stream Recurrent Neural Networks Hongsong Wang, Liang Wang

174 Forecasting Human Dynamics From Static Images Yu-Wei Chao, Jimei Yang, Brian Price, Scott Cohen, Jia Deng

477 Re-Ranking Person Re-Identification With k-Reciprocal Encoding Zhun Zhong, Liang Zheng, Donglin Cao, Shaozi Li

550 Deep Sequential Context Networks for Action Prediction Yu Kong, Zhiqiang Tao, Yun Fu

600 Global Context-Aware Attention LSTM Networks for 3D Action Recognition Jun Liu, Gang Wang, Ping Hu, Ling-Yu Duan, Alex C. Kot

903 Dynamic Attention-Controlled Cascaded Shape Regression Exploiting Training Data Augmentation and Fuzzy-Set Sample Weighting Zhen-Hua Feng, Josef Kittler, William Christmas, Patrik Huber, Xiao-Jun Wu

1236 A Deep Regression Architecture With Two-Stage Re-Initialization for High Performance Facial Landmark Detection Jiangjing Lv, Xiaohu Shao, Junliang Xing, Cheng Cheng, Xi Zhou

1309 Multiple People Tracking by Lifted Multicut and Person Re-Identification Siyu Tang, Mykhaylo Andriluka, Bjoern Andres, Bernt Schiele

2022 Towards Accurate Multi-Person Pose Estimation in the Wild George Papandreou, Tyler Zhu, Nori Kanazawa, Alexander Toshev, Jonathan Tompson, Chris Bregler, Kevin Murphy

Applications 28 Towards a Quality Metric for Dense Light Fields Vamsi Kiran Adhikarla, Marek Vinkler, Denis Sumin, Rafał K. Mantiuk, Karol Myszkowski, Hans-Peter Seidel, Piotr Didyk

1592 Controlling Perceptual Factors in Neural Style Transfer Leon A. Gatys, Alexander S. Ecker, Matthias Bethge, Aaron Hertzmann, Eli Shechtman

Biomedical Image/Video Analysis 2842 Joint Sequence Learning and Cross-Modality Convolution for 3D Biomedical Segmentation Kuan-Lun Tseng, Yen-Liang Lin, Winston Hsu, Chung-Yang Huang

2881 LSTM Self-Supervision for Detailed Behavior Analysis Biagio Brattoli, Uta Büchler, Anna-Sophia Wahl, Martin E. Schwab, Björn Ommer

Computational Photography 2082 A Wide-Field-Of-View Monocentric Light Field Camera Donald G. Dansereau, Glenn Schuster, Joseph Ford, Gordon Wetzstein

Image Motion & Tracking 765 S2F: Slow-To-Fast Interpolator Flow Yanchao Yang, Stefano Soatto

798 CLKN: Cascaded Lucas-Kanade Networks for Image Alignment Che-Han Chang, Chun-Nan Chou, Edward Y. Chang

2386 Multi-Object Tracking With Quadruplet Convolutional Neural Networks Jeany Son, Mooyeol Baek, Minsu Cho, Bohyung Han

Low- & Mid-Level Vision 55 Learning to Detect Salient Objects With Image-Level Supervision Lijun Wang, Huchuan Lu, Yifan Wang, Mengyang Feng, Dong Wang, Baocai Yin, Xiang Ruan

846 From Motion Blur to Motion Flow: A Deep Learning Solution for Removing Heterogeneous Motion Blur Dong Gong, Jie Yang, Lingqiao Liu, Yanning Zhang, Ian Reid, Chunhua Shen, Anton van den Hengel, Qinfeng Shi

1175 Co-Occurrence Filter Roy J. Jevnisek, Shai Avidan

1298 Fractal Dimension Invariant Filtering and Its CNN-Based Implementation Hongteng Xu, Junchi Yan, Nils Persson, Weiyao Lin, Hongyuan Zha

1300 Noise-Blind Image Deblurring Meiguang Jin, Stefan Roth, Paolo Favaro

1387 Simultaneous Visual Data Completion and Denoising Based on Tensor Rank and Total Variation Minimization and Its Primal-Dual Splitting Algorithm Tatsuya Yokota, Hidekata Hontani

2146 HPatches: A Benchmark and Evaluation of Handcrafted and Learned Local Descriptors Vassileios Balntas, Karel Lenc, Andrea Vedaldi, Krystian Mikolajczyk

2245 Hyperspectral Image Super-Resolution via Non-Local Sparse Tensor Factorization Renwei Dian, Leyuan Fang, Shutao Li

2302 Reflection Removal Using Low-Rank Matrix Completion Byeong-Ju Han, Jae-Young Sim

2725 Object Co-Skeletonization With Co-Segmentation Koteswar Rao Jerripothula, Jianfei Cai, Jiangbo Lu, Junsong Yuan

Machine Learning 129 Mining Object Parts From CNNs via Active Question-Answering Quanshi Zhang, Ruiming Cao, Ying Nian Wu, Song-Chun Zhu

224 PolyNet: A Pursuit of Structural Diversity in Very Deep Networks Xingcheng Zhang, Zhizhong Li, Chen Change Loy, Dahua Lin

402 The VQA-Machine: Learning How to Use Existing Vision Algorithms to Answer New Questions Peng Wang, Qi Wu, Chunhua Shen, Anton van den Hengel

417 Joint Discriminative Bayesian Dictionary and Classifier Learning Naveed Akhtar, Ajmal Mian, Fatih Porikli

673 Quad-Networks: Unsupervised Learning to Rank for Interest Point Detection Nikolay Savinov, Akihito Seki, Ľubor Ladický, Torsten Sattler, Marc Pollefeys

816 Outlier-Robust Tensor PCA Pan Zhou, Jiashi Feng

888 Learning Adaptive Receptive Fields for Deep Image Parsing Network Zhen Wei, Yao Sun, Jinqiao Wang, Hanjiang Lai, Si Liu

1489 Learning an Invariant Hilbert Space for Domain Adaptation Samitha Herath, Mehrtash Harandi, Fatih Porikli

1607 Fixed-Point Factorized Networks Peisong Wang, Jian Cheng

1663 Discriminative Optimization: Theory and Applications to Point Cloud Registration Jayakorn Vongkulbhisal, Fernando De la Torre, João P. Costeira

1727 Online Asymmetric Similarity Learning for Cross-Modal Retrieval Yiling Wu, Shuhui Wang, Qingming Huang

1766 Improving Training of Deep Neural Networks via Singular Value Bounding Kui Jia, Dacheng Tao, Shenghua Gao, Xiangmin Xu

2051 S3Pool: Pooling With Stochastic Spatial Sampling Shuangfei Zhai, Hui Wu, Abhishek Kumar, Yu Cheng, Yongxi Lu, Zhongfei Zhang, Rogerio Feris

2173 Sports Field Localization via Deep Structured Models Namdar Homayounfar, Sanja Fidler, Raquel Urtasun

2255 Noisy Softmax: Improving the Generalization Ability of DCNN via Postponing the Early Softmax Saturation Binghui Chen, Weihong Deng, Junping Du

2451 Switching Convolutional Neural Network for Crowd Counting Deepak Babu Sam, Shiv Surya, R. Venkatesh Babu

2575 Network Sketching: Exploiting Binary Structure in Deep CNNs Yiwen Guo, Anbang Yao, Hao Zhao, Yurong Chen

2849 Multi-Task Clustering of Human Actions by Sharing Information Xiaoqiang Yan, Shizhe Hu, Yangdong Ye

2951 Soft-Margin Mixture of Regressions Dong Huang, Longfei Han, Fernando De la Torre

3053 Multigrid Neural Architectures Tsung-Wei Ke, Michael Maire, Stella X. Yu

3092 High-Resolution Image Inpainting Using Multi-Scale Neural Patch Synthesis Chao Yang, Xin Lu, Zhe Lin, Eli Shechtman, Oliver Wang, Hao Li

3107 Deep Quantization: Encoding Convolutional Activations With Deep Generative Model Zhaofan Qiu, Ting Yao, Tao Mei

3112 DOPE: Distributed Optimization for Pairwise Energies Jose Dolz, Ismail Ben Ayed, Christian Desrosiers

3198 Improved Texture Networks: Maximizing Quality and Diversity in Feed-Forward Stylization and Texture Synthesis Dmitry Ulyanov, Andrea Vedaldi, Victor Lempitsky

Object Recognition & Scene Understanding 100 Polyhedral Conic Classifiers for Visual Object Detection and Classification Hakan Cevikalp, Bill Triggs

258 Incremental Kernel Null Space Discriminant Analysis for Novelty Detection Juncheng Liu, Zhouhui Lian, Yi Wang, Jianguo Xiao

283 Predicting Ground-Level Scene Layout From Aerial Imagery Menghua Zhai, Zachary Bessinger, Scott Workman, Nathan Jacobs

863 Deep Feature Flow for Video Recognition Xizhou Zhu, Yuwen Xiong, Jifeng Dai, Lu Yuan, Yichen Wei

1010 Object-Aware Dense Semantic Correspondence Fan Yang, Xin Li, Hong Cheng, Jianping Li, Leiting Chen

1050 Semantic Regularisation for Recurrent Image Annotation Feng Liu, Tao Xiang, Timothy M. Hospedales, Wankou Yang, Changyin Sun

1633 Video2Shop: Exact Matching Clothes in Videos to Online Shopping Images Zhi-Qi Cheng, Xiao Wu, Yang Liu, Xian-Sheng Hua

1925 Fast-At: Fast Automatic Thumbnail Generation Using Deep Neural Networks Seyed A. Esmaeili, Bharat Singh, Larry S. Davis

1955 Multi-Level Attention Networks for Visual Question Answering Dongfei Yu, Jianlong Fu, Tao Mei, Yong Rui

2052 Generating Descriptions With Grounded and Co-Referenced People Anna Rohrbach, Marcus Rohrbach, Siyu Tang, Seong Joon Oh, Bernt Schiele

2960 Straight to Shapes: Real-Time Detection of Encoded Shapes Saumya Jetley, Michael Sapienza, Stuart Golodetz, Philip H. S. Torr

3012 Simultaneous Feature Aggregating and Hashing for Large-Scale Image Search Thanh-Toan Do, Dang-Khoa Le Tan, Trung T. Pham, Ngai-Man Cheung

3202 Improving Facial Attribute Prediction Using Semantic Segmentation Mahdi M. Kalayeh, Boqing Gong, Mubarak Shah

Video Analytics 2252 Learning Cross-Modal Deep Representations for Robust Pedestrian Detection Dan Xu, Wanli Ouyang, Elisa Ricci, Xiaogang Wang, Nicu Sebe

2321 Spatio-Temporal Self-Organizing Map Deep Network for Dynamic Ob