


default search action
CVPR 2016: Las Vegas, NV, USA
- 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA, June 27-30, 2016. IEEE Computer Society 2016, ISBN 978-1-4673-8851-1

Oral & Spotlight Session 1-1A
O1-1A: Image Captioning and Question Answering
- Lisa Anne Hendricks, Subhashini Venugopalan, Marcus Rohrbach, Raymond J. Mooney, Kate Saenko

, Trevor Darrell:
Deep Compositional Captioning: Describing Novel Object Categories without Paired Training Data. 1-10 - Junhua Mao, Jonathan Huang, Alexander Toshev, Oana Camburu, Alan L. Yuille

, Kevin Murphy:
Generation and Comprehension of Unambiguous Object Descriptions. 11-20 - Zichao Yang, Xiaodong He, Jianfeng Gao, Li Deng, Alexander J. Smola:

Stacked Attention Networks for Image Question Answering. 21-29 - Hyeonwoo Noh, Paul Hongsuck Seo, Bohyung Han:

Image Question Answering Using Convolutional Neural Network with Dynamic Parameter Prediction. 30-38 - Jacob Andreas, Marcus Rohrbach, Trevor Darrell, Dan Klein:

Neural Module Networks. 39-48
S1-1A: Language and Vision
- Scott E. Reed, Zeynep Akata, Honglak Lee, Bernt Schiele

:
Learning Deep Representations of Fine-Grained Visual Descriptions. 49-58 - Zeynep Akata, Mateusz Malinowski

, Mario Fritz, Bernt Schiele
:
Multi-cue Zero-Shot Learning with Strong Supervision. 59-68 - Yongqin Xian, Zeynep Akata, Gaurav Sharma, Quynh Nguyen, Matthias Hein, Bernt Schiele

:
Latent Embeddings for Zero-Shot Classification. 69-77 - Roland Kwitt

, Sebastian Hegenbart, Marc Niethammer:
One-Shot Learning of Scene Locations via Feature Trajectory Transfer. 78-86 - Chuang Gan, Tianbao Yang, Boqing Gong:

Learning Attributes Equals Multi-Source Domain Generalization. 87-97 - Carl Vondrick, Hamed Pirsiavash, Antonio Torralba:

Anticipating Visual Representations from Unlabeled Video. 98-106
Oral & Spotlight Session 1-1B
O1-1B: Matching and Alignment
- Kwang Moo Yi, Yannick Verdie, Pascal Fua, Vincent Lepetit:

Learning to Assign Orientations to Feature Points. 107-116 - Tinghui Zhou, Philipp Krähenbühl, Mathieu Aubry, Qi-Xing Huang, Alexei A. Efros

:
Learning Dense Correspondence via 3D-Guided Cycle Consistency. 117-126 - Shenlong Wang, Sean Ryan Fanello

, Christoph Rhemann, Shahram Izadi, Pushmeet Kohli:
The Global Patch Collider. 127-135 - Seyed Hamid Rezatofighi, Anton Milan, Zhen Zhang

, Qinfeng Shi
, Anthony R. Dick
, Ian D. Reid:
Joint Probabilistic Matching Using m-Best Solutions. 136-145 - Xiangyu Zhu

, Zhen Lei, Xiaoming Liu, Hailin Shi, Stan Z. Li:
Face Alignment Across Large Poses: A 3D Solution. 146-155
S1-1B: Segmentation and Contour Detection
- Jie Feng, Brian L. Price, Scott Cohen, Shih-Fu Chang:

Interactive Segmentation on RGBD Images via Cue Selection. 156-164 - Chen Liu, Pushmeet Kohli, Yasutaka Furukawa:

Layered Scene Decomposition via the Occlusion-CRF. 165-173 - Michael Maire, Takuya Narihira, Stella X. Yu:

Affinity CNN: Learning Pixel-Centric Pairwise Relations for Figure/Ground Embedding. 174-182 - Anna Khoreva, Rodrigo Benenson, Mohamed Omran, Matthias Hein, Bernt Schiele

:
Weakly Supervised Object Boundaries. 183-192 - Jimei Yang, Brian L. Price, Scott Cohen, Honglak Lee, Ming-Hsuan Yang:

Object Contour Detection with a Fully Convolutional Encoder-Decoder Network. 193-202
Poster Session P1-1
- Qi Wu, Chunhua Shen, Lingqiao Liu

, Anthony R. Dick
, Anton van den Hengel
:
What Value Do Explicit High Level Concepts Have in Vision to Language Problems? 203-212 - Nati Ofir, Meirav Galun, Boaz Nadler, Ronen Basri:

Fast Detection of Curved Edges at Low SNR. 213-221 - Wei Shen, Kai Zhao, Yuan Jiang, Yan Wang, Zhijiang Zhang, Xiang Bai:

Object Skeleton Extraction in Natural Images by Fusing Scale-Associated Deep Side Outputs. 222-230 - Yu Liu, Michael S. Lew:

Learning Relaxed Deep Supervision for Better Edge Detection. 231-240 - Huan Fu

, Chaohui Wang, Dacheng Tao
, Michael J. Black:
Occlusion Boundary Detection via Deep Exploration of Context. 241-250 - Zizhao Zhang, Fuyong Xing

, Xiaoshuang Shi, Lin Yang:
SemiContour: A Semi-Supervised Learning Approach for Contour Detection. 251-259 - Saurabh Singh, Derek Hoiem, David A. Forsyth:

Learning to Localize Little Landmarks. 260-269 - Lingxi Xie, Liang Zheng

, Jingdong Wang
, Alan L. Yuille
, Qi Tian:
InterActive: Inter-Layer Activeness Propagation. 270-279 - Hao Yang, Joey Tianyi Zhou, Yu Zhang, Bin-Bin Gao

, Jianxin Wu, Jianfei Cai
:
Exploit Bounding Box Annotations for Multi-Label Object Recognition. 280-288 - Dmitry Laptev, Nikolay Savinov, Joachim M. Buhmann, Marc Pollefeys

:
TI-POOLING: Transformation-Invariant Pooling for Feature Learning in Convolutional Neural Networks. 289-297 - Edgar Simo-Serra, Hiroshi Ishikawa:

Fashion Style in 128 Floats: Joint Ranking and Classification Using Weak Data for Feature Extraction. 298-307 - Yuhui Quan, Chenglong Bao, Hui Ji

:
Equiangular Kernel Dictionary Learning with Applications to Dynamic Texture Analysis. 308-316 - Yang Gao, Oscar Beijbom, Ning Zhang, Trevor Darrell:

Compact Bilinear Pooling. 317-326 - Tsun-Yi Yang, Yen-Yu Lin

, Yung-Yu Chuang:
Accumulated Stability Voting: A Robust Descriptor from Descriptors of Multiple Scales. 327-335 - Swarna Kamlam Ravindran, Anurag Mittal:

CoMaL: Good Features to Match on Object Boundaries. 336-345 - Yuan-Ting Hu, Yen-Yu Lin

:
Progressive Feature Matching with Alternate Descriptor Selection and Correspondence Enrichment. 346-354 - Da Chen, Jean-Marie Mirebeau, Laurent D. Cohen:

A New Finsler Minimal Path Model with Curvature Penalization for Image Segmentation and Closed Contour Detection. 355-363 - Yuhua Chen, Dengxin Dai, Jordi Pont-Tuset, Luc Van Gool:

Scale-Aware Alignment of Hierarchical Image Segmentation. 364-372 - Ning Xu, Brian L. Price, Scott Cohen, Jimei Yang, Thomas S. Huang:

Deep Interactive Object Selection. 373-381 - Danna Gurari, Suyog Dutt Jain, Margrit Betke, Kristen Grauman:

Pull the Plug? Predicting If Computers or Humans Should Segment Images. 382-391 - Yuka Kihara, Matvey Soloviev, Tsuhan Chen

:
In the Shadows, Shape Priors Shine: Using Occlusion to Improve Multi-region Segmentation. 392-401 - Loïc Alain Royer

, David L. Richmond, Carsten Rother, Bjoern Andres
, Dagmar Kainmueller
:
Convexity Shape Constraints for Image Segmentation. 402-410 - Ertunc Erdil, Sinan Yildirim

, Müjdat Çetin, Tolga Tasdizen:
MCMC Shape Sampling for Image Segmentation with Nonparametric Shape Priors. 411-419 - Fengyuan Zhu, Guangyong Chen, Pheng-Ann Heng

:
From Noise Modeling to Blind Image Denoising. 420-429 - Jaesik Park

, Yu-Wing Tai
, Sudipta N. Sinha, In-So Kweon:
Efficient and Robust Color Consistency for Community Photo Collections. 430-438 - Or Lotan, Michal Irani:

Needle-Match: Reliable Patch Matching under High Uncertainty. 439-448 - Kuldeep Kulkarni, Suhas Lohit, Pavan K. Turaga

, Ronan Kerviche, Amit Ashok:
ReconNet: Non-Iterative Reconstruction of Images from Compressively Sensed Measurements. 449-458 - Jin-shan Pan, Zhe Hu, Zhixun Su

, Hsin-Ying Lee, Ming-Hsuan Yang:
Soft-Segmentation Guided Object Motion Deblurring. 459-468 - Dongliang Cheng, Abdelrahman Kamel, Brian L. Price, Scott Cohen, Michael S. Brown:

Two Illuminant Estimation and User Correction Preference. 469-477 - Guanbin Li, Yizhou Yu:

Deep Contrast Learning for Salient Object Detection. 478-487 - Seung-Hwan Baek

, Inchang Choi, Min H. Kim:
Multiview Image Completion with Space Structure Propagation. 488-496 - Long Mai, Hailin Jin, Feng Liu:

Composition-Preserving Deep Photo Aesthetics Assessment. 497-506 - Jiansheng Chen, Gaocheng Bai, Shaoheng Liang, Zhengqin Li:

Automatic Image Cropping: A Computational Complexity Study. 507-515 - Neil D. B. Bruce, Christopher Catton, Sasa Janjic:

A Deeper Look at Saliency: Feature Contrast, Semantics, and Beyond. 516-524 - Calden Wloka, John K. Tsotsos

:
Spatially Binned ROC: A Comprehensive Saliency Metric. 525-534 - Qiaosong Wang

, Wen Zheng, Robinson Piramuthu:
GraB: Visual Saliency via Novel Graph Model and Background Priors. 535-543 - Anna Volokitin, Michael Gygli, Xavier Boix:

Predicting When Saliency Maps are Accurate and Eye Fixations Consistent. 544-552 - Oriel Frigo, Neus Sabater, Julie Delon

, Pierre Hellier:
Split and Match: Example-Based Adaptive Patch Sampling for Unsupervised Style Transfer. 553-561 - Lilian Calvet, Pierre Gurdjos, Carsten Griwodz, Simone Gasparini

:
Detection and Accurate Localization of Circular Fiducials under Highly Challenging Conditions. 562-570 - Luis Herranz, Shuqiang Jiang, Xiangyang Li:

Scene Recognition with CNNs: Objects, Scales and Dataset Bias. 571-579 - Nicholas Rhinehart

, Kris Makoto Kitani:
Learning Action Maps of Large Environments via First-Person Vision. 580-588 - Yingying Zhang, Desen Zhou, Siqin Chen, Shenghua Gao, Yi Ma:

Single-Image Crowd Counting via Multi-Column Convolutional Neural Network. 589-597 - Junting Pan, Elisa Sayrol, Xavier Giró-i-Nieto, Kevin McGuinness, Noel E. O'Connor

:
Shallow and Deep Convolutional Networks for Saliency Prediction. 598-606 - Mohammad Najafi, Sarah Taghavi Namin, Mathieu Salzmann, Lars Petersson

:
Sample and Filter: Nonparametric Scene Parsing via Efficient Filtering. 607-615 - Saumitro Dasgupta, Kuan Fang, Kevin Chen, Silvio Savarese:

DeLay: Robust Spatial Layout Estimation for Cluttered Indoor Scenes. 616-624 - Siyu Zhu, Richard Zanibbi

:
A Text Detection System for Natural Scenes with Convolutional Feature Learning and Cascaded Classification. 625-632 - Xiaodan Liang, Yunchao Wei, Xiaohui Shen, Zequn Jie, Jiashi Feng, Liang Lin, Shuicheng Yan:

Reversible Recursive Instance-Level Object Segmentation. 633-641 - Yao Lu, Xue Bai, Linda G. Shapiro, Jue Wang

:
Coherent Parametric Contours for Interactive Video Object Segmentation. 642-650 - Yong-Jin Liu, Cheng-Chi Yu, Minjing Yu, Ying He

:
Manifold SLIC: A Fast Method to Compute Content-Sensitive Superpixels. 651-659 - Gayoung Lee, Yu-Wing Tai

, Junmo Kim:
Deep Saliency with Encoded Low Level Distance Map and High Level Features. 660-668 - Ziyu Zhang, Sanja Fidler

, Raquel Urtasun:
Instance-Level Segmentation for Autonomous Driving with Deep Densely Connected MRFs. 669-677 - Nian Liu, Junwei Han:

DHSNet: Deep Hierarchical Saliency Network for Salient Object Detection. 678-686 - Rong Quan, Junwei Han, Dingwen Zhang, Feiping Nie

:
Object Co-segmentation via Graph Optimized-Flexible Manifold Ranking. 687-695 - Won-Dong Jang, Chulwoo Lee, Chang-Su Kim

:
Primary Object Segmentation in Videos via Alternate Convex Optimization of Foreground and Background Distributions. 696-704 - Renjiao Yi, Jue Wang

, Ping Tan:
Automatic Fence Segmentation in Videos of Dynamic Scenes. 705-713 - Luca Del Pero, Susanna Ricco, Rahul Sukthankar, Vittorio Ferrari:

Discovering the Physical Parts of an Articulated Object Class from Multiple Videos. 714-723 - Federico Perazzi, Jordi Pont-Tuset, Brian McWilliams, Luc Van Gool, Markus H. Gross

, Alexander Sorkine-Hornung:
A Benchmark Dataset and Evaluation Methodology for Video Object Segmentation. 724-732 - Mahmudul Hasan, Jonghyun Choi

, Jan Neumann, Amit K. Roy-Chowdhury, Larry S. Davis:
Learning Temporal Regularity in Video Sequences. 733-742 - Nicolas Marki, Federico Perazzi, Oliver Wang, Alexander Sorkine-Hornung:

Bilateral Space Video Segmentation. 743-751 - Zhang Zhang, Kaiqi Huang, Tieniu Tan, Peipei Yang, Jun Li

:
ReD-SFA: Relation Discovery Based Slow Feature Analysis for Trajectory Clustering. 752-760
Oral & Spotlight Session 1-2A
O1-2A: Object Recognition and Detection
- Abhinav Shrivastava, Abhinav Gupta, Ross B. Girshick:

Training Region-Based Object Detectors with Online Hard Example Mining. 761-769 - Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun:

Deep Residual Learning for Image Recognition. 770-778 - Joseph Redmon, Santosh Kumar Divvala, Ross B. Girshick, Ali Farhadi:

You Only Look Once: Unified, Real-Time Object Detection. 779-788 - Spyros Gidaris, Nikos Komodakis:

LocNet: Improving Localization Accuracy for Object Detection. 789-798 - Qian Yu, Feng Liu, Yi-Zhe Song

, Tao Xiang, Timothy M. Hospedales, Chen Change Loy:
Sketch Me That Shoe. 799-807
S1-2A: Object Detection 1
- Shuran Song, Jianxiong Xiao:

Deep Sliding Shapes for Amodal 3D Object Detection in RGB-D Images. 808-816 - Kai Kang, Wanli Ouyang

, Hongsheng Li
, Xiaogang Wang:
Object Detection from Video Tubelets with Convolutional Neural Networks. 817-825 - Judy Hoffman

, Saurabh Gupta, Trevor Darrell:
Learning with Side Information through Modality Hallucination. 826-834 - Neelima Chavali, Harsh Agrawal

, Aroma Mahendru, Dhruv Batra:
Object-Proposal Evaluation Protocol is 'Gameable'. 835-844 - Tao Kong, Anbang Yao, Yurong Chen

, Fuchun Sun:
HyperNet: Towards Accurate Region Proposal Generation and Joint Object Detection. 845-853 - Dim P. Papadopoulos

, Jasper R. R. Uijlings, Frank Keller, Vittorio Ferrari:
We Don't Need No Bounding-Boxes: Training Object Class Detectors Using Only Human Verification. 854-863 - Wanli Ouyang, Xiaogang Wang, Cong Zhang, Xiaokang Yang:

Factors in Finetuning Deep Model for Object Detection with Long-Tail Distribution. 864-873
Oral & Spotlight Session 1-2B
O1-2B: Vision with Alternative Sensors
- Guy Rosman, Daniela Rus, John W. Fisher III:

Information-Driven Adaptive Structured-Light Scanners. 874-883 - Patrick Bardow, Andrew J. Davison, Stefan Leutenegger:

Simultaneous Optical Flow and Intensity Estimation from an Event Camera. 884-892 - Achuta Kadambi, Jamie Schiel, Ramesh Raskar:

Macroscopic Interferometry: Rethinking Depth Estimation with Frequency-Domain Time-of-Flight. 893-902 - Huaijin G. Chen

, Suren Jayasuriya, Jiyue Yang, Judy Stephen, Sriram Sivaramakrishnan, Ashok Veeraraghavan, Alyosha C. Molnar:
ASP Vision: Optically Computing the First Layer of Convolutional Neural Networks Using Angle Sensitive Pixels. 903-912 - Katherine L. Bouman, Michael D. Johnson, Daniel Zoran, Vincent L. Fish, Sheperd S. Doeleman, William T. Freeman:

Computational Imaging for VLBI Image Reconstruction. 913-922
S1-2B: Video Analysis 1
- Chuang Gan, Ting Yao, Kuiyuan Yang, Yi Yang, Tao Mei

:
You Lead, We Exceed: Labor-Free Video Concept Learning by Jointly Exploiting Web Videos and Images. 923-932 - Fanyi Xiao, Yong Jae Lee:

Track and Segment: An Iterative Unsupervised Approach for Video Object Proposals. 933-942 - Gao Zhu, Fatih Porikli

, Hongdong Li
:
Beyond Local Search: Tracking Objects Everywhere with Instance-Specific Proposals. 943-951 - Hongkai Yu, Youjie Zhou, Jeff P. Simmons, Craig P. Przybyla, Yuewei Lin, Xiaochuan Fan, Yang Mi

, Song Wang
:
Groupwise Tracking of Crowded Similar-Appearance Targets from Low-Continuity Image Sequences. 952-960 - Alexandre Alahi

, Kratarth Goel, Vignesh Ramanathan, Alexandre Robicquet, Li Fei-Fei, Silvio Savarese:
Social LSTM: Human Trajectory Prediction in Crowded Spaces. 961-971 - Andrii Maksai, Xinchao Wang

, Pascal Fua:
What Players do with the Ball: A Physically Constrained Interaction Modeling. 972-981 - Ting Yao, Tao Mei

, Yong Rui:
Highlight Detection with Pairwise Deep Ranking for First-Person Video Summarization. 982-990
Poster Session P1-2
- Bugra Tekin, Artem Rozantsev, Vincent Lepetit, Pascal Fua:

Direct Prediction of 3D Body Poses from Motion Compensated Sequences. 991-1000 - Michael Gygli, Yale Song, Liangliang Cao:

Video2GIF: Automatic Generation of Animated GIFs from Video. 1001-1009 - Amir Shahroudy

, Jun Liu
, Tian-Tsong Ng, Gang Wang:
NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis. 1010-1019 - Bingbing Ni, Xiaokang Yang, Shenghua Gao:

Progressively Parsing Interactional Objects for Fine Grained Action Detection. 1020-1028 - Pingbo Pan, Zhongwen Xu, Yi Yang, Fei Wu, Yueting Zhuang:

Hierarchical Recurrent Neural Encoder for Video Representation with Application to Captioning. 1029-1038 - Jingjing Meng, Hongxing Wang

, Junsong Yuan, Yap-Peng Tan:
From Keyframes to Key Objects: Video Summarization by Representative Object Proposal Selection. 1039-1048 - Zheng Shou, Dongang Wang

, Shih-Fu Chang:
Temporal Action Localization in Untrimmed Videos via Multi-stage CNNs. 1049-1058 - Ke Zhang, Wei-Lun Chao, Fei Sha, Kristen Grauman:

Summary Transfer: Exemplar-Based Subset Selection for Video Summarization. 1059-1067 - Yeong Jun Koh, Won-Dong Jang, Chang-Su Kim

:
POD: Discovering Primary Objects in Videos Based on Evolutionary Refinement of Object Recurrence, Background, and Primary Object Models. 1068-1076 - Waqas Sultani

, Mubarak Shah
:
What If We Do Not have Multiple Videos of the Same Action? - Video Action Localization Using Web Images. 1077-1085 - Lu Zhang, Hayley Hung:

Beyond F-Formations: Determining Social Involvement in Free Standing Conversing Groups from Static Images. 1086-1095 - Ziwei Liu, Ping Luo, Shi Qiu, Xiaogang Wang, Xiaoou Tang:

DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations. 1096-1104 - Hua Zhang, Si Liu, Changqing Zhang, Wenqi Ren, Rui Wang

, Xiaochun Cao:
SketchNet: Sketch Classification with Web Images. 1105-1113 - Xiaofan Zhang, Feng Zhou, Yuanqing Lin, Shaoting Zhang

:
Embedding Label Structures for Fine-Grained Feature Representation. 1114-1123 - Feng Zhou, Yuanqing Lin:

Fine-Grained Image Classification by Exploring Bipartite-Graph Labels. 1124-1133 - Xiaopeng Zhang, Hongkai Xiong

, Wengang Zhou, Weiyao Lin
, Qi Tian:
Picking Deep Filter Responses for Fine-Grained Image Recognition. 1134-1142 - Han Zhang, Tao Xu, Mohamed Elhoseiny, Xiaolei Huang, Shaoting Zhang

, Ahmed M. Elgammal
, Dimitris N. Metaxas:
SPDA-CNN: Unifying Semantic Part Detection and Abstraction for Fine-Grained Recognition. 1143-1152 - Yin Cui, Feng Zhou, Yuanqing Lin, Serge J. Belongie

:
Fine-Grained Categorization and Dataset Bootstrapping Using Deep Metric Learning with Humans in the Loop. 1153-1162 - Yaming Wang, Jonghyun Choi

, Vlad I. Morariu, Larry S. Davis:
Mining Discriminative Triplets of Patches for Fine-Grained Classification. 1163-1172 - Shaoli Huang

, Zhe Xu, Dacheng Tao
, Ya Zhang
:
Part-Stacked CNN for Fine-Grained Visual Categorization. 1173-1182 - Kevin Lin

, Jiwen Lu
, Chu-Song Chen, Jie Zhou:
Learning Compact Binary Descriptors with Unsupervised Deep Neural Networks. 1183-1192 - Kilho Son, Daniel Moreno, James Hays, David B. Cooper:

Solving Small-Piece Jigsaw Puzzles by Growing Consensus. 1193-1201 - Zhen Zhang

, Qinfeng Shi
, Julian J. McAuley
, Wei Wei, Yanning Zhang, Anton van den Hengel
:
Pairwise Matching through Max-Weight Bipartite Belief Propagation. 1202-1210 - Takumi Kobayashi:

Structured Feature Similarity with Explicit Feature Map. 1211-1219 - Mor Dar, Yael Moses:

Temporal Epipolar Regions. 1220-1228 - Albert Haque

, Alexandre Alahi
, Li Fei-Fei:
Recurrent Attention Models for Depth-Based Person Identification. 1229-1238 - Li Zhang, Tao Xiang, Shaogang Gong:

Learning a Discriminative Null Space for Person Re-identification. 1239-1248 - Tong Xiao, Hongsheng Li

, Wanli Ouyang
, Xiaogang Wang:
Learning Deep Feature Representations with Domain Guided Dropout for Person Re-identification. 1249-1258 - Shanshan Zhang, Rodrigo Benenson, Mohamed Omran, Jan Hendrik Hosang, Bernt Schiele

:
How Far are We from Solving Pedestrian Detection? 1259-1267 - Dapeng Chen, Zejian Yuan, Badong Chen, Nanning Zheng:

Similarity Learning with Spatial Constraints for Person Re-identification. 1268-1277 - Ying Zhang, Baohua Li

, Huchuan Lu, Atshushi Irie, Xiang Ruan
:
Sample-Specific SVM Learning for Person Re-identification. 1278-1287 - Faqiang Wang, Wangmeng Zuo, Liang Lin, David Zhang, Lei Zhang

:
Joint Learning of Single-Image and Cross-Image Representations for Person Re-identification. 1288-1296 - Haoxiang Li, Jonathan Brandt, Zhe Lin, Xiaohui Shen, Gang Hua:

A Multi-level Contextual Model for Person Recognition in Photo Albums. 1297-1305 - Peixi Peng, Tao Xiang, Yaowei Wang, Massimiliano Pontil, Shaogang Gong, Tiejun Huang, Yonghong Tian:

Unsupervised Cross-Dataset Transfer Learning for Person Re-identification. 1306-1315 - Jiale Cao, Yanwei Pang, Xuelong Li

:
Pedestrian Detection Inspired by Appearance Constancy and Shape Symmetry. 1316-1324 - Niall McLaughlin

, Jesús Martínez del Rincón
, Paul Miller:
Recurrent Convolutional Network for Video-Based Person Re-identification. 1325-1334 - De Cheng, Yihong Gong, Sanping Zhou, Jinjun Wang, Nanning Zheng:

Person Re-identification by Multi-Channel Parts-Based CNN with Improved Triplet Loss Function. 1335-1344 - Jinjie You, Ancong Wu

, Xiang Li, Wei-Shi Zheng:
Top-Push Video-Based Person Re-identification. 1345-1353 - Yeong-Jun Cho, Kuk-Jin Yoon:

Improving Person Re-identification via Pose-Aware Multi-shot Matching. 1354-1362 - Tetsu Matsukawa

, Takahiro Okabe, Einoshin Suzuki, Yoichi Sato:
Hierarchical Gaussian Descriptor for Person Re-identification. 1363-1372 - Lijun Wang, Wanli Ouyang, Xiaogang Wang, Huchuan Lu:

STCT: Sequentially Training Convolutional Networks for Visual Tracking. 1373-1381 - Juan-Manuel Pérez-Rúa, Tomás Crivelli, Patrick Bouthemy, Patrick Pérez:

Determining Occlusions from Space and Time Image Reconstructions. 1382-1391 - Ju Hong Yoon, Chang-Ryeol Lee, Ming-Hsuan Yang, Kuk-Jin Yoon:

Online Multi-object Tracking via Structural Constraint Event Aggregation. 1392-1400 - Luca Bertinetto, Jack Valmadre

, Stuart Golodetz
, Ondrej Miksik, Philip H. S. Torr:
Staple: Complementary Learners for Real-Time Tracking. 1401-1409 - Jiaolong Yang

, Hongdong Li
, Yuchao Dai, Robby T. Tan:
Robust Optical Flow Estimation of Double-Layer Images under Transparency or Reflection. 1410-1419 - Ran Tao, Efstratios Gavves, Arnold W. M. Smeulders:

Siamese Instance Search for Tracking. 1420-1429 - Martin Danelljan, Gustav Häger, Fahad Shahbaz Khan

, Michael Felsberg
:
Adaptive Decontamination of the Training Set: A Unified Formulation for Discriminative Visual Tracking. 1430-1438 - Adel Bibi

, Tianzhu Zhang, Bernard Ghanem
:
3D Part-Based Sparse Tracker with Automatic Synchronization and Registration. 1439-1448 - Zhen Cui, Shengtao Xiao, Jiashi Feng, Shuicheng Yan:

Recurrently Target-Attending Tracking. 1449-1458 - Ferran Diego, Fred A. Hamprecht:

Structured Regression Gradient Boosting. 1459-1467 - Maksim Lapin, Matthias Hein, Bernt Schiele

:
Loss Functions for Top-k Error: Analysis and Insights. 1468-1477 - Valentina Zantedeschi, Rémi Emonet, Marc Sebban:

Metric Learning as Convex Combinations of Local Models with Generalization Guarantees. 1478-1486 - Ziming Zhang, Yuting Chen, Venkatesh Saligrama

:
Efficient Training of Very Deep Neural Networks for Supervised Hashing. 1487-1495 - Saeid Motiian, Marco Piccirilli

, Donald A. Adjeroh, Gianfranco Doretto:
Information Bottleneck Learning Using Privileged Information for Visual Recognition. 1496-1505
Oral & Spotlight Session 2-1A
O2-1A: Recognition and Parsing in 3D
- Hossein Rahmani

, Ajmal S. Mian
:
3D Action Recognition from Novel Viewpoints. 1506-1515 - David F. Fouhey, Abhinav Gupta, Andrew Zisserman:

3D Shape Attributes. 1516-1524 - Zhile Ren, Erik B. Sudderth

:
Three-Dimensional Object Detection and Layout Prediction Using Clouds of Oriented Gradients. 1525-1533 - Iro Armeni, Ozan Sener, Amir R. Zamir, Helen Jiang, Ioannis K. Brilakis

, Martin Fischer, Silvio Savarese:
3D Semantic Parsing of Large-Scale Indoor Spaces. 1534-1543 - Lingyu Wei, Qixing Huang, Duygu Ceylan, Etienne Vouga, Hao Li

:
Dense Human Body Correspondences Using Convolutional Networks. 1544-1553
S2-1A: Recognition Beyond Objects
- Joseph DeGol, Mani Golparvar Fard, Derek Hoiem:

Geometry-Informed Material Recognition. 1554-1562 - Abhijit Bendale, Terrance E. Boult:

Towards Open Set Deep Networks. 1563-1572 - Peng Wang

, Lingqiao Liu
, Chunhua Shen, Zi Huang
, Anton van den Hengel, Heng Tao Shen:
What's Wrong with That Object? Identifying Images of Unusual Objects by Modelling the Detection Score Distribution. 1573-1581 - Torsten Sattler, Michal Havlena, Konrad Schindler, Marc Pollefeys

:
Large-Scale Location Recognition and the Geometric Burstiness Problem. 1582-1590 - Mark Wolff, Robert T. Collins, Yanxi Liu:

Regularity-Driven Building Facade Matching between Aerial and Street Views. 1591-1600 - R. T. Pramod, S. P. Arun:

Do Computational Models Differ Systematically from Human Object Perception? 1601-1609
Oral & Spotlight Session 2-1B
O2-1B: Image Processing and Restoration
- Timo Hackel, Jan Dirk Wegner, Konrad Schindler:

Contour Detection in Unstructured 3D Point Clouds. 1610-1618 - Yin Li, Manohar Paluri, James M. Rehg

, Piotr Dollár:
Unsupervised Learning of Edges. 1619-1627 - Jin-shan Pan, Deqing Sun, Hanspeter Pfister, Ming-Hsuan Yang:

Blind Image Deblurring Using Dark Channel Prior. 1628-1636 - Jiwon Kim, Jung Kwon Lee, Kyoung Mu Lee:

Deeply-Recursive Convolutional Network for Image Super-Resolution. 1637-1645 - Jiwon Kim, Jung Kwon Lee, Kyoung Mu Lee:

Accurate Image Super-Resolution Using Very Deep Convolutional Networks. 1646-1654
S2-1B: Image Processing and Restoration
- Nguyen Ho Man Rang, Michael S. Brown:

RAW Image Reconstruction Using a Self-Contained sRGB-JPEG Image with Only 64 KB Overhead. 1655-1663 - Kede Ma

, Qingbo Wu, Zhou Wang
, Zhengfang Duanmu, Hongwei Yong, Hongliang Li
, Lei Zhang:
Group MAD Competition? A New Methodology to Compare Objective Image Quality Models. 1664-1673 - Dana Berman, Tali Treibitz

, Shai Avidan:
Non-local Image Dehazing. 1674-1682 - Seonghyeon Nam, Youngbae Hwang, Yasuyuki Matsushita

, Seon Joo Kim:
A Holistic Approach to Cross-Channel Image Noise Modeling and Its Application to Image Denoising. 1683-1691 - Qi Xie, Qian Zhao, Deyu Meng, Zongben Xu, Shuhang Gu, Wangmeng Zuo, Lei Zhang

:
Multispectral Images Denoising by Intrinsic Tensor Sparsity Regularization. 1692-1700 - Wei-Sheng Lai, Jia-Bin Huang, Zhe Hu, Narendra Ahuja, Ming-Hsuan Yang:

A Comparative Study for Single Image Blind Deblurring. 1701-1709
Poster Session P2-1
- Minh Vo, Srinivasa G. Narasimhan

, Yaser Sheikh:
Spatiotemporal Bundle Adjustment for Dynamic 3D Reconstruction. 1710-1718 - Ajad Chhatkuli, Daniel Pizarro

, Toby Collins, Adrien Bartoli
:
Inextensible Non-Rigid Shape-from-Motion by Second-Order Cone Programming. 1719-1727 - Johan Fredriksson, Viktor Larsson, Carl Olsson, Fredrik Kahl:

Optimal Relative Pose with Unknown Correspondences. 1728-1736 - Haifei Huang, Hui Zhang

, Yiu-Ming Cheung
:
Homography Estimation from the Common Self-Polar Triangle of Separate Ellipses. 1737-1744 - Maximilian Diebold, Bernd Jähne, Alexander Gatto:

Heterogeneous Light Fields. 1745-1753 - Anders P. Eriksson, John Bastian, Tat-Jun Chin, Mats Isaksson

:
A Consensus-Based Framework for Distributed Bundle Adjustment. 1754-1762 - Kyungdon Joo, Tae-Hyun Oh

, Junsik Kim, In-So Kweon:
Globally Optimal Manhattan Frame Estimation in Real-Time. 1763-1771 - Kai Han, Kwan-Yee K. Wong

, Dirk Schnieders, Miaomiao Liu
:
Mirror Surface Reconstruction under an Uncalibrated Camera. 1772-1780 - Guibo Luo, Yuesheng Zhu, Zhaotian Li, Liming Zhang

:
A Hole Filling Approach Based on Background Reconstruction for View Synthesis in 3D Video. 1781-1789 - Yinqiang Zheng

, Laurent Kneip:
A Direct Least-Squares Solution to the PnP Problem with Unknown Focal Length. 1790-1798 - Zuzana Kukelova

, Jan Heller, Andrew W. Fitzgibbon:
Efficient Intersection of Three Quadrics and Applications in Computer Vision. 1799-1808 - Lior Talker, Yael Moses, Ilan Shimshoni

:
Using Spatial Order to Boost the Elimination of Incorrect Feature Matches. 1809-1817 - Martin Danelljan, Giulia Meneghetti, Fahad Shahbaz Khan

, Michael Felsberg
:
A Probabilistic Framework for Color-Based Point Set Registration. 1818-1826 - Dong Gong

, Mingkui Tan, Yanning Zhang, Anton van den Hengel, Qinfeng Shi
:
Blind Image Deconvolution by Automatic Gradient Activation. 1827-1836 - Eduardo Pérez-Pellitero, Jordi Salvador, Javier Ruiz Hidalgo, Bodo Rosenhahn:

PSyCo: Manifold Span Reduction for Super Resolution. 1837-1845 - Jochen Gast, Anita Sellent, Stefan Roth:

Parametric Object Motion from Blur. 1846-1854 - Zhe Hu, Lu Yuan, Stephen Lin, Ming-Hsuan Yang:

Image Deblurring Using Smartphone Inertial Sensors. 1855-1864 - Radu Timofte

, Rasmus Rothe, Luc Van Gool:
Seven Ways to Improve Example-Based Single Image Super Resolution. 1865-1873 - Wenzhe Shi, Jose Caballero, Ferenc Huszar, Johannes Totz, Andrew P. Aitken, Rob Bishop, Daniel Rueckert, Zehan Wang:

Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network. 1874-1883 - Xiaojun Chang

, Yaoliang Yu, Yi Yang, Eric P. Xing:
They are Not Equally Reliable: Semantic Event Search Using Differentiated Concept Classifiers. 1884-1893 - Minghuang Ma, Haoqi Fan, Kris M. Kitani:

Going Deeper into First-Person Activity Recognition. 1894-1903 - Yang Zhou, Bingbing Ni, Richang Hong, Xiaokang Yang, Qi Tian:

Cascaded Interactional Targeting Network for Egocentric Video Analysis. 1904-1913 - Fabian Caba Heilbron, Juan Carlos Niebles

, Bernard Ghanem:
Fast Temporal Activity Proposals for Efficient Detection of Human Actions in Untrimmed Videos. 1914-1923 - Basura Fernando, Peter Anderson, Marcus Hutter

, Stephen Gould:
Discriminative Hierarchical Rank Pooling for Activity Recognition. 1924-1932 - Christoph Feichtenhofer, Axel Pinz, Andrew Zisserman:

Convolutional Two-Stream Network Fusion for Video Action Recognition. 1933-1941 - Shugao Ma, Leonid Sigal, Stan Sclaroff:

Learning Activity Progression in LSTMs for Activity Detection and Early Detection. 1942-1950 - Yingwei Li, Weixin Li

, Vijay Mahadevan, Nuno Vasconcelos
:
VLAD3: Encoding Dynamics of Deep Features for Action Recognition. 1951-1960 - Bharat Singh, Tim K. Marks, Michael J. Jones, Oncel Tuzel, Ming Shao:

A Multi-stream Bi-directional Recurrent Neural Network for Fine-Grained Action Detection. 1961-1970 - Mostafa S. Ibrahim, Srikanth Muralidharan, Zhiwei Deng, Arash Vahdat, Greg Mori:

A Hierarchical Deep Temporal Model for Group Activity Recognition. 1971-1980 - Ivan Lillo, Juan Carlos Niebles

, Alvaro Soto:
A Hierarchical Pose-Based Approach to Complex Action Understanding Using Dictionaries of Actionlets and Motion Poselets. 1981-1990 - Wangjiang Zhu, Jie Hu, Gang Sun, Xudong Cao, Yu Qiao

:
A Key Volume Mining Deep Framework for Action Recognition. 1991-1999 - Eng-Jon Ong, Miroslaw Bober

:
Improved Hamming Distance Search Using Variable Length Hashing. 2000-2008 - Jae-Pil Heo, Zhe Lin, Xiaohui Shen, Jonathan Brandt, Sung-Eui Yoon:

Shortlist Selection with Residual-Aware Distance Estimator for K-Nearest Neighbor Search. 2009-2017 - Xiaojuan Wang, Ting Zhang, Guo-Jun Qi

, Jinhui Tang
, Jingdong Wang
:
Supervised Quantization for Similarity Search. 2018-2026 - Patrick Wieschollek, Oliver Wang, Alexander Sorkine-Hornung, Hendrik P. A. Lensch:

Efficient Large-Scale Approximate Nearest Neighbor Search on the GPU. 2027-2035 - Ting Zhang, Jingdong Wang

:
Collaborative Quantization for Cross-Modal Similarity Search. 2036-2045 - Thi Quynh Nhi Tran, Hervé Le Borgne, Michel Crucianu:

Aggregating Image and Text Quantized Correlated Components. 2046-2054 - Artem Babenko, Victor S. Lempitsky:

Efficient Indexing of Billion-Scale Datasets of Deep Descriptors. 2055-2063 - Haomiao Liu, Ruiping Wang, Shiguang Shan

, Xilin Chen:
Deep Supervised Hashing for Fast Image Retrieval. 2064-2072 - Ahmet Iscen, Michael G. Rabbat, Teddy Furon:

Efficient Large-Scale Similarity Search Using Matrix Factorization. 2073-2081 - Theodora Kontogianni

, Markus Mathias, Bastian Leibe
:
Incremental Object Discovery in Time-Varying Image Collections. 2082-2090 - Jia-Bin Huang, Rich Caruana, Andrew Farnsworth

, Steve Kelling, Narendra Ahuja:
Detecting Migrating Birds at Night. 2091-2099 - Ilja Kuzborskij, Fabio Maria Carlucci

, Barbara Caputo:
When Naïve Bayes Nearest Neighbors Meet Convolutional Neural Networks. 2100-2109 - Zhe Zhu, Dun Liang, Song-Hai Zhang, Xiaolei Huang, Baoli Li, Shi-Min Hu:

Traffic-Sign Detection and Classification in the Wild. 2110-2118 - Yuxing Tang, Josiah Wang

, Boyang Gao, Emmanuel Dellandréa, Robert J. Gaizauskas
, Liming Chen:
Large Scale Semi-Supervised Object Detection Using Visual and Semantic Knowledge Transfer. 2119-2128 - Fan Yang, Wongun Choi, Yuanqing Lin:

Exploit All the Layers: Fast and Accurate CNN Object Detector with Scale Dependent Pooling and Cascaded Rejection Classifiers. 2129-2137 - Keze Wang

, Liang Lin, Wangmeng Zuo, Shuhang Gu, Lei Zhang
:
Dictionary Pair Classifier Driven Convolutional Neural Networks for Object Detection. 2138-2146 - Xiaozhi Chen

, Kaustav Kundu, Ziyu Zhang, Huimin Ma, Sanja Fidler
, Raquel Urtasun:
Monocular 3D Object Detection for Autonomous Driving. 2147-2156 - Radu Tudor Ionescu, Bogdan Alexe, Marius Leordeanu, Marius Popescu

, Dim P. Papadopoulos
, Vittorio Ferrari:
How Hard Can It Be? Estimating the Difficulty of Visual Search in an Image. 2157-2166 - Hongye Liu, Yonghong Tian, Yaowei Wang, Lu Pang, Tiejun Huang:

Deep Relative Distance Learning: Tell the Difference between Similar Vehicles. 2167-2175 - Kyle Krafka, Aditya Khosla, Petr Kellnhofer, Harini Kannan, Suchendra M. Bhandarkar, Wojciech Matusik, Antonio Torralba:

Eye Tracking for Everyone. 2176-2184 - Zorah Lähner, Emanuele Rodolà, Frank R. Schmidt, Michael M. Bronstein, Daniel Cremers

:
Efficient Globally Optimal 2D-to-3D Deformable Shape Matching. 2185-2193 - Viktoriia Sharmanska

, Daniel Hernández-Lobato, José Miguel Hernández-Lobato, Novi Quadrianto
:
Ambiguity Helps: Classification with Disagreements in Crowdsourced Annotations. 2194-2202 - Roozbeh Mottaghi, Hannaneh Hajishirzi, Ali Farhadi:

A Task-Oriented Approach for Cost-Sensitive Recognition. 2203-2211 - Sukrit Shankar, Duncan P. Robertson, Yani Ioannou

, Antonio Criminisi, Roberto Cipolla
:
Refining Architectures of Deep Convolutional Neural Networks. 2212-2220 - Ali Borji, Saeed Izadi, Laurent Itti:

iLab-20M: A Large-Scale Controlled Object Dataset to Investigate Deep Learning. 2221-2230 - Chen-Yu Lee, Simon Osindero:

Recursive Recurrent Nets with Attention Modeling for OCR in the Wild. 2231-2239 - Venkatesh N. Murthy, Vivek K. Singh, Terrence Chen, R. Manmatha, Dorin Comaniciu:

Deep Decision Network for Multi-class Image Classification. 2240-2248 - Ruizhi Qiao, Lingqiao Liu

, Chunhua Shen, Anton van den Hengel:
Less is More: Zero-Shot Learning from Online Textual Documents with Noise Suppression. 2249-2257 - Wen Li, Dengxin Dai, Mingkui Tan, Dong Xu, Luc Van Gool:

Fast Algorithms for Linear and Kernel SVM+. 2258-2266
Oral & Spotlight Session 2-2A
O2-2A: Recognition and Labeling
- Guo-Jun Qi

:
Hierarchically Gated Deep Networks for Semantic Segmentation. 2267-2275 - Liang Lin, Guangrun Wang, Rui Zhang, Ruimao Zhang

, Xiaodan Liang, Wangmeng Zuo:
Deep Structured Scene Parsing by Learning with Image Descriptions. 2276-2284 - Jiang Wang, Yi Yang, Junhua Mao, Zhiheng Huang, Chang Huang, Wei Xu:

CNN-RNN: A Unified Framework for Multi-label Image Classification. 2285-2294 - Jing Wang, Yu Cheng, Rogério Schmidt Feris:

Walk and Learn: Facial Attribute Representation Learning from Egocentric Video and Contextual Data. 2295-2304 - Arik Poznanski, Lior Wolf:

CNN-N-Gram for HandwritingWord Recognition. 2305-2314
2A: Object Detection 2
- Ankush Gupta, Andrea Vedaldi, Andrew Zisserman:

Synthetic Data for Text Localisation in Natural Images. 2315-2324 - Russell Stewart, Mykhaylo Andriluka, Andrew Y. Ng:

End-to-End People Detection in Crowded Scenes. 2325-2333 - Wei-Chih Tu, Shengfeng He

, Qingxiong Yang, Shao-Yi Chien:
Real-Time Salient Object Detection with a Minimum Spanning Tree. 2334-2342 - David Feng, Nick Barnes, Shaodi You, Chris McCarthy:

Local Background Enclosure for RGB-D Salient Object Detection. 2343-2350 - Yongxi Lu, Tara Javidi

, Svetlana Lazebnik:
Adaptive Object Detection Using Adjacency and Zoom Prediction. 2351-2359 - Arthur Daniel Costea, Sergiu Nedevschi

:
Semantic Channels for Fast Pedestrian Detection. 2360-2368 - Mahyar Najibi, Mohammad Rastegari, Larry S. Davis:

G-CNN: An Iterative Grid Based Object Detector. 2369-2377
Oral & Spotlight Session 2-2B
O2-2B: Computational Photography and Faces
- Wei Wang, Zhen Cui, Yan Yan, Jiashi Feng, Shuicheng Yan, Xiangbo Shu, Nicu Sebe

:
Recurrent Face Aging. 2378-2386 - Justus Thies, Michael Zollhöfer, Marc Stamminger, Christian Theobalt

, Matthias Nießner:
Face2Face: Real-Time Face Capture and Reenactment of RGB Videos. 2387-2395 - Sergey Tulyakov, Xavier Alameda-Pineda, Elisa Ricci

, Lijun Yin, Jeffrey F. Cohn, Nicu Sebe
:
Self-Adaptive Matrix Completion for Heart Rate Estimation from Face Videos under Realistic Conditions. 2396-2404 - Andrew Owens, Phillip Isola, Josh H. McDermott, Antonio Torralba, Edward H. Adelson, William T. Freeman:

Visually Indicated Sounds. 2405-2413 - Leon A. Gatys, Alexander S. Ecker

, Matthias Bethge:
Image Style Transfer Using Convolutional Neural Networks. 2414-2423
S2-2B: Computational Photography and Biomedical Applications
- Le Hou, Dimitris Samaras, Tahsin M. Kurç, Yi Gao, James E. Davis, Joel H. Saltz:

Patch-Based Convolutional Neural Network for Whole Slide Tissue Image Classification. 2424-2433 - Hossam N. Isack, Olga Veksler, Milan Sonka

, Yuri Boykov:
Hedgehog Shape Priors for Multi-Object Segmentation. 2434-2442 - Won Hwa Kim, Hyunwoo J. Kim, Nagesh Adluru, Vikas Singh:

Latent Variable Graphical Model Selection Using Harmonic Analysis: Applications to the Human Connectome Project (HCP). 2443-2451 - Gyeongmin Choe, Srinivasa G. Narasimhan

, In-So Kweon:
Simultaneous Estimation of Near IR BRDF and Fine-Scale Surface Geometry. 2452-2460 - Seoung Wug Oh, Michael S. Brown, Marc Pollefeys

, Seon Joo Kim:
Do It Yourself Hyperspectral Imaging with Everyday Digital Cameras. 2461-2469 - Joon-Young Lee, Kalyan Sunkavalli, Zhe Lin, Xiaohui Shen, In-So Kweon:

Automatic Content-Aware Color and Tone Stylization. 2470-2478 - Chuan Li, Michael Wand:

Combining Markov Random Fields and Convolutional Neural Networks for Image Synthesis. 2479-2486
Poster Session P2-2
- Hao Chen

, Xiaojuan Qi, Lequan Yu
, Pheng-Ann Heng
:
DCAN: Deep Contour-Aware Networks for Accurate Gland Segmentation. 2487-2496 - Hoo-Chang Shin, Kirk Roberts, Le Lu

, Dina Demner-Fushman, Jianhua Yao
, Ronald M. Summers:
Learning to Read Chest X-Rays: Recurrent Neural Cascade Model for Automated Image Annotation. 2497-2506 - Huu Le, Tat-Jun Chin, David Suter

:
Conformal Surface Alignment with Optimal Möbius Search. 2507-2516 - Seong Jae Hwang

, Nagesh Adluru, Maxwell D. Collins, Sathya N. Ravi, Barbara B. Bendlin, Sterling C. Johnson, Vikas Singh:
Coupled Harmonic Bases for Longitudinal Characterization of Brain Networks. 2517-2525 - Jae Y. Shin, Nima Tajbakhsh, R. Todd Hurst, Christopher B. Kendall, Jianming Liang

:
Automating Carotid Intima-Media Thickness Video Interpretation with Convolutional Neural Networks. 2526-2535 - Deepak Pathak, Philipp Krähenbühl, Jeff Donahue, Trevor Darrell, Alexei A. Efros

:
Context Encoders: Feature Learning by Inpainting. 2536-2544 - Chenyi Lei, Dong Liu, Weiping Li, Zheng-Jun Zha

, Houqiang Li:
Comparative Deep Learning of Hybrid Representations for Image Recommendations. 2545-2553 - Vadim Lebedev, Victor S. Lempitsky:

Fast ConvNets Using Group-Wise Brain Damage. 2554-2564 - Zeeshan Hayder

, Xuming He, Mathieu Salzmann:
Learning to Co-Generate Object Proposals with a Deep Structured Network. 2565-2573 - Seyed-Mohsen Moosavi-Dezfooli

, Alhussein Fawzi, Pascal Frossard:
DeepFool: A Simple and Accurate Method to Fool Deep Neural Networks. 2574-2582 - Calvin Murdock, Zhen Li, Howard Zhou

, Tom Duerig:
Blockout: Dynamic Model Selection for Hierarchical Deep Networks. 2583-2591 - Forrest N. Iandola, Matthew W. Moskewicz, Khalid Ashraf, Kurt Keutzer:

FireCaffe: Near-Linear Acceleration of Deep Neural Network Training on Compute Clusters. 2592-2600 - Sarah Rastegar

, Mahdieh Soleymani Baghshah, Hamid R. Rabiee, Seyed Mohsen Shojaee:
MDL-CW: A Multimodal Deep Learning Framework with CrossWeights. 2601-2609 - Jörn-Henrik Jacobsen, Jan C. van Gemert, Zhongyu Lou, Arnold W. M. Smeulders:

Structured Receptive Fields in CNNs. 2610-2619 - Suriya Singh, Chetan Arora, C. V. Jawahar

:
First Person Action Recognition Using Deep Learned Descriptors. 2620-2628 - Ryo Yonetani, Kris M. Kitani, Yoichi Sato:

Recognizing Micro-Actions and Reactions from Paired Egocentric Videos. 2629-2638 - Chunyu Wang, Yizhou Wang, Alan L. Yuille

:
Mining 3D Key-Pose-Motifs for Action Recognition. 2639-2647 - Khurram Soomro, Haroon Idrees, Mubarak Shah

:
Predicting the Where and What of Actors and Actions through Online Action Localization. 2648-2657 - Xiaolong Wang, Ali Farhadi, Abhinav Gupta:

Actions ~ Transformations. 2658-2667 - Young Joon Yoo, Kimin Yun

, Sangdoo Yun, Jonghee Hong, Hawook Jeong
, Jin Young Choi:
Visual Path Prediction in Complex Scenes with Crowded Moving Objects. 2668-2677 - Serena Yeung

, Olga Russakovsky
, Greg Mori, Li Fei-Fei:
End-to-End Learning of Action Detection from Frame Glimpses in Videos. 2678-2687 - Analí Alfaro, Domingo Mery

, Alvaro Soto:
Action Recognition in Video Using Sparse Coding and Relative Features. 2688-2697 - Yang Wang, Minh Hoai:

Improving Human Action Recognition by Non-action Classification. 2698-2707 - Limin Wang, Yu Qiao

, Xiaoou Tang, Luc Van Gool:
Actionness Estimation Using Hybrid Fully Convolutional Networks. 2708-2717 - Bowen Zhang

, Limin Wang, Zhe Wang, Yu Qiao
, Hanli Wang:
Real-Time Action Recognition with Enhanced Motion Vector CNNs. 2718-2726 - Joo Ho Lee

, Inchang Choi, Min H. Kim:
Laplacian Patch-Based Image Synthesis. 2727-2735 - Yu Li

, Robby T. Tan, Xiaojie Guo, Jiangbo Lu
, Michael S. Brown:
Rain Streak Removal Using Layer Priors. 2736-2744 - Takashi Shibata

, Masayuki Tanaka
, Masatoshi Okutomi:
Gradient-Domain Image Reconstruction Framework with Intensity-Range and Base-Structure Constraints. 2745-2753 - Jialei Wang, Peder A. Olsen, Andrew R. Conn, Aurélie C. Lozano:

Removing Clouds and Recovering Ground Observations in Satellite Image Sequences via Temporally Contiguous Robust Matrix Completion. 2754-2763 - Zhangyang Wang, Ding Liu

, Shiyu Chang, Qing Ling, Yingzhen Yang, Thomas S. Huang:
D3: Deep Dual-Domain Based Fast Restoration of JPEG-Compressed Images. 2764-2772 - Vijay Rengarajan, A. N. Rajagopalan, Rangarajan Aravind:

From Bows to Arrows: Rolling Shutter Rectification of Urban Scenes. 2773-2781 - Xueyang Fu

, Delu Zeng, Yue Huang, Xiao-Ping (Steven) Zhang, Xinghao Ding:
A Weighted Variational Model for Simultaneous Reflectance and Illumination Estimation. 2782-2790 - Tsung-Yu Lin, Subhransu Maji:

Visualizing and Understanding Deep Texture Representations. 2791-2799 - Jin-shan Pan, Zhouchen Lin, Zhixun Su

, Ming-Hsuan Yang:
Robust Kernel Estimation with Outliers Handling for Image Deblurring. 2800-2808 - Hanwang Zhang

, Xindi Shang, Wenzhuo Yang, Huan Xu, Huan-Bo Luan, Tat-Seng Chua:
Online Collaborative Learning for Open-Vocabulary Visual Classifiers. 2809-2817 - Christian Szegedy, Vincent Vanhoucke

, Sergey Ioffe, Jonathon Shlens, Zbigniew Wojna
:
Rethinking the Inception Architecture for Computer Vision. 2818-2826 - Saurabh Gupta, Judy Hoffman

, Jitendra Malik:
Cross Modal Distillation for Supervision Transfer. 2827-2836 - Trung T. Pham, Seyed Hamid Rezatofighi, Ian D. Reid, Tat-Jun Chin:

Efficient Point Process Inference for Large-Scale Object Detection. 2837-2845 - Hakan Bilen

, Andrea Vedaldi:
Weakly Supervised Deep Detection Networks. 2846-2854 - Jacob Chan, Jimmy Addison Lee, Kemao Qian

:
BORDER: An Oriented Rectangles Approach to Texture-Less Object Recognition. 2855-2863 - Suyog Dutt Jain, Kristen Grauman:

Active Image Segmentation Propagation. 2864-2873 - Sean Bell, C. Lawrence Zitnick, Kavita Bala

, Ross B. Girshick:
Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks. 2874-2883 - Gong Cheng

, Peicheng Zhou, Junwei Han:
RIFD-CNN: Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection. 2884-2893 - Stefan Mathe, Aleksis Pirinen, Cristian Sminchisescu

:
Reinforcement Learning for Visual Object Detection. 2894-2902 - Inbar Huberman, Raanan Fattal:

Detecting Repeating Objects Using Patch Correlation Analysis. 2903-2911 - Sebastian Lapuschkin

, Alexander Binder
, Grégoire Montavon, Klaus-Robert Müller
, Wojciech Samek:
Analyzing Classifiers: Fisher Vectors and Deep Neural Networks. 2912-2920 - Bolei Zhou, Aditya Khosla, Àgata Lapedriza, Aude Oliva, Antonio Torralba:

Learning Deep Features for Discriminative Localization. 2921-2929 - Ishan Misra, C. Lawrence Zitnick, Margaret Mitchell, Ross B. Girshick:

Seeing through the Human Reporting Bias: Visual Classifiers from Noisy Human-Centric Labels. 2930-2939 - Lluís Castrejón, Yusuf Aytar, Carl Vondrick, Hamed Pirsiavash, Antonio Torralba:

Learning Aligned Cross-Modal Representations from Weakly Aligned Data. 2940-2949 - Sijia Cai, Lei Zhang

, Wangmeng Zuo, Xiangchu Feng:
A Probabilistic Collaborative Representation Based Approach for Pattern Classification. 2950-2959 - Hexiang Hu

, Guang-Tong Zhou, Zhiwei Deng, Zicheng Liao, Greg Mori:
Learning Structured Inference Neural Networks with Label Relations. 2960-2968 - Hongyuan Zhu, Jean-Baptiste Weibel

, Shijian Lu
:
Discriminative Multi-modal Feature Fusion for RGBD Indoor Scene Recognition. 2969-2976 - Qiang Li, Maoying Qiao, Wei Bian, Dacheng Tao

:
Conditional Graphical Lasso for Multi-label Image Classification. 2977-2986 - Zijun Wei, Minh Hoai:

Region Ranking SVM for Image Classification. 2987-2996 - Carl Vondrick, Deniz Oktay, Hamed Pirsiavash, Antonio Torralba:

Predicting Motivations of Actions by Leveraging Text. 2997-3005 - Jakub Sochor, Adam Herout

, Jirí Havel:
BoxCars: 3D Boxes as CNN Input for Improved Fine-Grained Vehicle Recognition. 3006-3015 - Xu Liu, Zilei Wang, Jiashi Feng, Hongsheng Xi:

Highway Vehicle Counting in Compressed Domain. 3016-3024 - Shiyao Huang, Xianghua Ying, Jiangpeng Rong, Zeyu Shang, Hongbin Zha:

Camera Calibration from Periodic Motion of a Pedestrian. 3025-3033
Oral & Spotlight Session 3-1A
O3-1A: Actions and Human Pose
- Hakan Bilen

, Basura Fernando, Efstratios Gavves, Andrea Vedaldi, Stephen Gould:
Dynamic Image Networks for Action Recognition. 3034-3042 - Vignesh Ramanathan, Jonathan Huang, Sami Abu-El-Haija, Alexander N. Gorban, Kevin Murphy, Li Fei-Fei:

Detecting Events and Key Actors in Multi-person Videos. 3043-3053 - Behrooz Mahasseni, Sinisa Todorovic:

Regularizing Long Short Term Memory with 3D Human-Skeleton Sequences for Action Recognition. 3054-3062 - James Charles, Tomas Pfister, Derek R. Magee

, David C. Hogg
, Andrew Zisserman:
Personalizing Human Video Pose Estimation. 3063-3072 - Wei Yang, Wanli Ouyang

, Hongsheng Li
, Xiaogang Wang:
End-to-End Learning of Deformable Mixture of Parts and Deep Convolutional Neural Networks for Human Pose Estimation. 3073-3082
S3-1A: Activity Recognition
- Chenliang Xu, Jason J. Corso:

Actor-Action Semantic Segmentation with Grouping Process Models. 3083-3092 - Jun Yuan, Bingbing Ni, Xiaokang Yang, Ashraf A. Kassim:

Temporal Action Localization with Pyramid of Score Distribution Features. 3093-3102 - Katsunori Ohnishi, Atsushi Kanehira, Asako Kanezaki, Tatsuya Harada:

Recognizing Activities of Daily Living with a Wrist-Mounted Camera. 3103-3111 - Zuxuan Wu, Yanwei Fu

, Yu-Gang Jiang, Leonid Sigal:
Harnessing Object and Scene Semantics for Large-Scale Video Understanding. 3112-3121 - Jinsoo Choi, Tae Hyun Oh

, In-So Kweon:
Video-Story Composition via Plot Analysis. 3122-3130 - Alexander Richard, Juergen Gall:

Temporal Action Detection Using a Statistical Language Model. 3131-3140
Oral & Spotlight Session 3-1B
O3-1B: Semantic Segmentation
- Shu Liu, Xiaojuan Qi, Jianping Shi, Hong Zhang, Jiaya Jia

:
Multi-scale Patch Aggregation (MPA) for Simultaneous Detection and Segmentation. 3141-3149 - Jifeng Dai

, Kaiming He, Jian Sun:
Instance-Aware Semantic Segmentation via Multi-task Network Cascades. 3150-3158 - Di Lin, Jifeng Dai

, Jiaya Jia
, Kaiming He, Jian Sun:
ScribbleSup: Scribble-Supervised Convolutional Networks for Semantic Segmentation. 3159-3167 - Abhijit Kundu

, Vibhav Vineet, Vladlen Koltun:
Feature Space Optimization for Semantic Video Segmentation. 3168-3175 - Maros Blaha, Christoph Vogel, Audrey Richard, Jan Dirk Wegner, Thomas Pock, Konrad Schindler:

Large-Scale Semantic 3D Reconstruction: An Adaptive Multi-resolution Model for Multi-class Volumetric Labeling. 3176-3184
S3-1B: Semantic Parsing and Segmentation
- Xiaodan Liang, Xiaohui Shen, Donglai Xiang

, Jiashi Feng, Liang Lin, Shuicheng Yan:
Semantic Object Parsing with Local-Global Long Short-Term Memory. 3185-3193 - Guosheng Lin

, Chunhua Shen, Anton van den Hengel, Ian D. Reid:
Efficient Piecewise Training of Deep Structured Models for Semantic Segmentation. 3194-3203 - Seunghoon Hong, Junhyuk Oh, Honglak Lee, Bohyung Han:

Learning Transferrable Knowledge for Semantic Segmentation with Deep Convolutional Neural Network. 3204-3212 - Marius Cordts, Mohamed Omran, Sebastian Ramos, Timo Rehfeld, Markus Enzweiler, Rodrigo Benenson, Uwe Franke, Stefan Roth, Bernt Schiele

:
The Cityscapes Dataset for Semantic Urban Scene Understanding. 3213-3223 - Raviteja Vemulapalli, Oncel Tuzel, Ming-Yu Liu, Rama Chellappa:

Gaussian Conditional Random Field Network for Semantic Segmentation. 3224-3233 - Germán Ros, Laura Sellart, Joanna Materzynska, David Vázquez, Antonio M. López

:
The SYNTHIA Dataset: A Large Collection of Synthetic Images for Semantic Segmentation of Urban Scenes. 3234-3243
Poster Session P3-1
- Alex Locher, Michal Perdoch, Luc Van Gool:

Progressive Prioritized Multi-view Stereo. 3244-3252 - Angjoo Kanazawa, David W. Jacobs, Manmohan Chandraker:

WarpNet: Weakly Supervised Matching for Single-View Reconstruction. 3253-3261 - Ole Johannsen, Antonin Sulc, Bastian Goldluecke:

What Sparse Light Field Coding Reveals about Scene Structure. 3262-3270 - Hao Wang, Jun Wang, Liang Wang:

Online Reconstruction of Indoor Scenes from RGB-D Streams. 3271-3279 - Ali Osman Ulusoy, Michael J. Black, Andreas Geiger:

Patches, Planes and Probabilities: A Non-Local Prior for Volumetric 3D Reconstruction. 3280-3289 - Ian Schillebeeckx

, Robert Pless:
Single Image Camera Calibration with Lenticular Arrays for Augmented Reality. 3290-3298 - Diego Thomas, Rin-Ichiro Taniguchi:

Augmented Blendshapes for Real-Time Simultaneous 3D Head Modeling and Facial Motion Capture. 3299-3308 - Jin Xie, Meng Wang, Yi Fang:

Learned Binary Spectral Shape Descriptor for 3D Shape Correspondence. 3309-3317 - Luca Magri

, Andrea Fusiello:
Multiple Models Fitting as a Set Coverage Problem. 3318-3326 - Cédric Verleysen, Christophe De Vleeschouwer:

Piecewise-Planar 3D Approximation from Wide-Baseline Stereo. 3327-3336 - Olivier Saurer, Marc Pollefeys

, Gim Hee Lee:
Sparse to Dense 3D Reconstruction from Rolling Shutter Images. 3337-3345 - Matthew Trager, Martial Hebert, Jean Ponce:

Consistency of Silhouettes and Their Duals. 3346-3354 - Cenek Albl, Zuzana Kukelova

, Tomás Pajdla:
Rolling Shutter Absolute Pose Problem with Known Vertical Direction. 3355-3363 - Eric Brachmann, Frank Michel, Alexander Krull, Michael Ying Yang

, Stefan Gumhold, Carsten Rother:
Uncertainty-Driven 6D Pose Estimation of Objects and Scenes from a Single RGB Image. 3364-3372 - Andrey Bushnevskiy, Lorenzo Sorgi, Bodo Rosenhahn:

Multicamera Calibration from Visible and Mirrored Epipoles. 3373-3381 - Lazaros Zafeiriou, Epameinondas Antonakos, Stefanos Zafeiriou, Maja Pantic:

Joint Unsupervised Deformable Spatio-Temporal Alignment of Sequences. 3382-3390 - Kaili Zhao, Wen-Sheng Chu, Honggang Zhang:

Deep Region and Multi-label Learning for Facial Action Unit Detection. 3391-3399 - Yue Wu, Qiang Ji:

Constrained Joint Cascade Regression Framework for Simultaneous Facial Action Unit Recognition and Facial Landmark Detection. 3400-3408 - Shizhan Zhu, Cheng Li, Chen Change Loy, Xiaoou Tang:

Unconstrained Face Alignment via Cascaded Compositional Learning. 3409-3417 - Marcel Piotraschke, Volker Blanz:

Automated 3D Face Reconstruction from Multiple Images Using Quality Measures. 3418-3427 - Jie Zhang, Meina Kan, Shiguang Shan

, Xilin Chen:
Occlusion-Free Face Alignment: Deep Regression Networks Coupled with De-Corrupt AutoEncoders. 3428-3437 - Zheng Zhang, Jeffrey M. Girard

, Yue Wu, Xing Zhang, Peng Liu, Umur A. Ciftci, Shaun J. Canavan, Michael Reale
, Andrew Horowitz, Huiyuan Yang, Jeffrey F. Cohn, Qiang Ji, Lijun Yin:
Multimodal Spontaneous Emotion Corpus for Human Behavior Analysis. 3438-3446 - Pei Yu, Jiahuan Zhou, Ying Wu:

Learning Reconstruction-Based Remote Gaze Estimation. 3447-3455 - Hongwei Qin, Junjie Yan, Xiu Li, Xiaolin Hu:

Joint Training of Cascaded CNN for Face Detection. 3456-3465 - Rui Zhao, Quan Gan, Shangfei Wang, Qiang Ji:

Facial Expression Intensity Estimation Using Ordinal Information. 3466-3474 - Bumsub Ham, Minsu Cho, Cordelia Schmid, Jean Ponce:

Proposal Flow. 3475-3484 - Chen Sun, Manohar Paluri, Ronan Collobert, Ram Nevatia, Lubomir D. Bourdev:

ProNet: Learning to Propose Object-Specific Boxes for Cascaded Neural Networks. 3485-3493 - Christopher Thomas, Adriana Kovashka:

Seeing Behind the Camera: Identifying the Authorship of a Photograph. 3494-3502 - Shuochen Su, Felix Heide, Robin Swanson, Jonathan Klein, Clara Callenberg, Matthias B. Hullin, Wolfgang Heidrich

:
Material Classification Using Raw Time-of-Flight Measurements. 3503-3511 - Dong Li, Jia-Bin Huang, Yali Li, Shengjin Wang, Ming-Hsuan Yang:

Weakly Supervised Object Localization with Progressive Domain Adaptation. 3512-3520 - Roozbeh Mottaghi, Hessam Bagherinezhad, Mohammad Rastegari, Ali Farhadi:

Newtonian Image Understanding: Unfolding the Dynamics of Objects in Static Images. 3521-3529 - Ali Harakeh

, Daniel C. Asmar
, Elie A. Shammas:
Identifying Good Training Data for Self-Supervised Free Space Estimation. 3530-3538 - Hani Altwaijry, Eduard Trulls, James Hays, Pascal Fua, Serge J. Belongie

:
Learning to Match Aerial Images with Deep Attentive Architectures. 3539-3547 - Krishna Kumar Singh, Fanyi Xiao, Yong Jae Lee:

Track and Transfer: Watching Videos to Simulate Strong Human Supervision for Weakly-Supervised Object Detection. 3548-3556 - Ali Diba

, Ali Mohammad Pazandeh, Hamed Pirsiavash, Luc Van Gool:
DeepCAMP: Deep Convolutional Action & Attribute Mid-Level Patterns. 3557-3565 - Hojin Cho, Myung-Chul Sung, Bongjin Jun:

Canny Text Detector: Fast and Robust Scene Text Localization Algorithm. 3566-3573 - Di Hu, Xuelong Li

, Xiaoqiang Lu:
Temporal Multimodal Learning in Audiovisual Speech Recognition. 3574-3582 - Andreas Doumanoglou, Rigas Kouskouridas

, Sotiris Malassiotis, Tae-Kyun Kim
:
Recovering 6D Object Pose and Predicting Next-Best-View in the Crowd. 3583-3592 - Liuhao Ge, Hui Liang, Junsong Yuan

, Daniel Thalmann:
Robust 3D Hand Pose Estimation in Single Depth Images: From Single-View CNN to Multi-View CNNs. 3593-3601 - Gedas Bertasius, Jianbo Shi, Lorenzo Torresani:

Semantic Segmentation with Boundary Neural Fields. 3602-3610 - Gellért Máttyus, Shenlong Wang, Sanja Fidler

, Raquel Urtasun:
HD Maps: Fine-Grained Road Segmentation by Parsing Ground and Aerial Images. 3611-3619 - Bing Shuai, Zhen Zuo, Bing Wang, Gang Wang:

DAG-Recurrent Neural Networks for Scene Labeling. 3620-3629 - Baisheng Lai, Xiaojin Gong:

Saliency Guided Dictionary Learning for Weakly-Supervised Image Parsing. 3630-3639 - Liang-Chieh Chen, Yi Yang, Jiang Wang, Wei Xu, Alan L. Yuille

:
Attention to Scale: Scale-Aware Semantic Image Segmentation. 3640-3649 - Nasim Souly, Mubarak Shah

:
Scene Labeling Using Sparse Precision Matrix. 3650-3658 - Ke Li, Bharath Hariharan, Jitendra Malik:

Iterative Instance Segmentation. 3659-3667 - Jason Kuen, Zhenhua Wang, Gang Wang:

Recurrent Attentional Networks for Saliency Detection. 3668-3677 - Guillaume Seguin, Piotr Bojanowski, Rémi Lajugie, Ivan Laptev:

Instance-Level Video Segmentation from Object Tracks. 3678-3687 - Jun Xie, Martin Kiefel, Ming-Ting Sun, Andreas Geiger:

Semantic Instance Annotation of Street Scenes by 3D to 2D Label Transfer. 3688-3697 - Amir Kolaman, Maxim Lvov, Rami R. Hagege, Hugo Guterman:

Amplitude Modulated Video Camera - Light Separation in Dynamic Scenes. 3698-3706 - Boxin Shi, Zhe Wu, Zhipeng Mo, Dinglong Duan, Sai-Kit Yeung, Ping Tan:

A Benchmark Dataset and Evaluation for Non-Lambertian and Uncalibrated Photometric Stereo. 3707-3716 - Ting-Chun Wang, Manohar Srikanth, Ravi Ramamoorthi:

Depth from Semi-Calibrated Stereo and Defocus. 3717-3726 - Ying Fu, Yinqiang Zheng

, Imari Sato, Yoichi Sato:
Exploiting Spectral-Spatial Correlation for Coded Hyperspectral Image Restoration. 3727-3736 - Julie Chang, Isaac Kauvar, Xuemei Hu, Gordon Wetzstein

:
Variable Aperture Light Field Photography: Overcoming the Diffraction-Limited Spatio-Angular Resolution Tradeoff. 3737-3745 - Stefan Heber, Thomas Pock:

Convolutional Networks for Shape from Light Field. 3746-3754 - Rajat Aggarwal, Amrisha Vohra, Anoop M. Namboodiri:

Panoramic Stereo Videos with a Single Camera. 3755-3763 - Mark Sheinin, Yoav Y. Schechner:

The Next Best Underwater View. 3764-3773 - Yoshie Kobayashi, Tetsuro Morimoto, Imari Sato, Yasuhiro Mukaigawa, Takao Tomono

, Katsushi Ikeuchi:
Reconstructing Shapes and Appearances of Thin Film Objects Using RGB Images. 3774-3782 - Tomas F. Yago Vicente

, Minh Hoai, Dimitris Samaras:
Noisy Label Recovery for Shadow Detection in Unfamiliar Domains. 3783-3792
Oral & Spotlight Session 3-2A
O3-2A: Video Understanding
- Oscar Koller, Hermann Ney, Richard Bowden:

Deep Hand: How to Train a CNN on 1 Million Hand Images When Your Data is Continuous and Weakly Labelled. 3793-3802 - Bo Li, Tianfu Wu, Caiming Xiong, Song-Chun Zhu:

Recognizing Car Fluents from Video. 3803-3812 - Edward Johns

, Stefan Leutenegger, Andrew J. Davison:
Pairwise Decomposition of Image Sequences for Active Multi-view Recognition. 3813-3822 - Yixin Zhu, Chenfanfu Jiang, Yibiao Zhao, Demetri Terzopoulos, Song-Chun Zhu:

Inferring Forces and Learning Human Utilities from Videos. 3823-3833 - Hyun Soo Park, Jyh-Jing Hwang, Jianbo Shi:

Force from Motion: Decoding Physical Sensation in a First Person Video. 3834-3842
S3-2A: Video Analysis 2
- Pan Ji, Hongdong Li

, Mathieu Salzmann, Yiran Zhong:
Robust Multi-Body Feature Tracker: A Segmentation-Free Approach. 3843-3851 - Dinesh Jayaraman, Kristen Grauman:

Slow and Steady Feature Analysis: Higher Order Temporal Coherence in Video. 3852-3861 - Chun-Hao Huang, Benjamin Allain, Jean-Sébastien Franco, Nassir Navab, Slobodan Ilic

, Edmond Boyer:
Volumetric 3D Tracking by Detection. 3862-3870 - Shoou-I Yu, Deyu Meng, Wangmeng Zuo, Alexander G. Hauptmann:

The Solution Path Algorithm for Identity-Aware Multi-object Tracking. 3871-3879 - Tianzhu Zhang, Adel Bibi

, Bernard Ghanem
:
In Defense of Sparse Tracking: Circulant Sparse Tracker. 3880-3888 - Laura Sevilla-Lara, Deqing Sun, Varun Jampani, Michael J. Black:

Optical Flow with Semantic Segmentation and Localized Layers. 3889-3898 - Yi-Hsuan Tsai, Ming-Hsuan Yang, Michael J. Black:

Video Segmentation via Object Flow. 3899-3908
Oral & Spotlight Session 3-2B
O3-2B: Grouping and Optimization Methods
- Marc T. Law, Yaoliang Yu, Matthieu Cord, Eric P. Xing:

Closed-Form Training of Mahalanobis Distance for Supervised Clustering. 3909-3917 - Chong You, Daniel P. Robinson, René Vidal:

Scalable Sparse Subspace Clustering by Orthogonal Matching Pursuit. 3918-3927 - Chong You, Chun-Guang Li, Daniel P. Robinson, René Vidal:

Oracle Based Active Set Algorithm for Scalable Elastic Net Subspace Clustering. 3928-3937 - Wen-bing Huang, Fuchun Sun, Le-le Cao

, Deli Zhao, Huaping Liu, Mehrtash Harandi
:
Sparse Coding and Dictionary Learning with Linear Dynamical Systems. 3938-3947 - Thomas Möllenhoff

, Emanuel Laude
, Michael Möller, Jan Lellmann, Daniel Cremers
:
Sublabel-Accurate Relaxation of Nonconvex Energies. 3948-3956
S3-2B: Statistical Methods and Transfer Learning
- Etai Littwin, Lior Wolf:

The Multiverse Loss for Robust Transfer Learning. 3957-3966 - Viktoriia Sharmanska

, Novi Quadrianto
:
Learning from the Mistakes of Others: Matching Errors in Cross-Dataset Learning. 3967-3975 - Rudrasis Chakraborty, Dohyung Seo, Baba C. Vemuri:

An Efficient Exact-PGA Algorithm for Constant Curvature Manifolds. 3976-3984 - Samuel Rota Bulò, Peter Kontschieder:

Online Learning with Bayesian Classification Trees. 3985-3993 - Ishan Misra, Abhinav Shrivastava, Abhinav Gupta, Martial Hebert:

Cross-Stitch Networks for Multi-task Learning. 3994-4003 - Hyun Oh Song, Yu Xiang, Stefanie Jegelka, Silvio Savarese:

Deep Metric Learning via Lifted Structured Feature Embedding. 4004-4012 - Andrew Lavin, Scott Gray:

Fast Algorithms for Convolutional Neural Networks. 4013-4021
Poster Session P3-2
- Ang Li, Dapeng Chen, Yuanliu Liu, Zejian Yuan:

Coordinating Multiple Disparity Proposals for Stereo Computation. 4022-4030 - Chi Zhang, Zhiwei Li, Rui Cai, Hongyang Chao, Yong Rui:

Joint Multiview Segmentation and Localization of RGB-D Images Using Depth-Induced Silhouette Consistency. 4031-4039 - Nikolaus Mayer

, Eddy Ilg, Philip Häusser, Philipp Fischer, Daniel Cremers
, Alexey Dosovitskiy, Thomas Brox:
A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation. 4040-4048 - Wei Feng, Fei-Peng Tian, Qian Zhang, Jizhou Sun:

6D Dynamic Camera Relocalization from Single Reference Image. 4049-4057 - René Ranftl, Vibhav Vineet, Qifeng Chen

, Vladlen Koltun:
Dense Monocular Depth Estimation in Complex Dynamic Scenes. 4058-4066 - Christian Mostegel, Markus Rumpler, Friedrich Fraundorfer, Horst Bischof:

Using Self-Contradiction to Learn Confidence Measures in Stereo Vision. 4067-4076 - Ankur Handa, Viorica Patraucean, Vijay Badrinarayanan, Simon Stent, Roberto Cipolla

:
Understanding RealWorld Indoor Scenes with Synthetic Data. 4077-4085 - Hae-Gon Jeon

, Joon-Young Lee, Sunghoon Im, Hyowon Ha, In-So Kweon:
Stereo Matching with Color and Monochrome Cameras in Low-Light Conditions. 4086-4094 - Gil Ben-Artzi, Yoni Kasten, Shmuel Peleg, Michael Werman:

Camera Calibration from Dynamic Silhouettes Using Motion Barcodes. 4095-4103 - Johannes L. Schönberger, Jan-Michael Frahm:

Structure-from-Motion Revisited. 4104-4113 - Wencheng Wang, Tianhao Gao:

Constructing Canonical Regions for Fast and Effective View Selection. 4114-4122 - Chen Kong, Simon Lucey

:
Prior-Less Compressible Structure from Motion. 4123-4131 - Yuchao Dai, Hongdong Li

, Laurent Kneip:
Rolling Shutter Camera Relative Pose: Generalized Epipolar Geometry. 4132-4140 - Marco Crocco, Cosimo Rubino, Alessio Del Bue

:
Structure from Motion with Objects. 4141-4149 - Ayan Sinha, Chiho Choi, Karthik Ramani:

DeepHand: Robust Hand Pose Estimation by Completing a Matrix Imputed with Deep Features. 4150-4158 - Zheng Zhang, Chengquan Zhang, Wei Shen, Cong Yao, Wenyu Liu

, Xiang Bai:
Multi-oriented Text Detection with Fully Convolutional Networks. 4159-4167 - Baoguang Shi, Xinggang Wang

, Pengyuan Lyu
, Cong Yao, Xiang Bai:
Robust Scene Text Recognition with Automatic Rectification. 4168-4176 - George Trigeorgis, Patrick Snape, Mihalis A. Nicolaou, Epameinondas Antonakos, Stefanos Zafeiriou:

Mnemonic Descent Method: A Recurrent Process Applied for End-to-End Face Alignment. 4177-4187 - Amin Jourabloo, Xiaoming Liu:

Large-Pose Face Alignment via CNN-Based Dense 3D Model Fitting. 4188-4196 - Joseph Roth, Yiying Tong, Xiaoming Liu:

Adaptive 3D Face Reconstruction from Unconstrained Photo Collections. 4197-4206 - Pavlo Molchanov, Xiaodong Yang, Shalini Gupta, Kihwan Kim, Stephen Tyree, Jan Kautz:

Online Detection and Classification of Dynamic Hand Gestures with Recurrent 3D Convolutional Neural Networks. 4207-4215 - Hyung Jin Chang

, Tobias Fischer
, Maxime Petit, Martina Zambelli, Yiannis Demiris
:
Kinematic Structure Correspondences via Hypergraph Matching. 4216-4225 - Binod Bhattarai

, Gaurav Sharma, Frédéric Jurie:
CP-mtML: Coupled Projection Multi-Task Metric Learning for Large Scale Face Retrieval. 4226-4235 - David Gadot, Lior Wolf:

PatchBatch: A Batch Augmented Loss for Optical Flow. 4236-4245 - Tatsunori Taniai, Sudipta N. Sinha, Yoichi Sato:

Joint Recovery of Dense Correspondence and Cosegmentation in Two Images. 4246-4255 - Yuanlu Xu

, Xiaobai Liu, Yang Liu, Song-Chun Zhu:
Multi-view People Tracking via Hierarchical Trajectory Composition. 4256-4265 - Jifeng Ning, Jimei Yang, Shaojie Jiang, Lei Zhang

, Ming-Hsuan Yang:
Object Tracking via Dual Linear Structured SVM and Explicit Feature Map. 4266-4274 - Taiki Sekii:

Robust, Real-Time 3D Tracking of Multiple Objects with Similar Appearances. 4275-4283 - Yedid Hoshen, Shmuel Peleg:

An Egocentric Look at Video Photographer Identity. 4284-4292 - Hyeonseob Nam, Bohyung Han:

Learning Multi-domain Convolutional Neural Networks for Visual Tracking. 4293-4302 - Yuankai Qi

, Shengping Zhang, Lei Qin, Hongxun Yao, Qingming Huang, Jongwoo Lim, Ming-Hsuan Yang:
Hedged Deep Tracking. 4303-4311 - Si Liu, Tianzhu Zhang, Xiaochun Cao, Changsheng Xu:

Structural Correlation Filter for Robust Visual Tracking. 4312-4320 - Jongwon Choi, Hyung Jin Chang

, Jiyeoup Jeong, Yiannis Demiris
, Jin Young Choi:
Visual Tracking Using Attention-Modulated Disintegration and Integration. 4321-4330 - Vikas Dhiman

, Quoc-Huy Tran, Jason J. Corso, Manmohan Chandraker:
A Continuous Occlusion Model for Road Scene Understanding. 4331-4339 - Adrien Gaidon, Qiao Wang, Yohann Cabon, Eleonora Vig:

VirtualWorlds as Proxy for Multi-object Tracking Analysis. 4340-4349 - Keisuke Midorikawa, Toshihiko Yamasaki, Kiyoharu Aizawa:

Uncalibrated Photometric Stereo by Stepwise Optimization Using Principal Components of Isotropic BRDFs. 4350-4358 - Yvain Quéau, Roberto Mecca, Jean-Denis Durou:

Unbiased Photometric Stereo for Colored Surfaces: A Variational Approach. 4359-4368 - Yiming Qian, Minglun Gong

, Yee-Hong Yang:
3D Reconstruction of Transparent Objects with Position-Normal Consistency. 4369-4377 - Roy Or-El

, Rom Hershkovitz, Aaron Wetzler, Guy Rosman, Alfred M. Bruckstein
, Ron Kimmel:
Real-Time Depth Refinement for Specular Objects. 4378-4386 - Kenichiro Tanaka, Yasuhiro Mukaigawa, Hiroyuki Kubo

, Yasuyuki Matsushita
, Yasushi Yagi:
Recovering Transparent Shape from Time-of-Flight Distortion. 4387-4395 - Williem

, In Kyu Park:
Robust Light Field Depth Estimation for Noisy Scene with Occlusion. 4396-4404 - Nianyi Li, Haiting Lin, Bilin Sun, Mingyuan Zhou, Jingyi Yu:

Rotational Crossed-Slit Light Fields. 4405-4413 - Fabrizio Natola, Valsamis Ntouskos

, Fiora Pirri, Marta Sanzari
:
Single Image Object Modeling Based on BRDF and r-Surfaces Learning. 4414-4423 - Monami Banerjee, Rudrasis Chakraborty, Edward Ofori

, Michael S. Okun
, David E. Vaillancourt, Baba C. Vemuri:
A Nonlinear Regression Technique for Manifold Valued Data with Applications to Medical Image Analysis. 4424-4432 - Qilong Wang

, Peihua Li, Wangmeng Zuo, Lei Zhang
:
RAID-G: Robust Estimation of Approximate Infinite Dimensional Gaussian with Application to Material Recognition. 4433-4441 - Nikolaos Karianakis, Jingming Dong, Stefano Soatto:

An Empirical Evaluation of Current Convolutional Architectures' Ability to Manage Nuisance Location and Scale Variability. 4442-4451 - Varun Jampani, Martin Kiefel, Peter V. Gehler:

Learning Sparse High Dimensional Filters: Image Filtering, Dense CRFs and Bilateral Neural Networks. 4452-4461 - Fujiao Ju, Yanfeng Sun, Junbin Gao

, Simeng Liu, Yongli Hu, Baocai Yin:
Mixture of Bilateral-Projection Two-Dimensional Probabilistic Principal Component Analysis. 4462-4470 - Raviteja Vemulapalli, Rama Chellappa:

Rolling Rotations for Recognizing Human Actions from 3D Skeletal Data. 4471-4479 - Stephan Zheng, Yang Song, Thomas Leung, Ian J. Goodfellow:

Improving the Robustness of Deep Neural Networks via Stability Training. 4480-4488 - Chao Xing, Xin Geng, Hui Xue:

Logistic Boosting Regression for Label Distribution Learning. 4489-4497 - Xikang Zhang

, Yin Wang, Mengran Gou, Mario Sznaier, Octavia I. Camps:
Efficient Temporal Sequence Comparison and Classification Using Gram Matrix Embeddings on a Riemannian Manifold. 4498-4507 - Konstantinos Rematas, Tobias Ritschel, Mario Fritz, Efstratios Gavves, Tinne Tuytelaars

:
Deep Reflectance Maps. 4508-4516 - Qingxiong Yang:

Semantic Filtering. 4517-4526 - Amir M. Rahimi, Raphael Ruschel, B. S. Manjunath:

UAVSensor Fusion with Latent-Dynamic Conditional Random Fields in Coronal Plane Estimation. 4527-4534 - Elena Stumm, Christopher Mei, Simon Lacroix, Juan I. Nieto, Marco Hutter, Roland Siegwart:

Robust Visual Place Recognition with Graph Kernels. 4535-4544 - Liang-Chieh Chen, Jonathan T. Barron, George Papandreou, Kevin Murphy, Alan L. Yuille

:
Semantic Image Segmentation with Task-Specific Edge Detection Using CNNs and a Discriminatively Trained Domain Transform. 4545-4554
Oral & Spotlight Session 4-1A
O4-1A: Image & Video Captioning and Descriptions
- Ronghang Hu, Huazhe Xu, Marcus Rohrbach, Jiashi Feng, Kate Saenko

, Trevor Darrell:
Natural Language Object Retrieval. 4555-4564 - Justin Johnson, Andrej Karpathy, Li Fei-Fei:

DenseCap: Fully Convolutional Localization Networks for Dense Captioning. 4565-4574 - Jean-Baptiste Alayrac, Piotr Bojanowski, Nishant Agrawal, Josef Sivic, Ivan Laptev, Simon Lacoste-Julien:

Unsupervised Learning from Narrated Instruction Videos. 4575-4583 - Haonan Yu, Jiang Wang, Zhiheng Huang, Yi Yang, Wei Xu:

Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks. 4584-4593 - Yingwei Pan

, Tao Mei
, Ting Yao, Houqiang Li, Yong Rui:
Jointly Modeling Embedding and Translation to Bridge Video and Language. 4594-4602
S4-1A: High Level Semantics
- Arjun Chandrasekaran, Ashwin K. Vijayakumar, Stanislaw Antol, Mohit Bansal, Dhruv Batra, C. Lawrence Zitnick, Devi Parikh:

We are Humor Beings: Understanding and Predicting Visual Humor. 4603-4612 - Kevin J. Shih, Saurabh Singh, Derek Hoiem:

Where to Look: Focus Regions for Visual Question Answering. 4613-4621 - Qi Wu, Peng Wang, Chunhua Shen, Anthony R. Dick

, Anton van den Hengel
:
Ask Me Anything: Free-Form Visual Question Answering Based on Knowledge from External Sources. 4622-4630 - Makarand Tapaswi, Yukun Zhu, Rainer Stiefelhagen, Antonio Torralba, Raquel Urtasun, Sanja Fidler

:
MovieQA: Understanding Stories in Movies through Question-Answering. 4631-4640 - Yuncheng Li, Yale Song, Liangliang Cao, Joel R. Tetreault, Larry Goldberg, Alejandro Jaimes, Jiebo Luo

:
TGIF: A New Dataset and Benchmark on Animated GIF Description. 4641-4650 - Quanzeng You, Hailin Jin, Zhaowen Wang, Chen Fang, Jiebo Luo

:
Image Captioning with Semantic Attention. 4651-4659
Oral & Spotlight Session 4-1B
O4-1B: Non-rigid Reconstruction and Motion Analysis
- Armin Mustafa, Hansung Kim

, Jean-Yves Guillemaut, Adrian Hilton:
Temporally Coherent 4D Reconstruction of Complex Dynamic Scenes. 4660-4669 - Minsik Lee

, Jungchan Cho, Songhwai Oh:
Consensus of Non-rigid Reconstructions. 4670-4678 - Shaifali Parashar, Daniel Pizarro, Adrien Bartoli

:
Isometric Non-rigid Shape-from-Motion in Linear Time. 4679-4687 - Jianhui Chen, Hoang Minh Le, Peter Carr, Yisong Yue, James J. Little:

Learning Online Smooth Predictors for Realtime Camera Planning Using Recurrent Decision Trees. 4688-4696 - Hyun Soo Park, Jyh-Jing Hwang, Yedong Niu, Jianbo Shi:

Egocentric Future Localization. 4697-4705 - Qifeng Chen

, Vladlen Koltun:
Full Flow: Optical Flow Estimation By Global Optimization over Regular Grids. 4706-4714
S4-1B: Human Pose Estimation
- Xiao Chu, Wanli Ouyang

, Hongsheng Li
, Xiaogang Wang:
Structured Feature Learning for Pose Estimation. 4715-4723 - Shih-En Wei, Varun Ramakrishna, Takeo Kanade, Yaser Sheikh:

Convolutional Pose Machines. 4724-4732 - João Carreira, Pulkit Agrawal, Katerina Fragkiadaki, Jitendra Malik:

Human Pose Estimation with Iterative Error Feedback. 4733-4742
Poster Session P4-1
- Thibaut Durand, Nicolas Thome, Matthieu Cord:

WELDON: Weakly Supervised Learning of Deep Convolutional Neural Networks. 4743-4752 - Lingxi Xie, Jingdong Wang

, Zhen Wei, Meng Wang, Qi Tian:
DisturbLabel: Regularizing CNN on the Loss Layer. 4753-4762 - Leslie N. Smith, Emily M. Hand, Timothy Doster:

Gradual DropIn of Layers to Train Very Deep Neural Networks. 4763-4771 - Zhiwei Deng, Arash Vahdat, Hexiang Hu

, Greg Mori:
Structure Inference Machines: Recurrent Neural Networks for Analyzing Relations in Group Activity Recognition. 4772-4781 - Nadav Cohen, Or Sharir

, Amnon Shashua:
Deep SimNets. 4782-4791 - Zhangyang Wang, Shiyu Chang, Yingzhen Yang, Ding Liu

, Thomas S. Huang:
Studying Very Low Resolution Recognition Using Deep Networks. 4792-4800 - Raviteja Vemulapalli, Oncel Tuzel, Ming-Yu Liu:

Deep Gaussian Conditional Random Field Network: A Model-Based Deep Network for Discriminative Denoising. 4801-4809 - Yufei Wang, Zhe Lin, Xiaohui Shen, Radomír Mech, Gavin S. P. Miller, Garrison W. Cottrell

:
Event-Specific Image Importance. 4810-4819 - Jiaxiang Wu, Cong Leng, Yuhang Wang, Qinghao Hu, Jian Cheng:

Quantized Convolutional Neural Networks for Mobile Devices. 4820-4828 - Alexey Dosovitskiy, Thomas Brox:

Inverting Visual Representations with Convolutional Networks. 4829-4837 - Iacopo Masi, Stephen Rawls, Gérard G. Medioni, Prem Natarajan

:
Pose-Aware Face Recognition in the Wild. 4838-4846 - Meina Kan, Shiguang Shan, Xilin Chen:

Multi-view Deep Network for Cross-View Classification. 4847-4855 - Yi Sun, Xiaogang Wang, Xiaoou Tang:

Sparsifying Neural Network Connections for Face Recognition. 4856-4864 - Qingxiang Feng, Yicong Zhou, Rushi Lan:

Pairwise Linear Regression Classification for Image Set Retrieval. 4865-4872 - Ira Kemelmacher-Shlizerman, Steven M. Seitz, Daniel Miller, Evan Brossard:

The MegaFace Benchmark: 1 Million Faces for Recognition at Scale. 4873-4882 - Ognjen Arandjelovic

:
Learnt Quasi-Transitive Similarity for Retrieval from Large Collections of Faces. 4883-4892 - Yandong Wen

, Zhifeng Li, Yu Qiao:
Latent Factor Guided Convolutional Neural Networks for Age-Invariant Face Recognition. 4893-4901 - Robert Walecki, Ognjen Rudovic, Vladimir Pavlovic

, Maja Pantic:
Copula Ordinal Regression for Joint Estimation of Facial Action Unit Intensity. 4902-4910 - Timo Bolkart, Stefanie Wuhrer:

A Robust Multilinear Model Learning Framework for 3D Faces. 4911-4919 - Zhenxing Niu, Mo Zhou, Le Wang, Xinbo Gao

, Gang Hua:
Ordinal Regression with Multiple Output CNN for Age Estimation. 4920-4928 - Leonid Pishchulin, Eldar Insafutdinov, Siyu Tang

, Bjoern Andres, Mykhaylo Andriluka, Peter V. Gehler, Bernt Schiele
:
DeepCut: Joint Subset Partition and Labeling for Multi Person Pose Estimation. 4929-4937 - Suha Kwak, Minsu Cho, Ivan Laptev:

Thin-Slicing for Pose: Learning to Understand Pose without Explicit Pose Estimation. 4938-4947 - Hashim Yasin, Umar Iqbal, Björn Krüger, Andreas Weber, Juergen Gall:

A Dual-Source Approach for 3D Pose Estimation from a Single Image. 4948-4956 - Markus Oberweger, Gernot Riegler, Paul Wohlhart, Vincent Lepetit:

Efficiently Creating 3D Training Data for Fine Hand Pose Estimation. 4957-4965 - Xiaowei Zhou, Menglong Zhu, Spyridon Leonardos, Konstantinos G. Derpanis, Kostas Daniilidis:

Sparseness Meets Deepness: 3D Human Pose Estimation from Monocular Video. 4966-4975 - Kushal Kafle, Christopher Kanan:

Answer-Type Prediction for Visual Question Answering. 4976-4984 - Satwik Kottur, Ramakrishna Vedantam, José M. F. Moura, Devi Parikh:

VisualWord2Vec (Vis-W2V): Learning Visually Grounded Word Embeddings Using Abstract Scenes. 4985-4994 - Yuke Zhu, Oliver Groth, Michael S. Bernstein

, Li Fei-Fei:
Visual7W: Grounded Question Answering in Images. 4995-5004 - Liwei Wang, Yin Li, Svetlana Lazebnik:

Learning Deep Structure-Preserving Image-Text Embeddings. 5005-5013 - Peng Zhang, Yash Goyal, Douglas Summers-Stay, Dhruv Batra, Devi Parikh:

Yin and Yang: Balancing and Answering Binary Visual Questions. 5014-5022 - Song Bai, Xiang Bai, Zhichao Zhou, Zhaoxiang Zhang, Longin Jan Latecki

:
GIFT: A Real-Time and Scalable 3D Shape Search Engine. 5023-5032 - Chao Zhang, William A. P. Smith

, Arnaud Dessein, Nick E. Pears, Hang Dai:
Functional Faces: Groupwise Dense Correspondence Using Functional Maps. 5033-5041 - Girum G. Demisse, Djamila Aouada

, Björn E. Ottersten:
Similarity Metric for Curved Shapes in Euclidean Space. 5042-5050 - Jie Shi, Wen Zhang

, Yalin Wang
:
Shape Analysis with Hyperbolic Wasserstein Distance. 5051-5061 - Xinchu Shi, Haibin Ling, Weiming Hu, Junliang Xing, Yanning Zhang:

Tensor Power Iteration for Multi-graph Matching. 5062-5070 - Yongxin Yang, Timothy M. Hospedales:

Multivariate Regression on the Grassmannian for Predicting Novel Domains. 5071-5080 - Yao-Hung Hubert Tsai, Yi-Ren Yeh, Yu-Chiang Frank Wang:

Learning Cross-Domain Landmarks for Heterogeneous Domain Adaptation. 5081-5090 - Diego Marcos, Raffay Hamid, Devis Tuia:

Geospatial Correspondences for Multimodal Registration. 5091-5100 - Yue Wu, Qiang Ji:

Constrained Deep Transfer Feature Learning and Its Applications. 5101-5109 - George Trigeorgis, Mihalis A. Nicolaou, Stefanos Zafeiriou, Björn W. Schuller

:
Deep Canonical Time Warping. 5110-5118 - Xianglong Liu

, Xinjie Fan, Cheng Deng
, Zhujin Li, Hao Su, Dacheng Tao
:
Multilinear Hyperplane Hashing. 5119-5127 - Olivier Canévet, François Fleuret:

Large Scale Hard Sample Mining with Monte Carlo Tree Search. 5128-5137 - Atsushi Kanehira, Tatsuya Harada:

Multi-label Ranking from Positive and Unlabeled Data. 5138-5146 - Jianwei Yang, Devi Parikh, Dhruv Batra:

Joint Unsupervised Learning of Deep Representations and Image Clusters. 5147-5156 - Ming Yin

, Yi Guo
, Junbin Gao
, Zhaoshui He, Shengli Xie:
Kernel Sparse Subspace Clustering on Symmetric Positive Definite Manifolds. 5157-5164 - Christopher Funk

, Yanxi Liu:
Symmetry reCAPTCHA. 5165-5174 - Chen Huang, Chen Change Loy, Xiaoou Tang:

Unsupervised Learning of Discriminative Attributes and Visual Representations. 5175-5184 - Mehrtash Tafazzoli Harandi

, Mathieu Salzmann, Fatih Porikli
:
When VLAD Met Hilbert. 5185-5194 - Ha Quang Minh

, Marco San-Biagio
, Loris Bazzani
, Vittorio Murino
:
Approximate Log-Hilbert-Schmidt Distances between Covariance Operators for Image Classification. 5195-5203 - Yongfang Cheng, Yin Wang, Mario Sznaier, Octavia I. Camps:

Subspace Clustering with Priors via Sparse Quadratically Constrained Quadratic Programming. 5204-5212 - Xiai Chen, Zhi Han, Yao Wang, Qian Zhao, Deyu Meng, Yandong Tang:

Robust Tensor Factorization with Unknown Noise. 5213-5221 - Yusuke Mukuta, Tatsuya Harada:

Kernel Approximation via Empirical Orthogonal Decomposition for Unsupervised Feature Learning. 5222-5230 - Agata Mosinska-Domanska, Raphael Sznitman

, Przemyslaw Glowacki, Pascal Fua:
Active Learning for Delineation of Curvilinear Structures. 5231-5239 - Xavier Alameda-Pineda, Elisa Ricci

, Yan Yan, Nicu Sebe
:
Recognizing Emotions from Abstract Paintings Using Non-Linear Matrix Completion. 5240-5248 - Canyi Lu, Jiashi Feng, Yudong Chen, Wei Liu

, Zhouchen Lin, Shuicheng Yan:
Tensor Robust Principal Component Analysis: Exact Recovery of Corrupted Low-Rank Tensors via Convex Optimization. 5249-5257 - Soheil Kolouri, Yang Zou, Gustavo K. Rohde:

Sliced Wasserstein Kernels for Probability Distributions. 5258-5267 - Xian Wei, Hao Shen

, Martin Kleinsteuber:
Trace Quotient Meets Sparsity: A Method for Learning Low Dimensional Image Representations. 5268-5277 - Hisham Cholakkal

, Jubin Johnson, Deepu Rajan:
Backtracking ScSPM Image Classifier for Weakly Supervised Top-Down Saliency. 5278-5287 - Jun Xu, Tao Mei

, Ting Yao, Yong Rui:
MSR-VTT: A Large Video Description Dataset for Bridging Video and Language. 5288-5296
Oral & Spotlight Session 4-2A
O4-2A: Learning and CNN Architectures
- Relja Arandjelovic, Petr Gronát, Akihiko Torii, Tomás Pajdla, Josef Sivic:

NetVLAD: CNN Architecture for Weakly Supervised Place Recognition. 5297-5307 - Ashesh Jain, Amir R. Zamir, Silvio Savarese, Ashutosh Saxena:

Structural-RNN: Deep Learning on Spatio-Temporal Graphs. 5308-5317 - Yong-Deok Kim, Taewoong Jang, Bohyung Han, Seungjin Choi:

Learning to Select Pre-Trained Deep Representations with Bayesian Evidence Framework. 5318-5326 - Soravit Changpinyo, Wei-Lun Chao, Boqing Gong, Fei Sha:

Synthesized Classifiers for Zero-Shot Learning. 5327-5336 - Yanwei Fu

, Leonid Sigal:
Semi-supervised Vocabulary-Informed Learning. 5337-5346
S4-2A: Learning and Optimization
- Zhuwen Li, Shuoguang Yang, Loong-Fah Cheong, Kim-Chuan Toh:

Simultaneous Clustering and Model Selection for Tensor Affinities. 5347-5355 - Jinglin Xu, Junwei Han, Feiping Nie

:
Discriminatively Embedded K-Means for Multi-view Clustering. 5356-5364 - Ishant Shanu, Chetan Arora, Parag Singla:

Min Norm Point Algorithm for Higher Order MRF-MAP Inference. 5365-5374 - Chen Huang, Yining Li, Chen Change Loy, Xiaoou Tang:

Learning Deep Representation for Imbalanced Classification. 5375-5384 - Vijay Kumar B. G, Gustavo Carneiro, Ian D. Reid:

Learning Local Image Descriptors with Deep Siamese and Triplet Convolutional Networks by Minimizing Global Loss Functions. 5385-5394 - Piotr Koniusz

, Anoop Cherian:
Sparse Coding for Third-Order Super-Symmetric Tensor Descriptors with Application to Texture Recognition. 5395-5403 - Jen-Hao Rick Chang, Aswin C. Sankaranarayanan, B. V. K. Vijaya Kumar

:
Random Features for Sparse Signal Classification. 5404-5412
Oral & Spotlight Session 4-2B
O4-2B: 3D Shape Reconstruction
- Hyowon Ha, Sunghoon Im, Jaesik Park

, Hae-Gon Jeon
, In-So Kweon:
High-Quality Depth from Uncalibrated Small Motion Clip. 5413-5421 - Hao Yang, Hui Zhang:

Efficient 3D Room Shape Recovery from a Single Panorama. 5422-5430 - Michael Firman, Oisin Mac Aodha, Simon J. Julier, Gabriel J. Brostow:

Structured Prediction of Unobserved Voxels from a Single Depth Image. 5431-5440 - Sean Ryan Fanello

, Christoph Rhemann, Vladimir Tankovich, Adarsh Kowdle, Sergio Orts-Escolano, David Kim, Shahram Izadi:
HyperDepth: Learning Depth from Structured Light without Matching. 5441-5450 - Ting-Chun Wang, Manmohan Chandraker, Alexei A. Efros

, Ravi Ramamoorthi:
SVBRDF-Invariant Shape and Reflectance Estimation from Light-Field Cameras. 5451-5459
S4-2B: 3D Reconstruction
- Nikolay Savinov, Christian Häne, Lubor Ladicky, Marc Pollefeys

:
Semantic 3D Reconstruction with Continuous Regularization and Ray Potentials Using a Visibility Consistency Constraint. 5460-5469 - Carolina Raposo

, João P. Barreto:
Theory and Practice of Structure-From-Motion Using Affine Correspondences. 5470-5478 - Silvano Galliani, Konrad Schindler:

Just Look at the Image: Viewpoint-Specific Surface Normal Prediction for Improved Multi-View Reconstruction. 5479-5487 - Filip Radenovic, Johannes L. Schönberger, Dinghuang Ji, Jan-Michael Frahm, Ondrej Chum, Jiri Matas

:
From Dusk Till Dawn: Modeling in the Dark. 5488-5496 - Benjamin Eckart, Kihwan Kim, Alejandro J. Troccoli, Alonzo Kelly, Jan Kautz:

Accelerated Generative Models for 3D Point Cloud Data. 5497-5505 - Anirban Roy, Sinisa Todorovic:

Monocular Depth Estimation Using Neural Regression Forest. 5506-5514 - John Flynn, Ivan Neulander, James Philbin, Noah Snavely:

Deep Stereo: Learning to Predict New Views from the World's Imagery. 5515-5524
Oral & Spotlight Session 4-3A
O4-3A: Face, Gesture, & Situation Recognition: Algorithms and Datasets
- Shuo Yang, Ping Luo, Chen Change Loy, Xiaoou Tang:

WIDER FACE: A Face Detection Benchmark. 5525-5533 - Mark Yatskar, Luke Zettlemoyer, Ali Farhadi:

Situation Recognition: Visual Semantic Role Labeling for Image Understanding. 5534-5542
S4-3A: People and Faces
- James Booth

, Anastasios Roussos
, Stefanos Zafeiriou, Allan Ponniah, David J. Dunaway:
A 3D Morphable Model Learnt from 10, 000 Faces. 5543-5552 - Rasmus Rothe, Radu Timofte

, Luc Van Gool:
Some Like It Hot - Visual Guidance for Preference Prediction. 5553-5561 - Carlos Fabian Benitez-Quiroz, Ramprakash Srinivasan, Aleix M. Martínez:

EmotioNet: An Accurate, Real-Time Algorithm for the Automatic Annotation of a Million Facial Expressions in the Wild. 5562-5570 - Shuxin Ouyang, Timothy M. Hospedales, Yi-Zhe Song

, Xueming Li:
ForgetMeNot: Memory-Aware Forensic Facial Sketch Matching. 5571-5579 - Karan Sikka, Gaurav Sharma, Marian Stewart Bartlett:

LOMo: Latent Ordinal Model for Facial Analysis in Videos. 5580-5589 - Dipan K. Pal, Felix Juefei-Xu, Marios Savvides:

Discriminative Invariant Kernel Features: A Bells-and-Whistles-Free Approach to Unsupervised Face Recognition and Pose Estimation. 5590-5599 - Peiyun Hu, Deva Ramanan

:
Bottom-Up and Top-Down Reasoning with Hierarchical Rectified Gaussians. 5600-5609 - David Joseph Tan, Thomas J. Cashman

, Jonathan Taylor, Andrew W. Fitzgibbon, Daniel Tarlow, Sameh Khamis, Shahram Izadi, Jamie Shotton:
Fits Like a Glove: Rapid and Reliable Hand Shape Personalization. 5610-5619 - Jing Shao, Chen Change Loy, Kai Kang, Xiaogang Wang:

Slicing Convolutional Neural Network for Crowd Video Understanding. 5620-5628
Spotlight Session 4-3B
S4-3B: 3D, Stereo, Matching, and Saliency Estimation
- Florian Bernard, Peter Gemmar, Frank Hertel, Jorge M. Gonçalves

, Johan Thunberg:
Linear Shape Deformation Models with Local Support Using Graph-Based Structured Matrix Factorisation. 5629-5638 - Jayakorn Vongkulbhisal

, Ricardo Silveira Cabral, Fernando De la Torre, João Paulo Costeira
:
Motion from Structure (MfS): Searching for 3D Objects in Cluttered Point Trajectories. 5639-5647 - Charles Ruizhongtai Qi, Hao Su, Matthias Nießner, Angela Dai, Mengyuan Yan, Leonidas J. Guibas:

Volumetric and Multi-view CNNs for Object Classification on 3D Data. 5648-5656 - Menghua Zhai, Scott Workman, Nathan Jacobs:

Detecting Vanishing Points Using Global Image Context in a Non-ManhattanWorld. 5657-5665 - Chunyuan Li, Andrew Stevens, Changyou Chen, Yunchen Pu, Zhe Gan, Lawrence Carin

:
Learning Weight Uncertainty with Stochastic Gradient MCMC for Shape Classification. 5666-5675 - Duc Thanh Nguyen, Binh-Son Hua, Minh-Khoi Tran, Quang-Hieu Pham, Sai-Kit Yeung:

A Field Model for Repairing 3D Shapes. 5676-5684 - Dylan Campbell

, Lars Petersson
:
GOGMA: Globally-Optimal Gaussian Mixture Alignment. 5685-5694 - Wenjie Luo, Alexander G. Schwing, Raquel Urtasun:

Efficient Deep Learning for Stereo Matching. 5695-5703 - Yinlin Hu

, Rui Song, Yunsong Li:
Efficient Coarse-to-Fine Patch Match for Large Displacement Optical Flow. 5704-5712 - Ben Harwood

, Tom Drummond:
FANNG: Fast Approximate Nearest Neighbour Graphs. 5713-5722 - Shengfeng He

, Rynson W. H. Lau:
Exemplar-Driven Top-Down Saliency Detection via Deep Association. 5723-5732 - Jianming Zhang, Stan Sclaroff, Zhe Lin, Xiaohui Shen, Brian L. Price, Radomír Mech:

Unconstrained Salient Object Detection via Proposal Subset Optimization. 5733-5742 - Sina Honari, Jason Yosinski, Pascal Vincent, Christopher J. Pal:

Recombinator Networks: Learning Coarse-to-Fine Feature Aggregation. 5743-5752 - Saumya Jetley, Naila Murray, Eleonora Vig:

End-to-End Saliency Mapping via Probability Distribution Prediction. 5753-5761
Poster Session P4-2
- Shaojing Fan, Tian-Tsong Ng, Bryan L. Koenig, Ming Jiang

, Qi Zhao:
A Paradigm for Building Generalized Models of Human Image Perception through Data Fusion. 5762-5771 - Chi Nhan Duong, Khoa Luu

, Kha Gia Quach, Tien D. Bui:
Longitudinal Face Modeling via Temporal Deep Restricted Boltzmann Machines. 5772-5780 - Srinivas S. S. Kruthiventi, Vennela Gudisa, Jaley H. Dholakiya, R. Venkatesh Babu

:
Saliency Unified: A Deep Architecture for simultaneous Eye Fixation Prediction and Salient Object Segmentation. 5781-5790 - Yuxiang Zhou, Epameinondas Antonakos, Joan Alabort-i-Medina, Anastasios Roussos

, Stefanos Zafeiriou:
Estimating Correspondences of Deformable Objects "In-the-Wild". 5791-5801 - Vladislav Golyanik, Sk Aziz Ali

, Didier Stricker
:
Gravitational Approach for Point Set Registration. 5802-5810 - Gang Wang, Zhicheng Wang, Yufei Chen

, Qiangqiang Zhou, Weidong Zhao:
Context-Aware Gaussian Fields for Non-rigid Point Set Registration. 5811-5819 - Magnus Oskarsson, Kenneth Batstone, Kalle Åström:

Trust No One: Low Rank Matrix Factorization Using Hierarchical RANSAC. 5820-5829 - Chen Wang, Ramin Zabih

:
Relaxation-Based Preprocessing Techniques for Markov Random Field Inference. 5830-5838 - Yuhui Quan, Yong Xu, Yuping Sun, Yan Huang

, Hui Ji
:
Sparse Coding for Classification via Discrimination Ensemble. 5839-5847 - Pierre Baqué, Timur M. Bagautdinov, François Fleuret, Pascal Fua:

Principled Parallel Mean-Field Inference for Discrete Random Fields. 5848-5857 - Tat-Jun Chin, Yang Heng Kee, Anders P. Eriksson, Frank Neumann

:
Guaranteed Outlier Removal with Mixed Integer Linear Programs. 5858-5866 - Thalaiyasingam Ajanthan, Richard I. Hartley, Mathieu Salzmann:

Memory Efficient Max Flow for Multi-label Submodular MRFs. 5867-5876 - Mingkui Tan, Shijie Xiao, Junbin Gao

, Dong Xu, Anton van den Hengel
, Qinfeng Shi
:
Proximal Riemannian Pursuit for Large-Scale Trace-Norm Minimization. 5877-5886 - Erik Bylow, Carl Olsson, Fredrik Kahl, Mikael G. Nilsson:

Minimizing the Maximal Rank. 5887-5895 - Caglayan Dicle, Burak Yilmaz, Octavia I. Camps, Mario Sznaier:

Solving Temporal Puzzles. 5896-5905 - Sohil Shah, Tom Goldstein, Christoph Studer:

Estimating Sparse Signals with Smooth Support via Convex Programming and Block Sparsity. 5906-5915 - Na Qi, Yunhui Shi, Xiaoyan Sun, Baocai Yin:

TenSR: Multi-dimensional Tensor Sparse Representation. 5916-5925 - Florian Jug

, Evgeny Levinkov, Corinna Blasse, Eugene W. Myers, Bjoern Andres:
Moral Lineage Tracing. 5926-5935 - Behrooz Nasihatkon, Frida Fejne, Fredrik Kahl:

Globally Optimal Rigid Intensity Based Registration: A Fast Fourier Domain Approach. 5936-5944 - Haichuan Yang, Yijun Huang, Lam Tran, Ji Liu, Shuai Huang:

On Benefits of Selection Diversity via Bilevel Exclusive Sparsity. 5945-5954 - Bohan Zhuang, Guosheng Lin, Chunhua Shen, Ian D. Reid:

Fast Training of Triplet-Based Deep Binary Embedding Networks. 5955-5964 - Aayush Bansal, Bryan C. Russell, Abhinav Gupta:

Marr Revisited: 2D-3D Alignment via Surface Normal Prediction. 5965-5974 - Ziad Al-Halah

, Makarand Tapaswi, Rainer Stiefelhagen:
Recovering the Missing Link: Predicting Class-Attribute Associations for Unsupervised Zero-Shot Learning. 5975-5984 - Yang Zhang, Boqing Gong, Mubarak Shah

:
Fast Zero-Shot Image Tagging. 5985-5994 - Anran Wang, Jianfei Cai

, Jiwen Lu
, Tat-Jen Cham
:
Modality and Component Aware Feature Fusion for RGB-D Scene Classification. 5995-6004 - Yilin Wang, Suhang Wang

, Jiliang Tang, Huan Liu, Baoxin Li:
PPP: Joint Pointwise and Pairwise Image Label Prediction. 6005-6013 - Jan Dirk Wegner, Steve Branson, David Hall, Konrad Schindler, Pietro Perona:

Cataloging Public Objects Using Aerial and Street-Level Images - Urban Trees. 6014-6023 - Francisco Massa, Bryan C. Russell, Mathieu Aubry

:
Deep Exemplar 2D-3D Detection by Adapting from Real to Rendered Views. 6024-6033 - Ziming Zhang, Venkatesh Saligrama

:
Zero-Shot Learning via Joint Latent Similarity Embedding. 6034-6042 - Bin Yang, Junjie Yan, Zhen Lei, Stan Z. Li:

CRAFT Objects from Images. 6043-6051

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














