


default search action
WACV 2018: Lake Tahoe, NV, USA
- 2018 IEEE Winter Conference on Applications of Computer Vision, WACV 2018, Lake Tahoe, NV, USA, March 12-15, 2018. IEEE Computer Society 2018, ISBN 978-1-5386-4886-5

Oral 1A: Faces / Biometrics
- Victoria Fernández Abrevaya, Stefanie Wuhrer, Edmond Boyer:

Multilinear Autoencoder for 3D Face Model Learning. 1-9 - Farnaz Abtahi, Tony Ro, Wei Li, Zhigang Zhu

:
Emotion Analysis Using Audio/Video, EMG and EEG: A Dataset and Comparison Study. 10-19 - Sandipan Banerjee, Joel Brogan

, Janez Krizaj
, Aparna Bharati
, Brandon RichardWebster, Vitomir Struc, Patrick J. Flynn, Walter J. Scheirer:
To Frontalize or Not to Frontalize: Do We Really Need Elaborate Pre-processing to Improve Face Recognition? 20-29 - Benjamin S. Riggan, Nathaniel J. Short, Shuowen Hu:

Thermal to Visible Synthesis of Face Images Using Multiple Regions. 30-38 - KangGeon Kim

, Zhenheng Yang, Iacopo Masi, Ramakant Nevatia, Gérard G. Medioni:
Face and Body Association for Video-Based Face Recognition. 39-48 - Chun-Hsiao Yeh, Herng-Hua Chang:

Face Liveness Detection Based on Perceptual Image Quality Assessment Features with Multi-scale Analysis. 49-56 - Ajit Puthenputhussery, Qingfeng Liu, Chengjun Liu:

Multiple Anthropological Fisher Kernel Framework and Its Application to Kinship Verification. 57-65 - Carlos Arango Duque, Olivier Alata, Rémi Emonet, Anne-Claire Legrand, Hubert Konik:

Micro-Expression Spotting Using the Riesz Pyramid. 66-74 - Francisco Madrigal, Frédéric Lerasle, André Monin:

3D Head Pose Estimation Enhanced Through SURF-Based Key-Frames. 75-83 - Emily M. Hand, Carlos Domingo Castillo, Rama Chellappa:

Predicting Facial Attributes in Video Using Temporal Coherence and Motion-Attention. 84-92 - Jonathan N. Gois, Eduardo A. B. da Silva, Carla L. Pagliari

, Marcelo M. Perez:
Fusion of Infrared and Visible-Light Videos Using Motion-Compensated Temporal Sub-Band Decompositions. 93-101 - Fatemeh Shiri, Fatih Porikli

, Richard Hartley, Piotr Koniusz
:
Identity-Preserving Face Recovery from Portraits. 102-111 - Yilin Wang, Suhang Wang

, Guojun Qi
, Jiliang Tang, Baoxin Li:
Weakly Supervised Facial Attribute Manipulation via Deep Adversarial Network. 112-121 - Pouya Samangouei

, Rama Chellappa, Mahyar Najibi, Larry S. Davis:
Face-MagNet: Magnifying Feature Maps to Detect Small Faces. 122-130 - Chunchun Li, Manuel Günther

, Terrance E. Boult:
ECLIPSE: Ensembles of Centroids Leveraging Iteratively Processed Spatial Eclipse Clustering. 131-140 - Marek Kowalski, Zbigniew Nasarzewski

, Grzegorz Galinski
, Piotr Garbat
:
HoloFace: Augmenting Human-to-Human Interactions on HoloLens. 141-149 - Abhishek Jha

, Vinay P. Namboodiri
, C. V. Jawahar
:
Word Spotting in Silent Lip Videos. 150-159 - Deepali Aneja, Bindita Chaudhuri, Alex Colburn

, Gary Faigin, Linda G. Shapiro, Barbara Mones:
Learning to Generate 3D Stylized Character Expressions from Humans. 160-169
Oral 1B: Vision for X / Industrial / Documents
- Yipin Zhou, Yale Song, Tamara L. Berg:

Image2GIF: Generating Cinemagraphs Using Recurrent Deep Q-Networks. 170-178 - Siddharth Srivastava, Gaurav Sharma, Brejesh Lall:

Large Scale Novel Object Discovery in 3D. 179-188 - Nitin Agarwal, Nicola J. Ferrier

, Mark Hereld:
Towards Automated Transcription of Label Text from Pinned Insect Collections. 189-198 - Bo Chang, Qiong Zhang

, Shenyi Pan
, Lili Meng:
Generating Handwritten Chinese Characters Using CycleGAN. 199-207 - Gil Levi, Pinhas Nisnevich, Adiel Ben-Shalom, Nachum Dershowitz, Lior Wolf:

A Method for Segmentation, Matching and Alignment of Dead Sea Scrolls. 208-217 - Noam Mor, Lior Wolf:

Confidence Prediction for Lexicon-Free OCR. 218-225 - Naoaki Kondo, Minoru Harada, Yuji Takagi:

Efficient Training for Automatic Defect Classification by Image Augmentation. 226-233 - Wei-Ta Chu

, Kai-Chia Ho, Ali Borji:
Visual Weather Temperature Prediction. 234-241 - Siyang Qin, Peng Ren, Seongdo Kim, Roberto Manduchi:

Robust and Accurate Text Stroke Segmentation. 242-250 - Sajith Rajapaksa, Mark G. Eramian, Hema Sudhakar Duddu, Menglu Wang, Steve Shirtliffe, Seungbum Ryu, Anique Josuttes, Ti Zhang, Sally Vail, Curtis Pozniak, Isobel Parkin

:
Classification of Crop Lodging with Gray Level Co-occurrence Matrix. 251-258 - Svati Dhamija, Terrance E. Boult:

Automated Action Units Vs. Expert Raters: Face off. 259-268 - Pongsate Tangseng, Kota Yamaguchi

, Takayuki Okatani:
Recommending Outfits from Personal Closet. 269-277 - Andrew D. Gilliam, Thomas B. Pollard, Andrew Neff, Yi Dong, Scott Sorensen, Robert Wagner, Selene Chew, Todd V. Rovito, Joseph L. Mundy:

SatTel: A Framework for Commercial Satellite Imagery Exploitation. 278-286 - Jianhui Chen, Fangrui Zhu, James J. Little:

A Two-Point Method for PTZ Camera Calibration in Sports. 287-295 - Anurag Ghosh

, Suriya Singh, C. V. Jawahar
:
Towards Structured Analysis of Broadcast Badminton Videos. 296-304 - Rahul Anand Sharma, Bharath Bhat, Vineet Gandhi, C. V. Jawahar

:
Automated Top View Registration of Broadcast Football Videos. 305-313 - Ivan F. Rodriguez, Rémi Mégret

, Edgar Acuña
, José L. Agosto-Rivera
, Tugrul Giray
:
Recognition of Pollen-Bearing Bees from Video Using Convolutional Neural Network. 314-322 - Shubhra Aich, Anique Josuttes, Ilya Ovsyannikov, Keegan Strueby, Imran Ahmed, Hema Sudhakar Duddu, Curtis Pozniak, Steve Shirtliffe, Ian Stavness:

DeepWheat: Estimating Phenotypic Traits from Crop Images with Deep Learning. 323-332 - Sachin Mehta

, Amar P. Azad, Saneem A. Chemmengath, Vikas Raykar, Shivkumar Kalyanaraman:
DeepSolarEye: Power Loss Prediction and Weakly Supervised Soiling Localization via Fully Convolutional Networks for Solar Panels. 333-342
Oral 1C: Action / Pose / Biometrics
- Jiawei He, Zhiwei Deng, Mostafa S. Ibrahim, Greg Mori:

Generic Tubelet Proposals for Action Localization. 343-351 - Sangwoo Cho, Hassan Foroosh:

A Temporal Sequence Learning for Action Recognition and Prediction. 352-361 - Xin Li

, Mooi Choo Chuah:
ReHAR: Robust and Efficient Human Activity Recognition. 362-371 - Ashish Mishra, Vinay Kumar Verma, M. Shiva Krishna Reddy, Arulkumar Subramaniam, Piyush Rai, Anurag Mittal:

A Generative Approach to Zero-Shot and Few-Shot Action Recognition. 372-380 - Yu-Wei Chao, Yunfan Liu

, Xieyang Liu, Huayi Zeng, Jia Deng:
Learning to Detect Human-Object Interactions. 381-389 - Gaurav Mishra, Saurabh Saini

, Kiran Varanasi, P. J. Narayanan:
Human Shape Capture and Tracking at Home. 390-399 - Mohit Sharma, Dragan Ahmetovic

, László A. Jeni, Kris M. Kitani:
Recognizing Visual Signatures of Spontaneous Head Gestures. 400-408 - Aakarsh Malhotra

, Richa Singh
, Mayank Vatsa
, Vishal M. Patel:
Person Authentication Using Head Images. 409-417 - Srenivas Varadarajan, Parual Datta, Omesh Tickoo:

A Greedy Part Assignment Algorithm for Real-Time Multi-person 2D Pose Estimation. 418-426 - Jianhui Chen, Lili Meng, James J. Little:

Camera Selection for Broadcasting Soccer Games. 427-435 - Paschalis Panteleris, Iason Oikonomidis, Antonis A. Argyros

:
Using a Single RGB Frame for Real Time 3D Hand Pose Estimation in the Wild. 436-445 - Moritz Einfalt, Dan Zecha, Rainer Lienhart

:
Activity-Conditioned Continuous Human Pose Estimation for Performance Analysis of Athletes Using the Example of Swimming. 446-455 - Ammar Qammaz, Damien Michel, Antonis A. Argyros

:
A Hybrid Method for 3D Pose Estimation of Personalized Human Body Models. 456-465 - Kuan Fang, Yu Xiang, Xiaocheng Li, Silvio Savarese:

Recurrent Autoregressive Networks for Online Multi-object Tracking. 466-475 - Rahul Sharma

, Tanaya Guha, Gaurav Sharma:
Multichannel Attention Network for Analyzing Visual Behavior in Public Speaking. 476-484 - Chaofeng Chen, Xiao Tan, Kwan-Yee K. Wong

:
Face Sketch Synthesis with Style Transfer Using Pyramid Column Feature. 485-493 - Peng Zhang, Qiang Wu

, Jingsong Xu
, Jian Zhang
:
Long-Term Person Re-identification Using True Motion from Videos. 494-502 - Daksha Yadav

, Naman Kohli, Mayank Vatsa
, Richa Singh
, Afzel Noore:
Iris Presentation Attack via Textured Contact Lens in Unconstrained Environment. 503-511
Oral 1D: Medical / Vehicles / Multimedia
- Zhengqin Li, Zak Murez, David J. Kriegman, Ravi Ramamoorthi, Manmohan Chandraker:

Learning to See Through Turbulent Water. 512-520 - Weilian Song, Scott Workman, Armin Hadzic, Xu Zhang, Eric Green, Mei Chen, Reginald R. Souleyrette

, Nathan Jacobs
:
FARSA: Fully Automated Roadway Safety Assessment. 521-529 - Puneet Gupta

, Brojeshwar Bhowmick, Arpan Pal
:
Robust Adaptive Heart-Rate Monitoring Using Face Videos. 530-538 - Huangjing Lin

, Hao Chen
, Qi Dou
, Liansheng Wang, Jing Qin
, Pheng-Ann Heng
:
ScanNet: A Fast and Dense Scanning Framework for Metastastic Breast Cancer Detection from Whole-Slide Image. 539-546 - Vanya V. Valindria

, Nick Pawlowski, Martin Rajchl, Ioannis Lavdas, Eric O. Aboagye
, Andrea G. Rockall
, Daniel Rueckert, Ben Glocker:
Multi-modal Learning from Unpaired Images: Application to Multi-organ Segmentation in CT and MRI. 547-556 - Amrita Saha, Megha Nawhal, Mitesh M. Khapra, Vikas C. Raykar:

Learning Disentangled Multimodal Representations for the Fashion Domain. 557-566 - Niki Martinel

, Gian Luca Foresti, Christian Micheloni
:
Wide-Slice Residual Networks for Food Recognition. 567-576 - Varduhi Yeghiazaryan, Irina Voiculescu:

Path Reducing Watershed for the GPU. 577-585 - Amy Tabb

, Keith E. Duncan, Christopher N. Topp:
Segmenting Root Systems in X-Ray Computed Tomography Images Using Level Sets. 586-595 - Tatsuya Ishihara, Kris M. Kitani, Chieko Asakawa, Michitaka Hirose:

Deep Radio-Visual Localization. 596-605 - Jonas Heylen, Seppe Iven, Bert De Brabandere, José Oramas M.

, Luc Van Gool, Tinne Tuytelaars
:
From Pixels to Actions: Learning to Drive a Car with Deep Neural Networks. 606-615 - Arno Solin

, Santiago Cortés Reina, Esa Rahtu
, Juho Kannala:
PIVO: Probabilistic Inertial-Visual Odometry for Occlusion-Robust Navigation. 616-625 - Kun Nie, Lars Wilko Sommer, Arne Schumann, Jürgen Beyerer:

Semantic Labeling Based Vehicle Detection in Aerial Imagery. 626-634 - Lars Wilko Sommer, Arne Schumann, Tobias Schuchert, Jürgen Beyerer:

Multi Feature Deconvolutional Faster R-CNN for Precise Vehicle Detection in Aerial Imagery. 635-642 - Suvam Patra, Pranjal Maheshwari, Shashank Yadav, Subhashis Banerjee, Chetan Arora:

A Joint 3D-2D Based Method for Free Space Detection on Roads. 643-652 - Yi Zhou, Ling Shao

:
Vehicle Re-Identification by Adversarial Bi-Directional LSTM Network. 653-662 - Sachin Mehta

, Ezgi Mercan, Jamen Bartlett, Donald L. Weaver, Joann G. Elmore
, Linda G. Shapiro:
Learning to Segment Breast Biopsy Whole Slide Images. 663-672 - Wentao Zhu, Chaochun Liu, Wei Fan, Xiaohui Xie:

DeepLung: Deep 3D Dual Path Nets for Automated Pulmonary Nodule Detection and Classification. 673-681 - Ligong Han, Robert F. Murphy, Deva Ramanan

:
Learning Generative Models of Tissue Organization with Supervised GANs. 682-690 - Amy Jin, Serena Yeung

, Jeffrey Jopling
, Jonathan Krause, Dan Azagury, Arnold Milstein, Li Fei-Fei:
Tool Detection and Operative Skill Assessment in Surgical Videos Using Region-Based Convolutional Neural Networks. 691-699
Oral 2A: Machine Learning for Vision 1
- Zhifei Zhang, Yang Song, Hairong Qi:

Decoupled Learning for Conditional Adversarial Networks. 700-708 - Qiangui Huang, Shaohua Kevin Zhou, Suya You, Ulrich Neumann:

Learning to Prune Filters in Convolutional Neural Networks. 709-718 - Irad Peleg, Lior Wolf:

Structured GANs. 719-728 - Rodrigo Santa Cruz, Basura Fernando, Anoop Cherian, Stephen Gould:

Neural Algebra of Classifiers. 729-737 - Jeffery Kinnison, Nathaniel Kremer-Herman, Douglas Thain

, Walter J. Scheirer:
SHADHO: Massively Scalable Hardware-Aware Distributed Hyperparameter Optimization. 738-747 - Nicolai Wojke, Alex Bewley:

Deep Cosine Metric Learning for Person Re-identification. 748-756 - Bodi Yuan, Jianyu Chen, Weidong Zhang, Hung-Shuo Tai, Sara McMains:

Iterative Cross Learning on Noisy Labels. 757-765 - Xi Hang Cao, Zoran Obradovic, Kyungnam Kim:

A Simple yet Effective Model for Zero-Shot Learning. 766-774 - Ziyin Wang, Sepehr Farhand, Gavriil Tsechpenakis:

Fading Affect Bias: Improving the Trade-off Between Accuracy and Efficiency in Feature Clustering. 775-783 - Patrick Follmann, Tobias Böttger:

A Rotationally-Invariant Convolution Module by Feature Map Back-Rotation. 784-792 - Dahun Kim, Donghyeon Cho, Donggeun Yoo, In So Kweon:

Learning Image Representations by Completing Damaged Jigsaw Puzzles. 793-802 - Andras Rozsa, Manuel Günther

, Terrance E. Boult:
Towards Robust Deep Neural Networks with BANG. 803-811 - Dinesh Khandelwal, Parag Singla, Chetan Arora:

Learning Higher Order Potentials for MRFs. 812-820 - Ameya Prabhu, Vishal Batchu, Rohit Gajawada, Sri Aurobindo Munagala, Anoop M. Namboodiri:

Hybrid Binary Networks: Optimizing for Accuracy, Efficiency and Memory. 821-829 - Ameya Prabhu, Vishal Batchu, Sri Aurobindo Munagala, Rohit Gajawada, Anoop M. Namboodiri:

Distribution-Aware Binarization of Neural Networks for Sketch Recognition. 830-838 - Aditya Chattopadhyay, Anirban Sarkar, Prantik Howlader, Vineeth N. Balasubramanian

:
Grad-CAM++: Generalized Gradient-Based Visual Explanations for Deep Convolutional Networks. 839-847 - Deepak Mittal, Shweta Bhardwaj, Mitesh M. Khapra, Balaraman Ravindran

:
Recovering from Random Pruning: On the Plasticity of Deep Convolutional Neural Networks. 848-857
Oral 2B: 3D / Geometry
- Andrey Kurenkov, Jingwei Ji, Animesh Garg

, Viraj Mehta, JunYoung Gwak, Christopher B. Choy, Silvio Savarese:
DeformNet: Free-Form Deformation Network for 3D Shape Reconstruction from a Single Image. 858-866 - Andrea Bignoli, Andrea Romanoni, Matteo Matteucci:

Multi-view Stereo 3D Edge Reconstruction. 867-875 - Prashant Domadiya, Pratik Shah, Suman K. Mitra:

Vector Graph Representation for Deformation Transfer Using Poisson Interpolation. 876-884 - Menandro Roxas

, Takeshi Oishi
:
Real-Time Simultaneous 3D Reconstruction and Optical Flow Estimation. 885-893 - Rui Zhu, Chaoyang Wang, Chen-Hsuan Lin, Ziyan Wang, Simon Lucey

:
Object-Centric Photometric Bundle Adjustment with Deep Shape Prior. 894-902 - Pulak Purkait, Christopher Zach:

Minimal Solvers for Monocular Rolling Shutter Compensation Under Ackermann Motion. 903-911 - Nico Marniok, Bastian Goldluecke:

Real-Time Variational Range Image Fusion and Visualization for Large-Scale Scenes Using GPU Hash Tables. 912-920 - Michika Maruyama, Satoshi Tabata, Yoshihiro Watanabe:

Multi-pattern Embedded Phase Shifting Using a High-Speed Projector for Fast and Accurate Dynamic 3D Measurement. 921-929 - Hao Song, Mark G. Eramian, Emil Hallin, Blanche Leyeza, Paul G. Arnison, Ronald Rogge:

Robust and User Friendly 3D Re-Construction of Neutron Tomographic Images. 930-938 - Ayesha Siddiqua, Guoliang Fan:

Supervised Deep-Autoencoder for Depth Image-Based 3D Model Retrieval. 939-946 - Yu Cao, Haishu Tan, Fuqiang Zhou:

Minimal Non-Linear Camera Pose Estimation Method Using Lines for SLAM Application. 947-954 - Rafael Alves Roberto

, Joao Paulo Silva do Monte Lima
, Hideaki Uchiyama, Clemens Arth
, Veronica Teichrieb
, Rin-Ichiro Taniguchi, Dieter Schmalstieg:
Incremental Structural Modeling Based on Geometric and Statistical Analyses. 955-963 - David Guera

, Fengqing Zhu, Sri Kalyan Yarlagadda, Stefano Tubaro, Paolo Bestagini
, Edward J. Delp
:
Reliability Map Estimation for CNN-Based Camera Model Attribution. 964-973 - Lokender Tiwari, Saket Anand:

DGSAC: Density Guided Sampling and Consensus. 974-982 - Tavi Halperin

, Michael Werman:
An Epipolar Line from a Single Pixel. 983-991 - Dominik Van Opdenbosch, Tamay Aykut

, Nicolas Alt, Eckehard G. Steinbach
:
Efficient Map Compression for Collaborative Visual SLAM. 992-1000 - Fangwei Zhong, Sheng Wang, Ziqi Zhang, China Chen, Yizhou Wang:

Detect-SLAM: Making Object Detection and SLAM Mutually Beneficial. 1001-1010 - Andreas Kuhn, Lukas Roth, Jan-Michael Frahm, Helmut Mayer

:
Improvement of Extrinsic Parameters from a Single Stereo Pair. 1011-1019
Oral 2C: Tracking / Detection
- Anupam Sobti, Chetan Arora, M. Balakrishnan:

Object Detection in Real-Time Systems: Going Beyond Precision. 1020-1028 - Manal Al Ghamdi, Yoshihiko Gotoh:

Graph-Based Correlated Topic Model for Trajectory Clustering in Crowded Videos. 1029-1037 - Dinghuang Ji, Zheng Wei, Enrique Dunn

, Jan-Michael Frahm:
Dynamic Visual Sequence Prediction with Motion Flow Networks. 1038-1046 - Litu Rout

, Sidhartha, Deepak Mishra
, Rama Krishna Sai Subrahmanyam Gorthi:
Rotation Adaptive Visual Object Tracking with Motion Consistency. 1047-1055 - René Schuster, Oliver Wasenmüller, Georg Kuschk, Christian Bailer, Didier Stricker

:
SceneFlowFields: Dense Interpolation of Sparse Scene Flow Correspondences. 1056-1065 - Rémi Trichet, François Brémond:

LBP Channels for Pedestrian Detection. 1066-1074 - Jason R. Parham, Charles V. Stewart, Jonathan P. Crall, Daniel I. Rubenstein, Jason Holmberg, Tanya Y. Berger-Wolf

:
An Animal Detection Pipeline for Identification. 1075-1083 - Xinshuo Weng, Shangxuan Wu, Fares Beainy, Kris M. Kitani:

Rotational Rectification Network: Enabling Pedestrian Detection for Mobile Vision. 1084-1092 - Sanghyun Woo, Soonmin Hwang

, In So Kweon:
StairNet: Top-Down Semantic Aggregation for Accurate One Shot Detection. 1093-1102 - Heng Yu, Eshed Ohn-Bar, Donghyun Yoo, Kris M. Kitani:

SmartPartNet: Part-Informed Person Detection for Body-Worn Smartphones. 1103-1112 - Lu Zhang, Miaojing Shi, Qiaobo Chen:

Crowd Counting via Scale-Adaptive Convolutional Neural Network. 1113-1121 - Tharindu Fernando

, Simon Denman, Sridha Sridharan, Clinton Fookes
:
Tracking by Prediction: A Deep Generative Model for Mutli-person Localisation and Tracking. 1122-1132 - Burak Uzkent, YoungWoo Seo:

EnKCF: Ensemble of Kernelized Correlation Filters for High-Speed Object Tracking. 1133-1141 - Jaepung An, Jaehyun Lee, Jiman Jeong, Insung Ihm:

Tracking an RGB-D Camera on Mobile Devices Using an Improved Frame-to-Frame Pose Estimation Method. 1142-1150 - Greg Olmschenk, Hao Tang, Zhigang Zhu

:
Crowd Counting with Minimal Data Using Generative Adversarial Networks for Multiple Target Regression. 1151-1159 - Oytun Ulutan, Benjamin S. Riggan, Nasser M. Nasrabadi, B. S. Manjunath:

An Order Preserving Bilinear Model for Person Detection in Multi-Modal Data. 1160-1169 - Hyemin Lee, Daijin Kim:

Salient Region-Based Online Object Tracking. 1170-1177 - Irtiza Hasan, Francesco Setti

, Theodore Tsesmelis, Alessio Del Bue
, Marco Cristani, Fabio Galasso:
"Seeing is Believing": Pedestrian Trajectory Forecasting Using Visual Frustum of Attention. 1178-1185 - Hao Xue

, Du Q. Huynh
, Mark Reynolds
:
SS-LSTM: A Hierarchical LSTM Model for Pedestrian Trajectory Prediction. 1186-1194
Oral 2D: Machine Learning for Vision 2
- Wenling Shang, Kihyuk Sohn, Yuandong Tian:

Channel-Recurrent Autoencoding for Image Modeling. 1195-1204 - Liang-Yan Gui, Liangke Gui, Yu-Xiong Wang, Louis-Philippe Morency, José M. F. Moura:

Factorized Convolutional Networks: Unsupervised Fine-Tuning for Image Clustering. 1205-1214 - Yifan Ding

, Liqiang Wang, Deliang Fan, Boqing Gong:
A Semi-Supervised Two-Stage Approach to Learning from Noisy Labels. 1215-1224 - Min Wang, Baoyuan Liu, Hassan Foroosh:

Look-Up Table Unit Activation Function for Deep Convolutional Neural Networks. 1225-1233 - Francisco Rodolfo Barbosa-Anda, Frédéric Lerasle, Cyril Briand, Alhayat Ali Mekonnen:

Soft-Cascade Learning with Explicit Computation Time Considerations. 1234-1243 - Karim Ahmed, Lorenzo Torresani:

BranchConnect: Image Categorization with Learned Branch Connections. 1244-1253 - Shangzhen Luan, Baochang Zhang, Siyue Zhou, Chen Chen, Jungong Han, Wankou Yang, Jianzhuang Liu:

Gabor Convolutional Networks. 1254-1262 - Jan Svoboda, Thomas J. Cashman, Andrew W. Fitzgibbon:

QRkit: Sparse, Composable QR Decompositions for Efficient and Stable Solutions to Problems in Computer Vision. 1263-1272 - Mingze Xu, Chenyou Fan, John D. Paden, Geoffrey C. Fox, David J. Crandall

:
Multi-task Spatiotemporal Neural Networks for Structured Surface Reconstruction. 1273-1282 - Liangfu Chen, Zeng Yang, Jianjun Ma, Zheng Luo:

Driving Scene Perception Network: Real-Time Joint Detection, Depth Estimation and Semantic Segmentation. 1283-1291 - Amena Khatun

, Simon Denman, Sridha Sridharan, Clinton Fookes
:
A Deep Four-Stream Siamese Convolutional Neural Network with Joint Verification and Identification Loss for Person Re-Detection. 1292-1301 - Ashkan Panahi, Xiao Bian, Hamid Krim

, Liyi Dai:
Robust Subspace Clustering by Bi-Sparsity Pursuit: Guarantees and Sequential Algorithm. 1302-1311 - Salman H. Khan, Munawar Hayat

, Nick Barnes
:
Adversarial Training of Variational Auto-Encoders for High Fidelity Image Generation. 1312-1320 - Zehua Fu, Mohsen Ardabilian Fard:

Learning Confidence Measures by Multi-modal Convolutional Neural Networks. 1321-1330 - Domen Racki, Dejan Tomazevic, Danijel Skocaj:

A Compact Convolutional Neural Network for Textured Surface Anomaly Detection. 1331-1339 - Garrett B. Goh, Charles Siegel, Abhinav Vishnu, Nathan O. Hodas, Nathan A. Baker:

How Much Chemistry Does a Deep Neural Network Need to Know to Make Accurate Predictions? 1340-1349 - Duc Minh Vo, Trung-Nghia Le

, Akihiro Sugimoto:
Balancing Content and Style with Two-Stream FCNs for Style Transfer. 1350-1358
Oral 3A: Segmentation / Saliency / Super-Resolution
- Louis Lettry

, Kenneth Vanhoey
, Luc Van Gool:
DARN: A Deep Adversarial Residual Network for Intrinsic Image Decomposition. 1359-1367 - Roey Mechrez, Eli Shechtman, Lihi Zelnik-Manor:

Saliency Driven Image Manipulation. 1368-1376 - Liyuan Pan, Yuchao Dai, Miaomiao Liu

, Fatih Porikli
:
Depth Map Completion by Jointly Exploiting Blurry Color Images and Sparse Depth Maps. 1377-1386 - Ning Yu, Xiaohui Shen, Zhe Lin, Radomír Mech, Connelly Barnes:

Learning to Detect Multiple Photographic Defects. 1387-1396 - Akshay Dudhane, Subrahmanyam Murala:

C^2MSNet: A Novel Approach for Single Image Haze Removal. 1397-1404 - Chetan Arora, Vivek Kwatra:

Stabilizing First Person 360 Degree Videos. 1405-1413 - Marc Bosch, Christopher M. Gifford, Pedro A. Rodriguez:

Super-Resolution for Overhead Imagery Using DenseNets and Adversarial Learning. 1414-1422 - Haoyu Ren, Mostafa El-Khamy

, Jungwon Lee:
CT-SRCNN: Cascade Trained and Trimmed Deep Convolutional Neural Networks for Image Super Resolution. 1423-1431 - Mahyar Najibi, Fan Yang, Qiaosong Wang

, Robinson Piramuthu:
Towards the Success Rate of One: Real-Time Unconstrained Salient Object Detection. 1432-1441 - Ryuhei Hamaguchi

, Aito Fujita, Keisuke Nemoto, Tomoyuki Imaizumi, Shuhei Hikosaka:
Effective Use of Dilated Convolutions for Segmenting Small Object Instances in Remote Sensing Imagery. 1442-1450 - Panqu Wang, Pengfei Chen, Ye Yuan, Ding Liu

, Zehua Huang, Xiaodi Hou, Garrison W. Cottrell
:
Understanding Convolution for Semantic Segmentation. 1451-1460 - Linwei Ye, Zhi Liu

, Yang Wang:
Learning Semantic Segmentation with Diverse Supervision. 1461-1469 - Fariba Zohrizadeh, Mohsen Kheirandishfard, Farhad Kamangar:

Image Segmentation Using Sparse Subset Selection. 1470-1479 - Elena Garces

, Erik Reinhard
:
Light-Field Surface Color Segmentation with an Application to Intrinsic Decomposition. 1480-1488 - Qin Huang, Chunyang Xia, Siyang Li, Ye Wang, Yuhang Song, C.-C. Jay Kuo

:
Unsupervised Clustering Guided Semantic Segmentation. 1489-1498 - Ishan Nigam, Chen Huang, Deva Ramanan

:
Ensemble Knowledge Transfer for Semantic Segmentation. 1499-1508 - Mai Lan Ha, Gianni Franchi

, Michael Möller, Andreas Kolb
, Volker Blanz:
Segmentation and Shape Extraction from Convolutional Neural Networks. 1509-1518 - Fuwen Tan

, Crispin Bernier, Benjamin Cohen, Vicente Ordonez
, Connelly Barnes:
Where and Who? Automatic Semantic-Aware Person Composition. 1519-1528 - Prakhar Gupta, Shubh Gupta, Ajaykrishnan Jayagopal, Sourav Pal, Ritwik Sinha:

Saliency Prediction for Mobile User Interfaces. 1529-1538 - Tharindu Fernando

, Simon Denman, Sridha Sridharan, Clinton Fookes
:
Task Specific Visual Saliency Prediction with Memory Augmented Conditional Generative Adversarial Networks. 1539-1548
Oral 3B: Action Recognition / Surveillance / Language
- Roeland De Geest, Tinne Tuytelaars

:
Modeling Temporal Structure with LSTM for Online Action Detection. 1549-1557 - Effrosyni Mavroudi, Divya Bhaskara, Shahin Sefati, Haider Ali, René Vidal:

End-to-End Fine-Grained Action Segmentation and Recognition Using Conditional Random Field Models and Discriminative Sparse Coding. 1558-1567 - Liyue Shen, Serena Yeung

, Judy Hoffman
, Greg Mori, Li Fei-Fei:
Scaling Human-Object Interaction Recognition Through Zero-Shot Learning. 1568-1576 - Hongtao Yang, Xuming He, Fatih Porikli

:
Instance-Aware Detailed Action Labeling in Videos. 1577-1586 - Joe Yue-Hei Ng, Larry S. Davis:

Temporal Difference Networks for Video Action Recognition. 1587-1596 - Rosaura G. Vidal

, Sreya Banerjee
, Klemen Grm, Vitomir Struc, Walter J. Scheirer:
UG^2: A Video Benchmark for Assessing the Impact of Image Restoration and Enhancement on Automatic Visual Recognition. 1597-1606 - Mingze Xu, Aidean Sharghi

, Xin Chen
, David J. Crandall
:
Fully-Coupled Two-Stream Spatiotemporal Networks for Extremely Low Resolution Action Recognition. 1607-1615 - Joe Yue-Hei Ng, Jonghyun Choi

, Jan Neumann, Larry S. Davis:
ActionFlowNet: Learning Motion Representation for Action Recognition. 1616-1624 - Sovan Biswas, Juergen Gall:

Structural Recurrent Neural Network (SRNN) for Group Activity Analysis. 1625-1632 - Jawad Tayyub, Majd Hawasly

, David C. Hogg
, Anthony G. Cohn:
Learning Hierarchical Models of Complex Daily Activities from Annotated Videos. 1633-1641 - Ruichi Yu, Hongcheng Wang, Larry S. Davis:

ReMotENet: Efficient Relevant Motion Event Detection for Large-Scale Home Surveillance Videos. 1642-1651 - Murray Evans

, Steffi L. Colyer
, Darren P. Cosker
, Aki Salo:
Foot Contact Timings and Step Length for Sprint Training. 1652-1660 - Nadia Robertini, Florian Bernard, Weipeng Xu, Christian Theobalt

:
Illumination-Invariant Robust Multiview 3D Human Motion Capture. 1661-1670 - Kenan E. Ak, Joo-Hwee Lim, Jo Yew Tham, Ashraf A. Kassim:

Efficient Multi-attribute Similarity Learning Towards Attribute-Based Fashion Search. 1671-1679 - Kai Xu, Fengbo Ren

:
CSVideoNet: A Real-Time End-to-End Learning Framework for High-Frame-Rate Video Compressive Sensing. 1680-1688 - Mahdyar Ravanbakhsh, Moin Nabi, Hossein Mousavi, Enver Sangineto

, Nicu Sebe
:
Plug-and-Play CNN for Crowd Motion Analysis: An Application in Abnormal Event Detection. 1689-1698 - Dong-Jin Kim, Jinsoo Choi, Tae-Hyun Oh

, Youngjin Yoon, In So Kweon:
Disjoint Multi-task Learning Between Heterogeneous Human-Centric Tasks. 1699-1708 - Zongjian Zhang, Qiang Wu

, Yang Wang
, Fang Chen
:
Fine-Grained and Semantic-Guided Visual Attention for Image Captioning. 1709-1717 - Jinsoo Choi, Tae-Hyun Oh

, In So Kweon:
Contextually Customized Video Summaries Via Natural Language. 1718-1726
Oral 3C: Vision and Learning, Languages, Applications
- Guangyu Zhong, Yi-Hsuan Tsai, Sifei Liu

, Zhixun Su
, Ming-Hsuan Yang:
Learning Video-Story Composition via Recurrent Neural Network. 1727-1735 - Liu Liu, Hairong Qi:

Discriminative Cross-View Binary Representation Learning. 1736-1744 - Oriane Siméoni, Ahmet Iscen, Giorgos Tolias

, Yannis Avrithis, Ondrej Chum:
Unsupervised Object Discovery for Instance Recognition. 1745-1754 - Ashraf Siddique, Seungkyu Lee:

Video Inpainting for Arbitrary Foreground Object Removal. 1755-1763 - Michal Kucer, David W. Messinger:

Aesthetic Inference for Smart Mobile Devices. 1764-1773 - Shangwen Li, Chen Chen, Yuzhuo Ren, C.-C. Jay Kuo

:
Improving Object Classification Performance via Confusing Categories Study. 1774-1783 - Wei Xiang, Dong-Qing Zhang, Heather Yu, Vassilis Athitsos

:
Context-Aware Single-Shot Detector. 1784-1793 - Jingyan Wang, Olga Russakovsky

, Deva Ramanan
:
The More You Look, the More You See: Towards General Object Understanding Through Recursive Refinement. 1794-1803 - Xuebin Qin, Shida He, Zichen Zhang, Masood Dehghan

, Martin Jägersand:
ByLabel: A Boundary Based Semi-Automatic Image Annotation Tool. 1804-1813 - Mikyas T. Desta, Larry Chen, Tomasz Kornuta

:
Object-Based Reasoning in VQA. 1814-1823 - Ajit Puthenputhussery, Qingfeng Liu, Hao Liu, Chengjun Liu:

Generative and Discriminative Sparse Coding for Image Classification Applications. 1824-1832 - Shayok Chakraborty:

Distributed Active Learning for Image Recognition. 1833-1841 - Ke Wang, Mohit Bansal, Jan-Michael Frahm:

Retweet Wars: Tweet Popularity Prediction via Dynamic Multimodal Regression. 1842-1851 - Handong Zhao, Quanfu Fan, Dan Gutfreund, Yun Fu:

Semantically Guided Visual Question Answering. 1852-1860 - Arun Balajee Vasudevan, Dengxin Dai:

Object Referring in Visual Scene with Spoken Language. 1861-1870 - Jonatas Wehrmann

, Mauricio A. Lopes, Martin D. Móre, Rodrigo C. Barros
:
Fast Self-Attentive Multimodal Retrieval. 1871-1878 - Tianlang Chen, Chenliang Xu, Jiebo Luo

:
Improving Text-Based Person Search by Spatial Matching and Adaptive Threshold. 1879-1887 - Zhe Wang, Xiaoyi Liu, Limin Wang, Yu Qiao, Xiaohui Xie, Charless C. Fowlkes

:
Structured Triplet Learning with POS-Tag Guided Attention for Visual Question Answering. 1888-1896
Oral 3D: Features / Detection / Shape / Non-RGB
- Adil M. Ahmad

, Daniel Lemmond, Terrance E. Boult:
Chainlets: A New Descriptor for Detection and Recognition. 1897-1906 - Yue Wu

, Wael Abd-Almageed
, Prem Natarajan:
Image Copy-Move Forgery Detection via an End-to-End Deep Neural Network. 1907-1915 - Di Qi, Joshua Arfin, Mengxue Zhang, Tushar Mathew, Robert Pless, Brendan Juba:

Anomaly Explanation Using Metadata. 1916-1924 - Kripasindhu Sarkar, Kiran Varanasi, Didier Stricker

:
3D Shape Processing by Convolutional Denoising Autoencoders on Local Patches. 1925-1934 - Amy Tabb

, Henry Medeiros
:
Fast and Robust Curve Skeletonization for Real-World Elongated Objects. 1935-1943 - Arulkumar Subramaniam, Prashanth Balasubramanian, Anurag Mittal:

NCC-Net: Normalized Cross Correlation Based Deep Matcher with Robustness to Illumination Variations. 1944-1953 - Sanjay Ghosh

, Naveen Tripathi
:
Guided Filtering of Hyperspectral Images. 1954-1962 - Jungjun Kim, Hwasup Lim, Sang Chul Ahn, Seungkyu Lee:

RGBD Camera Based Material Recognition via Surface Roughness Estimation. 1963-1971 - Miguel Domínguez, Rohan Dhamdhere

, Atir Petkar, Saloni Jain, Shagan Sah, Raymond W. Ptucha:
General-Purpose Deep Point Cloud Feature Extractor. 1972-1981 - Xingchao Peng, Kate Saenko

:
Synthetic to Real Adaptation with Generative Correlation Alignment Networks. 1982-1991 - Jan Kallwies, Hans-Joachim Wuensche:

Effective Combination of Vertical and Horizontal Stereo Vision. 1992-2000 - Youye Xie, Gongguo Tang, William A. Hoff:

Chess Piece Recognition Using Oriented Chamfer Matching with a Comparison to CNN. 2001-2009 - Tamay Aykut

, Christoph Burgmair, Mojtaba Karimi, Jingyi Xu, Eckehard G. Steinbach
:
Delay Compensation for Actuated Stereoscopic 360 Degree Telepresence Systems with Probabilistic Head Motion Prediction. 2010-2018 - Johannes Kunzel, Thomas Werner, Peter Eisert, Jan Waschnewski:

Automatic Analysis of Sewer Pipes Based on Unrolled Monocular Fisheye Images. 2019-2027 - Jilliam María Díaz Barros, Bruno Mirbach, Frederic Garcia

, Kiran Varanasi, Didier Stricker
:
Fusion of Keypoint Tracking and Facial Landmark Detection for Real-Time Head Pose Estimation. 2028-2037 - Niluthpol Chowdhury Mithun

, Cody Simons, Robert Casey, Stefan Hilligardt, Amit K. Roy-Chowdhury:
Learning Long-Term Invariant Features for Vision-Based Localization. 2038-2047 - Katharina Schwarz, Patrick Wieschollek, Hendrik P. A. Lensch:

Will People Like Your Image? Learning the Aesthetic Space. 2048-2057 - Kaili Wang, Yu-Hui Huang

, José Oramas M.
, Luc Van Gool, Tinne Tuytelaars
:
An Analysis of Human-Centered Geolocation. 2058-2066

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














