-
Ask Your Neurons: A Neural-Based Approach to Answering Questions About Images
- Mateusz Malinowski, Marcus Rohrbach, Mario Fritz
-
Aligning Books and Movies: Towards Story-Like Visual Explanations by Watching Movies and Reading Books
- Yukun Zhu, Ryan Kiros, Rich Zemel, Ruslan Salakhutdinov, Raquel Urtasun, Antonio Torralba, Sanja Fidler
-
Learning Query and Image Similarities With Ranking Canonical Correlation Analysis
- Wah Ngo
- Learning to See by Moving
- Pulkit Agrawal, Joao Carreira, Jitendra Malik
- scene recognition, object recognition, visual odometry, keypoint matching -- representation (feature) learning
- Convolutional Channel Features
- Bin Yang, Junjie Yan, Zhen Lei, Stan Z. Li
- pedestrian detection, face detection, edge detection, object proposal generation -- representation learning
- Local Convolutional Features With Unsupervised Training for Image Retrieval
- Mattis Paulin, Matthijs Douze, Zaid Harchaoui, Julien Mairal, Florent Perronin, Cordelia Schmid
- patch descriptor learning, image retrieval
- Discriminative Learning of Deep Convolutional Feature Point Descriptors
- Edgar Simo-Serra, Eduard Trulls, Luis Ferraz, Iasonas Kokkinos, Pascal Fua, Francesc Moreno-Noguer
- patch-level feature learning
- SALICON: Reducing the Semantic Gap in Saliency Prediction by Adapting Deep Neural Networks [Paper]
- Xun Huang, Chengyao Shen, Xavier Boix, Qi Zhao
- saliency detection
- Deep Networks for Image Super-Resolution With Sparse Prior
- Zhaowen Wang, Ding Liu, Jianchao Yang, Wei Han, Thomas Huang
- SR
- Learning Ordinal Relationships for Mid-Level Vision
- Daniel Zoran, Phillip Isola, Dilip Krishnan, William T. Freeman
- intrinsic image decomposition, depth from single image
- Deep Colorization
- Zezhou Cheng, Qingxiong Yang, Bin Sheng
- image colorization
- High-for-Low and Low-for-High: Efficient Boundary Detection From Deep Object Features and its Applications to High-Level Vision
- Gedas Bertasius, Jianbo Shi, Lorenzo Torresani
- boundary detection, semantic boundary labeling, semantic segmentation
- Video Super-Resolution via Deep Draft-Ensemble Learning
- Renjie Liao, Xin Tao, Ruiyu Li, Ziyang Ma, Jiaya Jia
- SR
- Compression Artifacts Reduction by a Deep Convolutional Network
- Chao Dong, Yubin Deng, Chen Change Loy, Xiaoou Tang
- JPEG artifact reduction
- Semantic Pose Using Deep Networks Trained on Synthetic RGB-D
- Jeremie Papon, Markus Schoeler
- indoor scene understanding from rgb-d
- Learning Informative Edge Maps for Indoor Scene Layout Prediction
- Arun Mallya, Svetlana Lazebnik
- edge map prediction, indoor scene layout prediction
- Multi-View Convolutional Neural Networks for 3D Shape Recognition
- Hang Su, Subhransu Maji, Evangelos Kalogerakis, Erik Learned-Miller
- 3d shape classification and retrieval, 3d shape descriptor
- Learning Analysis-by-Synthesis for 6D Pose Estimation in RGB-D Images
- Alexander Krull, Eric Brachmann, Frank Michel, Michael Ying Yang, Stefan Gumhold, Carsten Rother
- 6d pose estimation
- A Deep Visual Correspondence Embedding Model for Stereo Matching Costs [KITTI-submission]
- Zhuoyuan Chen, Xun Sun, Liang Wang, Yinan Yu, Chang Huang
- Deep Multi-Patch Aggregation Network for Image Style, Aesthetics, and Quality Estimation
- Xin Lu, Zhe Lin, Xiaohui Shen, Radomír Měch, James Z. Wang
- image style recognition, aesthetic quality categorization, image quality estimation
- Improving Image Classification With Location Context
- Kevin Tang, Manohar Paluri, Li Fei-Fei, Rob Fergus, Lubomir Bourdev
- image(scene) classification
- HICO: A Benchmark for Recognizing Human-Object Interactions in Images
- Yu-Wei Chao, Zhan Wang, Yugeng He, Jiaxuan Wang, Jia Deng
- benchmark paper
- Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification
- Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun
- ImageNet Classification
- Cross-Domain Image Retrieval With a Dual Attribute-Aware Ranking Network
- Junshi Huang, Rogerio S. Feris, Qiang Chen, Shuicheng Yan
- clothing detection/retrieval
- Contextual Action Recognition With R*CNN
- Georgia Gkioxari, Ross Girshick, Jitendra Malik
- action recognition
- What Makes an Object Memorable?
- Rachit Dubey, Joshua Peterson, Aditya Khosla, Ming-Hsuan Yang, Bernard Ghanem
- understanding the memorability of objects in images
- MMSS: Multi-Modal Sharable and Specific Feature Learning for RGB-D Object Recognition
- Anran Wang, Jianfei Cai, Jiwen Lu, Tat-Jen Cham
- Object Detection via a Multi-Region and Semantic Segmentation-Aware CNN Model
- Spyros Gidaris, Nikos Komodakis
- Neural Activation Constellations: Unsupervised Part Model Discovery With Convolutional Networks
- Marcel Simon, Erik Rodner
- Multi-Scale Recognition With DAG-CNNs
- Songfan Yang, Deva Ramanan
- Im2Calories: Towards an Automated Mobile Vision Food Diary
- Austin Meyers, Nick Johnston, Vivek Rathod, Anoop Korattikara, Alex Gorban, Nathan Silberman, Sergio Guadarrama, George Papandreou, Jonathan Huang, Kevin P. Murphy
- Aggregating Local Deep Features for Image Retrieval
- Artem Babenko, Victor Lempitsky
- Learning Deep Object Detectors From 3D Models
- Xingchao Peng, Baochen Sun, Karim Ali, Kate Saenko
- Harvesting Discriminative Meta Objects With Deep CNN Features for Scene Classification
- Ruobing Wu, Baoyuan Wang, Wenping Wang, Yizhou Yu
- Scalable Nonlinear Embeddings for Semantic Category-Based Image Retrieval
- Gaurav Sharma, Bernt Schiele
- image retrieval
- Semantic Image Segmentation via Deep Parsing Network
- Ziwei Liu, Xiaoxiao Li, Ping Luo, Chen-Change Loy, Xiaoou Tang
- Human Parsing With Contextualized Convolutional Neural Network
- Xiaodan Liang, Chunyan Xu, Xiaohui Shen, Jianchao Yang, Si Liu, Jinhui Tang, Liang Lin, Shuicheng Yan
- Holistically-Nested Edge Detection
- Saining Xie, Zhuowen Tu
- Learning Image Representations Tied to Ego-Motion
- Dinesh Jayaraman, Kristen Grauman
- representation learning
- Unsupervised Visual Representation Learning by Context Prediction
- Carl Doersch, Abhinav Gupta, Alexei A. Efros
- Webly Supervised Learning of Convolutional Networks
- Xinlei Chen, Abhinav Gupta
-
Fast R-CNN, Ross Girshick
-
Bilinear CNN Models for Fine-Grained Visual Recognition
- Tsung-Yu Lin, Aruni RoyChowdhury, Subhransu Maji
- Deep Neural Decision Forests
- Peter Kontschieder, Madalina Fiterau, Antonio Criminisi, Samuel Rota Bulò
- Deep Fried Convnets
- Zichao Yang, Marcin Moczulski, Misha Denil, Nando de Freitas, Alex Smola, Le Song, Ziyu Wang
- Semantic Component Analysis
- Calvin Murdock, Fernando De la Torre
- Learning Discriminative Reconstructions for Unsupervised Outlier Removal -Yan Xia, Xudong Cao, Fang Wen, Gang Hua, Jian Sun
- Learning Deconvolution Network for Semantic Segmentation
- Hyeonwoo Noh, Seunghoon Hong, Bohyung Han
- Conditional Random Fields as Recurrent Neural Networks
- Shuai Zheng, Sadeep Jayasumana, Bernardino Romera-Paredes, Vibhav Vineet, Zhizhong Su, Dalong Du, Chang Huang, Philip H. S. Torr
- Boosting Object Proposals: From Pascal to COCO
- Jordi Pont-Tuset, Luc Van Gool
- Joint Object and Part Segmentation Using Deep Learned Potentials
- Peng Wang, Xiaohui Shen, Zhe Lin, Scott Cohen, Brian Price, Alan L. Yuille
- BodyPrint: Pose Invariant 3D Shape Matching of Human Bodies
- Jiangping Wang, Kai Ma, Vivek Kumar Singh, Thomas Huang, Terrence Chen
- Contour Guided Hierarchical Model for Shape Matching
- Yuanqi Su, Yuehu Liu, Bonan Cuan, Nanning Zheng
- Robust Image Segmentation Using Contour-Guided Color Palettes
- Xiang Fu, Chien-Yi Wang, Chen Chen, Changhu Wang, C.-C. Jay Kuo
- BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation
- Jifeng Dai, Kaiming He, Jian Sun
- Detection and Segmentation of 2D Curved Reflection Symmetric Structures
- Ching L. Teo, Cornelia Fermüller, Yiannis Aloimonos
- Unsupervised Tube Extraction Using Transductive Learning and Dense Trajectories
- Mihai Marian Puscas, Enver Sangineto, Dubravko Culibrk, Nicu Sebe
- Compositional Hierarchical Representation of Shape Manifolds for Classification of Non-Manifold Shapes
- Mete Ozay, Umit Rusen Aktas, Jeremy L. Wyatt, Aleš Leonardis
- Learning to Combine Mid-Level Cues for Object Proposal Generation
- Tom Lee, Sanja Fidler, Sven Dickinson
- Enhancing Road Maps by Parsing Aerial Images Around the World
- Gellért Máttyus, Shenlong Wang, Sanja Fidler, Raquel Urtasun
- StereoSnakes: Contour Based Consistent Object Extraction For Stereo Images
- Ran Ju, Tongwei Ren, Gangshan Wu
- Semantic Segmentation of RGBD Images With Mutex Constraints
- Zhuo Deng, Sinisa Todorovic, Longin Jan Latecki
- Weakly- and Semi-Supervised Learning of a Deep Convolutional Network for Semantic Image Segmentation
- George Papandreou, Liang-Chieh Chen, Kevin P. Murphy, Alan L. Yuille
- Parsimonious Labeling
- Puneet K. Dokania, M. Pawan Kumar
- Constrained Convolutional Neural Networks for Weakly Supervised Segmentation
- Deepak Pathak, Philipp Krähenbühl, Trevor Darrell
- Convolutional Sparse Coding for Image Super-Resolution
- Shuhang Gu, Wangmeng Zuo, Qi Xie, Deyu Meng, Xiangchu Feng, Lei Zhang
- Depth-Based Hand Pose Estimation: Data, Methods, and Challenges
- James S. Supančič III, Grégory Rogez, Yi Yang, Jamie Shotton, Deva Ramanan
- Learning Deep Representation With Large-Scale Attributes
- Wanli Ouyang, Hongyang Li, Xingyu Zeng, Xiaogang Wang
- Deep Learning Strong Parts for Pedestrian Detection
- Yonglong Tian, Ping Luo, Xiaogang Wang, Xiaoou Tang
- Flowing ConvNets for Human Pose Estimation in Videos
- Tomas Pfister, James Charles, Andrew Zisserman
- BubbLeNet: Foveated Imaging for Visual Discovery
- Kevin Matzen, Noah Snavely
- Lending A Hand: Detecting Hands and Recognizing Activities in Complex Egocentric Interactions
- Sven Bambach, Stefan Lee, David J. Crandall, Chen Yu
- Relaxing From Vocabulary: Robust Weakly-Supervised Deep Learning for Vocabulary-Free Image Tagging
- Jianlong Fu, Yue Wu, Tao Mei, Jinqiao Wang, Hanqing Lu, Yong Rui
- Visual Phrases for Exemplar Face Detection
- Vijay Kumar, Anoop Namboodiri, C. V. Jawahar
- Spatial Semantic Regularisation for Large Scale Object Detection
- Damian Mrowca, Marcus Rohrbach, Judy Hoffman, Ronghang Hu, Kate Saenko, Trevor Darrell
- Human Pose Estimation in Videos
- Dong Zhang, Mubarak Shah
- Contour Box: Rejecting Object Proposals Without Explicit Closed Contours
- Cewu Lu, Shu Liu, Jiaya Jia, Chi-Keung Tang
-
Joint Camera Clustering and Surface Segmentation for Large-Scale Multi-View Stereo, Runze Zhang, Shiwei Li, Tian Fang, Siyu Zhu, Long Quan
-
Hyperpoints and Fine Vocabularies for Large-Scale Location Recognition, Torsten Sattler, Michal Havlena, Filip Radenović, Konrad Schindler, Marc Pollefeys
-
Globally Optimal 2D-3D Registration From Points or Lines Without Correspondences, Mark Brown, David Windridge, Jean-Yves Guillemaut
-
Semantically-Aware Aerial Reconstruction From Multi-Modal Data
- Randi Cabezas, Julian Straub, John W. Fisher III
- Exploiting Object Similarity in 3D Reconstruction
- Chen Zhou, Fatma Güney, Yizhou Wang, Andreas Geiger
- You Are Here: Mimicking the Human Thinking Process in Reading Floor-Plans
- Hang Chu, Dong Ki Kim, Tsuhan Chen
- The Likelihood-Ratio Test and Efficient Robust Estimation
- Andrea Cohen, Christopher Zach
- Real-Time Pose Estimation Piggybacked on Object Detection
- Roman Juránek, Adam Herout, Markéta Dubská, Pavel Zemčík
- Understanding and Predicting Image Memorability at a Large Scale
- Aditya Khosla, Akhil S. Raju, Antonio Torralba, Aude Oliva
- Multiple Granularity Descriptors for Fine-Grained Categorization
- Dequan Wang, Zhiqiang Shen, Jie Shao, Wei Zhang, Xiangyang Xue, Zheng Zhang
- Guiding the Long-Short Term Memory Model for Image Caption Generation
- Xu Jia, Efstratios Gavves, Basura Fernando, Tinne Tuytelaars
- Just Noticeable Differences in Visual Attributes
- Aron Yu, Kristen Grauman
- VQA: Visual Question Answering
- Stanislaw Antol, Aishwarya Agrawal, Jiasen Lu, Margaret Mitchell, Dhruv Batra, C. Lawrence Zitnick, Devi Parikh
- Localize Me Anywhere, Anytime: A Multi-Task Point-Retrieval Approach
- Guoyu Lu, Yan Yan, Li Ren, Jingkuan Song, Nicu Sebe, Chandra Kambhamettu
- Dense Optical Flow Prediction From a Static Image
- Jacob Walker, Abhinav Gupta, Martial Hebert
- Visual Madlibs: Fill in the Blank Description Generation and Question Answering
- Licheng Yu, Eunbyung Park, Alexander C. Berg, Tamara L. Berg
- Actions and Attributes From Wholes and Parts
- Georgia Gkioxari, Ross Girshick, Jitendra Malik
- DeepBox: Learning Objectness With Convolutional Networks
- Weicheng Kuo, Bharath Hariharan, Jitendra Malik
- Active Object Localization With Deep Reinforcement Learning
- Juan C. Caicedo, Svetlana Lazebnik
- Scene-Domain Active Part Models for Object Representation
- Zhou Ren, Chaohui Wang, Alan L. Yuille
- A Unified Multiplicative Framework for Attribute Learning
- Kongming Liang, Hong Chang, Shiguang Shan, Xilin Chen
- Contractive Rectifier Networks for Nonlinear Maximum Margin Classification
- Senjian An, Munawar Hayat, Salman H. Khan, Mohammed Bennamoun, Farid Boussaid, Ferdous Sohel
- Augmenting Strong Supervision Using Web Data for Fine-Grained Categorization
- Zhe Xu, Shaoli Huang, Ya Zhang, Dacheng Tao
- Learning Like a Child: Fast Novel Visual Concept Learning From Sentence Descriptions of Images
- Junhua Mao, Xu Wei, Yi Yang, Jiang Wang, Zhiheng Huang, Alan L. Yuille
- Learning Common Sense Through Visual Abstraction
- Ramakrishna Vedantam, Xiao Lin, Tanmay Batra, C. Lawrence Zitnick, Devi Parikh
- Domain Generalization for Object Recognition With Multi-Task Autoencoders
- Muhammad Ghifary, W. Bastiaan Kleijn, Mengjie Zhang, David Balduzzi
- Square Localization for Efficient and Accurate Object Detection
- Cewu Lu, Yongyi Lu, Hao Chen, Chi-Keung Tang
- Box Aggregation for Proposal Decimation: Last Mile of Object Detection
- Shu Liu, Cewu Lu, Jiaya Jia
- DeepProposal: Hunting Objects by Cascading Deep Convolutional Layers
- Amir Ghodrati, Ali Diba, Marco Pedersoli, Tinne Tuytelaars, Luc Van Gool
- Semantic Segmentation With Object Clique Potential
- Xiaojuan Qi, Jianping Shi, Shu Liu, Renjie Liao, Jiaya Jia
- Automatic Concept Discovery From Parallel Text and Visual Corpora
- Chen Sun, Chuang Gan, Ram Nevatia
- Monocular Object Instance Segmentation and Depth Ordering With CNNs
- Ziyu Zhang, Alexander G. Schwing, Sanja Fidler, Raquel Urtasun
- Multimodal Convolutional Neural Networks for Matching Image and Sentence
- Lin Ma, Zhengdong Lu, Lifeng Shang, Hang Li
- Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models
- Bryan A. Plummer, Liwei Wang, Chris M. Cervantes, Juan C. Caicedo, Julia Hockenmaier, Svetlana Lazebnik
- Predicting Depth, Surface Normals and Semantic Labels With a Common Multi-Scale Convolutional Architecture
- David Eigen, Rob Fergus
- AttentionNet: Aggregating Weak Directions for Accurate Object Detection
- Donggeun Yoo, Sunggyun Park, Joon-Young Lee, Anthony S. Paek, In So Kweon
- Common Subspace for Model and Similarity: Phrase Learning for Caption Generation From Images
- Yoshitaka Ushiku, Masataka Yamaguchi, Yusuke Mukuta, Tatsuya Harada
-
3D-Assisted Feature Synthesis for Novel Views of an Object, Hao Su, Fan Wang, Eric Yi, Leonidas J. Guibas
-
Render for CNN: Viewpoint Estimation in Images Using CNNs Trained With Rendered 3D Model Views, Hao Su, Charles R. Qi, Yangyan Li, Leonidas J. Guibas
- DeepDriving: Learning Affordance for Direct Perception in Autonomous Driving
- Chenyi Chen, Ari Seff, Alain Kornhauser, Jianxiong Xiao
- Active Transfer Learning With Zero-Shot Priors: Reusing Past Datasets for Future Tasks
- Efstratios Gavves, Thomas Mensink, Tatiana Tommasi, Cees G. M. Snoek, Tinne Tuytelaars
- HD-CNN: Hierarchical Deep Convolutional Neural Networks for Large Scale Visual Recognition
- Zhicheng Yan, Hao Zhang, Robinson Piramuthu, Vignesh Jagadeesh, Dennis DeCoste, Wei Di, Yizhou Yu
- Learning The Structure of Deep Convolutional Networks
- Jiashi Feng, Trevor Darrell
- FlowNet: Learning Optical Flow With Convolutional Networks
- Alexey Dosovitskiy, Philipp Fischer, Eddy Ilg, Philip Häusser, Caner Hazırbaş, Vladimir Golkov, Patrick van der Smagt, Daniel Cremers, Thomas Brox
- Unsupervised Learning of Visual Representations Using Videos
- Xiaolong Wang, Abhinav Gupta
- A Nonparametric Bayesian Approach Toward Stacked Convolutional Independent Component Analysis
- Sotirios P. Chatzis, Dimitrios Kosmopoulos
- Robust Optimization for Deep Regression
- Vasileios Belagiannis, Christian Rupprecht, Gustavo Carneiro, Nassir Navab
- Maximum-Margin Structured Learning With Deep Networks for 3D Human Pose Estimation
- Sijin Li, Weichen Zhang, Antoni B. Chan
- An Exploration of Parameter Redundancy in Deep Networks With Circulant Projections
- Yu Cheng, Felix X. Yu, Rogerio S. Feris, Sanjiv Kumar, Alok Choudhary, Shi-Fu Chang
- Understanding Deep Features With Computer-Generated Imagery
- Mathieu Aubry, Bryan C. Russell
- Context-Aware CNNs for Person Head Detection
- Tuan-Hung Vu, Anton Osokin, Ivan Laptev
- Highly-Expressive Spaces of Well-Behaved Transformations: Keeping It Simple
- Oren Freifeld, Søren Hauberg, Kayhan Batmanghelich, John W. Fisher III
- PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization
- Alex Kendall, Matthew Grimes, Roberto Cipolla
- Predicting Multiple Structured Visual Interpretations
- Debadeepta Dey, Varun Ramakrishna, Martial Hebert, J. Andrew Bagnell
- Look and Think Twice: Capturing Top-Down Visual Attention With Feedback Convolutional Neural Networks
- Chunshui Cao, Xianming Liu, Yi Yang, Yinan Yu, Jiang Wang, Zilei Wang, Yongzhen Huang, Liang Wang, Chang Huang, Wei Xu, Deva Ramanan, Thomas S. Huang
- Matrix Backpropagation for Deep Networks With Structured Layers
- Catalin Ionescu, Orestis Vantzos, Cristian Sminchisescu
- Joint Fine-Tuning in Deep Neural Networks for Facial Expression Recognition
- Heechul Jung, Sihaeng Lee, Junho Yim, Sunjeong Park, Junmo Kim
- Direct Intrinsics: Learning Albedo-Shading Decomposition by Convolutional Regression
- Takuya Narihira, Michael Maire, Stella X. Yu
- Face Flow
- Patrick Snape, Anastasios Roussos, Yannis Panagakis, Stefanos Zafeiriou
- Hierarchical Convolutional Features for Visual Tracking
- Chao Ma, Jia-Bin Huang, Xiaokang Yang, Ming-Hsuan Yang
- Online Object Tracking With Proposal Selection
- Yang Hua, Karteek Alahari, Cordelia Schmid
- Understanding and Diagnosing Visual Tracking Systems
- Naiyan Wang, Jianping Shi, Dit-Yan Yeung, Jiaya Jia
- Visual Tracking With Fully Convolutional Networks
- Lijun Wang, Wanli Ouyang, Xiaogang Wang, Huchuan Lu
- Multiple Feature Fusion via Weighted Entropy for Visual Tracking
- Lin Ma, Jiwen Lu, Jianjiang Feng, Jie Zhou
- Pedestrian Travel Time Estimation in Crowded Scenes
- Shuai Yi, Hongsheng Li, Xiaogang Wang
- Learning to Track for Spatio-Temporal Action Localization
- Philippe Weinzaepfel, Zaid Harchaoui, Cordelia Schmid
- Unsupervised Object Discovery and Tracking in Video Collections
- Suha Kwak, Minsu Cho, Ivan Laptev, Jean Ponce, Cordelia Schmid
- Car That Knows Before You Do: Anticipating Maneuvers via Learning Temporal Driving Models
- Ashesh Jain, Hema S. Koppula, Bharad Raghavan, Shane Soh, Ashutosh Saxena
- P-CNN: Pose-Based CNN Features for Action Recognition
- Guilhem Chéron, Ivan Laptev, Cordelia Schmid
- Fully Connected Object Proposals for Video Segmentation
- Federico Perazzi, Oliver Wang, Markus Gross, Alexander Sorkine-Hornung
- Video Segmentation With Just a Few Strokes
- Naveen Shankar Nagaraja, Frank R. Schmidt, Thomas Brox
- Actionness-Assisted Recognition of Actions
- Ye Luo, Loong-Fah Cheong, An Tran
- RGB-W: When Vision Meets Wireless
- Alexandre Alahi, Albert Haque, Li Fei-Fei
- Simultaneous Foreground Detection and Classification With Hybrid Features -Jaemyun Kim, Adín Ramírez Rivera, Byungyong Ryu, Oksam Chae
- Training a Feedback Loop for Hand Pose Estimation
- Markus Oberweger, Paul Wohlhart, Vincent Lepetit
- Opening the Black Box: Hierarchical Sampling Optimization for Estimating Human Hand Pose
- Danhang Tang, Jonathan Taylor, Pushmeet Kohli, Cem Keskin, Tae-Kyun Kim, Jamie Shotton
- Where to Buy It: Matching Street Clothing Photos in Online Shops
- M. Hadi Kiapour, Xufeng Han, Svetlana Lazebnik, Alexander C. Berg, Tamara L. Berg
- Multi-Task Recurrent Neural Network for Immediacy Prediction
- Xiao Chu, Wanli Ouyang, Wei Yang, Xiaogang Wang
- Learning Complexity-Aware Cascades for Deep Pedestrian Detection
- Zhaowei Cai, Mohammad Saberian, Nuno Vasconcelos
-
TransCut: Transparent Object Segmentation From a Light-Field Image, Yichao Xu, Hajime Nagahara, Atsushi Shimada, Rin-ichiro Taniguchi
-
Learning Data-Driven Reflectance Priors for Intrinsic Image Decomposition, Tinghui Zhou, Philipp Krähenbühl, Alexei A. Efros
-
Intrinsic Depth: Improving Depth Transfer With Intrinsic Images
- Naejin Kong, Michael J. Black
-
Selective Encoding for Recognizing Unreliably Localized Faces, Ang Li, Vlad Morariu, Larry S. Davis
-
Confidence Preserving Machine for Facial Action Unit Detection, Jiabei Zeng, Wen-Sheng Chu, Fernando De la Torre, Jeffrey F. Cohn, Zhang Xiong
-
Learning Social Relation Traits From Face Images
- Zhanpeng Zhang, Ping Luo, Chen-Change Loy, Xiaoou Tang
- Robust Heart Rate Measurement From Video Using Select Random Patches
- Antony Lam, Yoshinori Kuno
-
Robust Facial Landmark Detection Under Significant Head Poses and Occlusion, Yue Wu, Qiang Ji
-
Conditional Convolutional Neural Network for Modality-Aware Face Recognition
- Chao Xiong, Xiaowei Zhao, Danhang Tang, Karlekar Jayashree, Shuicheng Yan, Tae-Kyun Kim
- From Facial Parts Responses to Face Detection: A Deep Learning Approach
- Shuo Yang, Ping Luo, Chen-Change Loy, Xiaoou Tang
-
Pose-Invariant 3D Face Alignment, Amin Jourabloo, Xiaoming Liu
-
From Emotions to Action Units With Hidden and Semi-Hidden-Task Learning
- Adrià Ruiz, Joost Van de Weijer, Xavier Binefa
- Deep Learning Face Attributes in the Wild
- Ziwei Liu, Ping Luo, Xiaogang Wang, Xiaoou Tang
-
Multi-Task Learning With Low Rank Attribute Embedding for Person Re-Identification, Chi Su, Fan Yang, Shiliang Zhang, Qi Tian, Larry S. Davis, Wen Gao
-
Regressing a 3D Face Shape From a Single Image, Sergey Tulyakov, Nicu Sebe
-
A Spatio-Temporal Appearance Representation for Viceo-Based Pedestrian Re-Identification, Kan Liu, Bingpeng Ma, Wei Zhang, Rui Huang
-
Discriminative Pose-Free Descriptors for Face and Object Matching
- Soubhik Sanyal, Sivaram Prasad Mudunuri, Soma Biswas
- Bi-Shifting Auto-Encoder for Unsupervised Domain Adaptation
- Meina Kan, Shiguang Shan, Xilin Chen
- Person Recognition in Personal Photo Collections
- Seong Joon Oh, Rodrigo Benenson, Mario Fritz, Bernt Schiele
- Learning to Predict Saliency on Face Images
- Mai Xu, Yun Ren, Zulin Wang
-
Group Membership Prediction, Ziming Zhang, Yuting Chen, Venkatesh Saligrama
-
Robust RGB-D Odometry Using Point and Line Features, Yan Lu, Dezhen Song
-
Learning a Discriminative Model for the Perception of Realism in Composite Images, Jun-Yan Zhu, Philipp Krähenbühl, Eli Shechtman, Alexei A. Efros
-
What Makes Tom Hanks Look Like Tom Hanks, Supasorn Suwajanakorn, Steven M. Seitz, Ira Kemelmacher-Shlizerman
-
Personalized Age Progression With Aging Dictionary
- Xiangbo Shu, Jinhui Tang, Hanjiang Lai, Luoqi Liu, Shuicheng Yan
- FaceDirector: Continuous Control of Facial Performance in Video
- Charles Malleson, Jean-Charles Bazin, Oliver Wang, Derek Bradley, Thabo Beeler, Adrian Hilton, Alexander Sorkine-Hornung
- Synthesizing Illumination Mosaics From Internet Photo-Collections
- Dinghuang Ji, Enrique Dunn, Jan-Michael Frahm
- Hot or Not: Exploring Correlations Between Appearance and Temperature, Daniel Glasner, Pascal Fua, Todd Zickler, Lihi Zelnik-Manor
- Dense Semantic Correspondence Where Every Pixel is a Classifier, Hilton Bristow, Jack Valmadre, Simon Lucey
- Differential Recurrent Neural Networks for Action Recognition
- Vivek Veeriah, Naifan Zhuang, Guo-Jun Qi
- Simultaneous Deep Transfer Across Domains and Tasks
- Eric Tzeng, Judy Hoffman, Trevor Darrell, Kate Saenko
-
Low Dimensional Explicit Feature Maps, Ondřej Chum
-
Unsupervised Learning of Spatiotemporally Coherent Metrics
- Ross Goroshin, Joan Bruna, Jonathan Tompson, David Eigen, Yann LeCun
- Multi-Label Cross-Modal Retrieval
- Viresh Ranjan, Nikhil Rasiwasia, C. V. Jawahar
- Unsupervised Domain Adaptation With Imbalanced Cross-Domain Data
- Tzu Ming Harry Hsu, Wei Yu Chen, Cheng-An Hou, Yao-Hung Hubert Tsai, Yi-Ren Yeh, Yu-Chiang Frank Wang
- Geometry-Aware Deep Transform
- Jiaji Huang, Qiang Qiu, Robert Calderbank, Guillermo Sapiro
- Zero-Shot Learning via Semantic Similarity Embedding
- Ziming Zhang, Venkatesh Saligrama
- Multi-View Domain Generalization for Visual Recognition
- Li Niu, Wen Li, Dong Xu
- Infinite Feature Selection
- Giorgio Roffo, Simone Melzi, Marco Cristani
- Semi-Supervised Zero-Shot Classification With Label Representation Learning
- Xin Li, Yuhong Guo, Dale Schuurmans
- Predicting Deep Zero-Shot Convolutional Neural Networks Using Textual Descriptions
- Jimmy Lei Ba, Kevin Swersky, Sanja Fidler, Ruslan salakhutdinov
- Structured Feature Selection
- Tian Gao, Ziheng Wang, Qiang Ji
- Conditional High-Order Boltzmann Machine: A Supervised Learning Model for Relation Learning
- Yan Huang, Wei Wang, Liang Wang
- Learning Image and User Features for Recommendation in Social Networks
- Xue Geng, Hanwang Zhang, Jingwen Bian, Tat-Seng Chua
- Dual-Feature Warping-Based Motion Model Estimation
- Shiwei Li, Lu Yuan, Jian Sun, Long Quan
- An Adaptive Data Representation for Robust Point-Set Registration and Merging
- Dylan Campbell, Lars Petersson
- Learning Spatially Regularized Correlation Filters for Visual Tracking
- Martin Danelljan, Gustav Häger, Fahad Shahbaz Khan, Michael Felsberg
-
SpeDo: 6 DOF Ego-Motion Sensor Using Speckle Defocus Imaging, Kensei Jo, Mohit Gupta, Shree K. Nayar
-
Recurrent Network Models for Human Dynamics
- Katerina Fragkiadaki, Sergey Levine, Panna Felsen, Jitendra Malik
- Contour Flow: Middle-Level Motion Estimation by Combining Motion Segmentation and Contour Alignment
- Huijun Di, Qingxuan Shi, Feng Lv, Ming Qin, Yao Lu
- Minimizing Human Effort in Interactive Tracking by Incremental Learning of Model Parameters
- Arridhana Ciptadi, James M. Rehg
- A Novel Representation of Parts for Accurate 3D Object Detection and Tracking in Monocular Images
- Alberto Crivellaro, Mahdi Rad, Yannick Verdie, Kwang Moo Yi, Pascal Fua, Vincent Lepetit
- Linearization to Nonlinear Learning for Visual Tracking
- Bo Ma, Hongwei Hu, Jianbing Shen, Yuping Zhang, Fatih Porikli
- Self-Occlusions and Disocclusions in Causal Video Object Segmentation
- Yanchao Yang, Ganesh Sundaramoorthi, Stefano Soatto
- Large Displacement 3D Scene Flow With Occlusion Reasoning
- Andrei Zanfir, Cristian Sminchisescu
- Category-Blind Human Action Recognition: A Practical Recognition System
- Wenbo Li, Longyin Wen, Mooi Choo Chuah, Siwei Lyu
- Weakly-Supervised Alignment of Video With Text
- Piotr Bojanowski, Rémi Lajugie, Edouard Grave, Francis Bach, Ivan Laptev, Jean Ponce, Cordelia Schmid
- Learning Temporal Embeddings for Complex Video Analysis
- Vignesh Ramanathan, Kevin Tang, Greg Mori, Li Fei-Fei
- Unsupervised Semantic Parsing of Video Collections
- Ozan Sener, Amir R. Zamir, Silvio Savarese, Ashutosh Saxena
- Learning Spatiotemporal Features With 3D Convolutional Networks
- Du Tran, Lubomir Bourdev, Rob Fergus, Lorenzo Torresani, Manohar Paluri
- Temporal Perception and Prediction in Ego-Centric Video
- Yipin Zhou, Tamara L. Berg
- Describing Videos by Exploiting Temporal Structure
- Li Yao, Atousa Torabi, Kyunghyun Cho, Nicolas Ballas, Christopher Pal, Hugo Larochelle, Aaron Courville
- Storyline Representation of Egocentric Videos With an Applications to Story-Based Search
- Bo Xiong, Gunhee Kim, Leonid Sigal
- Sequence to Sequence – Video to Text
- Subhashini Venugopalan, Marcus Rohrbach, Jeffrey Donahue, Raymond Mooney, Trevor Darrell, Kate Saenko
- Action Recognition by Hierarchical Mid-Level Action Elements
- Tian Lan, Yuke Zhu, Amir Roshan Zamir, Silvio Savarese
- Human Action Recognition Using Factorized Spatio-Temporal Convolutional Networks
- Lin Sun, Kui Jia, Dit-Yan Yeung, Bertram E. Shi
- Love Thy Neighbors: Image Annotation by Exploiting Image Metadata
- Justin Johnson, Lamberto Ballan, Li Fei-Fei
- Unsupervised Extraction of Video Highlights Via Robust Recurrent Auto-Encoders
- Huan Yang, Baoyuan Wang, Stephen Lin, David Wipf, Minyi Guo, Baining Guo
- Uncovering Interactions and Interactors: Joint Estimation of Head, Body Orientation and F-Formations From Surveillance Videos
- Elisa Ricci, Jagannadan Varadarajan, Ramanathan Subramanian, Samuel Rota Bulò, Narendra Ahuja, Oswald Lanz
-
Generating Notifications for Missing Actions: Don't Forget to Turn the Lights Off!, Bilge Soran, Ali Farhadi, Linda Shapiro
-
Partial Person Re-Identification, Wei-Shi Zheng, Xiang Li, Tao Xiang, Shengcai Liao, Jianhuang Lai, Shaogang Gong
-
Multiple Hypothesis Tracking Revisited, Chanho Kim, Fuxin Li, Arridhana Ciptadi, James M. Rehg
-
Learning to Track: Online Multi-Object Tracking by Decision Making, Yu Xiang, Alexandre Alahi, Silvio Savarese