Skip to content

Instantly share code, notes, and snippets.

@myungsub
Last active May 17, 2017 10:23
Show Gist options
  • Save myungsub/c99ea6a60320d06d6812 to your computer and use it in GitHub Desktop.
Save myungsub/c99ea6a60320d06d6812 to your computer and use it in GitHub Desktop.
upload candidates to awesome-deep-vision

Vision & Language

  • Ask Your Neurons: A Neural-Based Approach to Answering Questions About Images

    • Mateusz Malinowski, Marcus Rohrbach, Mario Fritz
  • Aligning Books and Movies: Towards Story-Like Visual Explanations by Watching Movies and Reading Books

    • Yukun Zhu, Ryan Kiros, Rich Zemel, Ruslan Salakhutdinov, Raquel Urtasun, Antonio Torralba, Sanja Fidler
  • Learning Query and Image Similarities With Ranking Canonical Correlation Analysis

    • Wah Ngo

Recognition, Low-Level Vision, and Biomedical Image Analysis

  1. Learning to See by Moving
  • Pulkit Agrawal, Joao Carreira, Jitendra Malik
  • scene recognition, object recognition, visual odometry, keypoint matching -- representation (feature) learning
  1. Convolutional Channel Features
  • Bin Yang, Junjie Yan, Zhen Lei, Stan Z. Li
  • pedestrian detection, face detection, edge detection, object proposal generation -- representation learning
  1. Local Convolutional Features With Unsupervised Training for Image Retrieval
  • Mattis Paulin, Matthijs Douze, Zaid Harchaoui, Julien Mairal, Florent Perronin, Cordelia Schmid
  • patch descriptor learning, image retrieval
  1. Discriminative Learning of Deep Convolutional Feature Point Descriptors
  • Edgar Simo-Serra, Eduard Trulls, Luis Ferraz, Iasonas Kokkinos, Pascal Fua, Francesc Moreno-Noguer
  • patch-level feature learning
  1. SALICON: Reducing the Semantic Gap in Saliency Prediction by Adapting Deep Neural Networks [Paper]
  • Xun Huang, Chengyao Shen, Xavier Boix, Qi Zhao
  • saliency detection
  1. Deep Networks for Image Super-Resolution With Sparse Prior
  • Zhaowen Wang, Ding Liu, Jianchao Yang, Wei Han, Thomas Huang
  • SR
  1. Learning Ordinal Relationships for Mid-Level Vision
  • Daniel Zoran, Phillip Isola, Dilip Krishnan, William T. Freeman
  • intrinsic image decomposition, depth from single image
  1. Deep Colorization
  • Zezhou Cheng, Qingxiong Yang, Bin Sheng
  • image colorization
  1. High-for-Low and Low-for-High: Efficient Boundary Detection From Deep Object Features and its Applications to High-Level Vision
  • Gedas Bertasius, Jianbo Shi, Lorenzo Torresani
  • boundary detection, semantic boundary labeling, semantic segmentation
  1. Video Super-Resolution via Deep Draft-Ensemble Learning
  • Renjie Liao, Xin Tao, Ruiyu Li, Ziyang Ma, Jiaya Jia
  • SR
  1. Compression Artifacts Reduction by a Deep Convolutional Network
  • Chao Dong, Yubin Deng, Chen Change Loy, Xiaoou Tang
  • JPEG artifact reduction

Recognition and 3D Computer Vision

  1. Semantic Pose Using Deep Networks Trained on Synthetic RGB-D
  • Jeremie Papon, Markus Schoeler
  • indoor scene understanding from rgb-d
  1. Learning Informative Edge Maps for Indoor Scene Layout Prediction
  • Arun Mallya, Svetlana Lazebnik
  • edge map prediction, indoor scene layout prediction
  1. Multi-View Convolutional Neural Networks for 3D Shape Recognition
  • Hang Su, Subhransu Maji, Evangelos Kalogerakis, Erik Learned-Miller
  • 3d shape classification and retrieval, 3d shape descriptor
  1. Learning Analysis-by-Synthesis for 6D Pose Estimation in RGB-D Images
  • Alexander Krull, Eric Brachmann, Frank Michel, Michael Ying Yang, Stefan Gumhold, Carsten Rother
  • 6d pose estimation
  1. A Deep Visual Correspondence Embedding Model for Stereo Matching Costs [KITTI-submission]
  • Zhuoyuan Chen, Xun Sun, Liang Wang, Yinan Yu, Chang Huang
  1. Deep Multi-Patch Aggregation Network for Image Style, Aesthetics, and Quality Estimation
  • Xin Lu, Zhe Lin, Xiaohui Shen, Radomír Měch, James Z. Wang
  • image style recognition, aesthetic quality categorization, image quality estimation
  1. Improving Image Classification With Location Context
  • Kevin Tang, Manohar Paluri, Li Fei-Fei, Rob Fergus, Lubomir Bourdev
  • image(scene) classification
  1. HICO: A Benchmark for Recognizing Human-Object Interactions in Images
  • Yu-Wei Chao, Zhan Wang, Yugeng He, Jiaxuan Wang, Jia Deng
  • benchmark paper
  1. Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification
  • Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun
  • ImageNet Classification
  1. Cross-Domain Image Retrieval With a Dual Attribute-Aware Ranking Network
  • Junshi Huang, Rogerio S. Feris, Qiang Chen, Shuicheng Yan
  • clothing detection/retrieval
  1. Contextual Action Recognition With R*CNN
  • Georgia Gkioxari, Ross Girshick, Jitendra Malik
  • action recognition
  1. What Makes an Object Memorable?
  • Rachit Dubey, Joshua Peterson, Aditya Khosla, Ming-Hsuan Yang, Bernard Ghanem
  • understanding the memorability of objects in images
  1. MMSS: Multi-Modal Sharable and Specific Feature Learning for RGB-D Object Recognition
  • Anran Wang, Jianfei Cai, Jiwen Lu, Tat-Jen Cham
  1. Object Detection via a Multi-Region and Semantic Segmentation-Aware CNN Model
  • Spyros Gidaris, Nikos Komodakis
  1. Neural Activation Constellations: Unsupervised Part Model Discovery With Convolutional Networks
  • Marcel Simon, Erik Rodner
  1. Multi-Scale Recognition With DAG-CNNs
  • Songfan Yang, Deva Ramanan
  1. Im2Calories: Towards an Automated Mobile Vision Food Diary
  • Austin Meyers, Nick Johnston, Vivek Rathod, Anoop Korattikara, Alex Gorban, Nathan Silberman, Sergio Guadarrama, George Papandreou, Jonathan Huang, Kevin P. Murphy
  1. Aggregating Local Deep Features for Image Retrieval
  • Artem Babenko, Victor Lempitsky
  1. Learning Deep Object Detectors From 3D Models
  • Xingchao Peng, Baochen Sun, Karim Ali, Kate Saenko
  1. Harvesting Discriminative Meta Objects With Deep CNN Features for Scene Classification
  • Ruobing Wu, Baoyuan Wang, Wenping Wang, Yizhou Yu
  1. Scalable Nonlinear Embeddings for Semantic Category-Based Image Retrieval
  • Gaurav Sharma, Bernt Schiele
  • image retrieval

Segmentation, Edges and Saliency

  1. Semantic Image Segmentation via Deep Parsing Network
  • Ziwei Liu, Xiaoxiao Li, Ping Luo, Chen-Change Loy, Xiaoou Tang
  1. Human Parsing With Contextualized Convolutional Neural Network
  • Xiaodan Liang, Chunyan Xu, Xiaohui Shen, Jianchao Yang, Si Liu, Jinhui Tang, Liang Lin, Shuicheng Yan
  1. Holistically-Nested Edge Detection
  • Saining Xie, Zhuowen Tu

Learning Representations & Attributes

  1. Learning Image Representations Tied to Ego-Motion
  • Dinesh Jayaraman, Kristen Grauman
  • representation learning
  1. Unsupervised Visual Representation Learning by Context Prediction
  • Carl Doersch, Abhinav Gupta, Alexei A. Efros
  1. Webly Supervised Learning of Convolutional Networks
  • Xinlei Chen, Abhinav Gupta
  1. Fast R-CNN, Ross Girshick

  2. Bilinear CNN Models for Fine-Grained Visual Recognition

  • Tsung-Yu Lin, Aruni RoyChowdhury, Subhransu Maji

Statistical Methods & Learning

  1. Deep Neural Decision Forests
  • Peter Kontschieder, Madalina Fiterau, Antonio Criminisi, Samuel Rota Bulò
  1. Deep Fried Convnets
  • Zichao Yang, Marcin Moczulski, Misha Denil, Nando de Freitas, Alex Smola, Le Song, Ziyu Wang
  1. Semantic Component Analysis
  • Calvin Murdock, Fernando De la Torre
  1. Learning Discriminative Reconstructions for Unsupervised Outlier Removal -Yan Xia, Xudong Cao, Fang Wen, Gang Hua, Jian Sun

Optimization, Segmentation, and Recognition

  1. Learning Deconvolution Network for Semantic Segmentation
  • Hyeonwoo Noh, Seunghoon Hong, Bohyung Han
  1. Conditional Random Fields as Recurrent Neural Networks
  • Shuai Zheng, Sadeep Jayasumana, Bernardino Romera-Paredes, Vibhav Vineet, Zhizhong Su, Dalong Du, Chang Huang, Philip H. S. Torr
  1. Boosting Object Proposals: From Pascal to COCO
  • Jordi Pont-Tuset, Luc Van Gool
  1. Joint Object and Part Segmentation Using Deep Learned Potentials
  • Peng Wang, Xiaohui Shen, Zhe Lin, Scott Cohen, Brian Price, Alan L. Yuille
  1. BodyPrint: Pose Invariant 3D Shape Matching of Human Bodies
  • Jiangping Wang, Kai Ma, Vivek Kumar Singh, Thomas Huang, Terrence Chen
  1. Contour Guided Hierarchical Model for Shape Matching
  • Yuanqi Su, Yuehu Liu, Bonan Cuan, Nanning Zheng
  1. Robust Image Segmentation Using Contour-Guided Color Palettes
  • Xiang Fu, Chien-Yi Wang, Chen Chen, Changhu Wang, C.-C. Jay Kuo
  1. BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation
  • Jifeng Dai, Kaiming He, Jian Sun
  1. Detection and Segmentation of 2D Curved Reflection Symmetric Structures
  • Ching L. Teo, Cornelia Fermüller, Yiannis Aloimonos
  1. Unsupervised Tube Extraction Using Transductive Learning and Dense Trajectories
  • Mihai Marian Puscas, Enver Sangineto, Dubravko Culibrk, Nicu Sebe
  1. Compositional Hierarchical Representation of Shape Manifolds for Classification of Non-Manifold Shapes
  • Mete Ozay, Umit Rusen Aktas, Jeremy L. Wyatt, Aleš Leonardis
  1. Learning to Combine Mid-Level Cues for Object Proposal Generation
  • Tom Lee, Sanja Fidler, Sven Dickinson
  1. Enhancing Road Maps by Parsing Aerial Images Around the World
  • Gellért Máttyus, Shenlong Wang, Sanja Fidler, Raquel Urtasun
  1. StereoSnakes: Contour Based Consistent Object Extraction For Stereo Images
  • Ran Ju, Tongwei Ren, Gangshan Wu
  1. Semantic Segmentation of RGBD Images With Mutex Constraints
  • Zhuo Deng, Sinisa Todorovic, Longin Jan Latecki
  1. Weakly- and Semi-Supervised Learning of a Deep Convolutional Network for Semantic Image Segmentation
  • George Papandreou, Liang-Chieh Chen, Kevin P. Murphy, Alan L. Yuille
  1. Parsimonious Labeling
  • Puneet K. Dokania, M. Pawan Kumar
  1. Constrained Convolutional Neural Networks for Weakly Supervised Segmentation
  • Deepak Pathak, Philipp Krähenbühl, Trevor Darrell
  1. Convolutional Sparse Coding for Image Super-Resolution
  • Shuhang Gu, Wangmeng Zuo, Qi Xie, Deyu Meng, Xiangchu Feng, Lei Zhang
  1. Depth-Based Hand Pose Estimation: Data, Methods, and Challenges
  • James S. Supančič III, Grégory Rogez, Yi Yang, Jamie Shotton, Deva Ramanan
  1. Learning Deep Representation With Large-Scale Attributes
  • Wanli Ouyang, Hongyang Li, Xingyu Zeng, Xiaogang Wang
  1. Deep Learning Strong Parts for Pedestrian Detection
  • Yonglong Tian, Ping Luo, Xiaogang Wang, Xiaoou Tang
  1. Flowing ConvNets for Human Pose Estimation in Videos
  • Tomas Pfister, James Charles, Andrew Zisserman
  1. BubbLeNet: Foveated Imaging for Visual Discovery
  • Kevin Matzen, Noah Snavely
  1. Lending A Hand: Detecting Hands and Recognizing Activities in Complex Egocentric Interactions
  • Sven Bambach, Stefan Lee, David J. Crandall, Chen Yu
  1. Relaxing From Vocabulary: Robust Weakly-Supervised Deep Learning for Vocabulary-Free Image Tagging
  • Jianlong Fu, Yue Wu, Tao Mei, Jinqiao Wang, Hanqing Lu, Yong Rui
  1. Visual Phrases for Exemplar Face Detection
  • Vijay Kumar, Anoop Namboodiri, C. V. Jawahar
  1. Spatial Semantic Regularisation for Large Scale Object Detection
  • Damian Mrowca, Marcus Rohrbach, Judy Hoffman, Ronghang Hu, Kate Saenko, Trevor Darrell
  1. Human Pose Estimation in Videos
  • Dong Zhang, Mubarak Shah
  1. Contour Box: Rejecting Object Proposals Without Explicit Closed Contours
  • Cewu Lu, Shu Liu, Jiaya Jia, Chi-Keung Tang

Recognition and 3D CV

  1. Joint Camera Clustering and Surface Segmentation for Large-Scale Multi-View Stereo, Runze Zhang, Shiwei Li, Tian Fang, Siyu Zhu, Long Quan

  2. Hyperpoints and Fine Vocabularies for Large-Scale Location Recognition, Torsten Sattler, Michal Havlena, Filip Radenović, Konrad Schindler, Marc Pollefeys

  3. Globally Optimal 2D-3D Registration From Points or Lines Without Correspondences, Mark Brown, David Windridge, Jean-Yves Guillemaut

  4. Semantically-Aware Aerial Reconstruction From Multi-Modal Data

  • Randi Cabezas, Julian Straub, John W. Fisher III
  1. Exploiting Object Similarity in 3D Reconstruction
  • Chen Zhou, Fatma Güney, Yizhou Wang, Andreas Geiger
  1. You Are Here: Mimicking the Human Thinking Process in Reading Floor-Plans
  • Hang Chu, Dong Ki Kim, Tsuhan Chen
  1. The Likelihood-Ratio Test and Efficient Robust Estimation
  • Andrea Cohen, Christopher Zach
  1. Real-Time Pose Estimation Piggybacked on Object Detection
  • Roman Juránek, Adam Herout, Markéta Dubská, Pavel Zemčík
  1. Understanding and Predicting Image Memorability at a Large Scale
  • Aditya Khosla, Akhil S. Raju, Antonio Torralba, Aude Oliva
  1. Multiple Granularity Descriptors for Fine-Grained Categorization
  • Dequan Wang, Zhiqiang Shen, Jie Shao, Wei Zhang, Xiangyang Xue, Zheng Zhang
  1. Guiding the Long-Short Term Memory Model for Image Caption Generation
  • Xu Jia, Efstratios Gavves, Basura Fernando, Tinne Tuytelaars
  1. Just Noticeable Differences in Visual Attributes
  • Aron Yu, Kristen Grauman
  1. VQA: Visual Question Answering
  • Stanislaw Antol, Aishwarya Agrawal, Jiasen Lu, Margaret Mitchell, Dhruv Batra, C. Lawrence Zitnick, Devi Parikh
  1. Localize Me Anywhere, Anytime: A Multi-Task Point-Retrieval Approach
  • Guoyu Lu, Yan Yan, Li Ren, Jingkuan Song, Nicu Sebe, Chandra Kambhamettu
  1. Dense Optical Flow Prediction From a Static Image
  • Jacob Walker, Abhinav Gupta, Martial Hebert
  1. Visual Madlibs: Fill in the Blank Description Generation and Question Answering
  • Licheng Yu, Eunbyung Park, Alexander C. Berg, Tamara L. Berg
  1. Actions and Attributes From Wholes and Parts
  • Georgia Gkioxari, Ross Girshick, Jitendra Malik
  1. DeepBox: Learning Objectness With Convolutional Networks
  • Weicheng Kuo, Bharath Hariharan, Jitendra Malik
  1. Active Object Localization With Deep Reinforcement Learning
  • Juan C. Caicedo, Svetlana Lazebnik
  1. Scene-Domain Active Part Models for Object Representation
  • Zhou Ren, Chaohui Wang, Alan L. Yuille
  1. A Unified Multiplicative Framework for Attribute Learning
  • Kongming Liang, Hong Chang, Shiguang Shan, Xilin Chen
  1. Contractive Rectifier Networks for Nonlinear Maximum Margin Classification
  • Senjian An, Munawar Hayat, Salman H. Khan, Mohammed Bennamoun, Farid Boussaid, Ferdous Sohel
  1. Augmenting Strong Supervision Using Web Data for Fine-Grained Categorization
  • Zhe Xu, Shaoli Huang, Ya Zhang, Dacheng Tao
  1. Learning Like a Child: Fast Novel Visual Concept Learning From Sentence Descriptions of Images
  • Junhua Mao, Xu Wei, Yi Yang, Jiang Wang, Zhiheng Huang, Alan L. Yuille
  1. Learning Common Sense Through Visual Abstraction
  • Ramakrishna Vedantam, Xiao Lin, Tanmay Batra, C. Lawrence Zitnick, Devi Parikh
  1. Domain Generalization for Object Recognition With Multi-Task Autoencoders
  • Muhammad Ghifary, W. Bastiaan Kleijn, Mengjie Zhang, David Balduzzi
  1. Square Localization for Efficient and Accurate Object Detection
  • Cewu Lu, Yongyi Lu, Hao Chen, Chi-Keung Tang
  1. Box Aggregation for Proposal Decimation: Last Mile of Object Detection
  • Shu Liu, Cewu Lu, Jiaya Jia
  1. DeepProposal: Hunting Objects by Cascading Deep Convolutional Layers
  • Amir Ghodrati, Ali Diba, Marco Pedersoli, Tinne Tuytelaars, Luc Van Gool
  1. Semantic Segmentation With Object Clique Potential
  • Xiaojuan Qi, Jianping Shi, Shu Liu, Renjie Liao, Jiaya Jia
  1. Automatic Concept Discovery From Parallel Text and Visual Corpora
  • Chen Sun, Chuang Gan, Ram Nevatia
  1. Monocular Object Instance Segmentation and Depth Ordering With CNNs
  • Ziyu Zhang, Alexander G. Schwing, Sanja Fidler, Raquel Urtasun
  1. Multimodal Convolutional Neural Networks for Matching Image and Sentence
  • Lin Ma, Zhengdong Lu, Lifeng Shang, Hang Li
  1. Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models
  • Bryan A. Plummer, Liwei Wang, Chris M. Cervantes, Juan C. Caicedo, Julia Hockenmaier, Svetlana Lazebnik
  1. Predicting Depth, Surface Normals and Semantic Labels With a Common Multi-Scale Convolutional Architecture
  • David Eigen, Rob Fergus
  1. AttentionNet: Aggregating Weak Directions for Accurate Object Detection
  • Donggeun Yoo, Sunggyun Park, Joon-Young Lee, Anthony S. Paek, In So Kweon
  1. Common Subspace for Model and Similarity: Phrase Learning for Caption Generation From Images
  • Yoshitaka Ushiku, Masataka Yamaguchi, Yusuke Mukuta, Tatsuya Harada

Representations for Recognition & Localization

  1. 3D-Assisted Feature Synthesis for Novel Views of an Object, Hao Su, Fan Wang, Eric Yi, Leonidas J. Guibas

  2. Render for CNN: Viewpoint Estimation in Images Using CNNs Trained With Rendered 3D Model Views, Hao Su, Charles R. Qi, Yangyan Li, Leonidas J. Guibas

Statistical Methods & Learning, Motion & Tracking, and Video Analysis

  1. DeepDriving: Learning Affordance for Direct Perception in Autonomous Driving
  • Chenyi Chen, Ari Seff, Alain Kornhauser, Jianxiong Xiao
  1. Active Transfer Learning With Zero-Shot Priors: Reusing Past Datasets for Future Tasks
  • Efstratios Gavves, Thomas Mensink, Tatiana Tommasi, Cees G. M. Snoek, Tinne Tuytelaars
  1. HD-CNN: Hierarchical Deep Convolutional Neural Networks for Large Scale Visual Recognition
  • Zhicheng Yan, Hao Zhang, Robinson Piramuthu, Vignesh Jagadeesh, Dennis DeCoste, Wei Di, Yizhou Yu
  1. Learning The Structure of Deep Convolutional Networks
  • Jiashi Feng, Trevor Darrell
  1. FlowNet: Learning Optical Flow With Convolutional Networks
  • Alexey Dosovitskiy, Philipp Fischer, Eddy Ilg, Philip Häusser, Caner Hazırbaş, Vladimir Golkov, Patrick van der Smagt, Daniel Cremers, Thomas Brox
  1. Unsupervised Learning of Visual Representations Using Videos
  • Xiaolong Wang, Abhinav Gupta
  1. A Nonparametric Bayesian Approach Toward Stacked Convolutional Independent Component Analysis
  • Sotirios P. Chatzis, Dimitrios Kosmopoulos
  1. Robust Optimization for Deep Regression
  • Vasileios Belagiannis, Christian Rupprecht, Gustavo Carneiro, Nassir Navab
  1. Maximum-Margin Structured Learning With Deep Networks for 3D Human Pose Estimation
  • Sijin Li, Weichen Zhang, Antoni B. Chan
  1. An Exploration of Parameter Redundancy in Deep Networks With Circulant Projections
  • Yu Cheng, Felix X. Yu, Rogerio S. Feris, Sanjiv Kumar, Alok Choudhary, Shi-Fu Chang
  1. Understanding Deep Features With Computer-Generated Imagery
  • Mathieu Aubry, Bryan C. Russell
  1. Context-Aware CNNs for Person Head Detection
  • Tuan-Hung Vu, Anton Osokin, Ivan Laptev
  1. Highly-Expressive Spaces of Well-Behaved Transformations: Keeping It Simple
  • Oren Freifeld, Søren Hauberg, Kayhan Batmanghelich, John W. Fisher III
  1. PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization
  • Alex Kendall, Matthew Grimes, Roberto Cipolla
  1. Predicting Multiple Structured Visual Interpretations
  • Debadeepta Dey, Varun Ramakrishna, Martial Hebert, J. Andrew Bagnell
  1. Look and Think Twice: Capturing Top-Down Visual Attention With Feedback Convolutional Neural Networks
  • Chunshui Cao, Xianming Liu, Yi Yang, Yinan Yu, Jiang Wang, Zilei Wang, Yongzhen Huang, Liang Wang, Chang Huang, Wei Xu, Deva Ramanan, Thomas S. Huang
  1. Matrix Backpropagation for Deep Networks With Structured Layers
  • Catalin Ionescu, Orestis Vantzos, Cristian Sminchisescu
  1. Joint Fine-Tuning in Deep Neural Networks for Facial Expression Recognition
  • Heechul Jung, Sihaeng Lee, Junho Yim, Sunjeong Park, Junmo Kim
  1. Direct Intrinsics: Learning Albedo-Shading Decomposition by Convolutional Regression
  • Takuya Narihira, Michael Maire, Stella X. Yu
  1. Face Flow
  • Patrick Snape, Anastasios Roussos, Yannis Panagakis, Stefanos Zafeiriou
  1. Hierarchical Convolutional Features for Visual Tracking
  • Chao Ma, Jia-Bin Huang, Xiaokang Yang, Ming-Hsuan Yang
  1. Online Object Tracking With Proposal Selection
  • Yang Hua, Karteek Alahari, Cordelia Schmid
  1. Understanding and Diagnosing Visual Tracking Systems
  • Naiyan Wang, Jianping Shi, Dit-Yan Yeung, Jiaya Jia
  1. Visual Tracking With Fully Convolutional Networks
  • Lijun Wang, Wanli Ouyang, Xiaogang Wang, Huchuan Lu
  1. Multiple Feature Fusion via Weighted Entropy for Visual Tracking
  • Lin Ma, Jiwen Lu, Jianjiang Feng, Jie Zhou
  1. Pedestrian Travel Time Estimation in Crowded Scenes
  • Shuai Yi, Hongsheng Li, Xiaogang Wang
  1. Learning to Track for Spatio-Temporal Action Localization
  • Philippe Weinzaepfel, Zaid Harchaoui, Cordelia Schmid
  1. Unsupervised Object Discovery and Tracking in Video Collections
  • Suha Kwak, Minsu Cho, Ivan Laptev, Jean Ponce, Cordelia Schmid
  1. Car That Knows Before You Do: Anticipating Maneuvers via Learning Temporal Driving Models
  • Ashesh Jain, Hema S. Koppula, Bharad Raghavan, Shane Soh, Ashutosh Saxena
  1. P-CNN: Pose-Based CNN Features for Action Recognition
  • Guilhem Chéron, Ivan Laptev, Cordelia Schmid
  1. Fully Connected Object Proposals for Video Segmentation
  • Federico Perazzi, Oliver Wang, Markus Gross, Alexander Sorkine-Hornung
  1. Video Segmentation With Just a Few Strokes
  • Naveen Shankar Nagaraja, Frank R. Schmidt, Thomas Brox
  1. Actionness-Assisted Recognition of Actions
  • Ye Luo, Loong-Fah Cheong, An Tran
  1. RGB-W: When Vision Meets Wireless
  • Alexandre Alahi, Albert Haque, Li Fei-Fei
  1. Simultaneous Foreground Detection and Classification With Hybrid Features -Jaemyun Kim, Adín Ramírez Rivera, Byungyong Ryu, Oksam Chae

Vision & People

  1. Training a Feedback Loop for Hand Pose Estimation
  • Markus Oberweger, Paul Wohlhart, Vincent Lepetit
  1. Opening the Black Box: Hierarchical Sampling Optimization for Estimating Human Hand Pose
  • Danhang Tang, Jonathan Taylor, Pushmeet Kohli, Cem Keskin, Tae-Kyun Kim, Jamie Shotton
  1. Where to Buy It: Matching Street Clothing Photos in Online Shops
  • M. Hadi Kiapour, Xufeng Han, Svetlana Lazebnik, Alexander C. Berg, Tamara L. Berg
  1. Multi-Task Recurrent Neural Network for Immediacy Prediction
  • Xiao Chu, Wanli Ouyang, Wei Yang, Xiaogang Wang
  1. Learning Complexity-Aware Cascades for Deep Pedestrian Detection
  • Zhaowei Cai, Mohammad Saberian, Nuno Vasconcelos

Computational Photography, Face & Gesture, and Vision for X

  1. TransCut: Transparent Object Segmentation From a Light-Field Image, Yichao Xu, Hajime Nagahara, Atsushi Shimada, Rin-ichiro Taniguchi

  2. Learning Data-Driven Reflectance Priors for Intrinsic Image Decomposition, Tinghui Zhou, Philipp Krähenbühl, Alexei A. Efros

  3. Intrinsic Depth: Improving Depth Transfer With Intrinsic Images

  • Naejin Kong, Michael J. Black
  1. Selective Encoding for Recognizing Unreliably Localized Faces, Ang Li, Vlad Morariu, Larry S. Davis

  2. Confidence Preserving Machine for Facial Action Unit Detection, Jiabei Zeng, Wen-Sheng Chu, Fernando De la Torre, Jeffrey F. Cohn, Zhang Xiong

  3. Learning Social Relation Traits From Face Images

  • Zhanpeng Zhang, Ping Luo, Chen-Change Loy, Xiaoou Tang
  1. Robust Heart Rate Measurement From Video Using Select Random Patches
  • Antony Lam, Yoshinori Kuno
  1. Robust Facial Landmark Detection Under Significant Head Poses and Occlusion, Yue Wu, Qiang Ji

  2. Conditional Convolutional Neural Network for Modality-Aware Face Recognition

  • Chao Xiong, Xiaowei Zhao, Danhang Tang, Karlekar Jayashree, Shuicheng Yan, Tae-Kyun Kim
  1. From Facial Parts Responses to Face Detection: A Deep Learning Approach
  • Shuo Yang, Ping Luo, Chen-Change Loy, Xiaoou Tang
  1. Pose-Invariant 3D Face Alignment, Amin Jourabloo, Xiaoming Liu

  2. From Emotions to Action Units With Hidden and Semi-Hidden-Task Learning

  • Adrià Ruiz, Joost Van de Weijer, Xavier Binefa
  1. Deep Learning Face Attributes in the Wild
  • Ziwei Liu, Ping Luo, Xiaogang Wang, Xiaoou Tang
  1. Multi-Task Learning With Low Rank Attribute Embedding for Person Re-Identification, Chi Su, Fan Yang, Shiliang Zhang, Qi Tian, Larry S. Davis, Wen Gao

  2. Regressing a 3D Face Shape From a Single Image, Sergey Tulyakov, Nicu Sebe

  3. A Spatio-Temporal Appearance Representation for Viceo-Based Pedestrian Re-Identification, Kan Liu, Bingpeng Ma, Wei Zhang, Rui Huang

  4. Discriminative Pose-Free Descriptors for Face and Object Matching

  • Soubhik Sanyal, Sivaram Prasad Mudunuri, Soma Biswas
  1. Bi-Shifting Auto-Encoder for Unsupervised Domain Adaptation
  • Meina Kan, Shiguang Shan, Xilin Chen
  1. Person Recognition in Personal Photo Collections
  • Seong Joon Oh, Rodrigo Benenson, Mario Fritz, Bernt Schiele
  1. Learning to Predict Saliency on Face Images
  • Mai Xu, Yun Ren, Zulin Wang
  1. Group Membership Prediction, Ziming Zhang, Yuting Chen, Venkatesh Saligrama

  2. Robust RGB-D Odometry Using Point and Line Features, Yan Lu, Dezhen Song

  3. Learning a Discriminative Model for the Perception of Realism in Composite Images, Jun-Yan Zhu, Philipp Krähenbühl, Eli Shechtman, Alexei A. Efros

  4. What Makes Tom Hanks Look Like Tom Hanks, Supasorn Suwajanakorn, Steven M. Seitz, Ira Kemelmacher-Shlizerman

  5. Personalized Age Progression With Aging Dictionary

  • Xiangbo Shu, Jinhui Tang, Hanjiang Lai, Luoqi Liu, Shuicheng Yan
  1. FaceDirector: Continuous Control of Facial Performance in Video
  • Charles Malleson, Jean-Charles Bazin, Oliver Wang, Derek Bradley, Thabo Beeler, Adrian Hilton, Alexander Sorkine-Hornung
  1. Synthesizing Illumination Mosaics From Internet Photo-Collections
  • Dinghuang Ji, Enrique Dunn, Jan-Michael Frahm
  1. Hot or Not: Exploring Correlations Between Appearance and Temperature, Daniel Glasner, Pascal Fua, Todd Zickler, Lihi Zelnik-Manor

Motion & Correspondence

  1. Dense Semantic Correspondence Where Every Pixel is a Classifier, Hilton Bristow, Jack Valmadre, Simon Lucey

Statiscal Methods & Learning, Motion & Tracking, and Video Analysis II

  1. Differential Recurrent Neural Networks for Action Recognition
  • Vivek Veeriah, Naifan Zhuang, Guo-Jun Qi
  1. Simultaneous Deep Transfer Across Domains and Tasks
  • Eric Tzeng, Judy Hoffman, Trevor Darrell, Kate Saenko
  1. Low Dimensional Explicit Feature Maps, Ondřej Chum

  2. Unsupervised Learning of Spatiotemporally Coherent Metrics

  • Ross Goroshin, Joan Bruna, Jonathan Tompson, David Eigen, Yann LeCun
  1. Multi-Label Cross-Modal Retrieval
  • Viresh Ranjan, Nikhil Rasiwasia, C. V. Jawahar
  1. Unsupervised Domain Adaptation With Imbalanced Cross-Domain Data
  • Tzu Ming Harry Hsu, Wei Yu Chen, Cheng-An Hou, Yao-Hung Hubert Tsai, Yi-Ren Yeh, Yu-Chiang Frank Wang
  1. Geometry-Aware Deep Transform
  • Jiaji Huang, Qiang Qiu, Robert Calderbank, Guillermo Sapiro
  1. Zero-Shot Learning via Semantic Similarity Embedding
  • Ziming Zhang, Venkatesh Saligrama
  1. Multi-View Domain Generalization for Visual Recognition
  • Li Niu, Wen Li, Dong Xu
  1. Infinite Feature Selection
  • Giorgio Roffo, Simone Melzi, Marco Cristani
  1. Semi-Supervised Zero-Shot Classification With Label Representation Learning
  • Xin Li, Yuhong Guo, Dale Schuurmans
  1. Predicting Deep Zero-Shot Convolutional Neural Networks Using Textual Descriptions
  • Jimmy Lei Ba, Kevin Swersky, Sanja Fidler, Ruslan salakhutdinov
  1. Structured Feature Selection
  • Tian Gao, Ziheng Wang, Qiang Ji
  1. Conditional High-Order Boltzmann Machine: A Supervised Learning Model for Relation Learning
  • Yan Huang, Wei Wang, Liang Wang
  1. Learning Image and User Features for Recommendation in Social Networks
  • Xue Geng, Hanwang Zhang, Jingwen Bian, Tat-Seng Chua
  1. Dual-Feature Warping-Based Motion Model Estimation
  • Shiwei Li, Lu Yuan, Jian Sun, Long Quan
  1. An Adaptive Data Representation for Robust Point-Set Registration and Merging
  • Dylan Campbell, Lars Petersson
  1. Learning Spatially Regularized Correlation Filters for Visual Tracking
  • Martin Danelljan, Gustav Häger, Fahad Shahbaz Khan, Michael Felsberg
  1. SpeDo: 6 DOF Ego-Motion Sensor Using Speckle Defocus Imaging, Kensei Jo, Mohit Gupta, Shree K. Nayar

  2. Recurrent Network Models for Human Dynamics

  • Katerina Fragkiadaki, Sergey Levine, Panna Felsen, Jitendra Malik
  1. Contour Flow: Middle-Level Motion Estimation by Combining Motion Segmentation and Contour Alignment
  • Huijun Di, Qingxuan Shi, Feng Lv, Ming Qin, Yao Lu
  1. Minimizing Human Effort in Interactive Tracking by Incremental Learning of Model Parameters
  • Arridhana Ciptadi, James M. Rehg
  1. A Novel Representation of Parts for Accurate 3D Object Detection and Tracking in Monocular Images
  • Alberto Crivellaro, Mahdi Rad, Yannick Verdie, Kwang Moo Yi, Pascal Fua, Vincent Lepetit
  1. Linearization to Nonlinear Learning for Visual Tracking
  • Bo Ma, Hongwei Hu, Jianbing Shen, Yuping Zhang, Fatih Porikli
  1. Self-Occlusions and Disocclusions in Causal Video Object Segmentation
  • Yanchao Yang, Ganesh Sundaramoorthi, Stefano Soatto
  1. Large Displacement 3D Scene Flow With Occlusion Reasoning
  • Andrei Zanfir, Cristian Sminchisescu
  1. Category-Blind Human Action Recognition: A Practical Recognition System
  • Wenbo Li, Longyin Wen, Mooi Choo Chuah, Siwei Lyu
  1. Weakly-Supervised Alignment of Video With Text
  • Piotr Bojanowski, Rémi Lajugie, Edouard Grave, Francis Bach, Ivan Laptev, Jean Ponce, Cordelia Schmid
  1. Learning Temporal Embeddings for Complex Video Analysis
  • Vignesh Ramanathan, Kevin Tang, Greg Mori, Li Fei-Fei
  1. Unsupervised Semantic Parsing of Video Collections
  • Ozan Sener, Amir R. Zamir, Silvio Savarese, Ashutosh Saxena
  1. Learning Spatiotemporal Features With 3D Convolutional Networks
  • Du Tran, Lubomir Bourdev, Rob Fergus, Lorenzo Torresani, Manohar Paluri
  1. Temporal Perception and Prediction in Ego-Centric Video
  • Yipin Zhou, Tamara L. Berg
  1. Describing Videos by Exploiting Temporal Structure
  • Li Yao, Atousa Torabi, Kyunghyun Cho, Nicolas Ballas, Christopher Pal, Hugo Larochelle, Aaron Courville
  1. Storyline Representation of Egocentric Videos With an Applications to Story-Based Search
  • Bo Xiong, Gunhee Kim, Leonid Sigal
  1. Sequence to Sequence – Video to Text
  • Subhashini Venugopalan, Marcus Rohrbach, Jeffrey Donahue, Raymond Mooney, Trevor Darrell, Kate Saenko
  1. Action Recognition by Hierarchical Mid-Level Action Elements
  • Tian Lan, Yuke Zhu, Amir Roshan Zamir, Silvio Savarese
  1. Human Action Recognition Using Factorized Spatio-Temporal Convolutional Networks
  • Lin Sun, Kui Jia, Dit-Yan Yeung, Bertram E. Shi
  1. Love Thy Neighbors: Image Annotation by Exploiting Image Metadata
  • Justin Johnson, Lamberto Ballan, Li Fei-Fei
  1. Unsupervised Extraction of Video Highlights Via Robust Recurrent Auto-Encoders
  • Huan Yang, Baoyuan Wang, Stephen Lin, David Wipf, Minyi Guo, Baining Guo

Video -- Actions, Surveillance & Tracking

  1. Uncovering Interactions and Interactors: Joint Estimation of Head, Body Orientation and F-Formations From Surveillance Videos
  • Elisa Ricci, Jagannadan Varadarajan, Ramanathan Subramanian, Samuel Rota Bulò, Narendra Ahuja, Oswald Lanz
  1. Generating Notifications for Missing Actions: Don't Forget to Turn the Lights Off!, Bilge Soran, Ali Farhadi, Linda Shapiro

  2. Partial Person Re-Identification, Wei-Shi Zheng, Xiang Li, Tao Xiang, Shengcai Liao, Jianhuang Lai, Shaogang Gong

  3. Multiple Hypothesis Tracking Revisited, Chanho Kim, Fuxin Li, Arridhana Ciptadi, James M. Rehg

  4. Learning to Track: Online Multi-Object Tracking by Decision Making, Yu Xiang, Alexandre Alahi, Silvio Savarese

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment