| 2016 | A Computational Approach to Finding Facial Patterns of a Babyface. Zi-Yi Ke, Mei-Chen Yeh |
| 2016 | A Quality Adaptive Multimodal Affect Recognition System for User-Centric Multimedia Indexing. Rishabh Gupta, Mojtaba Khomami Abadi, Jesús Alejandro Cárdenes Cabré, Fabio Morreale, Tiago H. Falk, Nicu Sebe |
| 2016 | A Short Survey of Recent Advances in Graph Matching. Junchi Yan, Xu-Cheng Yin, Weiyao Lin, Cheng Deng, Hongyuan Zha, Xiaokang Yang |
| 2016 | ACD: Action Concept Discovery from Image-Sentence Corpora. Jiyang Gao, Chen Sun, Ram Nevatia |
| 2016 | Accurate Aggregation of Local Features by using K-sparse Autoencoder for 3D Model Retrieval. Takahiko Furuya, Ryutarou Ohbuchi |
| 2016 | Action Recognition by Learning Deep Multi-Granular Spatio-Temporal Video Representation. Qing Li, Zhaofan Qiu, Ting Yao, Tao Mei, Yong Rui, Jiebo Luo |
| 2016 | Adding Chinese Captions to Images. Xirong Li, Weiyu Lan, Jianfeng Dong, Hailong Liu |
| 2016 | An Automated End-To-End Pipeline for Fine-Grained Video Annotation using Deep Neural Networks. Baptist Vandersmissen, Lucas Sterckx, Thomas Demeester, Azarakhsh Jalalvand, Wesley De Neve, Rik Van de Walle |
| 2016 | Audiovisual Summarization of Lectures and Meetings Using a Segment Similarity Graph. Chidansh Amitkumar Bhatt, Andrei Popescu-Belis, Matthew Cooper |
| 2016 | Automatic Identification of Sports Video Highlights using Viewer Interest Features. Prithwi Raj Chakraborty, Dian Tjondronegoro, Ligang Zhang, Vinod Chandran |
| 2016 | Bags of Local Convolutional Features for Scalable Instance Search. Eva Mohedano, Kevin McGuinness, Noel E. O'Connor, Amaia Salvador, Ferran Marqués, Xavier Giró-i-Nieto |
| 2016 | Bidirectional Joint Representation Learning with Symmetrical Deep Neural Networks for Multimodal and Crossmodal Applications. Vedran Vukotic, Christian Raymond, Guillaume Gravier |
| 2016 | CNN-based Style Vector for Style Image Retrieval. Shin Matsuo, Keiji Yanai |
| 2016 | Combining Holistic and Part-based Deep Representations for Computational Painting Categorization. Rao Muhammad Anwer, Fahad Shahbaz Khan, Joost van de Weijer, Jorma Laaksonen |
| 2016 | Complura: Exploring and Leveraging a Large-scale Multilingual Visual Sentiment Ontology. Hongyi Liu, Brendan Jou, Tao Chen, Mercan Topkara, Nikolaos Pappas, Miriam Redi, Shih-Fu Chang |
| 2016 | Constrained Local Enhancement of Semantic Features by Content-Based Sparsity. Youssef Tamaazousti, Hervé Le Borgne, Adrian Popescu |
| 2016 | Correlation Autoencoder Hashing for Supervised Cross-Modal Search. Yue Cao, Mingsheng Long, Jianmin Wang, Han Zhu |
| 2016 | Discriminant Cross-modal Hashing. Xing Xu, Fumin Shen, Yang Yang, Heng Tao Shen |
| 2016 | Diverse Concept-Level Features for Multi-Object Classification. Youssef Tamaazousti, Hervé Le Borgne, Céline Hudelot |
| 2016 | Diverse Yet Efficient Retrieval using Locality Sensitive Hashing. Vidyadhar Rao, Prateek Jain, C. V. Jawahar |
| 2016 | Emotion Recognition from EEG Signals Enhanced by User's Profile. Tanfang Chen, Shangfei Wang, Zhen Gao, Chongliang Wu |
| 2016 | Event Detection with Zero Example: Select the Right and Suppress the Wrong Concepts. Yi-Jie Lu, Hao Zhang, Maaike de Boer, Chong-Wah Ngo |
| 2016 | Facial Landmark Detection and Tracking for Facial Behavior Analysis. Yue Wu |
| 2016 | Foreground Object Sensing for Saliency Detection. Hengliang Zhu, Bin Sheng, Xiao Lin, Yangyang Hao, Lizhuang Ma |
| 2016 | GPU-FV: Realtime Fisher Vector and Its Applications in Video Monitoring. Wenjing Ma, Liangliang Cao, Lei Yu, Guoping Long, Yucheng Li |
| 2016 | Homemade TS-Net for Automatic Face Recognition. Shilun Lin, Zhicheng Zhao, Fei Su |
| 2016 | Human's Scene Sketch Understanding. Yuxiang Ye, Yijuan Lu, Hao Jiang |
| 2016 | Image Annotation using Multi-scale Hypergraph Heat Diffusion Framework. Venkatesh N. Murthy, Avinash Sharma, Visesh Chari, R. Manmatha |
| 2016 | Incremental Learning for Fine-Grained Image Recognition. Liangliang Cao, Jenhao Hsiao, Paloma de Juan, Yuncheng Li, Bart Thomee |
| 2016 | Interactive Multimodal Learning on 100 Million Images. Jan Zahálka, Stevan Rudinac, Björn Þór Jónsson, Dennis C. Koelma, Marcel Worring |
| 2016 | Introducing Concept And Syntax Transition Networks for Image Captioning. Philipp Blandfort, Tushar Karayil, Damian Borth, Andreas Dengel |
| 2016 | Item-Based Video Recommendation: An Hybrid Approach considering Human Factors. Andrea Ferracani, Daniele Pezzatini, Marco Bertini, Alberto Del Bimbo |
| 2016 | Large-Scale E-Commerce Image Retrieval with Top-Weighted Convolutional Neural Networks. Shichao Zhao, Youjiang Xu, Yahong Han |
| 2016 | Learning Music Embedding with Metadata for Context Aware Recommendation. Dongjing Wang, Shuiguang Deng, Xin Zhang, Guandong Xu |
| 2016 | Learning for Traffic State Estimation on Large Scale of Incomplete Data. Yiyang Yao, Yingjie Xia, Zhenyu Shan, Zhengguang Liu |
| 2016 | MVC: A Dataset for View-Invariant Clothing Retrieval and Attribute Prediction. Kuan-Hsien Liu, Ting-Yen Chen, Chu-Song Chen |
| 2016 | Matching User Photos to Online Products with Robust Deep Features. Xi Wang, Zhenfeng Sun, Wenqiang Zhang, Yu Zhou, Yu-Gang Jiang |
| 2016 | Mirroring Facial Expressions: Evidence from Visual Analysis of Dyadic Interactions. Yuchi Huang, Saad M. Khan |
| 2016 | Mouse Activity as an Indicator of Interestingness in Video. Gloria Zen, Paloma de Juan, Yale Song, Alejandro Jaimes |
| 2016 | Multilingual Visual Sentiment Concept Matching. Nikolaos Pappas, Miriam Redi, Mercan Topkara, Brendan Jou, Hongyi Liu, Tao Chen, Shih-Fu Chang |
| 2016 | Multimodal Analysis of User-Generated Content in Support of Social Media Applications. Rajiv Ratn Shah |
| 2016 | Multimodal Deep Convolutional Neural Network for Audio-Visual Emotion Recognition. Shiqing Zhang, Shiliang Zhang, Tiejun Huang, Wen Gao |
| 2016 | Multimodal Event Detection and Summarization in Large Scale Image Collections. Manos Schinas, Symeon Papadopoulos, Georgios Petkos, Yiannis Kompatsiaris, Pericles A. Mitkas |
| 2016 | Multimodal Visual Pattern Mining with Convolutional Neural Networks. Hongzhi Li |
| 2016 | New Frontiers of Large Scale Multimedia Information Retrieval. Shih-Fu Chang |
| 2016 | Object-aware Deep Network for Commodity Image Retrieval. Zhiwei Fang, Jing Liu, Yuhang Wang, Yong Li, Hang Song, Jinhui Tang, Hanqing Lu |
| 2016 | On the "Face of Things". Ranran Feng, Balakrishnan Prabhakaran |
| 2016 | On the Effects of Spam Filtering and Incremental Learning for Web-Supervised Visual Concept Classification. Matthias Springstein, Ralph Ewerth |
| 2016 | Personalized Privacy-aware Image Classification. Eleftherios Spyromitros Xioufis, Symeon Papadopoulos, Adrian Popescu, Yiannis Kompatsiaris |
| 2016 | Personalized Retrieval and Browsing of Classical Music and Supporting Multimedia Material. Marko Tkalcic, Markus Schedl, Cynthia C. S. Liem, Mark S. Melenhorst |
| 2016 | Pooling Objects for Recognizing Scenes without Examples. Svetlana Kordumova, Thomas Mensink, Cees G. M. Snoek |
| 2016 | Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, ICMR 2016, New York, New York, USA, June 6-9, 2016 John R. Kender, John R. Smith, Jiebo Luo, Susanne Boll, Winston H. Hsu |
| 2016 | Rank Diffusion for Context-Based Image Retrieval. Daniel Carlos Guimarães Pedronette, Ricardo da Silva Torres |
| 2016 | Recurrent Support Vector Machines for Audio-Based Multimedia Event Detection. Yun Wang, Florian Metze |
| 2016 | Region Trajectories for Video Semantic Concept Detection. Yuancheng Ye, Xuejian Rong, Xiaodong Yang, Yingli Tian |
| 2016 | Regional Subspace Projection Coding for Image Retrieval. Mingmin Zhen, Wenmin Wang, Ronggang Wang |
| 2016 | Retrieval of Multimedia Objects by Fusing Multiple Modalities. Ilias Gialampoukidis, Anastasia Moumtzidou, Theodora Tsikrika, Stefanos Vrochidis, Ioannis Kompatsiaris |
| 2016 | SSD Technology Enables Dynamic Maintenance of Persistent High-Dimensional Indexes. Björn Þór Jónsson, Laurent Amsaleg, Herwig Lejsek |
| 2016 | Scaling Group Testing Similarity Search. Ahmet Iscen, Laurent Amsaleg, Teddy Furon |
| 2016 | Scene-driven Retrieval in Edited Videos using Aesthetic and Semantic Deep Features. Lorenzo Baraldi, Costantino Grana, Rita Cucchiara |
| 2016 | Searching for Audio by Sketching Mental Images of Sound: A Brave New Idea for Audio Retrieval in Creative Music Production. Peter Knees, Kristina Andersen |
| 2016 | Semantic Binary Codes. Sravanthi Bondugula, Larry S. Davis |
| 2016 | Semi-supervised Identification of Rarely Appearing Persons in Video by Correcting Weak Labels. Eric Müller, Christian Otto, Ralph Ewerth |
| 2016 | SentiCart: Cartography and Geo-contextualization for Multilingual Visual Sentiment. Brendan Jou, Margaret Yuying Qian, Shih-Fu Chang |
| 2016 | Sequential Correspondence Hierarchical Dirichlet Processes for Video Data Analysis. Jianfei Xue, Koji Eguchi |
| 2016 | Serendipity-driven Celebrity Video Hyperlinking. Shujun Yang, Lei Pang, Chong-Wah Ngo, Benoit Huet |
| 2016 | Situation Recognition from Multimodal Data. Vivek K. Singh, Siripen Pongpaichet, Ramesh C. Jain |
| 2016 | Spatially Localized Visual Dictionary Learning. Valentin Leveau, Alexis Joly, Olivier Buisson, Patrick Valduriez |
| 2016 | The ImageNet Shuffle: Reorganized Pre-training for Video Event Detection. Pascal Mettes, Dennis C. Koelma, Cees G. M. Snoek |
| 2016 | The LFM-1b Dataset for Music Retrieval and Recommendation. Markus Schedl |
| 2016 | The Science and Detection of Tilting. Xingjie Wei, Jussi Palomäki, Jeff Yan, Peter Robinson |
| 2016 | The Social Picture. Sebastiano Battiato, Giovanni Maria Farinella, Filippo Luigi Maria Milotta, Alessandro Ortis, Luca Addesso, Antonino Casella, Valeria D'Amico, Giovanni Torrisi |
| 2016 | Using Photos as Micro-Reports of Events. Siripen Pongpaichet, Mengfan Tang, Laleh Jalali, Ramesh C. Jain |
| 2016 | Video Description Generation using Audio and Visual Cues. Qin Jin, Junwei Liang |
| 2016 | Video Emotion Recognition with Transferred Deep Feature Encodings. Baohan Xu, Yanwei Fu, Yu-Gang Jiang, Boyang Li, Leonid Sigal |
| 2016 | Vinereactor: Crowdsourced Spontaneous Facial Expression Data. Edward Kim, Shruthika Vangala |
| 2016 | Watching What and How Politicians Discuss Various Topics: A Large-Scale Video Analytics UI. Emily Song, Joseph G. Ellis, Hongzhi Li, Shih-Fu Chang |
| 2016 | Web Video Popularity Prediction using Sentiment and Content Visual Features. Giulia Fontanini, Marco Bertini, Alberto Del Bimbo |
| 2016 | Xplore-M-Ego: Contextual Media Retrieval Using Natural Language Queries. Sreyasi Nag Chowdhury, Mateusz Malinowski, Andreas Bulling, Mario Fritz |