| 2016 | A Browsing and Retrieval System for Broadcast Videos using Scene Detection and Automatic Annotation. Lorenzo Baraldi, Costantino Grana, Alberto Messina, Rita Cucchiara |
| 2016 | A Compact Binary Aggregated Descriptor via Dual Selection for Visual Search. Yuwei Wu, Zhe Wang, Junsong Yuan, Ling-Yu Duan |
| 2016 | A Deeply-Supervised Deconvolutional Network for Horizon Line Detection. Lorenzo Porzi, Samuel Rota Bulò, Elisa Ricci |
| 2016 | A Digital World to Thrive In: How the Internet of Things Can Make the "Invisible Hand" Work. Dirk Helbing |
| 2016 | A Discriminative and Compact Audio Representation for Event Detection. Liping Jing, Bo Liu, Jaeyoung Choi, Adam Janin, Julia Bernd, Michael W. Mahoney, Gerald Friedland |
| 2016 | A Domain Robust Approach For Image Dataset Construction. Yazhou Yao, Xian-Sheng Hua, Fumin Shen, Jian Zhang, Zhenmin Tang |
| 2016 | A Fast 3D Retrieval Algorithm via Class-Statistic and Pair-Constraint Model. Zan Gao, Deyu Wang, Hua Zhang, Yanbing Xue, Guangping Xu |
| 2016 | A Fast Cattle Recognition System using Smart devices. Santosh Kumar, Sanjay Kumar Singh, Tanima Dutta, Hari Prabhat Gupta |
| 2016 | A Live Face Swapper. Shengtao Xiao, Luoqi Liu, Xuecheng Nie, Jiashi Feng, Ashraf A. Kassim, Shuicheng Yan |
| 2016 | A Multi-Video Browser for Endoscopic Videos on Tablets. Marco A. Hudelist, Sabrina Kletz, Klaus Schoeffmann |
| 2016 | A Multimodal Gamified Platform for Real-Time User Feedback in Sports Performance. David S. Monaghan, Freddie Honohan, Amin Ahmadi, Troy McDaniel, Ramin Tadayon, Ajay Karpur, Kieran Moran, Noel E. O'Connor, Sethuraman Panchanathan |
| 2016 | A New Tool for Collaborative Video Search via Content-based Retrieval and Visual Inspection. Wolfgang Hürst, Algernon Ip Vai Ching, Marco A. Hudelist, Manfred Jürgen Primus, Klaus Schoeffmann, Christian Beecks |
| 2016 | A Novel Shadow-Free Feature Extractor for Real-Time Road Detection. Zhenqiang Ying, Ge Li, Xianghao Zang, Ronggang Wang, Wenmin Wang |
| 2016 | A Perceptual Quality Metric for Videos Distorted by Spatially Correlated Noise. Chao Chen, Mohammad Izadi, Anil C. Kokaram |
| 2016 | A Platform for Building New Human-Computer Interface Systems that Support Online Automatic Recognition of Audio-Gestural Commands. Nikolaos Kardaris, Isidoros Rodomagoulakis, Vassilis Pitsikalis, Antonis Arvanitakis, Petros Maragos |
| 2016 | A Pragmatically Designed Adaptive and Web-compliant Object-based Video Streaming Methodology: Implementation and Subjective Evaluation. Maarten Wijnants, Gustavo Rovelo, Peter Quax, Wim Lamotte |
| 2016 | A Robust Distance with Correlated Metric Learning for Multi-Instance Multi-Label Data. Yashaswi Verma, C. V. Jawahar |
| 2016 | A Supervised Approach for Text Illustration. Harsh Jhamtani, Shubham Varma, Midhun Gundapuneni, Siddhartha Kumar Dutta |
| 2016 | A Tablet Annotation Tool for Endoscopic Videos. Marco A. Hudelist, Sabrina Kletz, Klaus Schoeffmann |
| 2016 | AKSDA-MSVM: A GPU-accelerated Multiclass Learning Framework for Multimedia. Stavros Arestis-Chartampilas, Nikolaos Gkalelis, Vasileios Mezaris |
| 2016 | Abnormal Event Discovery in User Generated Photos. Xiaoshan Yang, Tianzhu Zhang, Changsheng Xu |
| 2016 | Academic Coupled Dictionary Learning for Sketch-based Image Retrieval. Dan Xu, Xavier Alameda-Pineda, Jingkuan Song, Elisa Ricci, Nicu Sebe |
| 2016 | Accelerating Convolutional Neural Networks for Mobile Applications. Peisong Wang, Jian Cheng |
| 2016 | Action Recognition Based on Joint Trajectory Maps Using Convolutional Neural Networks. Pichao Wang, Zhaoyang Li, Yonghong Hou, Wanqing Li |
| 2016 | Action Recognition Using Local Consistent Group Sparse Coding with Spatio-Temporal Structure. Yi Tian, Qiuqi Ruan, Gaoyun An, Yun Fu |
| 2016 | Ad Recommendation for Sponsored Search Engine via Composite Long-Short Term Memory. Dejiang Kong, Fei Wu, Siliang Tang, Yueting Zhuang |
| 2016 | Adaptation of Word Vectors using Tree Structure for Visual Semantics. Nakamasa Inoue, Koichi Shinoda |
| 2016 | Adaptive Bitrate Selection for Video Encoding with Reduced Block Artifacts. Min-Kook Choi, Hyun-Gyu Lee, Minseok Song, Sang-Chul Lee |
| 2016 | Adaptive Visual Feedback Generation for Facial Expression Improvement with Multi-task Deep Neural Networks. Takuhiro Kaneko, Kaoru Hiramatsu, Kunio Kashino |
| 2016 | Affective Contextual Mobile Recommender System. Chao Wu, Jia Jia, Wenwu Zhu, Xu Chen, Bowen Yang, Yaoxue Zhang |
| 2016 | Alone versus In-a-group: A Comparative Analysis of Facial Affect Recognition. Wenxuan Mou, Hatice Gunes, Ioannis Patras |
| 2016 | AltMM 2016: 1st International Workshop on Multimedia Alternate Realities. Teresa Chambel, Rene Kaiser, Omar Niamut, Wei Tsang Ooi, Judith A. Redi |
| 2016 | An Intention-Aware Interactive System for Mobile Video Browsing. Jun He, Hanwang Zhang, Ling Shen, Richang Hong, Tat-Seng Chua |
| 2016 | Analyzing Structural Characteristics of Object Category Representations From Their Semantic-part Distributions. Ravi Kiran Sarvadevabhatla, R. Venkatesh Babu |
| 2016 | Analyzing and Predicting GIF Interestingness. Michael Gygli, Mohammad Soleymani |
| 2016 | AntiLoiter: A Loitering Discovery System for Longtime Videos across Multiple Surveillance Cameras. Jianquan Liu, Shoji Nishimura, Takuya Araki |
| 2016 | Application-Layer Rate-Adaptive Multicast Video Streaming over 802.11 for Mobile Devices. Raheeb Muzaffar, Evsen Yanmaz, Christian Bettstetter, Andrea Cavallaro |
| 2016 | Are Safer Looking Neighborhoods More Lively?: A Multimodal Investigation into Urban Life. Marco De Nadai, Radu-Laurentiu Vieriu, Gloria Zen, Stefan Dragicevic, Nikhil Naik, Michele Caraviello, César Augusto Hidalgo, Nicu Sebe, Bruno Lepri |
| 2016 | Artist-based Classification via Deep Learning with Multi-scale Weighted Pooling. Kevin Alfianto Jangtjik, Mei-Chen Yeh, Kai-Lung Hua |
| 2016 | Assessing 3D Scan Quality Through Paired-comparisons Psychophysics. Jacob Thorn, Rodrigo Pizarro, Bernhard Spanlang, Pablo Bermell-Garcia, Mar González-Franco |
| 2016 | Attention-based LSTM with Semantic Consistency for Videos Captioning. Zhao Guo, Lianli Gao, Jingkuan Song, Xing Xu, Jie Shao, Heng Tao Shen |
| 2016 | Audio Event Detection using Weakly Labeled Data. Anurag Kumar, Bhiksha Raj |
| 2016 | Automatic Music Video Generation Based on Emotion-Oriented Pseudo Song Prediction and Matching. Jen-Chun Lin, Wen-Li Wei, Hsin-Min Wang |
| 2016 | Automatic Reflection Removal using Gradient Intensity and Motion Cues. Chao Sun, Shuaicheng Liu, Taotao Yang, Bing Zeng, Zhengning Wang, Guanghui Liu |
| 2016 | Barrista: Caffe Well-Served. Christoph Lassner, Daniel Kappler, Martin Kiefel, Peter V. Gehler |
| 2016 | Beauty eMakeup: A Deep Makeup Transfer System. Xinyu Ou, Si Liu, Xiaochun Cao, Hefei Ling |
| 2016 | Bidirectional Long-Short Term Memory for Video Description. Yi Bin, Yang Yang, Fumin Shen, Xing Xu, Heng Tao Shen |
| 2016 | Binary Optimized Hashing. Qi Dai, Jianguo Li, Jingdong Wang, Yu-Gang Jiang |
| 2016 | Boosting Video Description Generation by Explicitly Translating from Frame-Level Captions. Yuan Liu, Zhongchao Shi |
| 2016 | CNN vs. SIFT for Image Retrieval: Alternative or Complementary? Ke Yan, Yaowei Wang, Dawei Liang, Tiejun Huang, Yonghong Tian |
| 2016 | CNNdroid: GPU-Accelerated Execution of Trained Deep Convolutional Neural Networks on Android. Seyyed Salar Latifi Oskouei, Hossein Golestani, Matin Hashemi, Soheil Ghiasi |
| 2016 | Capped Lp-Norm Graph Embedding for Photo Clustering. Mengfan Tang, Feiping Nie, Ramesh C. Jain |
| 2016 | ConTagNet: Exploiting User Context for Image Tag Recommendation. Yogesh Singh Rawat, Mohan S. Kankanhalli |
| 2016 | Context-aware Geometric Object Reconstruction for Mobile Education. Jinxin Zheng, Yongtao Wang, Zhi Tang |
| 2016 | Context-aware Image Tweet Modelling and Recommendation. Tao Chen, Xiangnan He, Min-Yen Kan |
| 2016 | Contextual Enrichment of Remote-Sensed Events with Social Media Streams. Benjamin Bischke, Damian Borth, Christian Schulze, Andreas Dengel |
| 2016 | Cross-batch Reference Learning for Deep Classification and Retrieval. Huei-Fang Yang, Kevin Lin, Chu-Song Chen |
| 2016 | Cross-modal Retrieval by Real Label Partial Least Squares. Jianfeng He, Bingpeng Ma, Shuhui Wang, Yugui Liu, Qingming Huang |
| 2016 | Cross-modal Retrieval with Label Completion. Xing Xu, Fumin Shen, Yang Yang, Heng Tao Shen, Li He, Jingkuan Song |
| 2016 | CrowdNet: A Deep Convolutional Network for Dense Crowd Counting. Lokesh Boominathan, Srinivas S. S. Kruthiventi, R. Venkatesh Babu |
| 2016 | Crowdsourcing Biodiversity Monitoring: How Sharing your Photo Stream can Sustain our Planet. Alexis Joly, Hervé Goëau, Julien Champ, Samuel Dufour-Kowalski, Henning Müller, Pierre Bonnet |
| 2016 | DASH2M: Exploring HTTP/2 for Internet Streaming to Mobile Devices. Mengbai Xiao, Viswanathan Swaminathan, Sheng Wei, Songqing Chen |
| 2016 | DRIVING: Distributed Scheduling for Video Streaming in Vehicular Wi-Fi Systems. Xi Chen, Lei Rao, Qiao Xiang, Xue Liu, Fan Bai |
| 2016 | Data Aesthetics: The Ethics and Aesthetics of Big Data Gathering seen from the Artists Eye. Lucas Evers, Frank Nack |
| 2016 | Deep Bi-directional Cross-triplet Embedding for Cross-Domain Clothing Retrieval. Shuhui Jiang, Yue Wu, Yun Fu |
| 2016 | Deep CTR Prediction in Display Advertising. Junxuan Chen, Baigui Sun, Hao Li, Hongtao Lu, Xian-Sheng Hua |
| 2016 | Deep Convolutional Neural Network with Independent Softmax for Large Scale Face Recognition. Yue Wu, Jun Li, Yu Kong, Yun Fu |
| 2016 | Deep Correlation Features for Image Style Classification. Wei-Ta Chu, Yi-Ling Wu |
| 2016 | Deep Cross Residual Learning for Multitask Visual Recognition. Brendan Jou, Shih-Fu Chang |
| 2016 | Deep Learning for Image Memorability Prediction: the Emotional Bias. Yoann Baveye, Romain Cohendet, Matthieu Perreira Da Silva, Patrick Le Callet |
| 2016 | Deep Multi-task Learning with Label Correlation Constraint for Video Concept Detection. Foteini Markatopoulou, Vasileios Mezaris, Ioannis Patras |
| 2016 | Deep Representation for Abnormal Event Detection in Crowded Scenes. Yachuang Feng, Yuan Yuan, Xiaoqiang Lu |
| 2016 | Deep-based Ingredient Recognition for Cooking Recipe Retrieval. Jingjing Chen, Chong-Wah Ngo |
| 2016 | DeepSketch2Image: Deep Convolutional Neural Networks for Partial Sketch Recognition and Image Retrieval. Omar Seddati, Stéphane Dupont, Saïd Mahmoudi |
| 2016 | Deeply-Supervised Recurrent Convolutional Neural Network for Saliency Detection. Youbao Tang, Xiangqian Wu, Wei Bu |
| 2016 | Demand-adaptive Clothing Image Retrieval Using Hybrid Topic Model. Zhengzhong Zhou, Jingjin Zhou, Liqing Zhang |
| 2016 | Describing Videos using Multi-modal Fusion. Qin Jin, Jia Chen, Shizhe Chen, Yifan Xiong, Alexander G. Hauptmann |
| 2016 | Detecting Arbitrary Oriented Text in the Wild with a Visual Attention Model. Wenyi Huang, Dafang He, Xiao Yang, Zihan Zhou, Daniel Kifer, C. Lee Giles |
| 2016 | Detecting Sarcasm in Multimodal Social Platforms. Rossano Schifanella, Paloma de Juan, Joel R. Tetreault, Liangliang Cao |
| 2016 | Detecting Violence in Video using Subclasses. Xirong Li, Yujia Huo, Qin Jin, Jieping Xu |
| 2016 | Dictionary Learning Based Hashing for Cross-Modal Retrieval. Xin-Shun Xu |
| 2016 | Discriminative Paired Dictionary Learning for Visual Recognition. Hui-Hung Wang, Yi-ling Chen, Chen-Kuo Chiang |
| 2016 | Do Textual Descriptions Help Action Recognition? Matteo Bruni, Tiberio Uricchio, Lorenzo Seidenari, Alberto Del Bimbo |
| 2016 | Dynamic Resource Provisioning with QoS Guarantee for Video Transcoding in Online Video Sharing Service. Guanyu Gao, Yonggang Wen, Cédric Westphal |
| 2016 | Early Embedding and Late Reranking for Video Captioning. Jianfeng Dong, Xirong Li, Weiyu Lan, Yujia Huo, Cees G. M. Snoek |
| 2016 | Efficient Digital Holographic Image Reconstruction on Mobile Devices. Chung-Hua Chu |
| 2016 | Efficient Mobile Implementation of A CNN-based Object Recognition System. Keiji Yanai, Ryosuke Tanno, Koichi Okamoto |
| 2016 | Emerging Topics in Learning from Noisy and Missing Data. Xavier Alameda-Pineda, Timothy M. Hospedales, Elisa Ricci, Nicu Sebe, Xiaogang Wang |
| 2016 | Emotion in Context: Deep Semantic Feature Fusion for Video Emotion Recognition. Chen Chen, Zuxuan Wu, Yu-Gang Jiang |
| 2016 | Enabling My Robot To Play Pictionary: Recurrent Neural Networks For Sketch Recognition. Ravi Kiran Sarvadevabhatla, Jogendra Kundu, R. Venkatesh Babu |
| 2016 | Ensemble of Sparse Cross-Modal Metrics for Heterogeneous Face Recognition. Jing Huo, Yang Gao, Yinghuan Shi, Wanqi Yang, Hujun Yin |
| 2016 | Event Localization in Music Auto-tagging. Jen-Yu Liu, Yi-Hsuan Yang |
| 2016 | Event Specific Multimodal Pattern Mining for Knowledge Base Construction. Hongzhi Li, Joseph G. Ellis, Heng Ji, Shih-Fu Chang |
| 2016 | Experience Individualization on Online TV Platforms through Persona-based Account Decomposition. Payal Bajaj, Sumit Shekhar |
| 2016 | Exploiting Hierarchical Activations of Neural Network for Image Retrieval. Ying Li, Xiangwei Kong, Liang Zheng, Qi Tian |
| 2016 | Exploiting Objects with LSTMs for Video Categorization. Yongqing Sun, Zuxuan Wu, Xi Wang, Hiroyuki Arai, Tetsuya Kinebuchi, Yu-Gang Jiang |
| 2016 | Exploration of Large Image Corpuses in Virtual Reality. Sanket Khanwalkar, Shonali Balakrishna, Ramesh C. Jain |
| 2016 | Face Recognition via Active Annotation and Learning. Hao Ye, Weiyuan Shao, Hong Wang, Jianqi Ma, Li Wang, Yingbin Zheng, Xiangyang Xue |
| 2016 | Facial Age Estimation Using Robust Label Distribution. Ke Chen, Joni-Kristian Kämäräinen, Zhaoxiang Zhang |
| 2016 | Facial Expression Recognition with Deep two-view Support Vector Machine. Chongliang Wu, Shangfei Wang, Bowen Pan, Huaping Chen |
| 2016 | Families in the Wild (FIW): Large-Scale Kinship Image Database and Benchmarks. Joseph P. Robinson, Ming Shao, Yue Wu, Yun Fu |
| 2016 | Fast Supervised LDA for Discovering Micro-Events in Large-Scale Video Datasets. Angelos Katharopoulos, Despoina Paschalidou, Christos Diou, Anastasios Delopoulos |
| 2016 | First Person View Video Summarization Subject to the User Needs. Ana Garcia del Molino |
| 2016 | First-Person Shooter Game for Virtual Reality Headset with Advanced Multi-Agent Intelligent System. Ilya Makarov, Mikhail Tokmakov, Pavel Polyakov, Peter Zyuzin, Maxim Martynov, Oleg Konoplya, George Kuznetsov, Ivan Guschenko-Cheverda, Maxim Uriev, Ivan Mokeev, Olga Gerasimova, Lada Tokmakova, Alexey Kosmachev |
| 2016 | Frame Untangling for Unobtrusive Display-Camera Visible Light Communication. Xiao Shu, Xiaolin Wu |
| 2016 | Frame- and Segment-Level Features and Candidate Pool Evaluation for Video Caption Generation. Rakshith Shetty, Jorma Laaksonen |
| 2016 | From Seed Discovery to Deep Reconstruction: Predicting Saliency in Crowd via Deep Networks. Yanhao Zhang, Lei Qin, Qingming Huang, Kuiyuan Yang, Jun Zhang, Hongxun Yao |
| 2016 | Frustratingly Easy Cross-Modal Hashing. Dekui Ma, Jian Liang, Xiangwei Kong, Ran He |
| 2016 | Generating Affective Captions using Concept And Syntax Transition Networks. Tushar Karayil, Philipp Blandfort, Damian Borth, Andreas Dengel |
| 2016 | Generating Diverse Image Datasets with Limited Labeling. Niluthpol Chowdhury Mithun, Rameswar Panda, Amit K. Roy-Chowdhury |
| 2016 | GeoTracks: Adaptive Music for Everyday Journeys. Chris Greenhalgh, Adrian Hazzard, Sean McGrath, Steve Benford |
| 2016 | Geospatial Multimedia Data for Situation Recognition. Mengfan Tang |
| 2016 | Global Consistent Shape Correspondence for Efficient and Effective Active Shape Models. Meng Wang, Yi Fang |
| 2016 | HEVC-compliant Tile-based Streaming of Panoramic Video for Virtual Reality Applications. Alireza Zare, Alireza Aminlou, Miska M. Hannuksela, Moncef Gabbouj |
| 2016 | High-speed Depth Stream Generation from a Hybrid Camera. Xinxin Zuo, Sen Wang, Jiangbin Zheng, Ruigang Yang |
| 2016 | History Rhyme: Searching Historic Events by Multimedia Knowledge. Yifan Xiong, Jia Chen, Qin Jin, Chao Zhang |
| 2016 | How Cosmopolitan Are Emojis?: Exploring Emojis Usage and Meaning over Different Languages with Distributional Semantics. Francesco Barbieri, Germán Kruszewski, Francesco Ronzano, Horacio Saggion |
| 2016 | Human Pose Estimation from Depth Images via Inference Embedded Multi-task Learning. Keze Wang, Shengfu Zhai, Hui Cheng, Xiaodan Liang, Liang Lin |
| 2016 | Hypervideo Production Using Crowdsourced Youtube Videos. Stefan John, Christian Handschigl, Britta Meixner, Michael Granitzer |
| 2016 | INRS Audiovisual Quality Dataset. Edip Demirbilek, Jean-Charles Grégoire |
| 2016 | Image Captioning with Deep Bidirectional LSTMs. Cheng Wang, Haojin Yang, Christian Bartz, Christoph Meinel |
| 2016 | Image Captioning with both Object and Scene Information. Xiangyang Li, Xinhang Song, Luis Herranz, Yaohui Zhu, Shuqiang Jiang |
| 2016 | Image Emotion Computing. Sicheng Zhao |
| 2016 | Image2Text: A Multimodal Image Captioner. Chang Liu, Changhu Wang, Fuchun Sun, Yong Rui |
| 2016 | Improved Dense Trajectory with Cross Streams. Katsunori Ohnishi, Masatoshi Hidaka, Tatsuya Harada |
| 2016 | Improving Speaker Diarization of TV Series using Talking-Face Detection and Clustering. Hervé Bredin, Gregory Gelly |
| 2016 | InnerView: Learning Place Ambiance from Social Media Images. Darshan Santani, Rui Hu, Daniel Gatica-Perez |
| 2016 | Intelli-Wrench: Smart Navigation Tool for Mechanical Assembly and Maintenance. Toru Takahashi, Yuta Kudo, Rui Ishiyama |
| 2016 | Interactive Image Search for Clothing Recommendation. Zhengzhong Zhou, Yifei Xu, Jingjin Zhou, Liqing Zhang |
| 2016 | Jockey Time: Making Video Playback to Enhance Emotional Effect. Kyeong-Ah Jeong, Hyeon-Jeong Suk |
| 2016 | Joint Graph Learning and Video Segmentation via Multiple Cues and Topology Calibration. Jingkuan Song, Lianli Gao, Mihai Marian Puscas, Feiping Nie, Fumin Shen, Nicu Sebe |
| 2016 | Joint Image and Text Representation for Aesthetics Analysis. Ye Zhou, Xin Lu, Junping Zhang, James Z. Wang |
| 2016 | Joint Image-Text Representation by Gaussian Visual-Semantic Embedding. Zhou Ren, Hailin Jin, Zhe Lin, Chen Fang, Alan L. Yuille |
| 2016 | Key Color Generation for Affective Multimedia Production: An Initial Method and Its Application. Eunjin Kim, Hyeon-Jeong Suk |
| 2016 | Kurento: The WebRTC Modular Media Server. Luis López-Fernández, Miguel Paris Diaz, Santiago Carot, Boni García, Micael Gallego, Francisco Gortázar, Raul Benitez Mejias, Jose A. Santos, David Fernández, Radu Tom Vlad, Iván Gracia, Francisco Javier Lopez |
| 2016 | Kvazaar: Open-Source HEVC/H.265 Encoder. Marko Viitanen, Ari Koivula, Ari Lemmetti, Arttu Ylä-Outinen, Jarno Vanne, Timo D. Hämäläinen |
| 2016 | LIME: A Method for Low-light IMage Enhancement. Xiaojie Guo |
| 2016 | LSOD: Local Sparse Orthogonal Descriptor for Image Matching. Yiru Zhao, Yaoyi Li, Zhiwen Shao, Hongtao Lu |
| 2016 | LTA 2016: The First Workshop on Lifelogging Tools and Applications. Cathal Gurrin, Xavier Giró-i-Nieto, Petia Radeva, Mariella Dimiccoli, Håvard D. Johansen, Hideo Joho, Vivek K. Singh |
| 2016 | Label Tree Embeddings for Acoustic Scene Classification. Huy Phan, Lars Hertel, Marco Maaß, Philipp Koch, Alfred Mertins |
| 2016 | Learning Multimodal Temporal Representation for Dubbing Detection in Broadcast Media. Nam Do-Hoang Le, Jean-Marc Odobez |
| 2016 | Learning Music Emotion Primitives via Supervised Dynamic Clustering. Yang Liu, Yan Liu, Xiang Zhang, Gong Chen, Kejun Zhang |
| 2016 | Learning a Multi-class Discriminative Dictionary with Nonredundancy Constraints for Visual Classification. Zhao Liu, Yuwei Wu, Junsong Yuan, Yap-Peng Tan |
| 2016 | Learning to Make Better Mistakes: Semantics-aware Visual Food Recognition. Hui Wu, Michele Merler, Rosario Uceda-Sosa, John R. Smith |
| 2016 | Leveraging Contextual Cues for Generating Basketball Highlights. Vinay Bettadapura, Caroline Pantofaru, Irfan A. Essa |
| 2016 | Leveraging ICN for Secure Content Distribution in IP Networks. Syed Obaid Amin, Qingji Zheng, Ravishankar Ravindran, Guoqiang Wang |
| 2016 | LightNet: A Versatile, Standalone Matlab-based Environment for Deep Learning. Chengxi Ye, Chen Zhao, Yezhou Yang, Cornelia Fermüller, Yiannis Aloimonos |
| 2016 | Linear Distance Preserving Pseudo-Supervised and Unsupervised Hashing. Min Wang, Wengang Zhou, Qi Tian, Zheng-Jun Zha, Houqiang Li |
| 2016 | Local Diffusion Map Signature for Symmetry-aware Non-rigid Shape Correspondence. Meng Wang, Yi Fang |
| 2016 | Locality-preserving K-SVD Based Joint Dictionary and Classifier Learning for Object Recognition. Yuan-Shan Lee, Chien-Yao Wang, Seksan Mathulaprangsan, Jia Hao Zhao, Jia-Ching Wang |
| 2016 | Location-Independent WiFi Action Recognition via Vision-based Methods. Jen-Yin Chang, Kuan-Ying Lee, Yu-Lin Wei, Kate Ching-Ju Lin, Winston H. Hsu |
| 2016 | Looking Good With Flickr Faves: Gaussian Processes for Finding Difference Makers in Personality Impressions. Xiaoyu Xiong, Maurizio Filippone, Alessandro Vinciarelli |
| 2016 | MARIM: Mobile Augmented Reality for Interactive Manuals. Tam V. Nguyen, Dorothy Tan, Bilal Mirza, Jose Sepulveda |
| 2016 | MP3DG-PCC, Open Source Software Framework for Implementation and Evaluation of Point Cloud Compression. Rufael Mekuria, Pablo César |
| 2016 | Magic Mirror: A Virtual Fashion Consultant. Yejun Liu, Jia Jia, Jingtian Fu, Yihui Ma, Jie Huang, Zijian Tong |
| 2016 | MatchDR: Image Correspondence by Leveraging Distance Ratio Constraint. Rui Wang, Dong Liang, Wei Zhang, Xiaochun Cao |
| 2016 | Mental Visual Indexing: Towards Fast Video Browsing. Richang Hong, Jun He, Hanwang Zhang, Tat-Seng Chua |
| 2016 | Micro Tells Macro: Predicting the Popularity of Micro-Videos via a Transductive Model. Jingyuan Chen, Xuemeng Song, Liqiang Nie, Xiang Wang, Hanwang Zhang, Tat-Seng Chua |
| 2016 | Micro-Expression Recognition with Expression-State Constrained Spatio-Temporal Feature Representations. Dae Hoe Kim, Wissam J. Baddar, Yong Man Ro |
| 2016 | Modular Parallelization Framework for Multi-Stream Video Processing. Tim Lenertz, Gauthier Lafruit |
| 2016 | Morph: A Fast and Scalable Cloud Transcoding System. Guanyu Gao, Yonggang Wen |
| 2016 | Motion Segmentation using Visual and Bio-mechanical Features. Stefano Alletto, Giuseppe Serra, Rita Cucchiara |
| 2016 | Multi-Modal Learning: Study on A Large-Scale Micro-Video Data Collection. Jingyuan Chen |
| 2016 | Multi-Protocol Video Delivery with Late Trans-Muxing. Rufael Mekuria, Jelte Fennema, Dirk Griffioen |
| 2016 | Multi-Scale Triplet CNN for Person Re-Identification. Jiawei Liu, Zheng-Jun Zha, Q. I. Tian, Dong Liu, Ting Yao, Qiang Ling, Tao Mei |
| 2016 | Multi-Stream Multi-Class Fusion of Deep Networks for Video Classification. Zuxuan Wu, Yu-Gang Jiang, Xi Wang, Hao Ye, Xiangyang Xue |
| 2016 | Multi-modal Conditional Attention Fusion for Dimensional Emotion Prediction. Shizhe Chen, Qin Jin |
| 2016 | Multi-modal Multi-view Topic-opinion Mining for Social Event Analysis. Shengsheng Qian, Tianzhu Zhang, Changsheng Xu |
| 2016 | Multi-pose Facial Expression Recognition Using Transformed Dirichlet Process. Feifei Zhang, Qirong Mao, Ming Dong, Yongzhao Zhan |
| 2016 | Multilayer and Multimodal Fusion of Deep Neural Networks for Video Classification. Xiaodong Yang, Pavlo Molchanov, Jan Kautz |
| 2016 | Multimedia COMMONS Workshop 2016 (MMCommons 2016): Datasets, Evaluation, and Reproducibility. Bart Thomee, Damian Borth, Julia Bernd |
| 2016 | Multimedia Privacy. Gerald Friedland, Symeon Papadopoulos, Julia Bernd, Yiannis Kompatsiaris |
| 2016 | Multimedia and Medicine: Teammates for Better Disease Detection and Survival. Michael Riegler, Mathias Lux, Carsten Griwodz, Concetto Spampinato, Thomas de Lange, Sigrun Losada Eskeland, Konstantin Pogorelov, Wallapak Tavanapong, Peter Thelin Schmidt, Cathal Gurrin, Dag Johansen, Håvard D. Johansen, Pål Halvorsen |
| 2016 | Multimedia for personal health and health care. Susanne Boll, Kiyoharu Aizawa, Alexia Briasouli, Cathal Gurrin, Laleh Jalali, Jochen Meyer |
| 2016 | Multimedia on the Mountaintop: Using Public Snow Images to Improve Water Systems Operation. Andrea Castelletti, Roman Fedorov, Piero Fraternali, Matteo Giuliani |
| 2016 | Multimodal Interest Level Estimation via Variational Bayesian Mixture of Robust CCA. Yuma Sasaka, Takahiro Ogawa, Miki Haseyama |
| 2016 | Multimodal Learning via Exploring Deep Semantic Similarity. Di Hu, Xiaoqiang Lu, Xuelong Li |
| 2016 | Multimodal Popularity Prediction of Brand-related Social Media Posts. Masoud Mazloom, Robert Rietveld, Stevan Rudinac, Marcel Worring, Willemijn van Dolen |
| 2016 | Multimodal Video Description. Vasili Ramanishka, Abir Das, Dong Huk Park, Subhashini Venugopalan, Lisa Anne Hendricks, Marcus Rohrbach, Kate Saenko |
| 2016 | Multimodal-based Multimedia Analysis, Retrieval, and Services in Support of Social Media Applications. Rajiv Ratn Shah |
| 2016 | Multiview Video Super-Resolution via Information Extraction and Merging. Yawei Li, Xiaofeng Li, Zhizhong Fu, Wenli Zhong |
| 2016 | Neighborhood-Preserving Hashing for Large-Scale Cross-Modal Search. Botong Wu, Yizhou Wang |
| 2016 | News Archive Exploration Combining Face Detection and Tracking with Network Visual Analytics. Benjamin Renoust, Thanh Duc Ngo, Duy-Dinh Le, Shin'ichi Satoh |
| 2016 | News Program Detection in TV Broadcast Videos. Raghvendra Kannao, Durgaprasad Dandi, Swamy Yellapu, Prithwijit Guha |
| 2016 | Novel Word Embedding and Translation-based Language Modeling for Extractive Speech Summarization. Kuan-Yu Chen, Shih-Hung Liu, Berlin Chen, Hsin-Min Wang, Hsin-Hsi Chen |
| 2016 | Objectness-aware Semantic Segmentation. Yuhang Wang, Jing Liu, Yong Li, Junjie Yan, Hanqing Lu |
| 2016 | On Estimating Air Pollution from Photos Using Convolutional Neural Network. Chao Zhang, Junchi Yan, Changsheng Li, Xiaoguang Rui, Liang Liu, Rongfang Bie |
| 2016 | One Sensor is not Enough: Adapting and Fusing Sensors for the Quality Assessment of User Generated Video. Stefan Wilk, Manisha Luthra, Wolfgang Effelsberg |
| 2016 | Online Weighted Clustering for Real-time Abnormal Event Detection in Video Surveillance. Hanhe Lin, Jeremiah D. Deng, Brendon J. Woodford, Ahmad Shahi |
| 2016 | OpenVQ: A Video Quality Assessment Toolkit. Kristian Skarseth, Henrik Bjørlo, Pål Halvorsen, Michael Riegler, Carsten Griwodz |
| 2016 | Overview of the ACM MultiMedia 2016 International Workshop on Multimedia Assisted Dietary Management. Stavroula G. Mougiakakou, Giovanni Maria Farinella, Keiji Yanai |
| 2016 | PL-ranking: A Novel Ranking Method for Cross-Modal Retrieval. Liang Zhang, Bingpeng Ma, Guorong Li, Qingming Huang, Qi Tian |
| 2016 | Parsimonious Mixed-Effects HodgeRank for Crowdsourced Preference Aggregation. Qianqian Xu, Jiechao Xiong, Xiaochun Cao, Yuan Yao |
| 2016 | Partial Multi-Modal Sparse Coding via Adaptive Similarity Structure Regularization. Zhou Zhao, Hanqing Lu, Deng Cai, Xiaofei He, Yueting Zhuang |
| 2016 | Patterns of Free-form Curation: Visual Thinking with Web Content. Nic Lupfer, Andruid Kerne, Andrew M. Webb, Rhema Linder |
| 2016 | Performance Measurements of Virtual Reality Systems: Quantifying the Timing and Positioning Accuracy. Chun-Ming Chang, Cheng-Hsin Hsu, Chih-Fan Hsu, Kuan-Ta Chen |
| 2016 | Personal Multi-view Viewpoint Recommendation based on Trajectory Distribution of the Viewing Target. Xueting Wang, Kensho Hara, Yu Enokibori, Takatsugu Hirayama, Kenji Mase |
| 2016 | Placing Broadcast News Videos in their Social Media Context Using Hashtags. Joseph G. Ellis, Svebor Karaman, Hongzhi Li, Hong Bin Shim, Shih-Fu Chang |
| 2016 | Play and Rewind: Optimizing Binary Representations of Videos by Self-Supervised Temporal Hashing. Hanwang Zhang, Meng Wang, Richang Hong, Tat-Seng Chua |
| 2016 | PlaylistCreator: An Assisted Approach for Playlist Creation. Ricardo Dias, Daniel Gonçalves, Manuel J. Fonseca |
| 2016 | Predicting Personalized Emotion Perceptions of Social Images. Sicheng Zhao, Hongxun Yao, Yue Gao, Rongrong Ji, Wenlong Xie, Xiaolei Jiang, Tat-Seng Chua |
| 2016 | Predicting and Optimizing Image Compression. Oleksandr Murashko, John Thomson, Hugh Leather |
| 2016 | Proceedings of the 2016 ACM Conference on Multimedia Conference, MM 2016, Amsterdam, The Netherlands, October 15-19, 2016 Alan Hanjalic, Cees Snoek, Marcel Worring, Dick C. A. Bulterman, Benoit Huet, Aisling Kelliher, Yiannis Kompatsiaris, Jin Li |
| 2016 | Processing-Aware Privacy-Preserving Photo Sharing over Online Social Networks. Weiwei Sun, Jiantao Zhou, Ran Lyu, Shuyuan Zhu |
| 2016 | Pyo, the Python DSP toolbox. Olivier Bélanger |
| 2016 | QoE Prediction for Enriched Assessment of Individual Video Viewing Experience. Yi Zhu, Alan Hanjalic, Judith A. Redi |
| 2016 | Quartet-net Learning for Visual Instance Retrieval. Jiewei Cao, Zi Huang, Peng Wang, Chao Li, Xiaoshuai Sun, Heng Tao Shen |
| 2016 | Query Adaptive Instance Search using Object Sketches. Sreyasee Das Bhattacharjee, Junsong Yuan, Weixiang Hong, Xiang Ruan |
| 2016 | ReadMe: A Real-Time Recommendation System for Mobile Augmented Reality Ecosystems. Dimitris Chatzopoulos, Pan Hui |
| 2016 | Real-time Wearable Computer Vision System for Improved Museum Experience. Giovanni Taverriti, Stefano Lombini, Lorenzo Seidenari, Marco Bertini, Alberto Del Bimbo |
| 2016 | Research Challenges in Developing Multimedia Systems for Managing Emergency Situations. Mengfan Tang, Siripen Pongpaichet, Ramesh C. Jain |
| 2016 | Robust Face Recognition with Deep Multi-View Representation Learning. Jianshu Li, Jian Zhao, Fang Zhao, Hao Liu, Jing Li, Shengmei Shen, Jiashi Feng, Terence Sim |
| 2016 | Robust Visual-Textual Sentiment Analysis: When Attention meets Tree-structured Recursive Neural Networks. Quanzeng You, Liangliang Cao, Hailin Jin, Jiebo Luo |
| 2016 | SDNDASH: Improving QoE of HTTP Adaptive Streaming Using Software Defined Networking. Abdelhak Bentaleb, Ali C. Begen, Roger Zimmermann |
| 2016 | Scalable Compression of Deep Neural Networks. Xing Wang, Jie Liang |
| 2016 | Scalable Multimedia Streaming in Wireless Networks with Device-to-Device Cooperation. Karim Jahed, Sanaa Sharafeddine, Abdallah Moussawi, Abbas Abou Daya, Hassan Dbouk, Saadallah Kassir, Zaher Dawy, Preethi Valsalan, Wael Chérif, Fethi Filali |
| 2016 | Scene Image Synthesis from Natural Sentences Using Hierarchical Syntactic Analysis. Tetsuaki Mano, Hiroaki Yamane, Tatsuya Harada |
| 2016 | SceneTextReg: A Real-Time Video OCR System. Haojin Yang, Cheng Wang, Christian Bartz, Christoph Meinel |
| 2016 | Semantic Description of Timbral Transformations in Music Production. Ryan Stables, Brecht De Man, Sean Enderby, Joshua D. Reiss, György Fazekas, Thomas Wilmering |
| 2016 | Semantic Image Profiling for Historic Events: Linking Images to Phrases. Jia Chen, Qin Jin, Yifan Xiong |
| 2016 | SenseCap: Synchronized Data Collection with Microsoft Kinect2 and LeapMotion. Julian F. P. Kooij |
| 2016 | Sentiment and Emotion Analysis for Social Multimedia: Methodologies and Applications. Quanzeng You |
| 2016 | Server Allocation for Multiplayer Cloud Gaming. Yunhua Deng, Yusen Li, Xueyan Tang, Wentong Cai |
| 2016 | Seventh International Workshop on Human Behavior Understanding (HBU 2016). Mohamed Chetouani, Jeffrey F. Cohn, Albert Ali Salah |
| 2016 | Share-and-Chat: Achieving Human-Level Video Commenting by Search and Multi-View Embedding. Yehao Li, Ting Yao, Tao Mei, Hongyang Chao, Yong Rui |
| 2016 | Shorter-is-Better: Venue Category Estimation from Micro-Video. Jianglong Zhang, Liqiang Nie, Xiang Wang, Xiangnan He, Xianglin Huang, Tat-Seng Chua |
| 2016 | Situation Recognition from Multimodal Data. Vivek K. Singh, Siripen Pongpaichet, Ramesh C. Jain |
| 2016 | Smart Beholder: An Extensible Smart Lens Platform. Chun-Ying Huang, Ching-Ling Fan, Chih-Fan Hsu, Hsin-Yu Chang, Tsung-Han Tsai, Kuan-Ta Chen, Cheng-Hsin Hsu |
| 2016 | Social and Affective Robotics Tutorial. Maja Pantic, Vanessa Evers, Marc Peter Deisenroth, Luis Merino, Björn W. Schuller |
| 2016 | SocialFX: Studying a Crowdsourced Folksonomy of Audio Effects Terms. Taylor Zheng, Prem Seetharaman, Bryan Pardo |
| 2016 | Spatio-Temporal Analysis of Bandwidth Maps for Geo-Predictive Video Streaming in Mobile Environments. Bayan Taani, Roger Zimmermann |
| 2016 | Spectral and Cepstral Audio Noise Reduction Techniques in Speech Emotion Recognition. Jouni Pohjalainen, Fabien Ringeval, Zixing Zhang, Björn W. Schuller |
| 2016 | StressClick: Sensing Stress from Gaze-Click Patterns. Michael Xuelin Huang, Jiajia Li, Grace Ngai, Hong Va Leong |
| 2016 | Summary for AVEC 2016: Depression, Mood, and Emotion Recognition Workshop and Challenge. Michel F. Valstar, Jonathan Gratch, Björn W. Schuller, Fabien Ringeval, Roddy Cowie, Maja Pantic |
| 2016 | Super Resolution of the Partial Pixelated Images With Deep Convolutional Neural Network. Haiyi Mao, Yue Wu, Jun Li, Yun Fu |
| 2016 | SuperSelect: An Interactive Superpixel-Based Segmentation Method for Touch Displays. Christoph Korinke, Tim Claudius Stratmann, Tim Laue, Susanne Boll |
| 2016 | SuperStreamer: Enabling Progressive Content Streaming in a Game Engine. Yong Xue Eu, Jermyn Tanu, Justin Jieting Law, Muhammad Hanif B. Ghazali, Shuan Siang Tay, Wei Tsang Ooi, Anand Bhojan |
| 2016 | Supervised Recurrent Hashing for Large Scale Video Retrieval. Yun Gu, Chao Ma, Jie Yang |
| 2016 | SwiDeN: Convolutional Neural Networks For Depiction Invariant Object Recognition. Ravi Kiran Sarvadevabhatla, Shiv Surya, Srinivas S. S. Kruthiventi, R. Venkatesh Babu |
| 2016 | Synchronization among Groups of Spectators for Highlight Detection in Movies. Michal Muszynski, Theodoros Kostoulas, Patrizia Lombardo, Thierry Pun, Guillaume Chanel |
| 2016 | Synthesizing Emerging Images from Photographs. Cheng-Han Yang, Ying-Miao Kuo, Hung-Kuo Chu |
| 2016 | Tamp: A Library for Compact Deep Neural Networks with Structured Matrices. Bingchen Gong, Brendan Jou, Felix X. Yu, Shih-Fu Chang |
| 2016 | Technology & Art in Stimulating Creative Placemaking in Public-Use Spaces. Wendy Ann Mansilla, Andrew Perkis |
| 2016 | The Lifecycle of Geotagged Multimedia Data. Rossano Schifanella, Bart Thomee |
| 2016 | ThePlantGame: Actively Training Human Annotators for Domain-specific Crowdsourcing. Maximilien Servajean, Alexis Joly, Dennis E. Shasha, Julien Champ, Esther Pacitti |
| 2016 | Time Matters: Multi-scale Temporalization of Social Media Popularity. Bo Wu, Wen-Huang Cheng, Yongdong Zhang, Tao Mei |
| 2016 | Towards Ultra-Low-Bitrate Video Conferencing Using Facial Landmarks. Pin-Chun Wang, Ching-Ling Fan, Chun-Ying Huang, Kuan-Ta Chen, Cheng-Hsin Hsu |
| 2016 | Tracking Natural Events through Social Media and Computer Vision. Jingya Wang, Mohammed Korayem, Saúl A. Blanco, David J. Crandall |
| 2016 | Transform-Invariant Convolutional Neural Networks for Image Classification and Search. Xu Shen, Xinmei Tian, Anfeng He, Shaoyan Sun, Dacheng Tao |
| 2016 | Transportation Mode Detection on Mobile Devices Using Recurrent Nets. Toan H. Vu, Le Dung, Jia-Ching Wang |
| 2016 | UnitBox: An Advanced Object Detection Network. Jiahui Yu, Yuning Jiang, Zhangyang Wang, Zhimin Cao, Thomas S. Huang |
| 2016 | User Redirection and Direct Haptics in Virtual Environments. Cristiano Carvalheiro, Rui Nóbrega, Hugo da Silva, Rui Rodrigues |
| 2016 | V3I-STAL: Visual Vehicle-to-Vehicle Interaction via Simultaneous Tracking and Localization. Xiaobai Liu |
| 2016 | Vibrotactile Experiences for Augmented Reality. Wolfgang Hürst, Nina Rosa, Jean-Paul van Bommel |
| 2016 | Video ChatBot: Triggering Live Social Interactions by Automatic Video Commenting. Yehao Li, Ting Yao, Rui Hu, Tao Mei, Yong Rui |
| 2016 | Video Generation Using 3D Convolutional Neural Network. Shohei Yamamoto, Tatsuya Harada |
| 2016 | Video eCommerce: Towards Online Video Advertising. Zhi-Qi Cheng, Yang Liu, Xiao Wu, Xian-Sheng Hua |
| 2016 | Vision and Language Integration Meets Multimedia Fusion: Proceedings of ACM Multimedia 2016 Workshop. Marie-Francine Moens, Katerina Pastra, Kate Saenko, Tinne Tuytelaars |
| 2016 | Visual Analytics for Multimedia: Challenges and Opportunities. Jarke J. van Wijk |
| 2016 | WIMBY: What's in My Backyard? Michael Dorkhom, Alan Woodley, Shlomo Geva, Richi Nayak |
| 2016 | Weakly-Supervised Recognition, Localization, and Explanation of Visual Entities. Pascal Mettes |
| 2016 | Weighted Linear Fusion of Multimodal Data: A Reasonable Baseline? Ognjen Arandjelovic |
| 2016 | What Makes Photo Cultures Different? Miriam Redi, Damon Crockett, Lev Manovich, Simon Osindero |
| 2016 | What Makes a Good Movie Trailer?: Interpretation from Simultaneous EEG and Eyetracker Recording. Sidi Liu, Jinglei Lv, Yimin Hou, Ting Shoemaker, Qinglin Dong, Kaiming Li, Tianming Liu |
| 2016 | Who is where?: Matching People in Video to Wearable Acceleration During Crowded Mingling Events. Laura Cabrera Quiros, Hayley Hung |
| 2016 | WorkCache: Salvaging siloed knowledge. Scott A. Carter, Laurent Denoue, Matthew Cooper |
| 2016 | Zero-Example Multimedia Event Detection and Recounting with Unsupervised Evidence Localization. Yi-Jie Lu |
| 2016 | Zero-Shot Hashing via Transferring Supervised Knowledge. Yang Yang, Yadan Luo, Weilun Chen, Fumin Shen, Jie Shao, Heng Tao Shen |
| 2016 | bBridge: A Big Data Platform for Social Multimedia Analytics. Aleksandr Farseev, Ivan Samborskii, Tat-Seng Chua |
| 2016 | madmom: A New Python Audio and Music Signal Processing Library. Sebastian Böck, Filip Korzeniowski, Jan Schlüter, Florian Krebs, Gerhard Widmer |
| 2016 | n-Dimensional Display Interface. Charles D. Estes |
| 2016 | vitrivr: A Flexible Retrieval Stack Supporting Multiple Query Modes for Searching in Multimedia Collections. Luca Rossetto, Ivan Giangreco, Claudiu Tanase, Heiko Schuldt |