| 2020 | A Coordinated Representation Learning Enhanced Multimodal Machine Translation Approach with Multi-Attention. Yifeng Han, Lin Li, Jianwei Zhang |
| 2020 | A Crowd Analysis Framework for Detecting Violence Scenes. Konstantinos Gkountakos, Konstantinos Ioannidis, Theodora Tsikrika, Stefanos Vrochidis, Ioannis Kompatsiaris |
| 2020 | A Framework for Paper Submission Recommendation System. Dinh V. Cuong, Dac H. Nguyen, Son Huynh, Phong Huynh, Cathal Gurrin, Minh-Son Dao, Duc-Tien Dang-Nguyen, Binh T. Nguyen |
| 2020 | A Lightweight Gated Global Module for Global Context Modeling in Neural Networks. Li Hao, Liping Hou, Yuantao Song, Ke Lu, Jian Xue |
| 2020 | Actor-Critic Sequence Generation for Relative Difference Captioning. Zhengcong Fei |
| 2020 | An Active Learning Framework for Duplicate Detection in SaaS Platforms. Quy H. Nguyen, Dac H. Nguyen, Minh-Son Dao, Duc-Tien Dang-Nguyen, Cathal Gurrin, Binh T. Nguyen |
| 2020 | An Interactive Learning System for Large-Scale Multimedia Analytics. Omar Shahbaz Khan |
| 2020 | An Interactive Multimodal Retrieval System for Memory Assistant and Life Organized Support. Van-Luon Tran, Anh-Vu Mai-Nguyen, Trong-Dat Phan, Anh-Khoa Vo, Minh-Son Dao, Koji Zettsu |
| 2020 | Analysis of the Effect of Dataset Construction Methodology on Transferability of Music Emotion Recognition Models. Sabina Hult, Line Bay Kreiberg, Sami Sebastian Brandt, Björn Þór Jónsson |
| 2020 | Anomaly Detection in Traffic Surveillance Videos with GAN-based Future Frame Prediction. Khac-Tuan Nguyen, Dat-Thanh Dinh, Minh N. Do, Minh-Triet Tran |
| 2020 | Are You Watching Closely? Content-based Retrieval of Hand Gestures. Mahnaz Amiri Parian, Luca Rossetto, Heiko Schuldt, Stéphane Dupont |
| 2020 | At the Speed of Sound: Efficient Audio Scene Classification. Bo Dong, Cristian Lumezanu, Yuncong Chen, Dongjin Song, Takehiko Mizoguchi, Haifeng Chen, Latifur Khan |
| 2020 | Attention Mechanisms, Signal Encodings and Fusion Strategies for Improved Ad-hoc Video Search with Dual Encoding Networks. Damianos Galanopoulos, Vasileios Mezaris |
| 2020 | Automatic Color Scheme Extraction from Movies. Suzi Kim, Sunghee Choi |
| 2020 | Automatic Evaluation of Iconic Image Retrieval based on Colour, Shape, and Texture. Riku Togashi, Sumio Fujita, Tetsuya Sakai |
| 2020 | Automatic Reminiscence Therapy for Dementia. Mariona Caros, Maite Garolera, Petia Radeva, Xavier Giró-i-Nieto |
| 2020 | Automation of Deep Learning - Theory and Practice. Martin Wistuba, Ambrish Rawat, Tejaswini Pedapati |
| 2020 | Beyond Relevance Feedback for Searching and Exploring large Multimedia Collections. Marcel Worring |
| 2020 | CEA'20: The 12th Workshop on Multimedia for Cooking and Eating Activities. Ichiro Ide, Yoko Yamakata, Atsushi Hashimoto |
| 2020 | Compact Network Training for Person ReID. Hussam Lawen, Avi Ben-Cohen, Matan Protter, Itamar Friedman, Lihi Zelnik-Manor |
| 2020 | Confidence-based Weighted Loss for Multi-label Classification with Missing Labels. Karim M. Ibrahim, Elena V. Epure, Geoffroy Peeters, Gaël Richard |
| 2020 | Continuous Health Interface Event Retrieval. Vaibhav Pandey, Nitish Nag, Ramesh C. Jain |
| 2020 | Continuous ODE-defined Image Features for Adaptive Retrieval. Fabio Carrara, Giuseppe Amato, Fabrizio Falchi, Claudio Gennaro |
| 2020 | DAGC: Employing Dual Attention and Graph Convolution for Point Cloud based Place Recognition. Qi Sun, Hongyan Liu, Jun He, Zhaoxin Fan, Xiaoyong Du |
| 2020 | Deep Adversarial Discrete Hashing for Cross-Modal Retrieval. Cong Bai, Chao Zeng, Qing Ma, Jinglin Zhang, Shengyong Chen |
| 2020 | Deep Discrete Attention Guided Hashing for Face Image Retrieval. Zhi Xiong, Dayan Wu, Wen Gu, Haisu Zhang, Bo Li, Weiping Wang |
| 2020 | Deep Semantic-Alignment Hashing for Unsupervised Cross-Modal Retrieval. Dejie Yang, Dayan Wu, Wanqian Zhang, Haisu Zhang, Bo Li, Weiping Wang |
| 2020 | Detecting, Classifying, and Mapping Retail Storefronts Using Street-level Imagery. Shahin Sharifi Noorian, Sihang Qiu, Achilleas Psyllidis, Alessandro Bozzon, Geert-Jan Houben |
| 2020 | Detection of Semantic Risk Situations in Lifelog Data for Improving Life of Frail People. Thinhinane Yebda, Jenny Benois-Pineau, Marion Pech, Hélène Amièva, Cathal Gurrin |
| 2020 | Efficient Base Class Selection Algorithms for Few-Shot Classification. Takumi Ohkuma, Hideki Nakayama |
| 2020 | EfficientFAN: Deep Knowledge Transfer for Face Alignment. Pengcheng Gao, Ke Lu, Jian Xue |
| 2020 | Emotion Recognition from Galvanic Skin Response Signal Based on Deep Hybrid Neural Networks. Imam Yogie Susanto, Tse-Yu Pan, Chien-Wen Chen, Min-Chun Hu, Wen-Huang Cheng |
| 2020 | Enabling Relevance-Based Exploration of Cataract Videos. Negin Ghamsarian |
| 2020 | Explaining with Counter Visual Attributes and Examples. Sadaf Gulshad, Arnold W. M. Smeulders |
| 2020 | Fake News Detection via Knowledge-driven Multimodal Graph Convolutional Networks. Youze Wang, Shengsheng Qian, Jun Hu, Quan Fang, Changsheng Xu |
| 2020 | Flood Level Prediction via Human Pose Estimation from Social Media Images. Khanh-An C. Quan, Vinh-Tiep Nguyen, Tan-Cong Nguyen, Tam V. Nguyen, Minh-Triet Tran |
| 2020 | Forward and Backward Multimodal NMT for Improved Monolingual and Multilingual Cross-Modal Retrieval. Po-Yao Huang, Xiaojun Chang, Alexander G. Hauptmann, Eduard H. Hovy |
| 2020 | Google Helps YouTube: Learning Few-Shot Video Classification from Historic Tasks and Cross-Domain Sample Transfer. Xinzhe Zhou, Yadong Mu |
| 2020 | HLVU: A New Challenge to Test Deep Understanding of Movies the Way Humans do. Keith Curtis, George Awad, Shahzad Rajput, Ian Soboroff |
| 2020 | Heterogeneous Non-Local Fusion for Multimodal Activity Recognition. Petr Byvshev, Pascal Mettes, Yu Xiao |
| 2020 | Human Object Interaction Detection via Multi-level Conditioned Network. Xu Sun, Xinwen Hu, Tongwei Ren, Gangshan Wu |
| 2020 | ICDAR'20: Intelligent Cross-Data Analysis and Retrieval. Minh-Son Dao, Morten Fjeld, Filip Biljecki, Uraz Yavanoglu, Mianxiong Dong |
| 2020 | Image Retrieval using Multi-scale CNN Features Pooling. Federico Vaccaro, Marco Bertini, Tiberio Uricchio, Alberto Del Bimbo |
| 2020 | Image Synthesis from Locally Related Texts. Tianrui Niu, Fangxiang Feng, Lingxuan Li, Xiaojie Wang |
| 2020 | Imageability Estimation using Visual and Language Features. Chihaya Matsuhira, Marc A. Kastner, Ichiro Ide, Yasutomo Kawanishi, Takatsugu Hirayama, Keisuke Doman, Daisuke Deguchi, Hiroshi Murase |
| 2020 | Incorporating Semantic Knowledge for Visual Lifelog Activity Recognition. Min-Huan Fu, An-Zi Yen, Hen-Hsen Huang, Hsin-Hsi Chen |
| 2020 | Intelligent Task Recognition: Towards Enabling Productivity Assistance in Daily Life. Jonathan Liono, Mohammad Saiedur Rahaman, Flora D. Salim, Yongli Ren, Damiano Spina, Falk Scholer, Johanne R. Trippas, Mark Sanderson, Paul N. Bennett, Ryen W. White |
| 2020 | Interactivity Proposals for Surveillance Videos. Shuo Chen, Pascal Mettes, Tao Hu, Cees G. M. Snoek |
| 2020 | Introduction to the Third Annual Lifelog Search Challenge (LSC'20). Cathal Gurrin, Tu-Khiem Le, Van-Tu Ninh, Duc-Tien Dang-Nguyen, Björn Þór Jónsson, Jakub Lokoc, Wolfgang Hürst, Minh-Triet Tran, Klaus Schöffmann |
| 2020 | Itinerary Planning via Deep Reinforcement Learning. Shengxin Chen, Bo-Hao Chen, Zhaojiong Chen, Yunbing Wu |
| 2020 | Knowledge Enhanced Neural Fashion Trend Forecasting. Yunshan Ma, Yujuan Ding, Xun Yang, Lizi Liao, Wai Keung Wong, Tat-Seng Chua |
| 2020 | Learning Fine-Grained Similarity Matching Networks for Visual Tracking. Dawei Zhang, Zhonglong Zheng, Xiaowei He, Liu Su, Liyuan Chen |
| 2020 | Learning to Select Elements for Graphic Design. Guolong Wang, Zheng Qin, Junchi Yan, Liu Jiang |
| 2020 | MAENet: Boosting Feature Representation for Cross-Modal Person Re-Identification with Pairwise Supervision. Yongbiao Chen, Sheng Zhang, Zhengwei Qi |
| 2020 | MMArt-ACM'20: International Joint Workshop on Multimedia Artworks Analysis and Attractiveness Computing in Multimedia 2020. Wei-Ta Chu, Ichiro Ide, Naoko Nitta, Norimichi Tsumura, Toshihiko Yamasaki |
| 2020 | Medical Image Retrieval: Applications and Resources. Henning Müller |
| 2020 | Multi-Attention Multimodal Sentiment Analysis. Taeyong Kim, Bowon Lee |
| 2020 | Multi-Graph Group Collaborative Filtering. Bo Jiang |
| 2020 | Multi-level Recognition on Falls from Activities of Daily Living. Jiawei Li, Shu-Tao Xia, Qianggang Ding |
| 2020 | Multimodal Analytics for Real-world News using Measures of Cross-modal Entity Consistency. Eric Müller-Budack, Jonas Theiner, Sebastian Diering, Maximilian Idahl, Ralph Ewerth |
| 2020 | Music Tower Blocks: Multi-Faceted Exploration Interface for Web-Scale Music Access. Markus Schedl, Michael Mayr, Peter Knees |
| 2020 | Object Detection for Unseen Domains while Reducing Response Time using Knowledge Transfer in Multimedia Event Processing. Asra Aslam |
| 2020 | On Visualizations in the Role of Universal Data Representation. Tomás Skopal |
| 2020 | One Perceptron to Rule Them All: Language, Vision, Audio and Speech. Xavier Giró-i-Nieto |
| 2020 | One Shot Logo Recognition Based on Siamese Neural Networks. Camilo Vargas, Qianni Zhang, Ebroul Izquierdo |
| 2020 | Optimizing Queries over Video via Lightweight Keypoint-based Object Detection. Jiansheng Dong, Jingling Yuan, Lin Li, Xian Zhong, Weiru Liu |
| 2020 | PredNet and Predictive Coding: A Critical Review. Roshan Prakash Rane, Edit Szügyi, Vageesh Saxena, André Ofner, Sebastian Stober |
| 2020 | Proceedings of the 2020 on International Conference on Multimedia Retrieval, ICMR 2020, Dublin, Ireland, June 8-11, 2020. Cathal Gurrin, Björn Þór Jónsson, Noriko Kando, Klaus Schöffmann, Yi-Ping Phoebe Chen, Noel E. O'Connor |
| 2020 | QIK: A System for Large-Scale Image Retrieval on Everyday Scenes With Common Objects. Arun Zachariah, Mohamed Gharibi, Praveen Rao |
| 2020 | Query-controllable Video Summarization. Jia-Hong Huang, Marcel Worring |
| 2020 | Rank-embedded Hashing for Large-scale Image Retrieval. Haiyan Fu, Ying Li, Hengheng Zhang, Jinfeng Liu, Tao Yao |
| 2020 | Reducing Response Time for Multimedia Event Processing using Domain Adaptation. Asra Aslam, Edward Curry |
| 2020 | Salienteye: Maximizing Engagement While Maintaining Artistic Style on Instagram Using Deep Neural Networks. Lili Wang, Ruibo Liu, Soroush Vosoughi |
| 2020 | Search Result Clustering in Collaborative Sound Collections. Xavier Favory, Frederic Font, Xavier Serra |
| 2020 | Semantic Gated Network for Efficient News Representation. Xuxiao Bu, Bingfeng Li, Yaxiong Wang, Jihua Zhu, Xueming Qian, Marco Zhao |
| 2020 | SenseMood: Depression Detection on Social Media. Chenhao Lin, Pengwei Hu, Hui Su, Shaochun Li, Jing Mei, Jie Zhou, Henry Leung |
| 2020 | Sentence-based and Noise-robust Cross-modal Retrieval on Cooking Recipes and Food Images. Zichen Zan, Lin Li, Jianquan Liu, Dong Zhou |
| 2020 | Super-Resolution Coding Defense Against Adversarial Examples. Yanjie Chen, Likun Cai, Wei Cheng, Hao Wang |
| 2020 | System Fusion with Deep Ensembles. Liviu-Daniel Stefan, Mihai Gabriel Constantin, Bogdan Ionescu |
| 2020 | Towards Evaluating and Simulating Keyword Queries for Development of Interactive Known-item Search Systems. Ladislav Peska, Frantisek Mejzlík, Tomás Soucek, Jakub Lokoc |
| 2020 | Trajectory Prediction Network for Future Anticipation of Ships. Pim Dijt, Pascal Mettes |
| 2020 | Urban Movie Map for Walkers: Route View Synthesis using 360° Videos. Naoki Sugimoto, Toru Okubo, Kiyoharu Aizawa |
| 2020 | Urban Object Detection Kit: A System for Collection and Analysis of Street-Level Imagery. Maarten Sukel, Stevan Rudinac, Marcel Worring |
| 2020 | Visible-infrared Person Re-identification via Colorization-based Siamese Generative Adversarial Network. Xian Zhong, Tianyou Lu, Wenxin Huang, Jingling Yuan, Wenxuan Liu, Chia-Wen Lin |
| 2020 | Visual Relations Augmented Cross-modal Retrieval. Yutian Guo, Jingjing Chen, Hao Zhang, Yu-Gang Jiang |
| 2020 | Visual Story Ordering with a Bidirectional Writer. Wei-Rou Lin, Hen-Hsen Huang, Hsin-Hsi Chen |
| 2020 | What Should I Do? Ramesh C. Jain |
| 2020 | YOLO-mini-tiger: Amur Tiger Detection. Runchen Wei, Ning He, Ke Lu |
| 2020 | iCap: Interactive Image Captioning with Predictive Text. Zhengxiong Jia, Xirong Li |
| 2020 | iSparse: Output Informed Sparsification of Neural Network. Yash Garg, K. Selçuk Candan |
| 2020 | surgXplore: Interactive Video Exploration for Endoscopy. Andreas Leibetseder, Klaus Schöffmann |