| 2021 | 2.5D Pose Guided Human Image Generation. Kang Yuan, Sheng Li |
| 2021 | A Beneficial Dual Transformation Approach for Deep Learning Networks Used in Steel Surface Defect Detection. Fityanul Akhyar, Chih-Yang Lin, Gugan S. Kathiresan |
| 2021 | A Denoising Convolutional Neural Network for Self-Supervised Rank Effectiveness Estimation on Image Retrieval. Lucas Pascotti Valem, Daniel Carlos Guimarães Pedronette |
| 2021 | A Smart Adversarial Attack on Deep Hashing Based Image Retrieval. Junda Lu, Mingyang Chen, Yifang Sun, Wei Wang, Yi Wang, Xiaochun Yang |
| 2021 | A Tensor Sparse Representation-Based CBMIR System for Computer-Aided Diagnosis of Focal Liver Lesions and its Pilot Trial. Jian Wang, Xian-Hua Han, Lanfen Lin, Hongjie Hu, Yen-Wei Chen |
| 2021 | A Unified-Model via Block Coordinate Descent for Learning the Importance of Filter. Qinghua Li, Xue Zhang, Cuiping Li, Hong Chen |
| 2021 | AWFA-LPD: Adaptive Weight Feature Aggregation for Multi-frame License Plate Detection. Xiaocheng Lu, Yuan Yuan, Qi Wang |
| 2021 | Aligning Visual Prototypes with BERT Embeddings for Few-Shot Learning. Kun Yan, Zied Bouraoui, Ping Wang, Shoaib Jameel, Steven Schockaert |
| 2021 | Automatic Baseball Pitch Overlay. Ting-Hsuan Chou, Wei-Ta Chu |
| 2021 | Bag of Tricks for Building an Accurate and Slim Object Detector for Embedded Applications. Yongkun Du, Zhineng Chen, Caiyan Jia, Xuanya Li, Yu-Gang Jiang |
| 2021 | Body Shape Calculator: Understanding the Type of Body Shapes from Anthropometric Measurements. Shintami Chusnul Hidayati, Yeni Anistyasari |
| 2021 | CEA'21: The 13th Workshop on Multimedia for Cooking and Eating Activities. Yoko Yamakata, Atsushi Hashimoto |
| 2021 | Can Action be Imitated? Learn to Reconstruct and Transfer Human Dynamics from Videos. Yuqian Fu, Yanwei Fu, Yu-Gang Jiang |
| 2021 | Collaborative Representation for Deep Meta Metric Learning. Min Zhu, Weifeng Liu, Kai Zhang, Ye Li, Peng Liu, Baodi Liu |
| 2021 | Combining Adversarial and Reinforcement Learning for Video Thumbnail Selection. Evlampios Apostolidis, Eleni Adamantidou, Vasileios Mezaris, Ioannis Patras |
| 2021 | Contextualized Keyword Representations for Multi-modal Retinal Image Captioning. Jia-Hong Huang, Ting-Wei Wu, Marcel Worring |
| 2021 | Cross-Modal Image-Recipe Retrieval via Intra- and Inter-Modality Hybrid Fusion. Jiao Li, Jialiang Sun, Xing Xu, Wei Yu, Fumin Shen |
| 2021 | Cross-Modal Self-Attention with Multi-Task Pre-Training for Medical Visual Question Answering. Haifan Gong, Guanqi Chen, Sishuo Liu, Yizhou Yu, Guanbin Li |
| 2021 | DANet: Dimension Apart Network for Radar Object Detection. Bo Ju, Wei Yang, Jinrang Jia, Xiaoqing Ye, Qu Chen, Xiao Tan, Hao Sun, Yifeng Shi, Errui Ding |
| 2021 | Dense Scale Network for Crowd Counting. Feng Dai, Hao Liu, Yike Ma, Xi Zhang, Qiang Zhao |
| 2021 | Discrete Tchebichef Transform for Versatile Video Coding. Ka-Hou Chan, Sio Kei Im |
| 2021 | Distractor-Aware Tracker with a Domain-Special Optimized Benchmark for Soccer Player Tracking. Zikai Song, Zhiwen Wan, Wei Yuan, Ying Tang, Junqing Yu, Yi-Ping Phoebe Chen |
| 2021 | Efficient Indexing of 3D Human Motions. Petra Budíková, Jan Sedmidubský, Pavel Zezula |
| 2021 | Efficient Nearest Neighbor Search by Removing Anti-hub. Kimihiro Tanaka, Yusuke Matsui, Shin'ichi Satoh |
| 2021 | Efficient-ROD: Efficient Radar Object Detection based on Densely Connected Residual Network. Chih-Chung Hsu, Chieh Lee, Lin Chen, Min-Kai Hung, Andy Yu-Lun Lin, Xian-Yu Wang |
| 2021 | Embedded YOLO: Faster and Lighter Object Detection. Wen-Kai Wu, Chien-Yu Chen, Jiann-Shu Lee |
| 2021 | Evaluating Contrastive Models for Instance-based Image Retrieval. Tarun Krishna, Kevin McGuinness, Noel E. O'Connor |
| 2021 | Facial Structure Guided GAN for Identity-preserved Face Image De-occlusion. Yiu-ming Cheung, Mengke Li, Rong Zou |
| 2021 | Few-Shot Action Localization without Knowing Boundaries. Ting-Ting Xie, Christos Tzelepis, Fan Fu, Ioannis Patras |
| 2021 | Fire Detection using Transformer Network. Mohammad Shahid, Kai-Lung Hua |
| 2021 | G-CAM: Graph Convolution Network Based Class Activation Mapping for Multi-label Image Recognition. Yangtao Wang, Yanzhao Xie, Yu Liu, Lisheng Fan |
| 2021 | GCNBoost: Artwork Classification by Label Propagation through a Knowledge Graph. Cheikh Brahim El Vaigh, Noa Garcia, Benjamin Renoust, Chenhui Chu, Yuta Nakashima, Hajime Nagahara |
| 2021 | GPT2MVS: Generative Pre-trained Transformer-2 for Multi-modal Video Summarization. Jia-Hong Huang, Luka Murn, Marta Mrak, Marcel Worring |
| 2021 | Generative Adversarial Networks with Bi-directional Normalization for Semantic Image Synthesis. Jia Long, Hongtao Lu |
| 2021 | Global Relation-Aware Attention Network for Image-Text Retrieval. Jie Cao, Shengsheng Qian, Huaiwen Zhang, Quan Fang, Changsheng Xu |
| 2021 | HINFShot: A Challenge Dataset for Few-Shot Node Classification in Heterogeneous Information Network. Zifeng Zhuang, Xintao Xiang, Siteng Huang, Donglin Wang |
| 2021 | HPOF: 3D Human Pose Recovery from Monocular Video with Optical Flow. Bin Ji, Chen Yang, Shunyu Yao, Ye Pan |
| 2021 | HSGMP: Heterogeneous Scene Graph Message Passing for Cross-modal Retrieval. Yu Duan, Yun Xiong, Yao Zhang, Yuwei Fu, Yangyong Zhu |
| 2021 | Heterogeneous Side Information-based Iterative Guidance Model for Recommendation. Feifei Dai, Xiaoyan Gu, Zhuo Wang, Mingda Qian, Bo Li, Weiping Wang |
| 2021 | Human Pose Estimation based on Attention Multi-resolution Network. Congcong Zhang, Ning He, Qixiang Sun, Xiaojie Yin, Ke Lu |
| 2021 | ICDAR'21: Intelligent Cross-Data Analysis and Retrieval. Minh-Son Dao, Michael Alexander Riegler, Duc-Tien Dang-Nguyen, Cathal Gurrin, Minh-Triet Tran, Nguyen Thanh Binh |
| 2021 | ICMR '21: International Conference on Multimedia Retrieval, Taipei, Taiwan, August 21-24, 2021 Wen-Huang Cheng, Mohan S. Kankanhalli, Meng Wang, Wei-Ta Chu, Jiaying Liu, Marcel Worring |
| 2021 | IR Questioner: QA-based Interactive Retrieval System. Rintaro Yanagi, Ren Togo, Takahiro Ogawa, Miki Haseyama |
| 2021 | Image Retrieval by Hierarchy-aware Deep Hashing Based on Multi-task Learning. Bowen Wang, Liangzhi Li, Yuta Nakashima, Takehiro Yamamoto, Hiroaki Ohshima, Yoshiyuki Shoji, Kenro Aihara, Noriko Kando |
| 2021 | Image-to-Image Transfer Makes Chaos to Order. Sanbi Luo, Tao Guo |
| 2021 | Impact of Interaction Strategies on User Relevance Feedback. Omar Shahbaz Khan, Björn Þór Jónsson, Jan Zahálka, Stevan Rudinac, Marcel Worring |
| 2021 | Introduction to the Fourth Annual Lifelog Search Challenge, LSC'21. Cathal Gurrin, Björn Þór Jónsson, Klaus Schöffmann, Duc-Tien Dang-Nguyen, Jakub Lokoc, Minh-Triet Tran, Wolfgang Hürst, Luca Rossetto, Graham Healy |
| 2021 | Joint Hand-Object Pose Estimation with Differentiably-Learned Physical Contact Point Analysis. Nan Zhuang, Yadong Mu |
| 2021 | Know Yourself and Know Others: Efficient Common Representation Learning for Few-shot Cross-modal Retrieval. Shaoying Wang, Hanjiang Lai, Zhenyu Shi |
| 2021 | Learning Hierarchical Visual-Semantic Representation with Phrase Alignment. Baoming Yan, Qingheng Zhang, Liyu Chen, Lin Wang, Leihao Pei, Jiang Yang, Enyun Yu, Xiaobo Li, Binqiang Zhao |
| 2021 | Learning to Select: A Fully Attentive Approach for Novel Object Captioning. Marco Cagrandi, Marcella Cornia, Matteo Stefanini, Lorenzo Baraldi, Rita Cucchiara |
| 2021 | Leveraging EfficientNet and Contrastive Learning for Accurate Global-scale Location Estimation. Giorgos Kordopatis-Zilos, Panagiotis Galopoulos, Symeon Papadopoulos, Ioannis Kompatsiaris |
| 2021 | Leveraging Two Types of Global Graph for Sequential Fashion Recommendation. Yujuan Ding, Yunshan Ma, Wai Keung Wong, Tat-Seng Chua |
| 2021 | Local-enhanced Interaction for Temporal Moment Localization. Guoqiang Liang, Shiyu Ji, Yanning Zhang |
| 2021 | Look Back Again: Dual Parallel Attention Network for Accurate and Robust Scene Text Recognition. Zilong Fu, Hongtao Xie, Guoqing Jin, Junbo Guo |
| 2021 | M-DFNet: Multi-phase Discriminative Feature Network for Retrieval of Focal Liver Lesions. Yingying Xu, Jing Liu, Lanfen Lin, Hongjie Hu, Ruofeng Tong, Jingsong Li, Yen-Wei Chen |
| 2021 | M2GUDA: Multi-Metrics Graph-Based Unsupervised Domain Adaptation for Cross-Modal Hashing. Chengyuan Zhang, Zhi Zhong, Lei Zhu, Shichao Zhang, Da Cao, Jianfeng Zhang |
| 2021 | MLFont: Few-Shot Chinese Font Generation via Deep Meta-Learning. Xu Chen, Lei Wu, Minggang He, Lei Meng, Xiangxu Meng |
| 2021 | MMArt-ACM'21: International Joint Workshop on Multimedia Artworks Analysis and Attractiveness Computing in Multimedia 2021. Min-Chun Hu, Ichiro Ide, Kensuke Tobitani |
| 2021 | MMPT'21: International Joint Workshop on Multi-Modal Pre-Training for Multimedia Understanding. Bei Liu, Jianlong Fu, Shizhe Chen, Qin Jin, Alexander G. Hauptmann, Yong Rui |
| 2021 | MS-SincResNet: Joint Learning of 1D and 2D Kernels Using Multi-scale SincNet and ResNet for Music Genre Classification. Pei-Chun Chang, Yong-Sheng Chen, Chang-Hsing Lee |
| 2021 | MSAV: An Unified Framework for Multi-view Subspace Analysis with View Consistence. Huibing Wang, Guangqi Jiang, Jinjia Peng, Xianping Fu |
| 2021 | MeTILDA: Platform for Melodic Transcription in Language Documentation and Application. Mitchell Lee, Praveena Avula, Min Chen |
| 2021 | Multi-Attention Audio-Visual Fusion Network for Audio Spatialization. Wen Zhang, Jie Shao |
| 2021 | Multi-Feature Graph Attention Network for Cross-Modal Video-Text Retrieval. Xiaoshuai Hao, Yucan Zhou, Dayan Wu, Wanqian Zhang, Bo Li, Weiping Wang |
| 2021 | Multi-Initialization Graph Meta-Learning for Node Classification. Feng Zhao, Donglin Wang, Xintao Xiang |
| 2021 | Multi-scale Dynamic Network for Temporal Action Detection. Yifan Ren, Xing Xu, Fumin Shen, Zheng Wang, Yang Yang, Heng Tao Shen |
| 2021 | NASTER: Non-local Attentional Scene Text Recognizer. Lei Wu, Xueliang Liu, Yanbin Hao, Yunjie Ma, Richang Hong |
| 2021 | NMS-Loss: Learning with Non-Maximum Suppression for Crowded Pedestrian Detection. Zekun Luo, Zheng Fang, Sixiao Zheng, Yabiao Wang, Yanwei Fu |
| 2021 | Nested Dense Attention Network for Single Image Super-Resolution. Cheng Qiu, Yirong Yao, Yuntao Du |
| 2021 | Neural Symbolic Representation Learning for Image Captioning. Xiaomei Wang, Lin Ma, Yanwei Fu, Xiangyang Xue |
| 2021 | Object Detection on Embedded Systems for Traffic in Asian Countries. Bao-Hong Lai, Hsun-Ping Hsieh |
| 2021 | Personal Knowledge Base Construction from Multimodal Data. An-Zi Yen, Chia-Chung Chang, Hen-Hsen Huang, Hsin-Hsi Chen |
| 2021 | Question-Guided Semantic Dual-Graph Visual Reasoning with Novel Answers. Xinzhe Zhou, Yadong Mu |
| 2021 | RGB-D Scene Recognition based on Object-Scene Relation and Semantics-Preserving Attention. Yuhui Guo, Xun Liang |
| 2021 | ROD2021 Challenge: A Summary for Radar Object Detection Challenge for Autonomous Driving Applications. Yizhou Wang, Jenq-Neng Hwang, Gaoang Wang, Hui Liu, Kwang-Ju Kim, Hung-Min Hsu, Jiarui Cai, Haotian Zhang, Zhongyu Jiang, Renshu Gu |
| 2021 | Radar Object Detection Using Data Merging, Enhancement and Fusion. Jun Yu, Xinlong Hao, Xinjian Gao, Qiang Sun, Yuyu Liu, Peng Chang, Zhong Zhang, Fang Gao, Feng Shuang |
| 2021 | Reading Scene Text by Fusing Visual Attention with Semantic Representations. Zhiguang Liu, Liangwei Wang, Jian Qiao |
| 2021 | Relation-aware Hierarchical Attention Framework for Video Question Answering. Fangtao Li, Ting Bai, Chenyu Cao, Zihe Liu, Chenghao Yan, Bin Wu |
| 2021 | Reproducibility Companion Paper: Knowledge Enhanced Neural Fashion Trend Forecasting. Yunshan Ma, Yujuan Ding, Xun Yang, Lizi Liao, Wai Keung Wong, Tat-Seng Chua, Jinyoung Moon, Hong-Han Shuai |
| 2021 | SAGN: Semantic Adaptive Graph Network for Skeleton-Based Human Action Recognition. Ziwang Fu, Feng Liu, Jiahao Zhang, Hanyang Wang, Chengyi Yang, Qing Xu, Jiayin Qi, Xiangling Fu, Aimin Zhou |
| 2021 | Scene Text Recognition with Cascade Attention Network. Min Zhang, Meng Ma, Ping Wang |
| 2021 | Scene-aware Learning Network for Radar Object Detection. Zangwei Zheng, Xiangyu Yue, Kurt Keutzer, Alberto L. Sangiovanni-Vincentelli |
| 2021 | Semi-supervised Many-to-many Music Timbre Transfer. Yu-Chen Chang, Wen-Cheng Chen, Min-Chun Hu |
| 2021 | Social Relation Analysis from Videos via Multi-entity Reasoning. Chenghao Yan, Zihe Liu, Fangtao Li, Chenyu Cao, Zheng Wang, Bin Wu |
| 2021 | Spatio-Temporal Activity Detection and Recognition in Untrimmed Surveillance Videos. Konstantinos Gkountakos, Despoina Touska, Konstantinos Ioannidis, Theodora Tsikrika, Stefanos Vrochidis, Ioannis Kompatsiaris |
| 2021 | Squeeze-and-Excitation network-Based Radar Object Detection With Weighted Location Fusion. Pengliang Sun, Xuetong Niu, Pengfei Sun, Kele Xu |
| 2021 | Summary of the 2021 Embedded Deep Learning Object Detection Model Compression Competition for Traffic in Asian Countries. Yu-Shu Ni, Chia-Chi Tsai, Jiun-In Guo, Jenq-Neng Hwang, Bo-Xun Wu, Po-Chi Hu, Ted T. Kuo, Po-Yu Chen, Hsien-Kai Kuo |
| 2021 | TEACH: Attention-Aware Deep Cross-Modal Hashing. Honglei Yao, Yu-Wei Zhan, Zhen-Duo Chen, Xin Luo, Xin-Shun Xu |
| 2021 | Ten Questions in Lifelog Mining and Information Recall. An-Zi Yen, Hen-Hsen Huang, Hsin-Hsi Chen |
| 2021 | Text-Enhanced Attribute-Based Attention for Generalized Zero-Shot Fine-Grained Image Classification. Yan-He Chen, Mei-Chen Yeh |
| 2021 | Text-Guided Visual Feature Refinement for Text-Based Person Search. Liying Gao, Kai Niu, Zehong Ma, Bingliang Jiao, Tonghao Tan, Peng Wang |
| 2021 | Unsupervised Deep Cross-Modal Hashing by Knowledge Distillation for Large-scale Cross-modal Retrieval. Mingyong Li, Hongya Wang |
| 2021 | Unsupervised Video Summarization via Multi-source Features. Hussain Kanafani, Junaid Ahmed Ghauri, Sherzod Hakimov, Ralph Ewerth |
| 2021 | Video Action Retrieval Using Action Recognition Model. Yuko Iinuma, Shin'ichi Satoh |
| 2021 | Visible-infrared Person Re-identification with Human Body Parts Assistance. Huangpeng Dai, Qing Xie, Jiachen Li, Yanchun Ma, Lin Li, Yongjian Liu |
| 2021 | Weakly Supervised Sketch Based Person Search. Lan Yan, Wenbo Zheng, Fei-Yue Wang, Chao Gou |