| 2021 | A Complete End to End Open Source Toolchain for the Versatile Video Coding (VVC) Standard. Adam Wieckowski, Christian Lehmann, Benjamin Bross, Detlev Marpe, Thibaud Biatek, Mickaël Raulet, Jean Le Feuvre |
| 2021 | A Gradient Balancing Approach for Robust Logo Detection. Fuxing Leng |
| 2021 | A Large-Scale Benchmark for Food Image Segmentation. Xiongwei Wu, Xin Fu, Ying Liu, Ee-Peng Lim, Steven C. H. Hoi, Qianru Sun |
| 2021 | A Multi-Domain Adaptive Graph Convolutional Network for EEG-based Emotion Recognition. Rui Li, Yiting Wang, Bao-Liang Lu |
| 2021 | A Multimodal Framework for Video Ads Understanding. Zejia Weng, Lingchen Meng, Rui Wang, Zuxuan Wu, Yu-Gang Jiang |
| 2021 | A Novel Patch Convolutional Neural Network for View-based 3D Model Retrieval. Zan Gao, Yuxiang Shao, Weili Guan, Meng Liu, Zhiyong Cheng, Shengyong Chen |
| 2021 | A Picture is Worth a Thousand Words: A Unified System for Diverse Captions and Rich Images Generation. Yupan Huang, Bei Liu, Jianlong Fu, Yutong Lu |
| 2021 | A Question Answering System for Unstructured Table Images. Wenyuan Xue, Siqi Cai, Wen Wang, Qingyong Li, Baosheng Yu, Yibing Zhan, Dacheng Tao |
| 2021 | A Simple and Effective Baseline for Robust Logo Detection. Weipeng Xu, Ye Liu, Daquan Lin |
| 2021 | A Solution to Multi-modal Ads Video Tagging Challenge. Hao Wu, Jiajie Wang, Yuanzhe Gu, Peisen Zhao, Zhonglin Zu |
| 2021 | A Statistical Approach to Mining Semantic Similarity for Deep Unsupervised Hashing. Xiao Luo, Daqing Wu, Zeyu Ma, Chong Chen, Minghua Deng, Jianqiang Huang, Xian-Sheng Hua |
| 2021 | A Stepwise Matching Method for Multi-modal Image based on Cascaded Network. Jinming Mu, Shuiping Gou, Shasha Mao, Shankui Zheng |
| 2021 | A System for Interactive and Intelligent AD Auxiliary Screening. Sen Yang, Qike Zhao, Lanxin Miao, Min Chen, Lianli Gao, Jingkuan Song, Weidong Le |
| 2021 | A Transformer based Approach for Image Manipulation Chain Detection. Jiaxiang You, Yuanman Li, Jiantao Zhou, Zhongyun Hua, Weiwei Sun, Xia Li |
| 2021 | A Tutorial on AI Music Composition. Xu Tan, Xiaobing Li |
| 2021 | A Virtual Character Generation and Animation System for E-Commerce Live Streaming. Li Hu, Bang Zhang, Peng Zhang, Jinwei Qi, Jian Cao, Daiheng Gao, Haiming Zhao, Xiaoduan Feng, Qi Wang, Lian Zhuo, Pan Pan, Yinghui Xu |
| 2021 | A2W: Context-Aware Recommendation System for Mobile Augmented Reality Web Browser. Kit-Yung Lam, Lik Hang Lee, Pan Hui |
| 2021 | ABPNet: Adaptive Background Modeling for Generalized Few Shot Segmentation. Kaiqi Dong, Wei Yang, Zhenbo Xu, Liusheng Huang, Zhidong Yu |
| 2021 | ADGD'21: 1st Workshop on Synthetic Multimedia - Audiovisual Deepfake Generation and Detection. Stefan Winkler, Weiling Chen, Abhinav Dhall, Pavel Korshunov |
| 2021 | ADVM'21: 1st International Workshop on Adversarial Learning for Multimedia. Aishan Liu, Xinyun Chen, Yingwei Li, Chaowei Xiao, Xun Yang, Xianglong Liu, Dawn Song, Dacheng Tao, Alan L. Yuille, Anima Anandkumar |
| 2021 | AFD-Net: Adaptive Fully-Dual Network for Few-Shot Object Detection. Longyao Liu, Bo Ma, Yulin Zhang, Xin Yi, Haozhi Li |
| 2021 | AFEC: Adaptive Feature Extraction Modules for Learned Image Compression. Yi Ma, Yongqi Zhai, Jiayu Yang, Chunhui Yang, Ronggang Wang |
| 2021 | AI and the Future of Education. James C. Lester |
| 2021 | AI-Lyricist: Generating Music and Vocabulary Constrained Lyrics. Xichu Ma, Ye Wang, Min-Yen Kan, Wee Sun Lee |
| 2021 | AICoacher: A System Framework for Online Realtime Workout Coach. Haocong Ying, Tie Liu, Mingxin Ai, Jiali Ding, Yuanyuan Shang |
| 2021 | AITransfer: Progressive AI-powered Transmission for Real-Time Point Cloud Video Streaming. Yakun Huang, Yuanwei Zhu, Xiuquan Qiao, Zhijie Tan, Boyuan Bai |
| 2021 | AIxFood'21: 3rd Workshop on AIxFood. Ricardo Guerrero, Michael Spranger, Shuqiang Jiang, Chong-Wah Ngo |
| 2021 | AKECP: Adaptive Knowledge Extraction from Feature Maps for Fast and Efficient Channel Pruning. Haonan Zhang, Longjun Liu, Hengyi Zhou, Wenxuan Hou, Hongbin Sun, Nanning Zheng |
| 2021 | AMSS-Net: Audio Manipulation on User-Specified Sources with Textual Queries. Woo-Sung Choi, Minseok Kim, Marco A. Martínez Ramírez, Jaehwa Chung, Soonyoung Jung |
| 2021 | APF: An Adversarial Privacy-preserving Filter to Protect Portrait Information. Xian Zhao, Jiaming Zhang, Xiaowen Huang |
| 2021 | ARShoe: Real-Time Augmented Reality Shoe Try-on System on Smartphones. Shan An, Guangfu Che, Jinghao Guo, Haogang Zhu, Junjie Ye, Fangru Zhou, Zhaoqi Zhu, Dong Wei, Aishan Liu, Wei Zhang |
| 2021 | ASFD: Automatic and Scalable Face Detector. Jian Li, Bin Zhang, Yabiao Wang, Ying Tai, Zhenyu Zhang, Chengjie Wang, Jilin Li, Xiaoming Huang, Yili Xia |
| 2021 | ASFM-Net: Asymmetrical Siamese Feature Matching Network for Point Completion. Yaqi Xia, Yan Xia, Wei Li, Rui Song, Kailang Cao, Uwe Stilla |
| 2021 | Actions Speak Louder than Listening: Evaluating Music Style Transfer based on Editing Experience. Wei Tsung Lu, Meng-Hsuan Wu, Yuh-Ming Chiu, Li Su |
| 2021 | Ada-VSR: Adaptive Video Super-Resolution with Meta-Learning. Akash Gupta, Padmaja Jonnalagedda, Bir Bhanu, Amit K. Roy-Chowdhury |
| 2021 | Adaptive Affinity Loss and Erroneous Pseudo-Label Refinement for Weakly Supervised Semantic Segmentation. Xiangrong Zhang, Zelin Peng, Peng Zhu, Tianyang Zhang, Chen Li, Huiyu Zhou, Licheng Jiao |
| 2021 | Adaptive Normalized Representation Learning for Generalizable Face Anti-Spoofing. Shubao Liu, Ke-Yue Zhang, Taiping Yao, Mingwei Bi, Shouhong Ding, Jilin Li, Feiyue Huang, Lizhuang Ma |
| 2021 | AdvFilter: Predictive Perturbation-aware Filtering against Adversarial Attack via Multi-domain Learning. Yihao Huang, Qing Guo, Felix Juefei-Xu, Lei Ma, Weikai Miao, Yang Liu, Geguang Pu |
| 2021 | AdvHash: Set-to-set Targeted Attack on Deep Hashing with One Single Adversarial Patch. Shengshan Hu, Yechao Zhang, Xiaogeng Liu, Leo Yu Zhang, Minghui Li, Hai Jin |
| 2021 | Adversarial Learning with Mask Reconstruction for Text-Guided Image Inpainting. Xingcai Wu, Yucheng Xie, Jiaqi Zeng, Zhenguo Yang, Yi Yu, Qing Li, Wenyin Liu |
| 2021 | Adversarial Pixel Masking: A Defense against Physical Attacks for Pre-trained Object Detectors. Ping-Han Chiang, Chi-Shen Chan, Shan-Hung Wu |
| 2021 | Aesthetic Evaluation and Guidance for Mobile Photography. Hao Lou, Heng Huang, Chaoen Xiao, Xin Jin |
| 2021 | Affective Color Fields: Reimagining Rothkoesque Artwork as an Interactive Companion for Artistic Self-Expression. Aiden Kang, Liang Wang, Ziyu Zhou, Zhe Huang, Robert J. K. Jacob |
| 2021 | AggNet for Self-supervised Monocular Depth Estimation: Go An Aggressive Step Furthe. Zhi Chen, Xiaoqing Ye, Liang Du, Wei Yang, Liusheng Huang, Xiao Tan, Zhenbo Shi, Fumin Shen, Errui Ding |
| 2021 | Air-Text: Air-Writing and Recognition System. Sun-Kyung Lee, Jong-Hwan Kim |
| 2021 | An Adaptive Iterative Inpainting Method with More Information Exploration. Shengjie Chen, Zhenhua Guo, Bo Yuan |
| 2021 | An EM Framework for Online Incremental Learning of Semantic Segmentation. Shipeng Yan, Jiale Zhou, Jiangwei Xie, Songyang Zhang, Xuming He |
| 2021 | Anchor-free 3D Single Stage Detector with Mask-Guided Attention for Point Cloud. Jiale Li, Hang Dai, Ling Shao, Yong Ding |
| 2021 | Annotation-Efficient Semantic Segmentation with Shape Prior Knowledge. Yuhang Lu |
| 2021 | Annotation-Efficient Untrimmed Video Action Recognition. Yixiong Zou, Shanghang Zhang, Guangyao Chen, Yonghong Tian, Kurt Keutzer, José M. F. Moura |
| 2021 | Anti-Distillation Backdoor Attacks: Backdoors Can Really Survive in Knowledge Distillation. Yunjie Ge, Qian Wang, Baolin Zheng, Xinlu Zhuang, Qi Li, Chao Shen, Cong Wang |
| 2021 | Apercevoir: Bio Internet of Things Interactive System. Youyang Hu, Chiao-Chi Chou, Chia-Wei Li |
| 2021 | Armor: A Benchmark for Meta-evaluation of Artificial Music. Songhe Wang, Zheng Bao, Jingtong E |
| 2021 | ArtScience and the ICECUBE LED Display [ILDm^3]. Mark David Hosale, Robert S. Allison, Jim Madsen, Marcus Gordon |
| 2021 | ArtiVisual: A Platform to Generate and Compare Art. Jardenna Mohazzab, Abe Vos, Jonathan van Westendorp, Lucas Lageweg, Dylan Prins, Aritra Bhowmik |
| 2021 | Assisting News Media Editors with Cohesive Visual Storylines. Gonçalo Marcelino, David Semedo, André Mourão, Saverio G. Blasi, João Magalhães, Marta Mrak |
| 2021 | AsyNCE: Disentangling False-Positives for Weakly-Supervised Video Grounding. Cheng Da, Yanhao Zhang, Yun Zheng, Pan Pan, Yinghui Xu, Chunhong Pan |
| 2021 | Attention-driven Graph Clustering Network. Zhihao Peng, Hui Liu, Yuheng Jia, Junhui Hou |
| 2021 | Attention-guided Temporally Coherent Video Object Matting. Yunke Zhang, Chi Wang, Miaomiao Cui, Peiran Ren, Xuansong Xie, Xian-Sheng Hua, Hujun Bao, Qixing Huang, Weiwei Xu |
| 2021 | Attribute-specific Control Units in StyleGAN for Fine-grained Image Manipulation. Rui Wang, Jian Chen, Gang Yu, Li Sun, Changqian Yu, Changxin Gao, Nong Sang |
| 2021 | Augmenting TV Shows via Uncalibrated Camera Small Motion Tracking in Dynamic Scene. Yizhen Lao, Jie Yang, Xinying Wang, Jianxin Lin, Yu Cao, Shien Song |
| 2021 | Auto-MSFNet: Search Multi-scale Fusion Network for Salient Object Detection. Miao Zhang, TingWei Liu, Yongri Piao, Shunyu Yao, Huchuan Lu |
| 2021 | Automated Multi-Modal Video Editing for Ads Video. Qin Lin, Nuo Pang, Zhiying Hong |
| 2021 | Automated Playtesting with a Cognitive Model of Sensorimotor Coordination. Injung Lee, Hyunchul Kim, Byungjoo Lee |
| 2021 | Automatic Channel Pruning with Hyper-parameter Search and Dynamic Masking. Baopu Li, Yanwen Fan, Zhihong Pan, Yuchen Bian, Gang Zhang |
| 2021 | BAM: Bilateral Activation Mechanism for Image Fusion. Zi-Rong Jin, Liang-Jian Deng, Tian-Jing Zhang, Xiao-Xu Jin |
| 2021 | Better Learning Shot Boundary Detection via Multi-task. Haoxin Zhang, Zhimin Li, Qinglin Lu |
| 2021 | Beyond OCR + VQA: Involving OCR into the Flow for Robust and Accurate TextVQA. Gangyan Zeng, Yuan Zhang, Yu Zhou, Xiaomeng Yang |
| 2021 | Block Popularity Prediction for Multimedia Storage Systems Using Spatial-Temporal-Sequential Neural Networks. Yingying Cheng, Fan Zhang, Gang Hu, Yiwen Wang, Hanhui Yang, Gong Zhang, Zhuo Cheng |
| 2021 | Boosting End-to-end Multi-Object Tracking and Person Search via Knowledge Distillation. Wei Zhang, Lingxiao He, Peng Chen, Xingyu Liao, Wu Liu, Qi Li, Zhenan Sun |
| 2021 | Boosting Lightweight Single Image Super-resolution via Joint-distillation. Xiaotong Luo, Qiuyuan Liang, Ding Liu, Yanyun Qu |
| 2021 | Boosting Mobile CNN Inference through Semantic Memory. Yun Li, Chen Zhang, Shihao Han, Li Lyna Zhang, Baoqun Yin, Yunxin Liu, Mengwei Xu |
| 2021 | Bottom-Up and Bidirectional Alignment for Referring Expression Comprehension. Liuwu Li, Yuqi Bu, Yi Cai |
| 2021 | BridgeNet: A Joint Learning Network of Depth Map Super-Resolution and Monocular Depth Estimation. Qi Tang, Runmin Cong, Ronghui Sheng, Lingzhi He, Dan Zhang, Yao Zhao, Sam Kwong |
| 2021 | Bridging the Gap between Low-Light Scenes: Bilevel Learning for Fast Adaptation. Dian Jin, Long Ma, Risheng Liu, Xin Fan |
| 2021 | Build Your Own Bundle - A Neural Combinatorial Optimization Method. Qilin Deng, Kai Wang, Minghao Zhao, Runze Wu, Yu Ding, Zhene Zou, Yue Shang, Jianrong Tao, Changjie Fan |
| 2021 | CAA: Candidate-Aware Aggregation for Temporal Action Detection. Yifan Ren, Xing Xu, Fumin Shen, Yazhou Yao, Huimin Lu |
| 2021 | CALLip: Lipreading using Contrastive and Attribute Learning. Yiyang Huang, Xuefeng Liang, Chaowei Fang |
| 2021 | CARE: Cloudified Android OSes on the Cloud Rendering. Dongjie Tang, Cathy Bao, Yong Yao, Chao Xie, Qiming Shi, Marc Mao, Randy Xu, Linsheng Li, Mohammad R. Haghighat, Zhengwei Qi, Haibing Guan |
| 2021 | CDD: Multi-view Subspace Clustering via Cross-view Diversity Detection. Shudong Huang, Ivor W. Tsang, Zenglin Xu, Jiancheng Lv, Quanhui Liu |
| 2021 | CDP: Towards Optimal Filter Pruning via Class-wise Discriminative Power. Tianshuo Xu, Yuhang Wu, Xiawu Zheng, Teng Xi, Gang Zhang, Errui Ding, Fei Chao, Rongrong Ji |
| 2021 | CG-GAN: Class-Attribute Guided Generative Adversarial Network for Old Photo Restoration. Jixin Liu, Rui Chen, Shipeng An, Heng Zhang |
| 2021 | CLIP4Caption: CLIP for Video Caption. Mingkang Tang, Zhanyu Wang, Zhenhua Liu, Fengyun Rao, Dian Li, Xiu Li |
| 2021 | CONQUER: Contextual Query-aware Ranking for Video Corpus Moment Retrieval. Zhijian Hou, Chong-Wah Ngo, Wing Kwong Chan |
| 2021 | CaFGraph: Context-aware Facial Multi-graph Representation for Facial Action Unit Recognition. Yingjie Chen, Diqi Chen, Yizhou Wang, Tao Wang, Yun Liang |
| 2021 | Camera-Agnostic Person Re-Identification via Adversarial Disentangling Learning. Hao Ni, Jingkuan Song, Xiaosu Zhu, Feng Zheng, Lianli Gao |
| 2021 | CanvasEmb: Learning Layout Representation with Large-scale Pre-training for Graphic Design. Yuxi Xie, Danqing Huang, Jinpeng Wang, Chin-Yew Lin |
| 2021 | Capsule-based Object Tracking with Natural Language Specification. Ding Ma, Xiangqian Wu |
| 2021 | Cascade Cross-modal Attention Network for Video Actor and Action Segmentation from a Sentence. Weidong Chen, Guorong Li, Xinfeng Zhang, Hongyang Yu, Shuhui Wang, Qingming Huang |
| 2021 | CausalRec: Causal Inference for Visual Debiasing in Visually-Aware Recommendation. Ruihong Qiu, Sen Wang, Zhi Chen, Hongzhi Yin, Zi Huang |
| 2021 | ChartPointFlow for Topology-Aware 3D Point Cloud Generation. Takumi Kimura, Takashi Matsubara, Kuniaki Uehara |
| 2021 | Chinese Character Inpainting with Contextual Semantic Constraints. Jiahao Wang, Gang Pan, Di Sun, Jiawan Zhang |
| 2021 | Cluster and Scatter: A Multi-grained Active Semi-supervised Learning Framework for Scalable Person Re-identification. Bingyu Hu, Zheng-Jun Zha, Jiawei Liu, Xierong Zhu, Hongtao Xie |
| 2021 | Co-Transport for Class-Incremental Learning. Da-Wei Zhou, Han-Jia Ye, De-Chuan Zhan |
| 2021 | Co-learning: Learning from Noisy Labels with Self-supervision. Cheng Tan, Jun Xia, Lirong Wu, Stan Z. Li |
| 2021 | CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising. Jianjie Luo, Yehao Li, Yingwei Pan, Ting Yao, Hongyang Chao, Tao Mei |
| 2021 | CoReD: Generalizing Fake Media Detection with Continual Representation using Distillation. Minha Kim, Shahroz Tariq, Simon S. Woo |
| 2021 | Coarse to Fine: Domain Adaptive Crowd Counting via Adversarial Scoring Network. Zhikang Zou, Xiaoye Qu, Pan Zhou, Shuangjie Xu, Xiaoqing Ye, Wenhao Wu, Jin Ye |
| 2021 | Collocation and Try-on Network: Whether an Outfit is Compatible. Na Zheng, Xuemeng Song, Qingying Niu, Xue Dong, Yibing Zhan, Liqiang Nie |
| 2021 | Combining Attention with Flow for Person Image Synthesis. Yurui Ren, Yubo Wu, Thomas H. Li, Shan Liu, Ge Li |
| 2021 | Community Generated VR Painting using Eye Gaze. Mu Mu, Murtada Dohan |
| 2021 | Complementary Factorization towards Outfit Compatibility Modeling. Tianyu Su, Xuemeng Song, Na Zheng, Weili Guan, Yan Li, Liqiang Nie |
| 2021 | Complementary Trilateral Decoder for Fast and Accurate Salient Object Detection. Zhirui Zhao, Changqun Xia, Chenxi Xie, Jia Li |
| 2021 | Conceptual and Syntactical Cross-modal Alignment with Cross-level Consistency for Image-Text Matching. Pengpeng Zeng, Lianli Gao, Xinyu Lyu, Shuaiqi Jing, Jingkuan Song |
| 2021 | Conditional Directed Graph Convolution for 3D Human Pose Estimation. Wenbo Hu, Changgong Zhang, Fangneng Zhan, Lei Zhang, Tien-Tsin Wong |
| 2021 | Consistency-Constancy Bi-Knowledge Learning for Pedestrian Detection in Night Surveillance. Xiao Wang, Zheng Wang, Wu Liu, Xin Xu, Jing Chen, Chia-Wen Lin |
| 2021 | Constrained Graphic Layout Generation via Latent Optimization. Kotaro Kikuchi, Edgar Simo-Serra, Mayu Otani, Kota Yamaguchi |
| 2021 | Context-Aware Selective Label Smoothing for Calibrating Sequence Recognition Model. Shuangping Huang, Yu Luo, Zhenzhou Zhuang, Jin-Gang Yu, Mengchao He, Yongpan Wang |
| 2021 | Contrastive Disentangled Meta-Learning for Signer-Independent Sign Language Translation. Tao Jin, Zhou Zhao |
| 2021 | Contrastive Learning for Cold-Start Recommendation. Yinwei Wei, Xiang Wang, Qi Li, Liqiang Nie, Yan Li, Xuanping Li, Tat-Seng Chua |
| 2021 | Convolutional Transformer based Dual Discriminator Generative Adversarial Networks for Video Anomaly Detection. Xinyang Feng, Dongjin Song, Yuncong Chen, Zhengzhang Chen, Jingchao Ni, Haifeng Chen |
| 2021 | Counterfactual Debiasing Inference for Compositional Action Recognition. Pengzhan Sun, Bo Wu, Xunsong Li, Wen Li, Lixin Duan, Chuang Gan |
| 2021 | Cross Chest Graph for Disease Diagnosis with Structural Relational Reasoning. Gangming Zhao |
| 2021 | Cross Modal Compression: Towards Human-comprehensible Semantic Compression. Jiguo Li, Chuanmin Jia, Xinfeng Zhang, Siwei Ma, Wen Gao |
| 2021 | Cross-Camera Feature Prediction for Intra-Camera Supervised Person Re-identification across Distant Scenes. Wenhang Ge, Chunyan Pan, Ancong Wu, Hongwei Zheng, Wei-Shi Zheng |
| 2021 | Cross-Modal Generalization: Learning in Low Resource Modalities via Meta-Alignment. Paul Pu Liang, Peter Wu, Liu Ziyin, Louis-Philippe Morency, Ruslan Salakhutdinov |
| 2021 | Cross-Modal Recipe Embeddings by Disentangling Recipe Contents and Dish Styles. Yu Sugiyama, Keiji Yanai |
| 2021 | Cross-View Exocentric to Egocentric Video Synthesis. Gaowen Liu, Hao Tang, Hugo Latapie, Jason J. Corso, Yan Yan |
| 2021 | Cross-View Representation Learning for Multi-View Logo Classification with Information Bottleneck. Jing Wang, Yuanjie Zheng, Jingqi Song, Sujuan Hou |
| 2021 | Cross-modal Consensus Network for Weakly Supervised Temporal Action Localization. Fa-Ting Hong, Jia-Chang Feng, Dan Xu, Ying Shan, Wei-Shi Zheng |
| 2021 | Cross-modal Joint Prediction and Alignment for Composed Query Image Retrieval. Yuchen Yang, Min Wang, Wengang Zhou, Houqiang Li |
| 2021 | Cross-modal Retrieval and Synthesis (X-MRS): Closing the Modality Gap in Shared Subspace Learning. Ricardo Guerrero, Hai Xuan Pham, Vladimir Pavlovic |
| 2021 | Cross-modal Self-Supervised Learning for Lip Reading: When Contrastive Learning meets Adversarial Training. Changchong Sheng, Matti Pietikäinen, Qi Tian, Li Liu |
| 2021 | Cross-modality Discrepant Interaction Network for RGB-D Salient Object Detection. Chen Zhang, Runmin Cong, Qinwei Lin, Lin Ma, Feng Li, Yao Zhao, Sam Kwong |
| 2021 | Curriculum-Based Meta-learning. Ji Zhang, Jingkuan Song, Yazhou Yao, Lianli Gao |
| 2021 | Cut-Thumbnail: A Novel Data Augmentation for Convolutional Neural Network. Tianshu Xie, Xuan Cheng, Xiaomin Wang, Minghui Liu, Jiali Deng, Tao Zhou, Ming Liu |
| 2021 | Cycle-Consistent Inverse GAN for Text-to-Image Synthesis. Hao Wang, Guosheng Lin, Steven C. H. Hoi, Chunyan Miao |
| 2021 | DAWN: Dynamic Adversarial Watermarking of Neural Networks. Sebastian Szyller, Buse Gul Atli, Samuel Marchal, N. Asokan |
| 2021 | DC-GNet: Deep Mesh Relation Capturing Graph Convolution Network for 3D Human Shape Reconstruction. Shihao Zhou, Mengxi Jiang, Shanshan Cai, Yunqi Lei |
| 2021 | DEPA: Self-Supervised Audio Embedding for Depression Detection. Pingyue Zhang, Mengyue Wu, Heinrich Dinkel, Kai Yu |
| 2021 | DFR-Net: A Novel Multi-Task Learning Network for Real-Time Multi-Instrument Segmentation. Yan-Jie Zhou, Shi-Qi Liu, Xiao-Liang Xie, Zeng-Guang Hou |
| 2021 | DLA-Net for FG-SBIR: Dynamic Local Aligned Network for Fine-Grained Sketch-Based Image Retrieval. Jiaqing Xu, Haifeng Sun, Qi Qi, Jingyu Wang, Ce Ge, Lejian Zhang, Jianxin Liao |
| 2021 | DPT: Deformable Patch-based Transformer for Visual Recognition. Zhiyang Chen, Yousong Zhu, Chaoyang Zhao, Guosheng Hu, Wei Zeng, Jinqiao Wang, Ming Tang |
| 2021 | DRDF: Determining the Importance of Different Multimodal Information with Dual-Router Dynamic Framework. Haiwen Hong, Xuan Jin, Yin Zhang, Yunqing Hu, Jingfeng Zhang, Yuan He, Hui Xue |
| 2021 | DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning. Wenhao Wu, Yuxiang Zhao, Yanwu Xu, Xiao Tan, Dongliang He, Zhikang Zou, Jin Ye, Yingying Li, Mingde Yao, Zichao Dong, Yifeng Shi |
| 2021 | DSP: Dual Soft-Paste for Unsupervised Domain Adaptive Semantic Segmentation. Li Gao, Jing Zhang, Lefei Zhang, Dacheng Tao |
| 2021 | DSSL: Deep Surroundings-person Separation Learning for Text-based Person Retrieval. Aichun Zhu, Zijie Wang, Yifeng Li, Xili Wan, Jing Jin, Tian Wang, Fangqiang Hu, Gang Hua |
| 2021 | Data-Free Ensemble Knowledge Distillation for Privacy-conscious Multimedia Model Compression. Zhiwei Hao, Yong Luo, Han Hu, Jianping An, Yonggang Wen |
| 2021 | Database-adaptive Re-ranking for Enhancing Cross-modal Image Retrieval. Rintaro Yanagi, Ren Togo, Takahiro Ogawa, Miki Haseyama |
| 2021 | Deadline and Priority-aware Congestion Control for Delay-sensitive Multimedia Streaming. Chao Zhou, Wenjun Wu, Dan Yang, Tianchi Huang, Liang Guo, Bing Yu |
| 2021 | Deconfounded and Explainable Interactive Vision-Language Retrieval of Complex Scenes. Junda Wu, Tong Yu, Shuai Li |
| 2021 | Decoupled IoU Regression for Object Detection. Yan Gao, Qimeng Wang, Xu Tang, Haochen Wang, Fei Ding, Jing Li, Yao Hu |
| 2021 | Deep Clustering based on Bi-Space Association Learning. Hao Huang, Shinjae Yoo, Chenxiao Xu |
| 2021 | Deep Human Dynamics Prior. Qiongjie Cui, Huaijiang Sun, Yue Kong, Xiaoning Sun |
| 2021 | Deep Interactive Video Inpainting: An Invisibility Cloak for Harry Potter. Cheng Chen, Jiayin Cai, Yao Hu, Xu Tang, Xinggang Wang, Chun Yuan, Xiang Bai, Song Bai |
| 2021 | Deep Learning for Visual Data Compression. Guo Lu, Ren Yang, Shenlong Wang, Shan Liu, Radu Timofte |
| 2021 | Deep Marginal Fisher Analysis based CNN for Image Representation and Classification. Xun Cai, Jiajing Chai, Yanbo Gao, Shuai Li, Bo Zhu |
| 2021 | Deep Neural Network Retrieval. Nan Zhong, Zhenxing Qian, Xinpeng Zhang |
| 2021 | Deep Reasoning Network for Few-shot Semantic Segmentation. Yunzhi Zhuge, Chunhua Shen |
| 2021 | Deep Self-Supervised t-SNE for Multi-modal Subspace Clustering. Qianqian Wang, Wei Xia, Zhiqiang Tao, Quanxue Gao, Xiaochun Cao |
| 2021 | Deep Unsupervised 3D SfM Face Reconstruction Based on Massive Landmark Bundle Adjustment. Yuxing Wang, Yawen Lu, Zhihua Xie, Guoyu Lu |
| 2021 | DeepGame: Efficient Video Encoding for Cloud Gaming. Omar Mossad, Khaled Diab, Ihab Amer, Mohamed Hefeeda |
| 2021 | DehazeFlow: Multi-scale Conditional Flow Network for Single Image Dehazing. Hongyu Li, Jia Li, Dong Zhao, Long Xu |
| 2021 | Delving into Deep Image Prior for Adversarial Defense: A Novel Reconstruction-based Defense Framework. Li Ding, Yongwei Wang, Xin Ding, Kaiwen Yuan, Ping Wang, Hua Huang, Z. Jane Wang |
| 2021 | Demystifying Commercial Video Conferencing Applications. Insoo Lee, Jinsung Lee, Kyunghan Lee, Dirk Grunwald, Sangtae Ha |
| 2021 | Dense Contrastive Visual-Linguistic Pretraining. Lei Shi, Kai Shuang, Shijie Geng, Peng Gao, Zuohui Fu, Gerard de Melo, Yunpeng Chen, Sen Su |
| 2021 | Dense Semantic Contrast for Self-Supervised Visual Representation Learning. Xiaoni Li, Yu Zhou, Yifei Zhang, Aoting Zhang, Wei Wang, Ning Jiang, Haiying Wu, Weiping Wang |
| 2021 | Depth Quality-Inspired Feature Manipulation for Efficient RGB-D Salient Object Detection. Wenbo Zhang, Ge-Peng Ji, Zhuo Wang, Keren Fu, Qijun Zhao |
| 2021 | Differentiated Learning for Multi-Modal Domain Adaptation. Jianming Lv, Kaijie Liu, Shengfeng He |
| 2021 | Diffusing the Liveness Cues for Face Anti-spoofing. Sheng Li, Xun Zhu, Guorui Feng, Xinpeng Zhang, Zhenxing Qian |
| 2021 | Digital Human in an Integrated Physical-Digital World (IPhD). Zhengyou Zhang |
| 2021 | Direction Relation Transformer for Image Captioning. Zeliang Song, Xiaofei Zhou, Linhua Dong, Jianlong Tan, Li Guo |
| 2021 | Discovering Density-Preserving Latent Space Walks in GANs for Semantic Image Transformations. Guanyue Li, Yi Liu, Xiwen Wei, Yang Zhang, Si Wu, Yong Xu, Hau-San Wong |
| 2021 | Discriminative Latent Semantic Graph for Video Captioning. Yang Bai, Junyan Wang, Yang Long, Bingzhang Hu, Yang Song, Maurice Pagnucco, Yu Guan |
| 2021 | Discriminator-free Generative Adversarial Attack. Shaohao Lu, Yuqiao Xian, Ke Yan, Yi Hu, Xing Sun, Xiaowei Guo, Feiyue Huang, Wei-Shi Zheng |
| 2021 | Disentangle Your Dense Object Detector. Zehui Chen, Chenhongyi Yang, Qiaofei Li, Feng Zhao, Zheng-Jun Zha, Feng Wu |
| 2021 | Disentangled Representation Learning and Enhancement Network for Single Image De-Raining. Guoqing Wang, Changming Sun, Xing Xu, Jingjing Li, Zheng Wang, Zeyu Ma |
| 2021 | Disentangling Hate in Online Memes. Roy Ka-Wei Lee, Rui Cao, Ziqing Fan, Jing Jiang, Wen-Haw Chong |
| 2021 | Distantly Supervised Semantic Text Detection and Recognition for Broadcast Sports Videos Understanding. Avijit Shah, Topojoy Biswas, Sathish Ramadoss, Deven Santosh Shah |
| 2021 | Distributed Attention for Grounded Image Captioning. Nenglun Chen, Xingjia Pan, Runnan Chen, Lei Yang, Zhiwen Lin, Yuqiang Ren, Haolei Yuan, Xiaowei Guo, Feiyue Huang, Wenping Wang |
| 2021 | Diverse Image Inpainting with Bidirectional and Autoregressive Transformers. Yingchen Yu, Fangneng Zhan, Rongliang Wu, Jianxiong Pan, Kaiwen Cui, Shijian Lu, Feiying Ma, Xuansong Xie, Chunyan Miao |
| 2021 | Diverse Multimedia Layout Generation with Multi Choice Learning. David D. Nguyen, Surya Nepal, Salil S. Kanhere |
| 2021 | Do We Really Need Frame-by-Frame Annotation Datasets for Object Tracking? Lei Hu, Shaoli Huang, Shilei Wang, Wei Liu, Jifeng Ning |
| 2021 | Do you see what I see?: Large-scale Learning from Multimodal Videos. Cordelia Schmid |
| 2021 | DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction. Hao Feng, Yuechen Wang, Wengang Zhou, Jiajun Deng, Houqiang Li |
| 2021 | Domain Adaptive Semantic Segmentation without Source Data. Fuming You, Jingjing Li, Lei Zhu, Zhi Chen, Zi Huang |
| 2021 | Domain Generalization via Feature Variation Decorrelation. Chang Liu, Lichen Wang, Kai Li, Yun Fu |
| 2021 | Domain-Aware SE Network for Sketch-based Image Retrieval with Multiplicative Euclidean Margin Softmax. Peng Lu, Gao Huang, Hangyu Lin, Wenming Yang, Guodong Guo, Yanwei Fu |
| 2021 | Dual Graph Convolutional Networks with Transformer and Curriculum Learning for Image Captioning. Xinzhi Dong, Chengjiang Long, Wenju Xu, Chunxia Xiao |
| 2021 | Dual Learning Music Composition and Dance Choreography. Shuang Wu, Zhenguang Liu, Shijian Lu, Li Cheng |
| 2021 | Dynamic Knowledge Distillation with Cross-Modality Knowledge Transfer. Guangzhi Wang |
| 2021 | Dynamic Momentum Adaptation for Zero-Shot Cross-Domain Crowd Counting. Qiangqiang Wu, Jia Wan, Antoni B. Chan |
| 2021 | D³Net: Dual-Branch Disturbance Disentangling Network for Facial Expression Recognition. Rongyun Mo, Yan Yan, Jing-Hao Xue, Si Chen, Hanzi Wang |
| 2021 | E2Net: Excitative-Expansile Learning for Weakly Supervised Object Localization. Zhiwei Chen, Liujuan Cao, Yunhang Shen, Feihong Lian, Yongjian Wu, Rongrong Ji |
| 2021 | EVRNet: Efficient Video Restoration on Edge Devices. Sachin Mehta, Amit Kumar, Fitsum A. Reda, Varun Nasery, Vikram Mulukutla, Rakesh Ranjan, Vikas Chandra |
| 2021 | Edge-oriented Convolution Block for Real-time Super Resolution on Mobile Devices. Xindong Zhang, Hui Zeng, Lei Zhang |
| 2021 | Edit Like A Designer: Modeling Design Workflows for Unaligned Fashion Editing. Qiyu Dai, Shuai Yang, Wenjing Wang, Wei Xiang, Jiaying Liu |
| 2021 | Effective De-identification Generative Adversarial Network for Face Anonymization. Zhenzhong Kuang, Huigui Liu, Jun Yu, Aikui Tian, Lei Wang, Jianping Fan, Noboru Babaguchi |
| 2021 | Efficient Graph Deep Learning in TensorFlow with tf_geometric. Jun Hu, Shengsheng Qian, Quan Fang, Youze Wang, Quan Zhao, Huaiwen Zhang, Changsheng Xu |
| 2021 | Efficient Multi-Modal Fusion with Diversity Analysis. Shuhui Qu, Yan Kang, Janghwan Lee |
| 2021 | Efficient Reinforcement Learning Development with RLzoo. Zihan Ding, Tianyang Yu, Hongming Zhang, Yanhua Huang, Guo Li, Quancheng Guo, Luo Mai, Hao Dong |
| 2021 | Efficient Sparse Attacks on Videos using Reinforcement Learning. Huanqian Yan, Xingxing Wei |
| 2021 | Ego-Deliver: A Large-Scale Dataset For Egocentric Video Analysis. Haonan Qiu, Pan He, Shuchun Liu, Weiyuan Shao, Feiyun Zhang, Jiajun Wang, Liang He, Feng Wang |
| 2021 | Elastic Tactile Simulation Towards Tactile-Visual Perception. Yikai Wang, Wenbing Huang, Bin Fang, Fuchun Sun, Chang Li |
| 2021 | Embracing the Dark Knowledge: Domain Generalization Using Regularized Knowledge Distillation. Yufei Wang, Haoliang Li, Lap-Pui Chau, Alex C. Kot |
| 2021 | End-to-End Video Object Detection with Spatial-Temporal Transformers. Lu He, Qianyu Zhou, Xiangtai Li, Li Niu, Guangliang Cheng, Xiao Li, Wenxuan Liu, Yunhai Tong, Lizhuang Ma, Liqing Zhang |
| 2021 | End-to-end Boundary Exploration for Weakly-supervised Semantic Segmentation. Jianjun Chen, Shancheng Fang, Hongtao Xie, Zheng-Jun Zha, Yue Hu, Jianlong Tan |
| 2021 | End-to-end Quality of Experience Evaluation for HTTP Adaptive Streaming. Babak Taraghi |
| 2021 | Enhanced Invertible Encoding for Learned Image Compression. Yueqi Xie, Ka Leong Cheng, Qifeng Chen |
| 2021 | Enhancing Knowledge Tracing via Adversarial Training. Xiaopeng Guo, Zhijie Huang, Jie Gao, Mingyu Shang, Maojing Shu, Jun Sun |
| 2021 | Exploiting BERT for Multimodal Target Sentiment Classification through Input Space Translation. Zaid Khan, Yun Fu |
| 2021 | Exploiting Invariance of Mining Facial Landmarks. Jiangming Shi, Zixian Gao, Hao Liu, Zekuan Yu, Fengjun Li |
| 2021 | Exploring Contextual-Aware Representation and Linguistic-Diverse Expression for Visual Dialog. Xiangpeng Li, Lianli Gao, Lei Zhao, Jingkuan Song |
| 2021 | Exploring Gradient Flow Based Saliency for DNN Model Compression. Xinyu Liu, Baopu Li, Zhen Chen, Yixuan Yuan |
| 2021 | Exploring Graph-Structured Semantics for Cross-Modal Retrieval. Lei Zhang, Leiting Chen, Chuan Zhou, Fan Yang, Xin Li |
| 2021 | Exploring Logical Reasoning for Referring Expression Comprehension. Ying Cheng, Ruize Wang, Jiashuo Yu, Rui-Wei Zhao, Yuejie Zhang, Rui Feng |
| 2021 | Exploring Pathologist Knowledge for Automatic Assessment of Breast Cancer Metastases in Whole-slide Image. Liuan Wang, Li Sun, Mingjie Zhang, Huigang Zhang, Ping Wang, Rong Zhou, Jun Sun |
| 2021 | Exploring Sequence Feature Alignment for Domain Adaptive Detection Transformers. Wen Wang, Yang Cao, Jing Zhang, Fengxiang He, Zheng-Jun Zha, Yonggang Wen, Dacheng Tao |
| 2021 | Exploring the Quality of GAN Generated Images for Person Re-Identification. Yiqi Jiang, Weihua Chen, Xiuyu Sun, Xiaoyu Shi, Fan Wang, Hao Li |
| 2021 | Extending 6-DoF VR Experience Via Multi-Sphere Images Interpolation. Jisheng Li, Yuze He, Jinghui Jiao, Yubin Hu, Yuxing Han, Jiangtao Wen |
| 2021 | Extracting Useful Knowledge from Noisy Web Images via Data Purification for Fine-Grained Recognition. Chuanyi Zhang, Yazhou Yao, Xing Xu, Jie Shao, Jingkuan Song, Zechao Li, Zhenmin Tang |
| 2021 | FAMGAN: Fine-grained AUs Modulation based Generative Adversarial Network for Micro-Expression Generation. Yifan Xu, Sirui Zhao, Huaying Tang, Xinglong Mao, Tong Xu, Enhong Chen |
| 2021 | FME'21: 1st Workshop on Facial Micro-Expression: Advanced Techniques for Facial Expressions Generation and Spotting. Jingting Li, Moi Hoon Yap, Wen-Huang Cheng, John See, Xiaopeng Hong, Xiaobai Li, Su-Jing Wang |
| 2021 | FOCAS: Practical Video Super Resolution using Foveated Rendering. Lingdong Wang, Mohammad H. Hajiesmaili, Ramesh K. Sitaraman |
| 2021 | FTAFace: Context-enhanced Face Detector with Fine-grained Task Attention. Deyu Wang, Dongchao Wen, Wei Tao, Lingxiao Yin, Tse-Wei Chen, Tadayuki Ito, Kinya Osa, Masami Kato |
| 2021 | Face Hallucination via Split-Attention in Split-Attention Network. Tao Lu, Yuanzhi Wang, Yanduo Zhang, Yu Wang, Wei Liu, Zhongyuan Wang, Junjun Jiang |
| 2021 | Face-based Voice Conversion: Learning the Voice behind a Face. Hsiao-Han Lu, Shao-En Weng, Ya-Fan Yen, Hong-Han Shuai, Wen-Huang Cheng |
| 2021 | FaceX-Zoo: A PyTorch Toolbox for Face Recognition. Jun Wang, Yinglu Liu, Yibo Hu, Hailin Shi, Tao Mei |
| 2021 | Facial Action Unit-based Deep Learning Framework for Spotting Macro- and Micro-expressions in Long Video Sequences. Bo Yang, Jianming Wu, Zhiguang Zhou, Megumi Komiya, Koki Kishimoto, Jianfeng Xu, Keisuke Nonaka, Toshiharu Horiuchi, Satoshi Komorita, Gen Hattori, Sei Naito, Yasuhiro Takishima |
| 2021 | Facial Micro-Expression Generation based on Deep Motion Retargeting and Transfer Learning. Xinqi Fan, Ali Raza Shahid, Hong Yan |
| 2021 | Facial Prior Based First Order Motion Model for Micro-expression Generation. Yi Zhang, Youjun Zhao, Yuhang Wen, Zixuan Tang, Xinhua Xu, Mengyuan Liu |
| 2021 | Fake Gradient: A Security and Privacy Protection Framework for DNN-based Image Classification. Xianglong Feng, Yi Xie, Mengmei Ye, Zhongze Tang, Bo Yuan, Sheng Wei |
| 2021 | FakeTagger: Robust Safeguards against DeepFake Dissemination via Provenance Tracking. Run Wang, Felix Juefei-Xu, Meng Luo, Yang Liu, Lina Wang |
| 2021 | Fast Video Visual Quality and Resolution Improvement using SR-UNet. Federico Vaccaro, Marco Bertini, Tiberio Uricchio, Alberto Del Bimbo |
| 2021 | Fast and Accurate Lane Detection via Frequency Domain Learning. Yulin He, Wei Chen, Zhengfa Liang, Dan Chen, Yusong Tan, Xin Luo, Chen Li, Yulan Guo |
| 2021 | Fast and Flexible Human Pose Estimation with HyperPose. Yixiao Guo, Jiawei Liu, Guo Li, Luo Mai, Hao Dong |
| 2021 | Fast, High-Quality Hierarchical Depth-Map Super-Resolution. Yiguo Qiao, Licheng Jiao, Wenbin Li, Christian Richardt, Darren Cosker |
| 2021 | Fast-forwarding, Rewinding, and Path Exploration in Interactive Branched Video Streaming. Albin Vogel, Erik Kronberg, Niklas Carlsson |
| 2021 | Faster-PPN: Towards Real-Time Semantic Segmentation with Dual Mutual Learning for Ultra-High Resolution Images. Bicheng Dai, Kaisheng Wu, Tong Wu, Kai Li, Yanyun Qu, Yuan Xie, Yun Fu |
| 2021 | Feature Stylization and Domain-aware Contrastive Learning for Domain Generalization. Seogkyu Jeon, Kibeom Hong, Pilhyeon Lee, Jewook Lee, Hyeran Byun |
| 2021 | Feedback Network for Mutually Boosted Stereo Image Super-Resolution and Disparity Estimation. Qinyan Dai, Juncheng Li, Qiaosi Yi, Faming Fang, Guixu Zhang |
| 2021 | Few-Shot Multi-Agent Perception. Chenyou Fan, Junjie Hu, Jianwei Huang |
| 2021 | Few-shot Fine-Grained Action Recognition via Bidirectional Attention and Contrastive Meta-Learning. Jiahao Wang, Yunhong Wang, Sheng Liu, Annan Li |
| 2021 | Few-shot Learning for Multi-Modality Tasks. Jie Chen, Qixiang Ye, Xiaoshan Yang, S. Kevin Zhou, Xiaopeng Hong, Li Zhang |
| 2021 | Few-shot Unsupervised Domain Adaptation with Image-to-Class Sparse Similarity Encoding. Shengqi Huang, Wanqi Yang, Lei Wang, Luping Zhou, Ming Yang |
| 2021 | Fine-Grained Language Identification in Scene Text Images. Yongrui Li, Shilian Wu, Jun Yu, Zengfu Wang |
| 2021 | Fine-grained Cross-modal Alignment Network for Text-Video Retrieval. Ning Han, Jingjing Chen, Guangyi Xiao, Hao Zhang, Yawen Zeng, Hao Chen |
| 2021 | Fingerspelling Recognition in the Wild with Fixed-Query based Visual Attention. Srinivas Kruthiventi S. S, George Jose, Nitya Tandon, Rajesh Roshan Biswal, Aashish Kumar |
| 2021 | Focal and Composed Vision-semantic Modeling for Visual Question Answering. Yudong Han, Yangyang Guo, Jianhua Yin, Meng Liu, Yupeng Hu, Liqiang Nie |
| 2021 | Focusing on Persons: Colorizing Old Images Learning from Modern Historical Movies. Xin Jin, Zhonglan Li, Ke Liu, Dongqing Zou, Xiaodong Li, Xingfan Zhu, Ziyin Zhou, Qilong Sun, Qingyu Liu |
| 2021 | FoodLogoDet-1500: A Dataset for Large-Scale Food Logo Detection via Multi-Scale Feature Decoupling Network. Qiang Hou, Weiqing Min, Jing Wang, Sujuan Hou, Yuanjie Zheng, Shuqiang Jiang |
| 2021 | Former-DFER: Dynamic Facial Expression Recognition Transformer. Zengqun Zhao, Qingshan Liu |
| 2021 | From Image to Imuge: Immunized Image Generation. Qichao Ying, Zhenxing Qian, Hang Zhou, Haisheng Xu, Xinpeng Zhang, Siyi Li |
| 2021 | From Superficial to Deep: Language Bias driven Curriculum Learning for Visual Question Answering. Mingrui Lao, Yanming Guo, Yu Liu, Wei Chen, Nan Pu, Michael S. Lew |
| 2021 | From Synthetic to Real: Image Dehazing Collaborating with Unlabeled Real Data. Ye Liu, Lei Zhu, Shunda Pei, Huazhu Fu, Jing Qin, Qing Zhang, Liang Wan, Wei Feng |
| 2021 | From Voxel to Point: IoU-guided 3D Object Detection for Point Cloud with Voxel-to-Point Decoder. Jiale Li, Hang Dai, Ling Shao, Yong Ding |
| 2021 | Fully Functional Image Manipulation Using Scene Graphs in A Bounding-Box Free Way. Sitong Su, Lianli Gao, Junchen Zhu, Jie Shao, Jingkuan Song |
| 2021 | Fully Quantized Image Super-Resolution Networks. Hu Wang, Peng Chen, Bohan Zhuang, Chunhua Shen |
| 2021 | GAMnet: Robust Feature Matching via Graph Adversarial-Matching Network. Bo Jiang, Pengfei Sun, Ziyan Zhang, Jin Tang, Bin Luo |
| 2021 | GAN-aided Serial Dependence Study in Medical Image Perception. Zhihang Ren |
| 2021 | GCCN: Geometric Constraint Co-attention Network for 6D Object Pose Estimation. Yongming Wen, Yiquan Fang, Junhao Cai, Kimwa Tung, Hui Cheng |
| 2021 | GCM-Net: Towards Effective Global Context Modeling for Image Inpainting. Huan Zheng, Zhao Zhang, Yang Wang, Zheng Zhang, Mingliang Xu, Yi Yang, Meng Wang |
| 2021 | GCNIllustrator: Illustrating the Effect of Hyperparameters on Graph Convolutional Networks. Ivona Najdenkoska, Jeroen den Boef, Thomas Schneider, Justo van der Werf, Reinier de Ridder, Fajar Fathurrahman, Marcel Worring |
| 2021 | GLM-Net: Global and Local Motion Estimation via Task-Oriented Encoder-Decoder Structure. Yuchen Yang, Ye Xiang, Shuaicheng Liu, Lifang Wu, Boxuan Zhao, Bing Zeng |
| 2021 | Game Theory-driven Rate Control for 360-Degree Video Coding. Tiesong Zhao, Jielian Lin, Yanjie Song, Xu Wang, Yuzhen Niu |
| 2021 | General Approximate Cross Validation for Model Selection: Supervised, Semi-supervised and Pairwise Learning. Bowei Zhu, Yong Liu |
| 2021 | Generally Boosting Few-Shot Learning with HandCrafted Features. Yi Zhang, Sheng Huang, Fengtao Zhou |
| 2021 | Generating Point Cloud from Single Image in The Few Shot Scenario. Yu Lin, Jinghui Guo, Yang Gao, Yi-Fan Li, Zhuoyi Wang, Latifur Khan |
| 2021 | Generative Adversarial Network for Text-to-Face Synthesis and Manipulation. Yutong Zhou |
| 2021 | Get The Best of the Three Worlds: Real-Time Neural Image Compression in a Non-GPU Environment. Zekun Zheng, Xiaodong Wang, Xinye Lin, Shaohe Lv |
| 2021 | Graph Convolutional Multi-modal Hashing for Flexible Multimedia Retrieval. Xu Lu, Lei Zhu, Li Liu, Liqiang Nie, Huaxiang Zhang |
| 2021 | Graph Neural Networks for Knowledge Enhanced Visual Representation of Paintings. Athanasios Efthymiou, Stevan Rudinac, Monika Kackovic, Marcel Worring, Nachoem Wijnberg |
| 2021 | Group-Level Focus of Visual Attention for Improved Next Speaker Prediction. Chris Birmingham, Kalin Stefanov, Maja J. Mataric |
| 2021 | Group-based Distinctive Image Captioning with Memory Attention. Jiuniu Wang, Wenjia Xu, Qingzhong Wang, Antoni B. Chan |
| 2021 | HANet: Hierarchical Alignment Networks for Video-Text Retrieval. Peng Wu, Xiangteng He, Mingqian Tang, Yiliang Lv, Jing Liu |
| 2021 | HAT: Hierarchical Aggregation Transformers for Person Re-identification. Guowen Zhang, Pingping Zhang, Jinqing Qi, Huchuan Lu |
| 2021 | HDA-Net: Horizontal Deformable Attention Network for Stereo Matching. Qi Zhang, Xuesong Zhang, Baoping Li, Yuzhong Chen, Anlong Ming |
| 2021 | HUMA'21: 2nd International Workshop on Human-centric Multimedia Analysis. Wu Liu, Xinchen Liu, Jingkuan Song, Dingwen Zhang, Wenbing Huang, Junbo Guo, John Smith |
| 2021 | Handling Difficult Labels for Multi-label Image Classification via Uncertainty Distillation. Liangchen Song, Jialian Wu, Ming Yang, Qian Zhang, Yuan Li, Junsong Yuan |
| 2021 | Heraclitus's Forest: An Interactive Artwork for Oral History. Lin Wang, Zhonghao Lin, Wei Cai |
| 2021 | HetEmotionNet: Two-Stream Heterogeneous Graph Recurrent Neural Network for Multi-modal Emotion Recognition. Ziyu Jia, Youfang Lin, Jing Wang, Zhiyang Feng, Xiangheng Xie, Caijie Chen |
| 2021 | Heterogeneous Face Recognition with Attention-guided Feature Disentangling. Shanmin Yang, Xiao Yang, Yi Lin, Peng Cheng, Yi Zhang, Jianwei Zhang |
| 2021 | Heterogeneous Feature Fusion and Cross-modal Alignment for Composed Image Retrieval. Gangjian Zhang, Shikui Wei, Huaxin Pang, Yao Zhao |
| 2021 | Heuristic Depth Estimation with Progressive Depth Reconstruction and Confidence-Aware Loss. Jiehua Zhang, Liang Li, Chenggang Yan, Yaoqi Sun, Tao Shen, Jiyong Zhang, Zhan Wang |
| 2021 | Hierarchical Fusion for Practical Ghost-free High Dynamic Range Imaging. Pengfei Xiong, Yu Chen |
| 2021 | Hierarchical Multi-Task Learning for Diagram Question Answering with Multi-Modal Transformer. Zhaoquan Yuan, Xiao Peng, Xiao Wu, Changsheng Xu |
| 2021 | Hierarchical View Predictor: Unsupervised 3D Global Feature Learning through Hierarchical Prediction among Unordered Views. Zhizhong Han, Xiyang Wang, Yu-Shen Liu, Matthias Zwicker |
| 2021 | How Video Super-Resolution and Frame Interpolation Mutually Benefit. Chengcheng Zhou, Zongqing Lu, Linge Li, Qiangyu Yan, Jing-Hao Xue |
| 2021 | How does Color Constancy Affect Target Recognition and Instance Segmentation? Siyan Xue, Shaobing Gao, Minjie Tan, Zhen He, Liangtian He |
| 2021 | How to Learn a Domain-Adaptive Event Simulator? Daxin Gu, Jia Li, Yu Zhang, Yonghong Tian |
| 2021 | Human Attributes Prediction under Privacy-preserving Conditions. Anshu Singh, Shaojing Fan, Mohan S. Kankanhalli |
| 2021 | Hybrid Network Compression via Meta-Learning. Jianming Ye, Shiliang Zhang, Jingdong Wang |
| 2021 | Hybrid Reasoning Network for Video-based Commonsense Captioning. Weijiang Yu, Jian Liang, Lei Ji, Lu Li, Yuejian Fang, Nong Xiao, Nan Duan |
| 2021 | I Know Your Keyboard Input: A Robust Keystroke Eavesdropper Based-on Acoustic Signals. Jia-Xuan Bai, Bin Liu, Luchuan Song |
| 2021 | I2V-GAN: Unpaired Infrared-to-Visible Video Translation. Shuang Li, Bingfeng Han, Zhenjie Yu, Chi Harold Liu, Kai Chen, Shuigen Wang |
| 2021 | ION: Instance-level Object Navigation. Weijie Li, Xinhang Song, Yubing Bai, Sixian Zhang, Shuqiang Jiang |
| 2021 | Identity-Preserving Face Anonymization via Adaptively Facial Attributes Obfuscation. Jingzhi Li, Lutong Han, Ruoyu Chen, Hua Zhang, Bing Han, Lili Wang, Xiaochun Cao |
| 2021 | Identity-aware Graph Memory Network for Action Detection. Jingcheng Ni, Jie Qin, Di Huang |
| 2021 | Image Quality Assessment in the Modern Age. Kede Ma, Yuming Fang |
| 2021 | Image Quality Caption with Attentive and Recurrent Semantic Attractor Network. Wen Yang, Jinjian Wu, Leida Li, Weisheng Dong, Guangming Shi |
| 2021 | Image Re-composition via Regional Content-Style Decoupling. Rong Zhang, Wei Li, Yiqun Zhang, Hong Zhang, Jinhui Yu, Ruigang Yang, Weiwei Xu |
| 2021 | Image Search with Text Feedback by Deep Hierarchical Attention Mutual Information Maximization. Chunbin Gu, Jiajun Bu, Zhen Zhang, Zhi Yu, Dongfang Ma, Wei Wang |
| 2021 | Image Style Transfer with Generative Adversarial Networks. Ru Li |
| 2021 | Imbalanced Source-free Domain Adaptation. Xinhao Li, Jingjing Li, Lei Zhu, Guoqing Wang, Zi Huang |
| 2021 | Imitating Arbitrary Talking Style for Realistic Audio-Driven Talking Face Synthesis. Haozhe Wu, Jia Jia, Haoyu Wang, Yishun Dou, Chao Duan, Qingshan Deng |
| 2021 | Imitative Learning for Multi-Person Action Forecasting. Yu-Ke Li, Pin Wang, Mang Ye, Ching-Yao Chan |
| 2021 | Implicit Feature Refinement for Instance Segmentation. Lufan Ma, Tiancai Wang, Bin Dong, Jiangpeng Yan, Xiu Li, Xiangyu Zhang |
| 2021 | Implicit Feedbacks are Not Always Favorable: Iterative Relabeled One-Class Collaborative Filtering against Noisy Interactions. Zitai Wang, Qianqian Xu, Zhiyong Yang, Xiaochun Cao, Qingming Huang |
| 2021 | Improving Fake News Detection by Using an Entity-enhanced Framework to Fuse Diverse Multimodal Clues. Peng Qi, Juan Cao, Xirong Li, Huan Liu, Qiang Sheng, Xiaoyue Mi, Qin He, Yongbiao Lv, Chenyang Guo, Yingchao Yu |
| 2021 | Improving Pedestrian Detection from a Long-tailed Domain Perspective. Mengyuan Ding, Shanshan Zhang, Jian Yang |
| 2021 | Improving Robustness and Accuracy via Relative Information Encoding in 3D Human Pose Estimation. Wenkang Shan, Haopeng Lu, Shanshe Wang, Xinfeng Zhang, Wen Gao |
| 2021 | Improving Weakly Supervised Object Localization via Causal Intervention. Feifei Shao, Yawei Luo, Li Zhang, Lu Ye, Siliang Tang, Yi Yang, Jun Xiao |
| 2021 | Inferring the Importance of Product Appearance with Semi-supervised Multi-modal Enhancement: A Step Towards the Screenless Retailing. Yongshun Gong, Jinfeng Yi, Dongdong Chen, Jian Zhang, Jiayu Zhou, Zhihua Zhou |
| 2021 | Information-Growth Attention Network for Image Super-Resolution. Zhuangzi Li, Ge Li, Thomas H. Li, Shan Liu, Wei Gao |
| 2021 | Informative Class-Conditioned Feature Alignment for Unsupervised Domain Adaptation. Wanxia Deng, Yawen Cui, Zhen Liu, Gangyao Kuang, Dewen Hu, Matti Pietikäinen, Li Liu |
| 2021 | InsPose: Instance-Aware Networks for Single-Stage Multi-Person Pose Estimation. Dahu Shi, Xing Wei, Xiaodong Yu, Wenming Tan, Ye Ren, Shiliang Pu |
| 2021 | Instance-wise or Class-wise? A Tale of Neighbor Shapley for Concept-based Explanation. Jiahui Li, Kun Kuang, Lin Li, Long Chen, Songyang Zhang, Jian Shao, Jun Xiao |
| 2021 | Integrating Semantic and Temporal Relationships in Facial Action Unit Detection. Zhihua Li, Xiang Deng, Xiaotian Li, Lijun Yin |
| 2021 | InterBN: Channel Fusion for Adversarial Unsupervised Domain Adaptation. Mengzhu Wang, Wei Wang, Baopu Li, Xiang Zhang, Long Lan, Huibin Tan, Tianyi Liang, Wei Yu, Zhigang Luo |
| 2021 | Interpolation Variable Rate Image Compression. Zhenhong Sun, Zhiyu Tan, Xiuyu Sun, Fangyi Zhang, Yichen Qian, Dongyang Li, Hao Li |
| 2021 | Interpreting Super-Resolution CNNs for Sub-Pixel Motion Compensation in Video Coding. Luka Murn, Alan F. Smeaton, Marta Mrak |
| 2021 | Interventional Video Relation Detection. Yicong Li, Xun Yang, Xindi Shang, Tat-Seng Chua |
| 2021 | Intrinsic Temporal Regularization for High-resolution Human Video Synthesis. Lingbo Yang, Zhanning Gao, Siwei Ma, Wen Gao |
| 2021 | Is Someone Speaking?: Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection. Ruijie Tao, Zexu Pan, Rohan Kumar Das, Xinyuan Qian, Mike Zheng Shou, Haizhou Li |
| 2021 | Is Visual Context Really Helpful for Knowledge Graph? A Representation Learning Perspective. Meng Wang, Sen Wang, Han Yang, Zheng Zhang, Xi Chen, Guilin Qi |
| 2021 | JDMAN: Joint Discriminative and Mutual Adaptation Networks for Cross-Domain Facial Expression Recognition. Yingjian Li, Yingnan Gao, Bingzhi Chen, Zheng Zhang, Lei Zhu, Guangming Lu |
| 2021 | JPGNet: Joint Predictive Filtering and Generative Network for Image Inpainting. Qing Guo, Xiaoguang Li, Felix Juefei-Xu, Hongkai Yu, Yang Liu, Song Wang |
| 2021 | Joint Implicit Image Function for Guided Depth Super-Resolution. Jiaxiang Tang, Xiaokang Chen, Gang Zeng |
| 2021 | Joint Learning for Relationship and Interaction Analysis in Video with Multimodal Feature Fusion. Beibei Zhang, Fan Yu, Yanxin Gao, Tongwei Ren, Gangshan Wu |
| 2021 | Joint Optimization in Edge-Cloud Continuum for Federated Unsupervised Person Re-identification. Weiming Zhuang, Yonggang Wen, Shuai Zhang |
| 2021 | Joint-teaching: Learning to Refine Knowledge for Resource-constrained Unsupervised Cross-modal Retrieval. Peng-Fei Zhang, Jiasheng Duan, Zi Huang, Hongzhi Yin |
| 2021 | JokerGAN: Memory-Efficient Model for Handwritten Text Generation with Text Line Awareness. Jan Zdenek, Hideki Nakayama |
| 2021 | Kandinsky Mobile: Abstract Art-Inspired Interactive Visualization of Social Discussions on Mobile Devices. Castillo Clarence Fitzgerald Gumtang, Sourav S. Bhowmick |
| 2021 | Keyframe Extraction from Motion Capture Sequences with Graph based Deep Reinforcement Learning. Clinton Mo, Kun Hu, Shaohui Mei, Zebin Chen, Zhiyong Wang |
| 2021 | Knowing When to Quit: Selective Cascaded Regression with Patch Attention for Real-Time Face Alignment. Gil Shapira, Noga Levy, Ishay Goldin, Roy Josef Jevnisek |
| 2021 | Knowledge Perceived Multi-modal Pretraining in E-commerce. Yushan Zhu, Huaixiao Zhao, Wen Zhang, Ganqiang Ye, Hui Chen, Ningyu Zhang, Huajun Chen |
| 2021 | Knowledge-Supervised Learning: Knowledge Consensus Constraints for Person Re-Identification. Li Wang, Baoyu Fan, Zhenhua Guo, Yaqian Zhao, Runze Zhang, Rengang Li, Weifeng Gong, Endong Wang |
| 2021 | L2RS: A Learning-to-Rescore Mechanism for Hybrid Speech Recognition. Yuanfeng Song, Di Jiang, Xuefang Zhao, Qian Xu, Raymond Chi-Wing Wong, Lixin Fan, Qiang Yang |
| 2021 | LSSNet: A Two-stream Convolutional Neural Network for Spotting Macro- and Micro-expression in Long Videos. Wang-Wang Yu, Jingwen Jiang, Yong-Jie Li |
| 2021 | LSTC: Boosting Atomic Action Detection with Long-Short-Term Context. Yuxi Li, Boshen Zhang, Jian Li, Yabiao Wang, Weiyao Lin, Chengjie Wang, Jilin Li, Feiyue Huang |
| 2021 | Large-scale Multi-Modality Pretrained Models: Applications and Experiences. Jingren Zhou |
| 2021 | Latent Memory-augmented Graph Transformer for Visual Storytelling. Mengshi Qi, Jie Qin, Di Huang, Zhiqiang Shen, Yi Yang, Jiebo Luo |
| 2021 | Learning Contextual Transformer Network for Image Inpainting. Ye Deng, Siqi Hui, Sanping Zhou, Deyu Meng, Jinjun Wang |
| 2021 | Learning Disentangled Factors from Paired Data in Cross-Modal Retrieval: An Implicit Identifiable VAE Approach. Minyoung Kim, Ricardo Guerrero, Vladimir Pavlovic |
| 2021 | Learning Fine-Grained Motion Embedding for Landscape Animation. Hongwei Xue, Bei Liu, Huan Yang, Jianlong Fu, Houqiang Li, Jiebo Luo |
| 2021 | Learning Hierarchal Channel Attention for Fine-grained Visual Classification. Xiang Guan, Guoqing Wang, Xing Xu, Yi Bin |
| 2021 | Learning Hierarchical Embedding for Video Instance Segmentation. Zheyun Qin, Xiankai Lu, Xiushan Nie, Xiantong Zhen, Yilong Yin |
| 2021 | Learning Human Motion Prediction via Stochastic Differential Equations. Kedi Lyu, Zhenguang Liu, Shuang Wu, Haipeng Chen, Xuhong Zhang, Yuyu Yin |
| 2021 | Learning Kinematic Formulas from Multiple View Videos. Liangchen Song, Sheng Liu, Celong Liu, Zhong Li, Yuqi Ding, Yi Xu, Junsong Yuan |
| 2021 | Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition. Tailin Chen, Desen Zhou, Jian Wang, Shidong Wang, Yu Guan, Xuming He, Errui Ding |
| 2021 | Learning Multi-context Aware Location Representations from Large-scale Geotagged Images. Yifang Yin, Ying Zhang, Zhenguang Liu, Yuxuan Liang, Sheng Wang, Rajiv Ratn Shah, Roger Zimmermann |
| 2021 | Learning Regularizer for Monocular Depth Estimation with Adversarial Guidance. Guibao Shen, Yingkui Zhang, Jialu Li, Mingqiang Wei, Qiong Wang, Guangyong Chen, Pheng-Ann Heng |
| 2021 | Learning Sample-Specific Policies for Sequential Image Augmentation. Pu Li, Xiaobai Liu, Xiaohui Xie |
| 2021 | Learning Segment Similarity and Alignment in Large-Scale Content Based Video Retrieval. Chen Jiang, Kaiming Huang, Sifeng He, Xudong Yang, Wei Zhang, Xiaobo Zhang, Yuan Cheng, Lei Yang, Qing Wang, Furong Xu, Tan Pan, Wei Chu |
| 2021 | Learning Spatial-angular Fusion for Compressive Light Field Imaging in a Cycle-consistent Framework. Xianqiang Lyu, Zhiyu Zhu, Mantang Guo, Jing Jin, Junhui Hou, Huanqiang Zeng |
| 2021 | Learning Spatio-temporal Representation by Channel Aliasing Video Perception. Yiqi Lin, Jinpeng Wang, Manlin Zhang, Andy J. Ma |
| 2021 | Learning Structure Affinity for Video Depth Estimation. Yuanzhouhan Cao, Yidong Li, Haokui Zhang, Chao Ren, Yifan Liu |
| 2021 | Learning Transferrable and Interpretable Representations for Domain Generalization. Zhekai Du, Jingjing Li, Ke Lu, Lei Zhu, Zi Huang |
| 2021 | Learning Unified Embeddings for Recommendation via Meta-path Semantics. Qianxiu Hao, Qianqian Xu, Zhiyong Yang, Qingming Huang |
| 2021 | Learning What and When to Drop: Adaptive Multimodal and Contextual Dynamics for Emotion Recognition in Conversation. Feiyu Chen, Zhengxiao Sun, Deqiang Ouyang, Xueliang Liu, Jie Shao |
| 2021 | Learning to Compose Stylistic Calligraphy Artwork with Emotions. Shaozu Yuan, Ruixue Liu, Meng Chen, Baoyang Chen, Zhijie Qiu, Xiaodong He |
| 2021 | Learning to Decode Contextual Information for Efficient Contour Detection. Ruoxi Deng, Shengjun Liu, Jinxin Wang, Huibing Wang, Hanli Zhao, Xiaoqin Zhang |
| 2021 | Learning to Understand Traffic Signs. Yunfei Guo, Wei Feng, Fei Yin, Tao Xue, Shuqi Mei, Cheng-Lin Liu |
| 2021 | Legitimate Adversarial Patches: Evading Human Eyes and Detection Models in the Physical World. Jia Tan, Nan Ji, Haidong Xie, Xueshuang Xiang |
| 2021 | Lesion-Inspired Denoising Network: Connecting Medical Image Denoising and Lesion Detection. Kecheng Chen, Kun Long, Yazhou Ren, Jiayu Sun, Xiaorong Pu |
| 2021 | Lifting the Veil of Frequency in Joint Segmentation and Depth Estimation. Tianhao Fu, Yingying Li, Xiaoqing Ye, Xiao Tan, Hao Sun, Fumin Shen, Errui Ding |
| 2021 | LightFEC: Network Adaptive FEC with a Lightweight Deep-Learning Approach. Han Hu, Sheng Cheng, Xinggong Zhang, Zongming Guo |
| 2021 | Linking the Characters: Video-oriented Social Graph Generation via Hierarchical-cumulative GCN. Shiwei Wu, Joya Chen, Tong Xu, Liyi Chen, Lingfei Wu, Yao Hu, Enhong Chen |
| 2021 | Local Graph Convolutional Networks for Cross-Modal Hashing. Yudong Chen, Sen Wang, Jianglin Lu, Zhi Chen, Zheng Zhang, Zi Huang |
| 2021 | Locally Adaptive Structure and Texture Similarity for Image Quality Assessment. Keyan Ding, Yi Liu, Xueyi Zou, Shiqi Wang, Kede Ma |
| 2021 | Long Short-term Convolutional Transformer for No-Reference Video Quality Assessment. Junyong You |
| 2021 | Long-Range Feature Propagating for Natural Image Matting. Qinglin Liu, Haozhe Xie, Shengping Zhang, Bineng Zhong, Rongrong Ji |
| 2021 | Long-tailed Distribution Adaptation. Zhiliang Peng, Wei Huang, Zonghao Guo, Xiaosong Zhang, Jianbin Jiao, Qixiang Ye |
| 2021 | M3TR: Multi-modal Multi-label Recognition with Transformer. Jiawei Zhao, Yifan Zhao, Jia Li |
| 2021 | MBRS: Enhancing Robustness of DNN-based Watermarking by Mini-Batch of Real and Simulated JPEG Compression. Zhaoyang Jia, Han Fang, Weiming Zhang |
| 2021 | MCCN: Multimodal Coordinated Clustering Network for Large-Scale Cross-modal Retrieval. Zhixiong Zeng, Ying Sun, Wenji Mao |
| 2021 | MDMS: Music Data Matching System for Query Variant Retrieval. Rinita Roy, Ruben Mayer, Hans-Arno Jacobsen |
| 2021 | MGH: Metadata Guided Hypergraph Modeling for Unsupervised Person Re-identification. Yiming Wu, Xintian Wu, Xi Li, Jian Tian |
| 2021 | MHFC: Multi-Head Feature Collaboration for Few-Shot Learning. Shuai Shao, Lei Xing, Yan Wang, Rui Xu, Chunyan Zhao, Yanjiang Wang, Baodi Liu |
| 2021 | MM '21: ACM Multimedia Conference, Virtual Event, China, October 20 - 24, 2021 Heng Tao Shen, Yueting Zhuang, John R. Smith, Yang Yang, Pablo César, Florian Metze, Balakrishnan Prabhakaran |
| 2021 | MM-Flow: Multi-modal Flow Network for Point Cloud Completion. Yiqiang Zhao, Yiyao Zhou, Rui Chen, Bin Hu, Xiding Ai |
| 2021 | MM21 Pre-training for Video Understanding Challenge: Video Captioning with Pretraining Techniques. Sihan Chen, Xinxin Zhu, Dongze Hao, Wei Liu, Jiawei Liu, Zijia Zhao, Longteng Guo, Jing Liu |
| 2021 | MMFashion: An Open-Source Toolbox for Visual Fashion Analysis. Xin Liu, Jiancheng Li, Jiaqi Wang, Ziwei Liu |
| 2021 | MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and Understanding. Zhanghui Kuang, Hongbin Sun, Zhizhong Li, Xiaoyu Yue, Tsui Hin Lin, Jianyong Chen, Huaqiang Wei, Yiqin Zhu, Tong Gao, Wenwei Zhang, Kai Chen, Wayne Zhang, Dahua Lin |
| 2021 | MMSports'21: 4th International Workshop on Multimedia Content Analysis in Sports. Rainer Lienhart, Thomas B. Moeslund, Hideo Saito |
| 2021 | MS-GraphSIM: Inferring Point Cloud Quality via Multiscale Graph Similarity. Yujie Zhang, Qi Yang, Yiling Xu |
| 2021 | MSO: Multi-Feature Space Joint Optimization Network for RGB-Infrared Person Re-Identification. Yajun Gao, Tengfei Liang, Yi Jin, Xiaoyan Gu, Wu Liu, Yidong Li, Congyan Lang |
| 2021 | MULL'21: First International Workshop on Multimedia Understanding with Less Labeling. Xiu-Shen Wei, Jufeng Yang, Han-Jia Ye, Jian Yang |
| 2021 | MV-TON: Memory-based Video Virtual Try-on network. Xiaojing Zhong, Zhonghua Wu, Taizhe Tan, Guosheng Lin, Qingyao Wu |
| 2021 | MageAdd: Real-Time Interaction Simulation for Scene Synthesis. Shao-Kui Zhang, Yi-Xiao Li, Yu He, Yong-Liang Yang, Song-Hai Zhang |
| 2021 | Mask and Predict: Multi-step Reasoning for Scene Graph Generation. Hongshuo Tian, Ning Xu, An-An Liu, Chenggang Yan, Zhendong Mao, Quan Zhang, Yongdong Zhang |
| 2021 | Mask is All You Need: Rethinking Mask R-CNN for Dense and Arbitrary-Shaped Scene Text Detection. Xugong Qin, Yu Zhou, Youhui Guo, Dayan Wu, Zhihong Tian, Ning Jiang, Hongbin Wang, Weiping Wang |
| 2021 | Memory-Augmented Deep Unfolding Network for Compressive Sensing. Jiechong Song, Bin Chen, Jian Zhang |
| 2021 | Merging Multiple Template Matching Predictions in Intra Coding with Attentive Convolutional Neural Network. Qijun Wang, Guodong Zheng |
| 2021 | MeronymNet: A Hierarchical Model for Unified and Controllable Multi-Category Object Generation. Rishabh Baghel, Abhishek Trivedi, Tejas Ravichandran, Ravi Kiran Sarvadevabhatla |
| 2021 | MeshNet++: A Network with a Face. Vinit Veerendraveer Singh, Shivanand Venkanna Sheshappanavar, Chandra Kambhamettu |
| 2021 | Meta Self-Paced Learning for Cross-Modal Matching. Jiwei Wei, Xing Xu, Zheng Wang, Guoqing Wang |
| 2021 | Meta-FDMixup: Cross-Domain Few-Shot Learning Guided by Labeled Target Data. Yuqian Fu, Yanwei Fu, Yu-Gang Jiang |
| 2021 | Metaverse for Social Good: A University Campus Prototype. Haihan Duan, Jiaye Li, Sizheng Fan, Zhonghao Lin, Xiao Wu, Wei Cai |
| 2021 | Metric Learning for Anti-Compression Facial Forgery Detection. Shenhao Cao, Qin Zou, Xiuqing Mao, Dengpan Ye, Zhongyuan Wang |
| 2021 | Milliseconds Color Stippling. Lei Ma, Jian Shi, Yanyun Chen |
| 2021 | Mining Latent Structures for Multimedia Recommendation. Jinghao Zhang, Yanqiao Zhu, Qiang Liu, Shu Wu, Shuhui Wang, Liang Wang |
| 2021 | Missing Data Imputation for Solar Yield Prediction using Temporal Multi-Modal Variational Auto-Encoder. Meng Shen, Huaizheng Zhang, Yixin Cao, Fan Yang, Yonggang Wen |
| 2021 | Mitigating Generation Shifts for Generalized Zero-Shot Learning. Zhi Chen, Yadan Luo, Sen Wang, Ruihong Qiu, Jingjing Li, Zi Huang |
| 2021 | Mix-order Attention Networks for Image Restoration. Tao Dai, Yalei Lv, Bin Chen, Zhi Wang, Zexuan Zhu, Shu-Tao Xia |
| 2021 | Modeling the Uncertainty for Self-supervised 3D Skeleton Action Representation Learning. Yukun Su, Guosheng Lin, Ruizhou Sun, Yun Hao, Qingyao Wu |
| 2021 | Motion Prediction via Joint Dependency Modeling in Phase Space. Pengxiang Su, Zhenguang Liu, Shuang Wu, Lei Zhu, Yifang Yin, Xuanjing Shen |
| 2021 | Move As You Like: Image Animation in E-Commerce Scenario. Borun Xu, Biao Wang, Jiale Tao, Tiezheng Ge, Yuning Jiang, Wen Li, Lixin Duan |
| 2021 | MovieREP: A New Movie Reproduction Framework for Film Soundtrack. Ruiqi Wang, Long Ye, Qin Zhang |
| 2021 | MuCAI'21: 2nd ACM Multimedia Workshop on Multimodal Conversational AI. João Magalhães, Alexander G. Hauptmann, Ricardo Gamelas Sousa, Carlos Santiago |
| 2021 | MuSe 2021 Challenge: Multimodal Emotion, Sentiment, Physiological-Emotion, and Stress Detection. Lukas Stappen, Eva-Maria Meßner, Erik Cambria, Guoying Zhao, Björn W. Schuller |
| 2021 | Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning. Xi Zhang, Feifei Zhang, Changsheng Xu |
| 2021 | Multi-Level Visual Representation with Semantic-Reinforced Learning for Video Captioning. Chengbo Dong, Xinru Chen, Aozhu Chen, Fan Hu, Zihan Wang, Xirong Li |
| 2021 | Multi-Modal Multi-Instance Learning for Retinal Disease Recognition. Xirong Li, Yang Zhou, Jie Wang, Hailan Lin, Jianchun Zhao, Dayong Ding, Weihong Yu, Youxin Chen |
| 2021 | Multi-Modal Sarcasm Detection with Interactive In-Modal and Cross-Modal Graphs. Bin Liang, Chenwei Lou, Xiang Li, Lin Gui, Min Yang, Ruifeng Xu |
| 2021 | Multi-Perspective Video Captioning. Yi Bin, Xindi Shang, Bo Peng, Yujuan Ding, Tat-Seng Chua |
| 2021 | Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus. Rongjie Huang, Feiyang Chen, Yi Ren, Jinglin Liu, Chenye Cui, Zhou Zhao |
| 2021 | Multi-Source Fusion and Automatic Predictor Selection for Zero-Shot Video Object Segmentation. Xiaoqi Zhao, Youwei Pang, Jiaxing Yang, Lihe Zhang, Huchuan Lu |
| 2021 | Multi-branch Channel-wise Enhancement Network for Fine-grained Visual Recognition. Guangjun Li, Yongxiong Wang, Fengting Zhu |
| 2021 | Multi-caption Text-to-Face Synthesis: Dataset and Algorithm. Jianxin Sun, Qi Li, Weining Wang, Jian Zhao, Zhenan Sun |
| 2021 | Multi-initialization Optimization Network for Accurate 3D Human Pose and Shape Estimation. Zhiwei Liu, Xiangyu Zhu, Lu Yang, Xiang Yan, Ming Tang, Zhen Lei, Guibo Zhu, Xuetao Feng, Yan Wang, Jinqiao Wang |
| 2021 | Multi-label Pattern Image Retrieval via Attention Mechanism Driven Graph Convolutional Network. Ying Li, Hongwei Zhou, Yeyu Yin, Jiaquan Gao |
| 2021 | Multi-modal Representation Learning for Video Advertisement Content Structuring. Daya Guo, Zhaoyang Zeng |
| 2021 | Multi-view 3D Smooth Human Pose Estimation based on Heatmap Filtering and Spatio-temporal Information. Zehai Niu, Ke Lu, Jian Xue, Haifeng Ma, Runchen Wei |
| 2021 | Multi-view Clustering via Deep Matrix Factorization and Partition Alignment. Chen Zhang, Siwei Wang, Jiyuan Liu, Sihang Zhou, Pei Zhang, Xinwang Liu, En Zhu, Changwang Zhang |
| 2021 | MultiMediate: Multi-modal Group Behaviour Analysis for Artificial Mediation. Philipp Müller, Michael Dietz, Dominik Schiller, Dominike Thomas, Guanhua Zhang, Patrick Gebhard, Elisabeth André, Andreas Bulling |
| 2021 | MultiModal Language Modelling on Knowledge Graphs for Deep Video Understanding. Vishal Anand, Raksha Ramesh, Boshen Jin, Ziyin Wang, Xiaoxiao Lei, Ching-Yung Lin |
| 2021 | Multifocal Attention-Based Cross-Scale Network for Image De-raining. Zheyu Zhang, Yurui Zhu, Xueyang Fu, Zhiwei Xiong, Zheng-Jun Zha, Feng Wu |
| 2021 | Multimedia Classifiers: Behind the Scenes. Manjunath Iyer |
| 2021 | Multimodal Asymmetric Dual Learning for Unsupervised Eyeglasses Removal. Qing Lin, Bo Yan, Weimin Tan |
| 2021 | Multimodal Compatibility Modeling via Exploring the Consistent and Complementary Correlations. Weili Guan, Haokun Wen, Xuemeng Song, Chung-Hsing Yeh, Xiaojun Chang, Liqiang Nie |
| 2021 | Multimodal Dialog System: Relational Graph-based Context-aware Question Understanding. Haoyu Zhang, Meng Liu, Zan Gao, Xiaoqiang Lei, Yinglong Wang, Liqiang Nie |
| 2021 | Multimodal Entity Linking: A New Dataset and A Baseline. Jingru Gan, Jinchang Luo, Haiwei Wang, Shuhui Wang, Wei He, Qingming Huang |
| 2021 | Multimodal Global Relation Knowledge Distillation for Egocentric Action Anticipation. Yi Huang, Xiaoshan Yang, Changsheng Xu |
| 2021 | Multimodal Relation Extraction with Efficient Graph Alignment. Changmeng Zheng, Junhao Feng, Ze Fu, Yi Cai, Qing Li, Tao Wang |
| 2021 | Multimodal Video Summarization via Time-Aware Transformers. Xindi Shang, Zehuan Yuan, Anran Wang, Changhu Wang |
| 2021 | Multiple Object Tracking by Trajectory Map Regression with Temporal Priors Embedding. Xingyu Wan, Sanping Zhou, Jinjun Wang, Rongye Meng |
| 2021 | Multiple Objects-Aware Visual Question Generation. Jiayuan Xie, Yi Cai, Qingbao Huang, Tao Wang |
| 2021 | Multiview Detection with Shadow Transformer (and View-Coherent Data Augmentation). Yunzhong Hou, Liang Zheng |
| 2021 | MusicBERT: A Self-supervised Learning of Music Representation. Hongyuan Zhu, Ye Niu, Di Fu, Hao Wang |
| 2021 | NJU MCG - Sensetime Team Submission to Pre-training for Video Understanding Challenge Track II. Liwei Jin, Haoyue Cheng, Su Xu, Wayne Wu, Limin Wang |
| 2021 | Neighbor-Vote: Improving Monocular 3D Object Detection through Neighbor Distance Voting. Xiaomeng Chu, Jiajun Deng, Yao Li, Zhenxun Yuan, Yanyong Zhang, Jianmin Ji, Yu Zhang |
| 2021 | Neighbor-view Enhanced Model for Vision and Language Navigation. Dong An, Yuankai Qi, Yan Huang, Qi Wu, Liang Wang, Tieniu Tan |
| 2021 | Neural Free-Viewpoint Performance Rendering under Complex Human-object Interactions. Guoxing Sun, Xin Chen, Yizhang Chen, Anqi Pang, Pei Lin, Yuheng Jiang, Lan Xu, Jingyi Yu, Jingya Wang |
| 2021 | Neural-based Rendering and Application. Peng Dai |
| 2021 | No-Reference Video Quality Assessment with Heterogeneous Knowledge Ensemble. Jinjian Wu, Yongxu Liu, Leida Li, Weisheng Dong, Guangming Shi |
| 2021 | Non-Linear Fusion for Self-Paced Multi-View Clustering. Zongmo Huang, Yazhou Ren, Xiaorong Pu, Lifang He |
| 2021 | Object Point Cloud Classification via Poly-Convolutional Architecture Search. Xuanxiang Lin, Ke Chen, Kui Jia |
| 2021 | Object-aware Long-short-range Spatial Alignment for Few-Shot Fine-Grained Image Classification. Yike Wu, Bo Zhang, Gang Yu, Weixi Zhang, Bin Wang, Tao Chen, Jiayuan Fan |
| 2021 | Occlusion-aware Bi-directional Guided Network for Light Field Salient Object Detection. Dong Jing, Shuo Zhang, Runmin Cong, Youfang Lin |
| 2021 | On-demand Action Detection System using Pose Information. Noboru Yoshida, Jianquan Liu |
| 2021 | Once and for All: Self-supervised Multi-modal Co-training on One-billion Videos at Alibaba. Lianghua Huang, Yu Liu, Xiangzeng Zhou, Ansheng You, Ming Li, Bin Wang, Yingya Zhang, Pan Pan, Yinghui Xu |
| 2021 | One-Stage Incomplete Multi-view Clustering via Late Fusion. Yi Zhang, Xinwang Liu, Siwei Wang, Jiyuan Liu, Sisi Dai, En Zhu |
| 2021 | One-Stage Visual Grounding via Semantic-Aware Feature Filter. Jiabo Ye, Xin Lin, Liang He, Dingbang Li, Qin Chen |
| 2021 | One-stage Context and Identity Hallucination Network. Yinglu Liu, Mingcan Xiang, Hailin Shi, Tao Mei |
| 2021 | Open Set Face Anti-Spoofing in Unseen Attacks. Xin Dong, Hao Liu, Weiwei Cai, Pengyuan Lv, Zekuan Yu |
| 2021 | OsGG-Net: One-step Graph Generation Network for Unbiased Head Pose Estimation. Shentong Mo, Xin Miao |
| 2021 | Out-of-distribution Generalization and Its Applications for Multimedia. Xin Wang, Peng Cui, Wenwu Zhu |
| 2021 | Overview of Tencent Multi-modal Ads Video Understanding. Zhenzhi Wang, Zhimin Li, Liyu Wu, Jiangfeng Xiong, Qinglin Lu |
| 2021 | PFFN: Progressive Feature Fusion Network for Lightweight Image Super-Resolution. Dongyang Zhang, Changyu Li, Ning Xie, Guoqing Wang, Jie Shao |
| 2021 | PIMNet: A Parallel, Iterative and Mimicking Network for Scene Text Recognition. Zhi Qiao, Yu Zhou, Jin Wei, Wei Wang, Yuan Zhang, Ning Jiang, Hongbin Wang, Weiping Wang |
| 2021 | PRNet: A Progressive Recovery Network for Revealing Perceptually Encrypted Images. Tao Xiang, Ying Yang, Shangwei Guo, Hangcheng Liu, Hantao Liu |
| 2021 | PUGCQ: A Large Scale Dataset for Quality Assessment of Professional User-Generated Content. Guo Li, Baoliang Chen, Lingyu Zhu, Qingwen He, Hongfei Fan, Shiqi Wang |
| 2021 | Pairwise Emotional Relationship Recognition in Drama Videos: Dataset and Benchmark. Xun Gao, Yin Zhao, Jie Zhang, Longjun Cai |
| 2021 | Pairwise VLAD Interaction Network for Video Question Answering. Hui Wang, Dan Guo, Xian-Sheng Hua, Meng Wang |
| 2021 | Parametric Reshaping of Portraits in Videos. Xiangjun Tang, Wenxin Sun, Yong-Liang Yang, Xiaogang Jin |
| 2021 | Pareto Optimality for Fairness-constrained Collaborative Filtering. Qianxiu Hao, Qianqian Xu, Zhiyong Yang, Qingming Huang |
| 2021 | Partial Tubal Nuclear Norm Regularized Multi-view Learning. Yongyong Chen, Shuqin Wang, Chong Peng, Guangming Lu, Yicong Zhou |
| 2021 | Partially Fake it Till you Make It: Mixing Real and Fake Thermal Images for Improved Object Detection. Francesco Bongini, Lorenzo Berlincioni, Marco Bertini, Alberto Del Bimbo |
| 2021 | Perception-Oriented Stereo Image Super-Resolution. Chenxi Ma, Bo Yan, Weimin Tan, Xuhao Jiang |
| 2021 | Perceptual Quality Assessment of Internet Videos. Jiahua Xu, Jing Li, Xingguang Zhou, Wei Zhou, Baichao Wang, Zhibo Chen |
| 2021 | Personality Recognition by Modelling Person-specific Cognitive Processes using Graph Representation. Zilong Shao, Siyang Song, Shashank Jaiswal, Linlin Shen, Michel F. Valstar, Hatice Gunes |
| 2021 | Personalized Multi-modal Video Retrieval on Mobile Devices. Haotian Zhang, Allan D. Jepson, Iqbal Mohomed, Konstantinos G. Derpanis, Ran Zhang, Afsaneh Fazly |
| 2021 | Phoenix: Combining Highest-Profit First Scheduling and Responsive Congestion Control for Delay-sensitive Multimedia Transmission. Haozhe Li |
| 2021 | Pixel-level Intra-domain Adaptation for Semantic Segmentation. Zizheng Yan, Xianggang Yu, Yipeng Qin, Yushuang Wu, Xiaoguang Han, Shuguang Cui |
| 2021 | Pixel-wise Graph Attention Networks for Person Re-identification. Wenyu Zhang, Qing Ding, Jian Hu, Yi Ma, Mingzhe Lu |
| 2021 | Plenoptic Quality Assessment: The JPEG Pleno Experience. António M. G. Pinheiro |
| 2021 | Point Cloud Projection and Multi-Scale Feature Fusion Network Based Blind Quality Assessment for Colored Point Clouds. Wenxu Tao, Gangyi Jiang, Zhidi Jiang, Mei Yu |
| 2021 | Polar Ray: A Single-stage Angle-free Detector for Oriented Object Detection in Aerial Images. Shuai Liu, Lu Zhang, Shuai Hao, Huchuan Lu, You He |
| 2021 | Pose-Guided Feature Learning with Knowledge Distillation for Occluded Person Re-Identification. Kecheng Zheng, Cuiling Lan, Wenjun Zeng, Jiawei Liu, Zhizheng Zhang, Zheng-Jun Zha |
| 2021 | Pose-guided Inter- and Intra-part Relational Transformer for Occluded Person Re-Identification. Zhongxing Ma, Yifan Zhao, Jia Li |
| 2021 | Position-Augmented Transformers with Entity-Aligned Mesh for TextVQA. Xuanyu Zhang, Qing Yang |
| 2021 | Post2Story: Automatically Generating Storylines from Microblogging Platforms. Xujian Zhao, Chongwei Wang, Peiquan Jin, Hui Zhang, Chunming Yang, Bo Li |
| 2021 | Pre-training Graph Transformer with Multimodal Side Information for Recommendation. Yong Liu, Susen Yang, Chenyi Lei, Guoxin Wang, Haihong Tang, Juyong Zhang, Aixin Sun, Chunyan Miao |
| 2021 | Privacy-Preserving Portrait Matting. Jizhizi Li, Sihan Ma, Jing Zhang, Dacheng Tao |
| 2021 | Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training. Yuqing Song, Shizhe Chen, Qin Jin, Wei Luo, Jun Xie, Fei Huang |
| 2021 | Progressive Graph Attention Network for Video Question Answering. Liang Peng, Shuangji Yang, Yi Bin, Guoqing Wang |
| 2021 | Progressive Semantic Matching for Video-Text Retrieval. Hongying Liu, Ruyi Luo, Fanhua Shang, Mantang Niu, Yuanyuan Liu |
| 2021 | Progressive and Selective Fusion Network for High Dynamic Range Imaging. Qian Ye, Jun Xiao, Kin-Man Lam, Takayuki Okatani |
| 2021 | Pseudo Graph Convolutional Network for Vehicle ReID. Wen Qian, Zhiqun He, Silong Peng, Chen Chen, Wei Wu |
| 2021 | PyTorchVideo: A Deep Learning Library for Video Understanding. Haoqi Fan, Tullie Murrell, Heng Wang, Kalyan Vasudev Alwala, Yanghao Li, Yilei Li, Bo Xiong, Nikhila Ravi, Meng Li, Haichuan Yang, Jitendra Malik, Ross B. Girshick, Matt Feiszli, Aaron Adcock, Wan-Yen Lo, Christoph Feichtenhofer |
| 2021 | Q-Art Code: Generating Scanning-robust Art-style QR Codes by Deformable Convolution. Hao Su, Jianwei Niu, Xuefeng Liu, Qingfeng Li, Ji Wan, Mingliang Xu |
| 2021 | QoE Ready to Respond: A QoE-aware MEC Selection Scheme for DASH-based Adaptive Video Streaming to Mobile Users. Wanxin Shi, Qing Li, Ruishan Zhang, Gengbiao Shen, Yong Jiang, Zhenhui Yuan, Gabriel-Miro Muntean |
| 2021 | Quality Assessment of End-to-End Learned Image Compression: The Benchmark and Objective Measure. Yang Li, Shiqi Wang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Yue Wang |
| 2021 | Question-controlled Text-aware Image Captioning. Anwen Hu, Shizhe Chen, Qin Jin |
| 2021 | R-GAN: Exploring Human-like Way for Reasonable Text-to-Image Synthesis via Generative Adversarial Networks. Yanyuan Qiao, Qi Chen, Chaorui Deng, Ning Ding, Yuankai Qi, Mingkui Tan, Xincheng Ren, Qi Wu |
| 2021 | RAMS-Trans: Recurrent Attention Multi-scale Transformer for Fine-grained Image Recognition. Yunqing Hu, Xuan Jin, Yin Zhang, Haiwen Hong, Jingfeng Zhang, Yuan He, Hui Xue |
| 2021 | RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object Detection. Zhuofan Zong, Qianggang Cao, Biao Leng |
| 2021 | ROECS: A Robust Semi-direct Pipeline Towards Online Extrinsics Correction of the Surround-view System. Tianjun Zhang, Brian Nlong Zhao, Ying Shen, Xuan Shao, Lin Zhang, Yicong Zhou |
| 2021 | ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration. Yuhao Cui, Zhou Yu, Chunqi Wang, Zhongzhou Zhao, Ji Zhang, Meng Wang, Jun Yu |
| 2021 | Rate Adaptation and Block Scheduling for Delay-sensitive Multimedia Applications. Dongyuan Su, Laizhong Cui, Lei Zhang, Yanyan Suo, Yan Qiu |
| 2021 | ReLLIE: Deep Reinforcement Learning for Customized Low-Light Image Enhancement. Rongkai Zhang, Lanqing Guo, Siyu Huang, Bihan Wen |
| 2021 | RecipeLog: Recipe Authoring App for Accurate Food Recording. Akihisa Ishino, Yoko Yamakata, Hiroaki Karasawa, Kiyoharu Aizawa |
| 2021 | ReconVAT: A Semi-Supervised Automatic Music Transcription Framework for Low-Resource Real-World Data. Kin Wai Cheuk, Dorien Herremans, Li Su |
| 2021 | Reconstruction: A Motion Driven Interactive Artwork Inspired by Chinese Shadow Puppet. Wenli Jiang, Chong Cao |
| 2021 | Recovering the Unbiased Scene Graphs from the Biased Ones. Meng-Jiun Chiou, Henghui Ding, Hanshu Yan, Changhu Wang, Roger Zimmermann, Jiashi Feng |
| 2021 | Recursive Fusion and Deformable Spatiotemporal Attention for Video Compression Artifact Reduction. Minyi Zhao, Yi Xu, Shuigeng Zhou |
| 2021 | RecycleNet: An Overlapped Text Instance Recovery Approach. Yiqing Hu, Yan Zheng, Xinghua Jiang, Hao Liu, Deqiang Jiang, Yinsong Liu, Bo Ren, Rongrong Ji |
| 2021 | Recycling Discriminator: Towards Opinion-Unaware Image Quality Assessment Using Wasserstein GAN. Yunan Zhu, Haichuan Ma, Jialun Peng, Dong Liu, Zhiwei Xiong |
| 2021 | Relationship-Preserving Knowledge Distillation for Zero-Shot Sketch Based Image Retrieval. Jialin Tian, Xing Xu, Zheng Wang, Fumin Shen, Xin Liu |
| 2021 | Remember and Reuse: Cross-Task Blind Image Quality Assessment via Relevance-aware Incremental Learning. Rui Ma, Hanxiao Luo, Qingbo Wu, King Ngi Ngan, Hongliang Li, Fanman Meng, Linfeng Xu |
| 2021 | Reproducibility Companion Paper: Blind Natural Video Quality Prediction via Statistical Temporal Features and Deep Spatial Features. Jari Korhonen, Yicheng Su, Junyong You, Steven Hicks, Cise Midoglu |
| 2021 | Reproducibility Companion Paper: Campus3D: A Photogrammetry Point Cloud Benchmark for Outdoor Scene Hierarchical Understanding. Yuqing Liao, Xinke Li, Zekun Tong, Yabang Zhao, Andrew Lim, Zhenzhong Kuang, Cise Midoglu |
| 2021 | Reproducibility Companion Paper: Describing Subjective Experiment Consistency by p-Value P-P Plot. Jakub Nawala, Lucjan Janowski, Bogdan Cmiel, Krzysztof Rusek, Marc A. Kastner, Jan Zahálka |
| 2021 | Reproducibility Companion Paper: Kalman Filter-Based Head Motion Prediction for Cloud-Based Mixed Reality. Serhan Gül, Sebastian Bosse, Dimitri Podborski, Thomas Schierl, Cornelius Hellge, Marc A. Kastner, Jan Zahálka |
| 2021 | Reproducibility Companion Paper: Norm-in-Norm Loss with Faster Convergence and Better Performance for Image Quality Assessment. Dingquan Li, Tingting Jiang, Ming Jiang, Vajira Lasantha Thambawita, Haoliang Wang |
| 2021 | Reproducibility Companion Paper: On Learning Disentangled Representation for Acoustic Event Detection. Lijian Gao, Qirong Mao, Jingjing Chen, Ming Dong, Ratna Babu Chinnam, Lucile Sassatelli, Miguel Fabián Romero Rondón, Ujjwal Sharma |
| 2021 | Reproducibility Companion Paper: Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework. Li Tao, Xueting Wang, Toshihiko Yamasaki, Jingjing Chen, Steven Hicks |
| 2021 | Reproducibility Companion Paper: Visual Relation of Interest Detection. Fan Yu, Haonan Wang, Tongwei Ren, Jinhui Tang, Gangshan Wu, Jingjing Chen, Zhenzhong Kuang |
| 2021 | Research on Micro-Expression Spotting Method Based on Optical Flow Features. Yuhong He |
| 2021 | Rethinking the Impacts of Overfitting and Feature Quality on Small-scale Video Classification. Xuansheng Wu, Feichi Yang, Tong Zhou, Xinyue Lin |
| 2021 | Retinomorphic Sensing: A Novel Paradigm for Future Multimedia Computing. Zhaodong Kang, Jianing Li, Lin Zhu, Yonghong Tian |
| 2021 | Revisiting Mid-Level Patterns for Cross-Domain Few-Shot Recognition. Yixiong Zou, Shanghang Zhang, Jianpeng Yu, Yonghong Tian, José M. F. Moura |
| 2021 | Robust Logo Detection in E-Commerce Images by Data Augmentation. Hang Chen, Xiao Li, Zefan Wang, Xiaolin Hu |
| 2021 | Robust Real-World Image Super-Resolution against Adversarial Attacks. Jiutao Yue, Haofeng Li, Pengxu Wei, Guanbin Li, Liang Lin |
| 2021 | Robust Shadow Detection by Exploring Effective Shadow Contexts. Xianyong Fang, Xiaohao He, Linbo Wang, Jianbing Shen |
| 2021 | SFE-Net: EEG-based Emotion Recognition with Symmetrical Spatial Feature Extraction. Xiangwen Deng, Junlin Zhu, Shangming Yang |
| 2021 | SI3DP: Source Identification Challenges and Benchmark for Consumer-Level 3D Printer Forensics. Bo Seok Shim, Yoo Seung Shin, Seong-Wook Park, Jong-Uk Hou |
| 2021 | SINGA-Easy: An Easy-to-Use Framework for MultiModal Analysis. Naili Xing, Sai Ho Yeung, Chenghao Cai, Teck Khim Ng, Wei Wang, Kaiyuan Yang, Nan Yang, Meihui Zhang, Gang Chen, Beng Chin Ooi |
| 2021 | SM-SGE: A Self-Supervised Multi-Scale Skeleton Graph Encoding Framework for Person Re-Identification. Haocong Rao, Xiping Hu, Jun Cheng, Bin Hu |
| 2021 | SOGAN: 3D-Aware Shadow and Occlusion Robust GAN for Makeup Transfer. Yueming Lyu, Jing Dong, Bo Peng, Wei Wang, Tieniu Tan |
| 2021 | SRNet: Spatial Relation Network for Efficient Single-stage Instance Segmentation in Videos. Xiaowen Ying, Xin Li, Mooi Choo Chuah |
| 2021 | SSFlow: Style-guided Neural Spline Flows for Face Image Manipulation. Hanbang Liang, Xianxu Hou, Linlin Shen |
| 2021 | SSPU-Net: Self-Supervised Point Cloud Upsampling via Differentiable Rendering. Yifan Zhao, Le Hui, Jin Xie |
| 2021 | SSconv: Explicit Spectral-to-Spatial Convolution for Pansharpening. Yudong Wang, Liang-Jian Deng, Tian-Jing Zhang, Xiao Wu |
| 2021 | STST: Spatial-Temporal Specialized Transformer for Skeleton-based Action Recognition. Yuhan Zhang, Bo Wu, Wen Li, Lixin Duan, Chuang Gan |
| 2021 | SUMAC'21: 3rd Workshop on Structuring and Understanding of Multimedia heritAge Contents. Valérie Gouet-Brunet, Margarita Khokhlova, Ronak Kosti, Li Weng |
| 2021 | SVHAN: Sequential View Based Hierarchical Attention Network for 3D Shape Recognition. Yue Zhao, Weizhi Nie, An-An Liu, Zan Gao, Yuting Su |
| 2021 | SalS-GAN: Spatially-Adaptive Latent Space in StyleGAN for Real Image Embedding. Lingyun Zhang, Xiuxiu Bai, Yao Gao |
| 2021 | Salient Error Detection based Refinement for Wide-baseline Image Interpolation. Yuan Chang, Yisong Chen, Guoping Wang |
| 2021 | Sand Scope: An Interactive Installation for Revealing the Connection Between Mental Space and Life Space in a Microcosm of the World. Lyn Chao-ling Chen |
| 2021 | Scalable Multi-view Subspace Clustering with Unified Anchors. Mengjing Sun, Pei Zhang, Siwei Wang, Sihang Zhou, Wenxuan Tu, Xinwang Liu, En Zhu, Changjian Wang |
| 2021 | Scene Graph with 3D Information for Change Captioning. Zeming Liao, Qingbao Huang, Yu Liang, Mingyi Fu, Yi Cai, Qing Li |
| 2021 | Scene Text Image Super-Resolution via Parallelly Contextual Attention Network. Cairong Zhao, Shuyang Feng, Brian Nlong Zhao, Zhijun Ding, Jun Wu, Fumin Shen, Heng Tao Shen |
| 2021 | Searching Motion Graphs for Human Motion Synthesis. Chenchen Liu, Yadong Mu |
| 2021 | Searching a Hierarchically Aggregated Fusion Architecture for Fast Multi-Modality Image Fusion. Risheng Liu, Zhu Liu, Jinyuan Liu, Xin Fan |
| 2021 | Seeing is Believing?: Effects of Visualization on Smart Device Privacy Perceptions. Carlos Bermejo Fernandez, Petteri Nurmi, Pan Hui |
| 2021 | Selective Dependency Aggregation for Action Classification. Yi Tan, Yanbin Hao, Xiangnan He, Yinwei Wei, Xun Yang |
| 2021 | Self-Contrastive Learning with Hard Negative Sampling for Self-supervised Point Cloud Learning. Bi'an Du, Xiang Gao, Wei Hu, Xin Li |
| 2021 | Self-Representation Subspace Clustering for Incomplete Multi-view Data. Jiyuan Liu, Xinwang Liu, Yi Zhang, Pei Zhang, Wenxuan Tu, Siwei Wang, Sihang Zhou, Weixuan Liang, Siqi Wang, Yuexiang Yang |
| 2021 | Self-Supervised Pre-training on the Target Domain for Cross-Domain Person Re-identification. Junyin Zhang, Yongxin Ge, Xinqian Gu, Boyu Hua, Tao Xiang |
| 2021 | Self-Supervised Regional and Temporal Auxiliary Tasks for Facial Action Unit Recognition. Jingwei Yan, Jingjing Wang, Qiang Li, Chunmao Wang, Shiliang Pu |
| 2021 | Self-feature Learning: An Efficient Deep Lightweight Network for Image Super-resolution. Jun Xiao, Qian Ye, Rui Zhao, Kin-Man Lam, Kao Wan |
| 2021 | Self-supervised Consensus Representation Learning for Attributed Graph. Changshu Liu, Liangjian Wen, Zhao Kang, Guangchun Luo, Ling Tian |
| 2021 | Self-supervised Multi-view Multi-Human Association and Tracking. Yiyang Gan, Ruize Han, Liqiang Yin, Wei Feng, Song Wang |
| 2021 | Self-supervising Action Recognition by Statistical Moment and Subspace Descriptors. Lei Wang, Piotr Koniusz |
| 2021 | Semantic Media Conversion: Possibilities and Limits. H. V. Jagadish |
| 2021 | Semantic Scalable Image Compression with Cross-Layer Priors. Hanyue Tu, Li Li, Wengang Zhou, Houqiang Li |
| 2021 | Semantic Tag Augmented XlanV Model for Video Captioning. Yiqing Huang, Hongwei Xue, Jiansheng Chen, Huimin Ma, Hongbing Ma |
| 2021 | Semantic-Guided Relation Propagation Network for Few-shot Action Recognition. Xiao Wang, Weirong Ye, Zhongang Qi, Xun Zhao, Guangge Wang, Ying Shan, Hanzi Wang |
| 2021 | Semantic-aware Transfer with Instance-adaptive Parsing for Crowded Scenes Pose Estimation. Xuanhan Wang, Lianli Gao, Yan Dai, Yixuan Zhou, Jingkuan Song |
| 2021 | Semi-Autoregressive Image Captioning. Xu Yan, Zhengcong Fei, Zekang Li, Shuhui Wang, Qingming Huang, Qi Tian |
| 2021 | Semi-supervised Domain Adaptive Retrieval via Discriminative Hashing Learning. Haifeng Xia, Taotao Jing, Chen Chen, Zhengming Ding |
| 2021 | Semi-supervised Learning via Improved Teacher-Student Network for Robust 3D Reconstruction of Stereo Endoscopic Image. Hongkuan Shi, Zhiwei Wang, Jinxin Lv, Yilang Wang, Peng Zhang, Fei Zhu, Qiang Li |
| 2021 | Sensor-Augmented Egocentric-Video Captioning with Dynamic Modal Attention. Katsuyuki Nakamura, Hiroki Ohashi, Mitsuhiro Okada |
| 2021 | Shadow Detection via Predicting the Confidence Maps of Shadow Detection Methods. Jingwei Liao, Yanli Liu, Guanyu Xing, Housheng Wei, Jueyu Chen, Songhua Xu |
| 2021 | Shape Controllable Virtual Try-on for Underwear Models. Xin Gao, Zhenjiang Liu, Zunlei Feng, Chengji Shen, Kairi Ou, Haihong Tang, Mingli Song |
| 2021 | Show, Read and Reason: Table Structure Recognition with Flexible Context Aggregator. Hao Liu, Xin Li, Bing Liu, Deqiang Jiang, Yinsong Liu, Bo Ren, Rongrong Ji |
| 2021 | Similar Scenes Arouse Similar Emotions: Parallel Data Augmentation for Stylized Image Captioning. Guodun Li, Yuchen Zhai, Zehao Lin, Yin Zhang |
| 2021 | Simplifying Multimodal Emotion Recognition with Single Eye Movement Modality. Xu Yan, Li-Ming Zhao, Bao-Liang Lu |
| 2021 | SimulLR: Simultaneous Lip Reading Transducer with Attention-Guided Adaptive Memory. Zhijie Lin, Zhou Zhao, Haoyuan Li, Jinglin Liu, Meng Zhang, Xingshan Zeng, Xiaofei He |
| 2021 | SimulSLT: End-to-End Simultaneous Sign Language Translation. Aoxiong Yin, Zhou Zhao, Jinglin Liu, Weike Jin, Meng Zhang, Xingshan Zeng, Xiaofei He |
| 2021 | Single Image 3D Object Estimation with Primitive Graph Networks. Qian He, Desen Zhou, Bo Wan, Xuming He |
| 2021 | Situational Anomaly Detection in Multimedia Data under Concept Drift. Pratibha Kumari |
| 2021 | Skeleton-Aware Neural Sign Language Translation. Shiwei Gan, Yafeng Yin, Zhiwei Jiang, Lei Xie, Sanglu Lu |
| 2021 | Skeleton-Contrastive 3D Action Representation Learning. Fida Mohammad Thoker, Hazel Doughty, Cees G. M. Snoek |
| 2021 | SmartEye: An Open Source Framework for Real-Time Video Analytics with Edge-Cloud Collaboration. Xuezhi Wang, Guanyu Gao |
| 2021 | SmartMeeting: Automatic Meeting Transcription and Summarization for In-Person Conversations. Yuanfeng Song, Di Jiang, Xuefang Zhao, Xiaoling Huang, Qian Xu, Raymond Chi-Wing Wong, Qiang Yang |
| 2021 | SmartSales: An AI-Powered Telemarketing Coaching System in FinTech. Yuanfeng Song, Xuefang Zhao, Di Jiang, Xiaoling Huang, Weiwei Zhao, Qian Xu, Raymond Chi-Wing Wong, Qiang Yang |
| 2021 | Social Signals and Multimedia: Past, Present, Future. Hayley Hung, Cathal Gurrin, Martha A. Larson, Hatice Gunes, Fabien Ringeval, Elisabeth André, Louis-Philippe Morency |
| 2021 | Softly: Simulated Empathic Touch between an Agent and a Human. Maxime Grandidier, Fabien Boucaud, Indira Thouvenin, Catherine Pelachaud |
| 2021 | Source Data-free Unsupervised Domain Adaptation for Semantic Segmentation. Mucong Ye, Jing Zhang, Jinpeng Ouyang, Ding Yuan |
| 2021 | Space-Angle Super-Resolution for Multi-View Images. Yuqi Sun, Ri Cheng, Bo Yan, Shili Zhou |
| 2021 | Sparse to Dense Depth Completion using a Generative Adversarial Network with Intelligent Sampling Strategies. Md Fahim Faysal Khan, Nelson Daniel Troncoso Aldas, Abhishek Kumar, Siddharth Advani, Vijaykrishnan Narayanan |
| 2021 | Spatio-Temporal Interaction Graph Parsing Networks for Human-Object Interaction Recognition. Ning Wang, Guangming Zhu, Liang Zhang, Peiyi Shen, Hongsheng Li, Cong Hua |
| 2021 | Spatiotemporal Inconsistency Learning for DeepFake Video Detection. Zhihao Gu, Yang Chen, Taiping Yao, Shouhong Ding, Jilin Li, Feiyue Huang, Lizhuang Ma |
| 2021 | Speech2AffectiveGestures: Synthesizing Co-Speech Gestures with Generative Adversarial Affective Expression Learning. Uttaran Bhattacharya, Elizabeth Childs, Nicholas Rewkowski, Dinesh Manocha |
| 2021 | Stacked Semantically-Guided Learning for Image De-distortion. Huiyuan Fu, Changhao Tian, Xin Wang, Huadong Ma |
| 2021 | State-aware Video Procedural Captioning. Taichi Nishimura, Atsushi Hashimoto, Yoshitaka Ushiku, Hirotaka Kameko, Shinsuke Mori |
| 2021 | Stereo Video Super-Resolution via Exploiting View-Temporal Correlations. Ruikang Xu, Zeyu Xiao, Mingde Yao, Yueyi Zhang, Zhiwei Xiong |
| 2021 | StrucTexT: Structured Text Understanding with Multi-Modal Transformers. Yulin Li, Yuxi Qian, Yuechen Yu, Xiameng Qin, Chengquan Zhang, Yan Liu, Kun Yao, Junyu Han, Jingtuo Liu, Errui Ding |
| 2021 | Structure-aware Mathematical Expression Recognition with Sequence-Level Modeling. Minli Li, Peilin Zhao, Yifan Zhang, Shuaicheng Niu, Qingyao Wu, Mingkui Tan |
| 2021 | Structured Multi-modal Feature Embedding and Alignment for Image-Sentence Retrieval. Xuri Ge, Fuhai Chen, Joemon M. Jose, Zhilong Ji, Zhongqin Wu, Xiao Liu |
| 2021 | Style-Aware Image Recommendation for Social Media Marketing. Yiwei Zhang, Toshihiko Yamasaki |
| 2021 | SuperFront: From Low-resolution to High-resolution Frontal Face Synthesis. Yu Yin, Joseph P. Robinson, Songyao Jiang, Yue Bai, Can Qin, Yun Fu |
| 2021 | Sync Glass: Virtual Pouring and Toasting Experience with Multimodal Presentation. Yuki Tajima, Toshiharu Horiuchi, Gen Hattori |
| 2021 | Syntropic Counterpoints: Metaphysics of The Machines. Predrag K. Nikolic, Ruiyang Liu, Shengcheng Luo |
| 2021 | TACR-Net: Editing on Deep Video and Voice Portraits. Luchuan Song, Bin Liu, Guojun Yin, Xiaoyi Dong, Yufei Zhang, Jia-Xuan Bai |
| 2021 | TBRA: Tiling and Bitrate Adaptation for Mobile 360-Degree Video Streaming. Lei Zhang, Yanyan Suo, Ximing Wu, Feng Wang, Yuchi Chen, Laizhong Cui, Jiangchuan Liu, Zhong Ming |
| 2021 | TDI TextSpotter: Taking Data Imbalance into Account in Scene Text Spotting. Yu Zhou, Hongtao Xie, Shancheng Fang, Jing Wang, Zhengjun Zha, Yongdong Zhang |
| 2021 | TSA-Net: Tube Self-Attention Network for Action Quality Assessment. Shunli Wang, Dingkang Yang, Peng Zhai, Chixiao Chen, Lihua Zhang |
| 2021 | Target-guided Adaptive Base Class Reweighting for Few-Shot Learning. Jiliang Yan, Deming Zhai, Junjun Jiang, Xianming Liu |
| 2021 | Text as Neural Operator: Image Manipulation by Text Instruction. Tianhao Zhang, Hung-Yu Tseng, Lu Jiang, Weilong Yang, Honglak Lee, Irfan Essa |
| 2021 | Text is NOT Enough: Integrating Visual Impressions into Open-domain Dialogue Generation. Lei Shen, Haolan Zhan, Xin Shen, Yonghao Song, Xiaofang Zhao |
| 2021 | Text to Scene: A System of Configurable 3D Indoor Scene Synthesis. Xinyan Yang, Fei Hu, Long Ye |
| 2021 | Text-driven 3D Avatar Animation with Emotional and Expressive Behaviors. Li Hu, Jinwei Qi, Bang Zhang, Pan Pan, Yinghui Xu |
| 2021 | Text2Video: Automatic Video Generation Based on Text Scripts. Yipeng Yu, Zirui Tu, Longyu Lu, Xiao Chen, Hui Zhan, Zixun Sun |
| 2021 | The ACM Multimedia 2021 Meet Deadline Requirements Grand Challenge. Jie Zhang, Junjie Deng, Mowei Wang, Yong Cui, Wei Tsang Ooi, Jiangchuan Liu, Xinyu Zhang, Kai Zheng, Yi Li |
| 2021 | The Next Generation Multimodal Conversational Search and Recommendation. João Magalhães, Tat-Seng Chua, Tao Mei, Alan F. Smeaton |
| 2021 | Theophany: Multimodal Speech Augmentation in Instantaneous Privacy Channels. Abhishek Kumar, Tristan Braud, Lik Hang Lee, Pan Hui |
| 2021 | Token Shift Transformer for Video Classification. Hao Zhang, Yanbin Hao, Chong-Wah Ngo |
| 2021 | Towards Accurate Localization by Instance Search. Yi-Geng Hong, Hui-Chu Xiao, Wan-Lei Zhao |
| 2021 | Towards Adversarial Patch Analysis and Certified Defense against Crowd Counting. Qiming Wu, Zhikang Zou, Pan Zhou, Xiaoqing Ye, Binghui Wang, Ang Li |
| 2021 | Towards Bridging Video and Language by Caption Generation and Sentence Localization. Shaoxiang Chen |
| 2021 | Towards Controllable and Photorealistic Region-wise Image Manipulation. Ansheng You, Chenglin Zhou, Qixuan Zhang, Lan Xu |
| 2021 | Towards Cross-Granularity Few-Shot Learning: Coarse-to-Fine Pseudo-Labeling with Visual-Semantic Meta-Embedding. Jinhai Yang, Hua Yang, Lin Chen |
| 2021 | Towards Fast and High-Quality Sign Language Production. Wencan Huang, Wenwen Pan, Zhou Zhao, Qi Tian |
| 2021 | Towards Multiple Black-boxes Attack via Adversarial Example Generation Network. Mingxing Duan, Kenli Li, Lingxi Xie, Qi Tian, Bin Xiao |
| 2021 | Towards Realistic Visual Dubbing with Heterogeneous Sources. Tianyi Xie, Liucheng Liao, Cheng Bi, Benlai Tang, Xiang Yin, Jianfei Yang, Mingjie Wang, Jiali Yao, Yang Zhang, Zejun Ma |
| 2021 | Towards Reasoning Ability in Scene Text Visual Question Answering. Qingqing Wang, Liqiang Xiao, Yue Lu, Yaohui Jin, Hao He |
| 2021 | Towards Robust Cross-domain Image Understanding with Unsupervised Noise Removal. Lei Zhu, Zhaojing Luo, Wei Wang, Meihui Zhang, Gang Chen, Kaiping Zheng |
| 2021 | Towards Robust Deep Hiding Under Non-Differentiable Distortions for Practical Blind Watermarking. Chaoning Zhang, Adil Karjauv, Philipp Benz, In So Kweon |
| 2021 | Towards a Unified Middle Modality Learning for Visible-Infrared Person Re-Identification. Yukang Zhang, Yan Yan, Yang Lu, Hanzi Wang |
| 2021 | Trajectory is not Enough: Hidden Following Detection. Danni Xu, Ruimin Hu, Zixiang Xiong, Zheng Wang, Linbo Luo, Dengshi Li |
| 2021 | TransFusion: Multi-Modal Fusion for Video Tag Inference via Translation-based Knowledge Embedding. Di Jin, Zhongang Qi, Yingmin Luo, Ying Shan |
| 2021 | TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding. Dailan He, Yusheng Zhao, Junyu Luo, Tianrui Hui, Shaofei Huang, Aixi Zhang, Si Liu |
| 2021 | Transfer Vision Patterns for Multi-Task Pixel Learning. Xiaoya Zhang, Ling Zhou, Yong Li, Zhen Cui, Jin Xie, Jian Yang |
| 2021 | Transferrable Contrastive Learning for Visual Domain Adaptation. Yang Chen, Yingwei Pan, Yu Wang, Ting Yao, Xinmei Tian, Tao Mei |
| 2021 | Transformer-based Feature Reconstruction Network for Robust Multimodal Sentiment Analysis. Ziqi Yuan, Wei Li, Hua Xu, Wenmeng Yu |
| 2021 | TriTransNet: RGB-D Salient Object Detection with a Triplet Transformer Embedding Network. Zhengyi Liu, Yuan Wang, Zhengzheng Tu, Yun Xiao, Bin Tang |
| 2021 | Triangle-Reward Reinforcement Learning: A Visual-Linguistic Semantic Alignment for Image Captioning. Weizhi Nie, Jiesi Li, Ning Xu, An-An Liu, Xuanya Li, Yongdong Zhang |
| 2021 | Trustworthy AI'21: 1st International Workshop on Trustworthy AI for Multimedia Computing. Teddy Furon, Jingen Liu, Yogesh S. Rawat, Wei Zhang, Qi Zhao |
| 2021 | Trustworthy Multimedia Analysis. Xiaowen Huang, Jiaming Zhang, Yi Zhang, Xian Zhao, Jitao Sang |
| 2021 | TsFPS: An Accurate and Flexible 6DoF Tracking System with Fiducial Platonic Solids. Nan Xiang, Xiaosong Yang, Jian J. Zhang |
| 2021 | Two-pronged Strategy: Lightweight Augmented Graph Network Hashing for Scalable Image Retrieval. Hui Cui, Lei Zhu, Jingjing Li, Zhiyong Cheng, Zheng Zhang |
| 2021 | Two-stage Visual Cues Enhancement Network for Referring Image Segmentation. Yang Jiao, Zequn Jie, Weixin Luo, Jingjing Chen, Yu-Gang Jiang, Xiaolin Wei, Lin Ma |
| 2021 | UACANet: Uncertainty Augmented Context Attention for Polyp Segmentation. Taehun Kim, Hyemin Lee, Daijin Kim |
| 2021 | Understanding Chinese Video and Language via Contrastive Multimodal Pre-Training. Chenyi Lei, Shixian Luo, Yong Liu, Wanggui He, Jiamang Wang, Guoxin Wang, Haihong Tang, Chunyan Miao, Houqiang Li |
| 2021 | Underwater Species Detection using Channel Sharpening Attention. Lihao Jiang, Yi Wang, Qi Jia, Shengwei Xu, Yu Liu, Xin Fan, Haojie Li, Risheng Liu, Xinwei Xue, Ruili Wang |
| 2021 | UniCon: Unified Context Network for Robust Active Speaker Detection. Yuanhang Zhang, Susan Liang, Shuang Yang, Xiao Liu, Zhongqin Wu, Shiguang Shan, Xilin Chen |
| 2021 | Unifying Multimodal Transformer for Bi-directional Image and Text Generation. Yupan Huang, Hongwei Xue, Bei Liu, Yutong Lu |
| 2021 | Unsupervised Cross-Modal Distillation for Thermal Infrared Tracking. Jingxian Sun, Lichao Zhang, Yufei Zha, Abel Gonzalez-Garcia, Peng Zhang, Wei Huang, Yanning Zhang |
| 2021 | Unsupervised Image Deraining: Optimization Model Driven Deep CNN. Changfeng Yu, Yi Chang, Yi Li, Xile Zhao, Luxin Yan |
| 2021 | Unsupervised Portrait Shadow Removal via Generative Priors. Yingqing He, Yazhou Xing, Tianjia Zhang, Qifeng Chen |
| 2021 | Unsupervised Vehicle Search in the Wild: A New Benchmark. Xian Zhong, Shilei Zhao, Xiao Wang, Kui Jiang, Wenxuan Liu, Wenxin Huang, Zheng Wang |
| 2021 | UrbanMM'21: 1st International Workshop on Multimedia Computing for Urban Data. Stevan Rudinac, Alessandro Bozzon, Tat-Seng Chua, Suzanne Little, Daniel Gatica-Perez, Kiyoharu Aizawa |
| 2021 | Using Interaction Data to Predict Engagement with Interactive Media. Jonathan Carlton, Andy Brown, Caroline Jay, John Keane |
| 2021 | Using Motion Histories for Eye Contact Detection in Multiperson Group Conversations. Eugene Yujun Fu, Michael W. Ngai |
| 2021 | VASTile: Viewport Adaptive Scalable 360-Degree Video Frame Tiling. Chamara Madarasingha, Kanchana Thilakarathna |
| 2021 | VLAD-VSA: Cross-Domain Face Presentation Attack Detection with Vocabulary Separation and Adaptation. Jiong Wang, Zhou Zhao, Weike Jin, Xinyu Duan, Zhen Lei, Baoxing Huai, Yiling Wu, Xiaofei He |
| 2021 | VQMG: Hierarchical Vector Quantised and Multi-hops Graph Reasoning for Explicit Representation Learning. Lei Li, Chun Yuan |
| 2021 | Vehicle Counting Network with Attention-based Mask Refinement and Spatial-awareness Block Loss. Ji Zhang, Jian-Jun Qiao, Xiao Wu, Wei Li |
| 2021 | VeloCity: Using Voice Assistants for Cyclists to Provide Traffic Reports. Gian-Luca Savino, Jessé Moraes Braga, Johannes Schöning |
| 2021 | ViDA-MAN: Visual Dialog with Digital Humans. Tong Shen, Jiawei Zuo, Fan Shi, Jin Zhang, Liqin Jiang, Meng Chen, Zhengchen Zhang, Wei Zhang, Xiaodong He, Tao Mei |
| 2021 | VidVRD 2021: The Third Grand Challenge on Video Relation Detection. Wei Ji, Yicong Li, Meng Wei, Xindi Shang, Junbin Xiao, Tongwei Ren, Tat-Seng Chua |
| 2021 | Video Background Music Generation with Controllable Music Transformer. Shangzhe Di, Zeren Jiang, Si Liu, Zhaokai Wang, Leyan Zhu, Zexin He, Hongming Liu, Shuicheng Yan |
| 2021 | Video Coding for Machine. Wen Gao |
| 2021 | Video Relation Detection via Tracklet based Visual Transformer. Kaifeng Gao, Long Chen, Yifeng Huang, Jun Xiao |
| 2021 | Video Representation Learning with Graph Contrastive Augmentation. Jingran Zhang, Xing Xu, Fumin Shen, Yazhou Yao, Jie Shao, Xiaofeng Zhu |
| 2021 | Video Semantic Segmentation via Sparse Temporal Transformer. Jiangtong Li, Wentao Wang, Junjie Chen, Li Niu, Jianlou Si, Chen Qian, Liqing Zhang |
| 2021 | Video Similarity and Alignment Learning on Partial Video Copy Detection. Zhen Han, Xiangteng He, Mingqian Tang, Yiliang Lv |
| 2021 | Video Transformer for Deepfake Detection with Incremental Learning. Sohail Ahmed Khan, Hang Dai |
| 2021 | Video Visual Relation Detection via Iterative Inference. Xindi Shang, Yicong Li, Junbin Xiao, Wei Ji, Tat-Seng Chua |
| 2021 | Video-to-Image Casting: A Flatting Method for Video Analysis. Xu Chen, Chenqiang Gao, Feng Yang, Xiaohan Wang, Yi Yang, Yahong Han |
| 2021 | VideoDiscovery: An Automatic Short-Video Generation System for E-commerce Live-streaming. Yanhao Zhang, Qiang Wang, Yun Zheng, Pan Pan, Yinghui Xu |
| 2021 | View-normalized Skeleton Generation for Action Recognition. Qingzhe Pan, Zhifu Zhao, Xuemei Xie, Jianan Li, Yuhan Cao, Guangming Shi |
| 2021 | Viewing from Frequency Domain: A DCT-based Information Enhancement Network for Video Person Re-Identification. Liangchen Liu, Xi Yang, Nannan Wang, Xinbo Gao |
| 2021 | Visible Watermark Removal via Self-calibrated Localization and Background Refinement. Jing Liang, Li Niu, Fengjun Guo, Teng Long, Liqing Zhang |
| 2021 | Vision-guided Music Source Separation via a Fine-grained Cycle-Separation Network. Shuo Ma, Yanli Ji, Xing Xu, Xiaofeng Zhu |
| 2021 | Visual Co-Occurrence Alignment Learning for Weakly-Supervised Video Moment Retrieval. Zheng Wang, Jingjing Chen, Yu-Gang Jiang |
| 2021 | Visual Language Based Succinct Zero-Shot Object Detection. Ye Zheng, Xi Huang, Li Cui |
| 2021 | VmAP: A Fair Metric for Video Object Detection. Anupam Sobti, Vaibhav Mavi, M. Balakrishnan, Chetan Arora |
| 2021 | VoteHMR: Occlusion-Aware Voting Network for Robust 3D Human Mesh Recovery from Partial Point Clouds. Guanze Liu, Yu Rong, Lu Sheng |
| 2021 | WAB'21: 1st Workshop on Multimodal Product Identification in Livestreaming and WAB Challenge. Yueting Zhuang, Xing Tang, Guilin Wu, Yahong Han, Haihong Tang, Xiaobo Li, Xiaohan Wang, Baoming Yan, Bo Gao, Yi Yang |
| 2021 | WAS-VTON: Warping Architecture Search for Virtual Try-on Network. Zhenyu Xie, Xujie Zhang, Fuwei Zhao, Haoye Dong, Michael C. Kampffmeyer, Haonan Yan, Xiaodan Liang |
| 2021 | WeClick: Weakly-Supervised Video Semantic Segmentation with Click Annotations. Peidong Liu, Zibin He, Xiyu Yan, Yong Jiang, Shu-Tao Xia, Feng Zheng, Maowei Hu |
| 2021 | WePerson: Learning a Generalized Re-identification Model from All-weather Virtual Data. He Li, Mang Ye, Bo Du |
| 2021 | Weakly-Supervised Temporal Action Localization via Cross-Stream Collaborative Learning. Yuan Ji, Xu Jia, Huchuan Lu, Xiang Ruan |
| 2021 | Weakly-Supervised Video Object Grounding via Stable Context Learning. Wei Wang, Junyu Gao, Changsheng Xu |
| 2021 | Weight Evolution: Improving Deep Neural Networks Training through Evolving Inferior Weight Values. Zhenquan Lin, Kailing Guo, Xiaofen Xing, Xiangmin Xu |
| 2021 | Weighted Gaussian Loss based Hamming Hashing. Rong-Cheng Tu, Xian-Ling Mao, Cihang Kong, Zihang Shao, Zelin Li, Wei Wei, Heyan Huang |
| 2021 | When Face Completion Meets Irregular Holes: An Attributes Guided Deep Inpainting Network. Jie Xiao, Dandan Zhan, Haoran Qi, Zhi Jin |
| 2021 | When Video Classification Meets Incremental Classes. Hanbin Zhao, Xin Qin, Shihao Su, Yongjian Fu, Zibo Lin, Xi Li |
| 2021 | Why Do We Click: Visual Impression-aware News Recommendation. Jiahao Xun, Shengyu Zhang, Zhou Zhao, Jieming Zhu, Qi Zhang, Jingjie Li, Xiuqiang He, Xiaofei He, Tat-Seng Chua, Fei Wu |
| 2021 | Windowing Decomposition Convolutional Neural Network for Image Enhancement. Chuanjun Zheng, Daming Shi, Yukun Liu |
| 2021 | Wisdom of (Binned) Crowds: A Bayesian Stratification Paradigm for Crowd Counting. Sravya Vardhani Shivapuja, Mansi Pradeep Khamkar, Divij Bajaj, Ganesh Ramakrishnan, Ravi Kiran Sarvadevabhatla |
| 2021 | X-GGM: Graph Generative Modeling for Out-of-distribution Generalization in Visual Question Answering. Jingjing Jiang, Ziyi Liu, Yifan Liu, Zhixiong Nan, Nanning Zheng |
| 2021 | X-modaler: A Versatile and High-performance Codebase for Cross-modal Analytics. Yehao Li, Yingwei Pan, Jingwen Chen, Ting Yao, Tao Mei |
| 2021 | Yes, "Attention Is All You Need", for Exemplar based Colorization. Wang Yin, Peng Lu, Zhaoran Zhao, Xujun Peng |
| 2021 | Zero-shot Video Emotion Recognition via Multimodal Protagonist-aware Transformer Network. Fan Qi, Xiaoshan Yang, Changsheng Xu |
| 2021 | ZiGAN: Fine-grained Chinese Calligraphy Font Generation via a Few-shot Style Transfer Approach. Qi Wen, Shuang Li, Bingfeng Han, Yi Yuan |
| 2021 | ZoomSense: A Scalable Infrastructure for Augmenting Zoom. Tom Bartindale, Peter Chen, Harrison Marshall, Stanislav Pozdniakov, Dan Richardson |
| 2021 | aBio: Active Bi-Olfactory Display Using Subwoofers for Virtual Reality. Youyang Hu, Yao Fu Jan, Kuan-Wei Tseng, You-Shin Tsai, Hung-Ming Sung, Jin-Yao Lin, Yi-Ping Hung |
| 2021 | iART: A Search Engine for Art-Historical Images to Support Research in the Humanities. Matthias Springstein, Stefanie Schneider, Javad Rahnama, Eyke Hüllermeier, Hubertus Kohle, Ralph Ewerth |
| 2021 | iButter: Neural Interactive Bullet Time Generator for Human Free-viewpoint Rendering. Liao Wang, Ziyu Wang, Pei Lin, Yuheng Jiang, Xin Suo, Minye Wu, Lan Xu, Jingyi Yu |