ACM Multimedia A*

685 papers

YearTitle / Authors
2021A Complete End to End Open Source Toolchain for the Versatile Video Coding (VVC) Standard.
Adam Wieckowski, Christian Lehmann, Benjamin Bross, Detlev Marpe, Thibaud Biatek, Mickaël Raulet, Jean Le Feuvre
2021A Gradient Balancing Approach for Robust Logo Detection.
Fuxing Leng
2021A Large-Scale Benchmark for Food Image Segmentation.
Xiongwei Wu, Xin Fu, Ying Liu, Ee-Peng Lim, Steven C. H. Hoi, Qianru Sun
2021A Multi-Domain Adaptive Graph Convolutional Network for EEG-based Emotion Recognition.
Rui Li, Yiting Wang, Bao-Liang Lu
2021A Multimodal Framework for Video Ads Understanding.
Zejia Weng, Lingchen Meng, Rui Wang, Zuxuan Wu, Yu-Gang Jiang
2021A Novel Patch Convolutional Neural Network for View-based 3D Model Retrieval.
Zan Gao, Yuxiang Shao, Weili Guan, Meng Liu, Zhiyong Cheng, Shengyong Chen
2021A Picture is Worth a Thousand Words: A Unified System for Diverse Captions and Rich Images Generation.
Yupan Huang, Bei Liu, Jianlong Fu, Yutong Lu
2021A Question Answering System for Unstructured Table Images.
Wenyuan Xue, Siqi Cai, Wen Wang, Qingyong Li, Baosheng Yu, Yibing Zhan, Dacheng Tao
2021A Simple and Effective Baseline for Robust Logo Detection.
Weipeng Xu, Ye Liu, Daquan Lin
2021A Solution to Multi-modal Ads Video Tagging Challenge.
Hao Wu, Jiajie Wang, Yuanzhe Gu, Peisen Zhao, Zhonglin Zu
2021A Statistical Approach to Mining Semantic Similarity for Deep Unsupervised Hashing.
Xiao Luo, Daqing Wu, Zeyu Ma, Chong Chen, Minghua Deng, Jianqiang Huang, Xian-Sheng Hua
2021A Stepwise Matching Method for Multi-modal Image based on Cascaded Network.
Jinming Mu, Shuiping Gou, Shasha Mao, Shankui Zheng
2021A System for Interactive and Intelligent AD Auxiliary Screening.
Sen Yang, Qike Zhao, Lanxin Miao, Min Chen, Lianli Gao, Jingkuan Song, Weidong Le
2021A Transformer based Approach for Image Manipulation Chain Detection.
Jiaxiang You, Yuanman Li, Jiantao Zhou, Zhongyun Hua, Weiwei Sun, Xia Li
2021A Tutorial on AI Music Composition.
Xu Tan, Xiaobing Li
2021A Virtual Character Generation and Animation System for E-Commerce Live Streaming.
Li Hu, Bang Zhang, Peng Zhang, Jinwei Qi, Jian Cao, Daiheng Gao, Haiming Zhao, Xiaoduan Feng, Qi Wang, Lian Zhuo, Pan Pan, Yinghui Xu
2021A2W: Context-Aware Recommendation System for Mobile Augmented Reality Web Browser.
Kit-Yung Lam, Lik Hang Lee, Pan Hui
2021ABPNet: Adaptive Background Modeling for Generalized Few Shot Segmentation.
Kaiqi Dong, Wei Yang, Zhenbo Xu, Liusheng Huang, Zhidong Yu
2021ADGD'21: 1st Workshop on Synthetic Multimedia - Audiovisual Deepfake Generation and Detection.
Stefan Winkler, Weiling Chen, Abhinav Dhall, Pavel Korshunov
2021ADVM'21: 1st International Workshop on Adversarial Learning for Multimedia.
Aishan Liu, Xinyun Chen, Yingwei Li, Chaowei Xiao, Xun Yang, Xianglong Liu, Dawn Song, Dacheng Tao, Alan L. Yuille, Anima Anandkumar
2021AFD-Net: Adaptive Fully-Dual Network for Few-Shot Object Detection.
Longyao Liu, Bo Ma, Yulin Zhang, Xin Yi, Haozhi Li
2021AFEC: Adaptive Feature Extraction Modules for Learned Image Compression.
Yi Ma, Yongqi Zhai, Jiayu Yang, Chunhui Yang, Ronggang Wang
2021AI and the Future of Education.
James C. Lester
2021AI-Lyricist: Generating Music and Vocabulary Constrained Lyrics.
Xichu Ma, Ye Wang, Min-Yen Kan, Wee Sun Lee
2021AICoacher: A System Framework for Online Realtime Workout Coach.
Haocong Ying, Tie Liu, Mingxin Ai, Jiali Ding, Yuanyuan Shang
2021AITransfer: Progressive AI-powered Transmission for Real-Time Point Cloud Video Streaming.
Yakun Huang, Yuanwei Zhu, Xiuquan Qiao, Zhijie Tan, Boyuan Bai
2021AIxFood'21: 3rd Workshop on AIxFood.
Ricardo Guerrero, Michael Spranger, Shuqiang Jiang, Chong-Wah Ngo
2021AKECP: Adaptive Knowledge Extraction from Feature Maps for Fast and Efficient Channel Pruning.
Haonan Zhang, Longjun Liu, Hengyi Zhou, Wenxuan Hou, Hongbin Sun, Nanning Zheng
2021AMSS-Net: Audio Manipulation on User-Specified Sources with Textual Queries.
Woo-Sung Choi, Minseok Kim, Marco A. Martínez Ramírez, Jaehwa Chung, Soonyoung Jung
2021APF: An Adversarial Privacy-preserving Filter to Protect Portrait Information.
Xian Zhao, Jiaming Zhang, Xiaowen Huang
2021ARShoe: Real-Time Augmented Reality Shoe Try-on System on Smartphones.
Shan An, Guangfu Che, Jinghao Guo, Haogang Zhu, Junjie Ye, Fangru Zhou, Zhaoqi Zhu, Dong Wei, Aishan Liu, Wei Zhang
2021ASFD: Automatic and Scalable Face Detector.
Jian Li, Bin Zhang, Yabiao Wang, Ying Tai, Zhenyu Zhang, Chengjie Wang, Jilin Li, Xiaoming Huang, Yili Xia
2021ASFM-Net: Asymmetrical Siamese Feature Matching Network for Point Completion.
Yaqi Xia, Yan Xia, Wei Li, Rui Song, Kailang Cao, Uwe Stilla
2021Actions Speak Louder than Listening: Evaluating Music Style Transfer based on Editing Experience.
Wei Tsung Lu, Meng-Hsuan Wu, Yuh-Ming Chiu, Li Su
2021Ada-VSR: Adaptive Video Super-Resolution with Meta-Learning.
Akash Gupta, Padmaja Jonnalagedda, Bir Bhanu, Amit K. Roy-Chowdhury
2021Adaptive Affinity Loss and Erroneous Pseudo-Label Refinement for Weakly Supervised Semantic Segmentation.
Xiangrong Zhang, Zelin Peng, Peng Zhu, Tianyang Zhang, Chen Li, Huiyu Zhou, Licheng Jiao
2021Adaptive Normalized Representation Learning for Generalizable Face Anti-Spoofing.
Shubao Liu, Ke-Yue Zhang, Taiping Yao, Mingwei Bi, Shouhong Ding, Jilin Li, Feiyue Huang, Lizhuang Ma
2021AdvFilter: Predictive Perturbation-aware Filtering against Adversarial Attack via Multi-domain Learning.
Yihao Huang, Qing Guo, Felix Juefei-Xu, Lei Ma, Weikai Miao, Yang Liu, Geguang Pu
2021AdvHash: Set-to-set Targeted Attack on Deep Hashing with One Single Adversarial Patch.
Shengshan Hu, Yechao Zhang, Xiaogeng Liu, Leo Yu Zhang, Minghui Li, Hai Jin
2021Adversarial Learning with Mask Reconstruction for Text-Guided Image Inpainting.
Xingcai Wu, Yucheng Xie, Jiaqi Zeng, Zhenguo Yang, Yi Yu, Qing Li, Wenyin Liu
2021Adversarial Pixel Masking: A Defense against Physical Attacks for Pre-trained Object Detectors.
Ping-Han Chiang, Chi-Shen Chan, Shan-Hung Wu
2021Aesthetic Evaluation and Guidance for Mobile Photography.
Hao Lou, Heng Huang, Chaoen Xiao, Xin Jin
2021Affective Color Fields: Reimagining Rothkoesque Artwork as an Interactive Companion for Artistic Self-Expression.
Aiden Kang, Liang Wang, Ziyu Zhou, Zhe Huang, Robert J. K. Jacob
2021AggNet for Self-supervised Monocular Depth Estimation: Go An Aggressive Step Furthe.
Zhi Chen, Xiaoqing Ye, Liang Du, Wei Yang, Liusheng Huang, Xiao Tan, Zhenbo Shi, Fumin Shen, Errui Ding
2021Air-Text: Air-Writing and Recognition System.
Sun-Kyung Lee, Jong-Hwan Kim
2021An Adaptive Iterative Inpainting Method with More Information Exploration.
Shengjie Chen, Zhenhua Guo, Bo Yuan
2021An EM Framework for Online Incremental Learning of Semantic Segmentation.
Shipeng Yan, Jiale Zhou, Jiangwei Xie, Songyang Zhang, Xuming He
2021Anchor-free 3D Single Stage Detector with Mask-Guided Attention for Point Cloud.
Jiale Li, Hang Dai, Ling Shao, Yong Ding
2021Annotation-Efficient Semantic Segmentation with Shape Prior Knowledge.
Yuhang Lu
2021Annotation-Efficient Untrimmed Video Action Recognition.
Yixiong Zou, Shanghang Zhang, Guangyao Chen, Yonghong Tian, Kurt Keutzer, José M. F. Moura
2021Anti-Distillation Backdoor Attacks: Backdoors Can Really Survive in Knowledge Distillation.
Yunjie Ge, Qian Wang, Baolin Zheng, Xinlu Zhuang, Qi Li, Chao Shen, Cong Wang
2021Apercevoir: Bio Internet of Things Interactive System.
Youyang Hu, Chiao-Chi Chou, Chia-Wei Li
2021Armor: A Benchmark for Meta-evaluation of Artificial Music.
Songhe Wang, Zheng Bao, Jingtong E
2021ArtScience and the ICECUBE LED Display [ILDm^3].
Mark David Hosale, Robert S. Allison, Jim Madsen, Marcus Gordon
2021ArtiVisual: A Platform to Generate and Compare Art.
Jardenna Mohazzab, Abe Vos, Jonathan van Westendorp, Lucas Lageweg, Dylan Prins, Aritra Bhowmik
2021Assisting News Media Editors with Cohesive Visual Storylines.
Gonçalo Marcelino, David Semedo, André Mourão, Saverio G. Blasi, João Magalhães, Marta Mrak
2021AsyNCE: Disentangling False-Positives for Weakly-Supervised Video Grounding.
Cheng Da, Yanhao Zhang, Yun Zheng, Pan Pan, Yinghui Xu, Chunhong Pan
2021Attention-driven Graph Clustering Network.
Zhihao Peng, Hui Liu, Yuheng Jia, Junhui Hou
2021Attention-guided Temporally Coherent Video Object Matting.
Yunke Zhang, Chi Wang, Miaomiao Cui, Peiran Ren, Xuansong Xie, Xian-Sheng Hua, Hujun Bao, Qixing Huang, Weiwei Xu
2021Attribute-specific Control Units in StyleGAN for Fine-grained Image Manipulation.
Rui Wang, Jian Chen, Gang Yu, Li Sun, Changqian Yu, Changxin Gao, Nong Sang
2021Augmenting TV Shows via Uncalibrated Camera Small Motion Tracking in Dynamic Scene.
Yizhen Lao, Jie Yang, Xinying Wang, Jianxin Lin, Yu Cao, Shien Song
2021Auto-MSFNet: Search Multi-scale Fusion Network for Salient Object Detection.
Miao Zhang, TingWei Liu, Yongri Piao, Shunyu Yao, Huchuan Lu
2021Automated Multi-Modal Video Editing for Ads Video.
Qin Lin, Nuo Pang, Zhiying Hong
2021Automated Playtesting with a Cognitive Model of Sensorimotor Coordination.
Injung Lee, Hyunchul Kim, Byungjoo Lee
2021Automatic Channel Pruning with Hyper-parameter Search and Dynamic Masking.
Baopu Li, Yanwen Fan, Zhihong Pan, Yuchen Bian, Gang Zhang
2021BAM: Bilateral Activation Mechanism for Image Fusion.
Zi-Rong Jin, Liang-Jian Deng, Tian-Jing Zhang, Xiao-Xu Jin
2021Better Learning Shot Boundary Detection via Multi-task.
Haoxin Zhang, Zhimin Li, Qinglin Lu
2021Beyond OCR + VQA: Involving OCR into the Flow for Robust and Accurate TextVQA.
Gangyan Zeng, Yuan Zhang, Yu Zhou, Xiaomeng Yang
2021Block Popularity Prediction for Multimedia Storage Systems Using Spatial-Temporal-Sequential Neural Networks.
Yingying Cheng, Fan Zhang, Gang Hu, Yiwen Wang, Hanhui Yang, Gong Zhang, Zhuo Cheng
2021Boosting End-to-end Multi-Object Tracking and Person Search via Knowledge Distillation.
Wei Zhang, Lingxiao He, Peng Chen, Xingyu Liao, Wu Liu, Qi Li, Zhenan Sun
2021Boosting Lightweight Single Image Super-resolution via Joint-distillation.
Xiaotong Luo, Qiuyuan Liang, Ding Liu, Yanyun Qu
2021Boosting Mobile CNN Inference through Semantic Memory.
Yun Li, Chen Zhang, Shihao Han, Li Lyna Zhang, Baoqun Yin, Yunxin Liu, Mengwei Xu
2021Bottom-Up and Bidirectional Alignment for Referring Expression Comprehension.
Liuwu Li, Yuqi Bu, Yi Cai
2021BridgeNet: A Joint Learning Network of Depth Map Super-Resolution and Monocular Depth Estimation.
Qi Tang, Runmin Cong, Ronghui Sheng, Lingzhi He, Dan Zhang, Yao Zhao, Sam Kwong
2021Bridging the Gap between Low-Light Scenes: Bilevel Learning for Fast Adaptation.
Dian Jin, Long Ma, Risheng Liu, Xin Fan
2021Build Your Own Bundle - A Neural Combinatorial Optimization Method.
Qilin Deng, Kai Wang, Minghao Zhao, Runze Wu, Yu Ding, Zhene Zou, Yue Shang, Jianrong Tao, Changjie Fan
2021CAA: Candidate-Aware Aggregation for Temporal Action Detection.
Yifan Ren, Xing Xu, Fumin Shen, Yazhou Yao, Huimin Lu
2021CALLip: Lipreading using Contrastive and Attribute Learning.
Yiyang Huang, Xuefeng Liang, Chaowei Fang
2021CARE: Cloudified Android OSes on the Cloud Rendering.
Dongjie Tang, Cathy Bao, Yong Yao, Chao Xie, Qiming Shi, Marc Mao, Randy Xu, Linsheng Li, Mohammad R. Haghighat, Zhengwei Qi, Haibing Guan
2021CDD: Multi-view Subspace Clustering via Cross-view Diversity Detection.
Shudong Huang, Ivor W. Tsang, Zenglin Xu, Jiancheng Lv, Quanhui Liu
2021CDP: Towards Optimal Filter Pruning via Class-wise Discriminative Power.
Tianshuo Xu, Yuhang Wu, Xiawu Zheng, Teng Xi, Gang Zhang, Errui Ding, Fei Chao, Rongrong Ji
2021CG-GAN: Class-Attribute Guided Generative Adversarial Network for Old Photo Restoration.
Jixin Liu, Rui Chen, Shipeng An, Heng Zhang
2021CLIP4Caption: CLIP for Video Caption.
Mingkang Tang, Zhanyu Wang, Zhenhua Liu, Fengyun Rao, Dian Li, Xiu Li
2021CONQUER: Contextual Query-aware Ranking for Video Corpus Moment Retrieval.
Zhijian Hou, Chong-Wah Ngo, Wing Kwong Chan
2021CaFGraph: Context-aware Facial Multi-graph Representation for Facial Action Unit Recognition.
Yingjie Chen, Diqi Chen, Yizhou Wang, Tao Wang, Yun Liang
2021Camera-Agnostic Person Re-Identification via Adversarial Disentangling Learning.
Hao Ni, Jingkuan Song, Xiaosu Zhu, Feng Zheng, Lianli Gao
2021CanvasEmb: Learning Layout Representation with Large-scale Pre-training for Graphic Design.
Yuxi Xie, Danqing Huang, Jinpeng Wang, Chin-Yew Lin
2021Capsule-based Object Tracking with Natural Language Specification.
Ding Ma, Xiangqian Wu
2021Cascade Cross-modal Attention Network for Video Actor and Action Segmentation from a Sentence.
Weidong Chen, Guorong Li, Xinfeng Zhang, Hongyang Yu, Shuhui Wang, Qingming Huang
2021CausalRec: Causal Inference for Visual Debiasing in Visually-Aware Recommendation.
Ruihong Qiu, Sen Wang, Zhi Chen, Hongzhi Yin, Zi Huang
2021ChartPointFlow for Topology-Aware 3D Point Cloud Generation.
Takumi Kimura, Takashi Matsubara, Kuniaki Uehara
2021Chinese Character Inpainting with Contextual Semantic Constraints.
Jiahao Wang, Gang Pan, Di Sun, Jiawan Zhang
2021Cluster and Scatter: A Multi-grained Active Semi-supervised Learning Framework for Scalable Person Re-identification.
Bingyu Hu, Zheng-Jun Zha, Jiawei Liu, Xierong Zhu, Hongtao Xie
2021Co-Transport for Class-Incremental Learning.
Da-Wei Zhou, Han-Jia Ye, De-Chuan Zhan
2021Co-learning: Learning from Noisy Labels with Self-supervision.
Cheng Tan, Jun Xia, Lirong Wu, Stan Z. Li
2021CoCo-BERT: Improving Video-Language Pre-training with Contrastive Cross-modal Matching and Denoising.
Jianjie Luo, Yehao Li, Yingwei Pan, Ting Yao, Hongyang Chao, Tao Mei
2021CoReD: Generalizing Fake Media Detection with Continual Representation using Distillation.
Minha Kim, Shahroz Tariq, Simon S. Woo
2021Coarse to Fine: Domain Adaptive Crowd Counting via Adversarial Scoring Network.
Zhikang Zou, Xiaoye Qu, Pan Zhou, Shuangjie Xu, Xiaoqing Ye, Wenhao Wu, Jin Ye
2021Collocation and Try-on Network: Whether an Outfit is Compatible.
Na Zheng, Xuemeng Song, Qingying Niu, Xue Dong, Yibing Zhan, Liqiang Nie
2021Combining Attention with Flow for Person Image Synthesis.
Yurui Ren, Yubo Wu, Thomas H. Li, Shan Liu, Ge Li
2021Community Generated VR Painting using Eye Gaze.
Mu Mu, Murtada Dohan
2021Complementary Factorization towards Outfit Compatibility Modeling.
Tianyu Su, Xuemeng Song, Na Zheng, Weili Guan, Yan Li, Liqiang Nie
2021Complementary Trilateral Decoder for Fast and Accurate Salient Object Detection.
Zhirui Zhao, Changqun Xia, Chenxi Xie, Jia Li
2021Conceptual and Syntactical Cross-modal Alignment with Cross-level Consistency for Image-Text Matching.
Pengpeng Zeng, Lianli Gao, Xinyu Lyu, Shuaiqi Jing, Jingkuan Song
2021Conditional Directed Graph Convolution for 3D Human Pose Estimation.
Wenbo Hu, Changgong Zhang, Fangneng Zhan, Lei Zhang, Tien-Tsin Wong
2021Consistency-Constancy Bi-Knowledge Learning for Pedestrian Detection in Night Surveillance.
Xiao Wang, Zheng Wang, Wu Liu, Xin Xu, Jing Chen, Chia-Wen Lin
2021Constrained Graphic Layout Generation via Latent Optimization.
Kotaro Kikuchi, Edgar Simo-Serra, Mayu Otani, Kota Yamaguchi
2021Context-Aware Selective Label Smoothing for Calibrating Sequence Recognition Model.
Shuangping Huang, Yu Luo, Zhenzhou Zhuang, Jin-Gang Yu, Mengchao He, Yongpan Wang
2021Contrastive Disentangled Meta-Learning for Signer-Independent Sign Language Translation.
Tao Jin, Zhou Zhao
2021Contrastive Learning for Cold-Start Recommendation.
Yinwei Wei, Xiang Wang, Qi Li, Liqiang Nie, Yan Li, Xuanping Li, Tat-Seng Chua
2021Convolutional Transformer based Dual Discriminator Generative Adversarial Networks for Video Anomaly Detection.
Xinyang Feng, Dongjin Song, Yuncong Chen, Zhengzhang Chen, Jingchao Ni, Haifeng Chen
2021Counterfactual Debiasing Inference for Compositional Action Recognition.
Pengzhan Sun, Bo Wu, Xunsong Li, Wen Li, Lixin Duan, Chuang Gan
2021Cross Chest Graph for Disease Diagnosis with Structural Relational Reasoning.
Gangming Zhao
2021Cross Modal Compression: Towards Human-comprehensible Semantic Compression.
Jiguo Li, Chuanmin Jia, Xinfeng Zhang, Siwei Ma, Wen Gao
2021Cross-Camera Feature Prediction for Intra-Camera Supervised Person Re-identification across Distant Scenes.
Wenhang Ge, Chunyan Pan, Ancong Wu, Hongwei Zheng, Wei-Shi Zheng
2021Cross-Modal Generalization: Learning in Low Resource Modalities via Meta-Alignment.
Paul Pu Liang, Peter Wu, Liu Ziyin, Louis-Philippe Morency, Ruslan Salakhutdinov
2021Cross-Modal Recipe Embeddings by Disentangling Recipe Contents and Dish Styles.
Yu Sugiyama, Keiji Yanai
2021Cross-View Exocentric to Egocentric Video Synthesis.
Gaowen Liu, Hao Tang, Hugo Latapie, Jason J. Corso, Yan Yan
2021Cross-View Representation Learning for Multi-View Logo Classification with Information Bottleneck.
Jing Wang, Yuanjie Zheng, Jingqi Song, Sujuan Hou
2021Cross-modal Consensus Network for Weakly Supervised Temporal Action Localization.
Fa-Ting Hong, Jia-Chang Feng, Dan Xu, Ying Shan, Wei-Shi Zheng
2021Cross-modal Joint Prediction and Alignment for Composed Query Image Retrieval.
Yuchen Yang, Min Wang, Wengang Zhou, Houqiang Li
2021Cross-modal Retrieval and Synthesis (X-MRS): Closing the Modality Gap in Shared Subspace Learning.
Ricardo Guerrero, Hai Xuan Pham, Vladimir Pavlovic
2021Cross-modal Self-Supervised Learning for Lip Reading: When Contrastive Learning meets Adversarial Training.
Changchong Sheng, Matti Pietikäinen, Qi Tian, Li Liu
2021Cross-modality Discrepant Interaction Network for RGB-D Salient Object Detection.
Chen Zhang, Runmin Cong, Qinwei Lin, Lin Ma, Feng Li, Yao Zhao, Sam Kwong
2021Curriculum-Based Meta-learning.
Ji Zhang, Jingkuan Song, Yazhou Yao, Lianli Gao
2021Cut-Thumbnail: A Novel Data Augmentation for Convolutional Neural Network.
Tianshu Xie, Xuan Cheng, Xiaomin Wang, Minghui Liu, Jiali Deng, Tao Zhou, Ming Liu
2021Cycle-Consistent Inverse GAN for Text-to-Image Synthesis.
Hao Wang, Guosheng Lin, Steven C. H. Hoi, Chunyan Miao
2021DAWN: Dynamic Adversarial Watermarking of Neural Networks.
Sebastian Szyller, Buse Gul Atli, Samuel Marchal, N. Asokan
2021DC-GNet: Deep Mesh Relation Capturing Graph Convolution Network for 3D Human Shape Reconstruction.
Shihao Zhou, Mengxi Jiang, Shanshan Cai, Yunqi Lei
2021DEPA: Self-Supervised Audio Embedding for Depression Detection.
Pingyue Zhang, Mengyue Wu, Heinrich Dinkel, Kai Yu
2021DFR-Net: A Novel Multi-Task Learning Network for Real-Time Multi-Instrument Segmentation.
Yan-Jie Zhou, Shi-Qi Liu, Xiao-Liang Xie, Zeng-Guang Hou
2021DLA-Net for FG-SBIR: Dynamic Local Aligned Network for Fine-Grained Sketch-Based Image Retrieval.
Jiaqing Xu, Haifeng Sun, Qi Qi, Jingyu Wang, Ce Ge, Lejian Zhang, Jianxin Liao
2021DPT: Deformable Patch-based Transformer for Visual Recognition.
Zhiyang Chen, Yousong Zhu, Chaoyang Zhao, Guosheng Hu, Wei Zeng, Jinqiao Wang, Ming Tang
2021DRDF: Determining the Importance of Different Multimodal Information with Dual-Router Dynamic Framework.
Haiwen Hong, Xuan Jin, Yin Zhang, Yunqing Hu, Jingfeng Zhang, Yuan He, Hui Xue
2021DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning.
Wenhao Wu, Yuxiang Zhao, Yanwu Xu, Xiao Tan, Dongliang He, Zhikang Zou, Jin Ye, Yingying Li, Mingde Yao, Zichao Dong, Yifeng Shi
2021DSP: Dual Soft-Paste for Unsupervised Domain Adaptive Semantic Segmentation.
Li Gao, Jing Zhang, Lefei Zhang, Dacheng Tao
2021DSSL: Deep Surroundings-person Separation Learning for Text-based Person Retrieval.
Aichun Zhu, Zijie Wang, Yifeng Li, Xili Wan, Jing Jin, Tian Wang, Fangqiang Hu, Gang Hua
2021Data-Free Ensemble Knowledge Distillation for Privacy-conscious Multimedia Model Compression.
Zhiwei Hao, Yong Luo, Han Hu, Jianping An, Yonggang Wen
2021Database-adaptive Re-ranking for Enhancing Cross-modal Image Retrieval.
Rintaro Yanagi, Ren Togo, Takahiro Ogawa, Miki Haseyama
2021Deadline and Priority-aware Congestion Control for Delay-sensitive Multimedia Streaming.
Chao Zhou, Wenjun Wu, Dan Yang, Tianchi Huang, Liang Guo, Bing Yu
2021Deconfounded and Explainable Interactive Vision-Language Retrieval of Complex Scenes.
Junda Wu, Tong Yu, Shuai Li
2021Decoupled IoU Regression for Object Detection.
Yan Gao, Qimeng Wang, Xu Tang, Haochen Wang, Fei Ding, Jing Li, Yao Hu
2021Deep Clustering based on Bi-Space Association Learning.
Hao Huang, Shinjae Yoo, Chenxiao Xu
2021Deep Human Dynamics Prior.
Qiongjie Cui, Huaijiang Sun, Yue Kong, Xiaoning Sun
2021Deep Interactive Video Inpainting: An Invisibility Cloak for Harry Potter.
Cheng Chen, Jiayin Cai, Yao Hu, Xu Tang, Xinggang Wang, Chun Yuan, Xiang Bai, Song Bai
2021Deep Learning for Visual Data Compression.
Guo Lu, Ren Yang, Shenlong Wang, Shan Liu, Radu Timofte
2021Deep Marginal Fisher Analysis based CNN for Image Representation and Classification.
Xun Cai, Jiajing Chai, Yanbo Gao, Shuai Li, Bo Zhu
2021Deep Neural Network Retrieval.
Nan Zhong, Zhenxing Qian, Xinpeng Zhang
2021Deep Reasoning Network for Few-shot Semantic Segmentation.
Yunzhi Zhuge, Chunhua Shen
2021Deep Self-Supervised t-SNE for Multi-modal Subspace Clustering.
Qianqian Wang, Wei Xia, Zhiqiang Tao, Quanxue Gao, Xiaochun Cao
2021Deep Unsupervised 3D SfM Face Reconstruction Based on Massive Landmark Bundle Adjustment.
Yuxing Wang, Yawen Lu, Zhihua Xie, Guoyu Lu
2021DeepGame: Efficient Video Encoding for Cloud Gaming.
Omar Mossad, Khaled Diab, Ihab Amer, Mohamed Hefeeda
2021DehazeFlow: Multi-scale Conditional Flow Network for Single Image Dehazing.
Hongyu Li, Jia Li, Dong Zhao, Long Xu
2021Delving into Deep Image Prior for Adversarial Defense: A Novel Reconstruction-based Defense Framework.
Li Ding, Yongwei Wang, Xin Ding, Kaiwen Yuan, Ping Wang, Hua Huang, Z. Jane Wang
2021Demystifying Commercial Video Conferencing Applications.
Insoo Lee, Jinsung Lee, Kyunghan Lee, Dirk Grunwald, Sangtae Ha
2021Dense Contrastive Visual-Linguistic Pretraining.
Lei Shi, Kai Shuang, Shijie Geng, Peng Gao, Zuohui Fu, Gerard de Melo, Yunpeng Chen, Sen Su
2021Dense Semantic Contrast for Self-Supervised Visual Representation Learning.
Xiaoni Li, Yu Zhou, Yifei Zhang, Aoting Zhang, Wei Wang, Ning Jiang, Haiying Wu, Weiping Wang
2021Depth Quality-Inspired Feature Manipulation for Efficient RGB-D Salient Object Detection.
Wenbo Zhang, Ge-Peng Ji, Zhuo Wang, Keren Fu, Qijun Zhao
2021Differentiated Learning for Multi-Modal Domain Adaptation.
Jianming Lv, Kaijie Liu, Shengfeng He
2021Diffusing the Liveness Cues for Face Anti-spoofing.
Sheng Li, Xun Zhu, Guorui Feng, Xinpeng Zhang, Zhenxing Qian
2021Digital Human in an Integrated Physical-Digital World (IPhD).
Zhengyou Zhang
2021Direction Relation Transformer for Image Captioning.
Zeliang Song, Xiaofei Zhou, Linhua Dong, Jianlong Tan, Li Guo
2021Discovering Density-Preserving Latent Space Walks in GANs for Semantic Image Transformations.
Guanyue Li, Yi Liu, Xiwen Wei, Yang Zhang, Si Wu, Yong Xu, Hau-San Wong
2021Discriminative Latent Semantic Graph for Video Captioning.
Yang Bai, Junyan Wang, Yang Long, Bingzhang Hu, Yang Song, Maurice Pagnucco, Yu Guan
2021Discriminator-free Generative Adversarial Attack.
Shaohao Lu, Yuqiao Xian, Ke Yan, Yi Hu, Xing Sun, Xiaowei Guo, Feiyue Huang, Wei-Shi Zheng
2021Disentangle Your Dense Object Detector.
Zehui Chen, Chenhongyi Yang, Qiaofei Li, Feng Zhao, Zheng-Jun Zha, Feng Wu
2021Disentangled Representation Learning and Enhancement Network for Single Image De-Raining.
Guoqing Wang, Changming Sun, Xing Xu, Jingjing Li, Zheng Wang, Zeyu Ma
2021Disentangling Hate in Online Memes.
Roy Ka-Wei Lee, Rui Cao, Ziqing Fan, Jing Jiang, Wen-Haw Chong
2021Distantly Supervised Semantic Text Detection and Recognition for Broadcast Sports Videos Understanding.
Avijit Shah, Topojoy Biswas, Sathish Ramadoss, Deven Santosh Shah
2021Distributed Attention for Grounded Image Captioning.
Nenglun Chen, Xingjia Pan, Runnan Chen, Lei Yang, Zhiwen Lin, Yuqiang Ren, Haolei Yuan, Xiaowei Guo, Feiyue Huang, Wenping Wang
2021Diverse Image Inpainting with Bidirectional and Autoregressive Transformers.
Yingchen Yu, Fangneng Zhan, Rongliang Wu, Jianxiong Pan, Kaiwen Cui, Shijian Lu, Feiying Ma, Xuansong Xie, Chunyan Miao
2021Diverse Multimedia Layout Generation with Multi Choice Learning.
David D. Nguyen, Surya Nepal, Salil S. Kanhere
2021Do We Really Need Frame-by-Frame Annotation Datasets for Object Tracking?
Lei Hu, Shaoli Huang, Shilei Wang, Wei Liu, Jifeng Ning
2021Do you see what I see?: Large-scale Learning from Multimodal Videos.
Cordelia Schmid
2021DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction.
Hao Feng, Yuechen Wang, Wengang Zhou, Jiajun Deng, Houqiang Li
2021Domain Adaptive Semantic Segmentation without Source Data.
Fuming You, Jingjing Li, Lei Zhu, Zhi Chen, Zi Huang
2021Domain Generalization via Feature Variation Decorrelation.
Chang Liu, Lichen Wang, Kai Li, Yun Fu
2021Domain-Aware SE Network for Sketch-based Image Retrieval with Multiplicative Euclidean Margin Softmax.
Peng Lu, Gao Huang, Hangyu Lin, Wenming Yang, Guodong Guo, Yanwei Fu
2021Dual Graph Convolutional Networks with Transformer and Curriculum Learning for Image Captioning.
Xinzhi Dong, Chengjiang Long, Wenju Xu, Chunxia Xiao
2021Dual Learning Music Composition and Dance Choreography.
Shuang Wu, Zhenguang Liu, Shijian Lu, Li Cheng
2021Dynamic Knowledge Distillation with Cross-Modality Knowledge Transfer.
Guangzhi Wang
2021Dynamic Momentum Adaptation for Zero-Shot Cross-Domain Crowd Counting.
Qiangqiang Wu, Jia Wan, Antoni B. Chan
2021D³Net: Dual-Branch Disturbance Disentangling Network for Facial Expression Recognition.
Rongyun Mo, Yan Yan, Jing-Hao Xue, Si Chen, Hanzi Wang
2021E2Net: Excitative-Expansile Learning for Weakly Supervised Object Localization.
Zhiwei Chen, Liujuan Cao, Yunhang Shen, Feihong Lian, Yongjian Wu, Rongrong Ji
2021EVRNet: Efficient Video Restoration on Edge Devices.
Sachin Mehta, Amit Kumar, Fitsum A. Reda, Varun Nasery, Vikram Mulukutla, Rakesh Ranjan, Vikas Chandra
2021Edge-oriented Convolution Block for Real-time Super Resolution on Mobile Devices.
Xindong Zhang, Hui Zeng, Lei Zhang
2021Edit Like A Designer: Modeling Design Workflows for Unaligned Fashion Editing.
Qiyu Dai, Shuai Yang, Wenjing Wang, Wei Xiang, Jiaying Liu
2021Effective De-identification Generative Adversarial Network for Face Anonymization.
Zhenzhong Kuang, Huigui Liu, Jun Yu, Aikui Tian, Lei Wang, Jianping Fan, Noboru Babaguchi
2021Efficient Graph Deep Learning in TensorFlow with tf_geometric.
Jun Hu, Shengsheng Qian, Quan Fang, Youze Wang, Quan Zhao, Huaiwen Zhang, Changsheng Xu
2021Efficient Multi-Modal Fusion with Diversity Analysis.
Shuhui Qu, Yan Kang, Janghwan Lee
2021Efficient Reinforcement Learning Development with RLzoo.
Zihan Ding, Tianyang Yu, Hongming Zhang, Yanhua Huang, Guo Li, Quancheng Guo, Luo Mai, Hao Dong
2021Efficient Sparse Attacks on Videos using Reinforcement Learning.
Huanqian Yan, Xingxing Wei
2021Ego-Deliver: A Large-Scale Dataset For Egocentric Video Analysis.
Haonan Qiu, Pan He, Shuchun Liu, Weiyuan Shao, Feiyun Zhang, Jiajun Wang, Liang He, Feng Wang
2021Elastic Tactile Simulation Towards Tactile-Visual Perception.
Yikai Wang, Wenbing Huang, Bin Fang, Fuchun Sun, Chang Li
2021Embracing the Dark Knowledge: Domain Generalization Using Regularized Knowledge Distillation.
Yufei Wang, Haoliang Li, Lap-Pui Chau, Alex C. Kot
2021End-to-End Video Object Detection with Spatial-Temporal Transformers.
Lu He, Qianyu Zhou, Xiangtai Li, Li Niu, Guangliang Cheng, Xiao Li, Wenxuan Liu, Yunhai Tong, Lizhuang Ma, Liqing Zhang
2021End-to-end Boundary Exploration for Weakly-supervised Semantic Segmentation.
Jianjun Chen, Shancheng Fang, Hongtao Xie, Zheng-Jun Zha, Yue Hu, Jianlong Tan
2021End-to-end Quality of Experience Evaluation for HTTP Adaptive Streaming.
Babak Taraghi
2021Enhanced Invertible Encoding for Learned Image Compression.
Yueqi Xie, Ka Leong Cheng, Qifeng Chen
2021Enhancing Knowledge Tracing via Adversarial Training.
Xiaopeng Guo, Zhijie Huang, Jie Gao, Mingyu Shang, Maojing Shu, Jun Sun
2021Exploiting BERT for Multimodal Target Sentiment Classification through Input Space Translation.
Zaid Khan, Yun Fu
2021Exploiting Invariance of Mining Facial Landmarks.
Jiangming Shi, Zixian Gao, Hao Liu, Zekuan Yu, Fengjun Li
2021Exploring Contextual-Aware Representation and Linguistic-Diverse Expression for Visual Dialog.
Xiangpeng Li, Lianli Gao, Lei Zhao, Jingkuan Song
2021Exploring Gradient Flow Based Saliency for DNN Model Compression.
Xinyu Liu, Baopu Li, Zhen Chen, Yixuan Yuan
2021Exploring Graph-Structured Semantics for Cross-Modal Retrieval.
Lei Zhang, Leiting Chen, Chuan Zhou, Fan Yang, Xin Li
2021Exploring Logical Reasoning for Referring Expression Comprehension.
Ying Cheng, Ruize Wang, Jiashuo Yu, Rui-Wei Zhao, Yuejie Zhang, Rui Feng
2021Exploring Pathologist Knowledge for Automatic Assessment of Breast Cancer Metastases in Whole-slide Image.
Liuan Wang, Li Sun, Mingjie Zhang, Huigang Zhang, Ping Wang, Rong Zhou, Jun Sun
2021Exploring Sequence Feature Alignment for Domain Adaptive Detection Transformers.
Wen Wang, Yang Cao, Jing Zhang, Fengxiang He, Zheng-Jun Zha, Yonggang Wen, Dacheng Tao
2021Exploring the Quality of GAN Generated Images for Person Re-Identification.
Yiqi Jiang, Weihua Chen, Xiuyu Sun, Xiaoyu Shi, Fan Wang, Hao Li
2021Extending 6-DoF VR Experience Via Multi-Sphere Images Interpolation.
Jisheng Li, Yuze He, Jinghui Jiao, Yubin Hu, Yuxing Han, Jiangtao Wen
2021Extracting Useful Knowledge from Noisy Web Images via Data Purification for Fine-Grained Recognition.
Chuanyi Zhang, Yazhou Yao, Xing Xu, Jie Shao, Jingkuan Song, Zechao Li, Zhenmin Tang
2021FAMGAN: Fine-grained AUs Modulation based Generative Adversarial Network for Micro-Expression Generation.
Yifan Xu, Sirui Zhao, Huaying Tang, Xinglong Mao, Tong Xu, Enhong Chen
2021FME'21: 1st Workshop on Facial Micro-Expression: Advanced Techniques for Facial Expressions Generation and Spotting.
Jingting Li, Moi Hoon Yap, Wen-Huang Cheng, John See, Xiaopeng Hong, Xiaobai Li, Su-Jing Wang
2021FOCAS: Practical Video Super Resolution using Foveated Rendering.
Lingdong Wang, Mohammad H. Hajiesmaili, Ramesh K. Sitaraman
2021FTAFace: Context-enhanced Face Detector with Fine-grained Task Attention.
Deyu Wang, Dongchao Wen, Wei Tao, Lingxiao Yin, Tse-Wei Chen, Tadayuki Ito, Kinya Osa, Masami Kato
2021Face Hallucination via Split-Attention in Split-Attention Network.
Tao Lu, Yuanzhi Wang, Yanduo Zhang, Yu Wang, Wei Liu, Zhongyuan Wang, Junjun Jiang
2021Face-based Voice Conversion: Learning the Voice behind a Face.
Hsiao-Han Lu, Shao-En Weng, Ya-Fan Yen, Hong-Han Shuai, Wen-Huang Cheng
2021FaceX-Zoo: A PyTorch Toolbox for Face Recognition.
Jun Wang, Yinglu Liu, Yibo Hu, Hailin Shi, Tao Mei
2021Facial Action Unit-based Deep Learning Framework for Spotting Macro- and Micro-expressions in Long Video Sequences.
Bo Yang, Jianming Wu, Zhiguang Zhou, Megumi Komiya, Koki Kishimoto, Jianfeng Xu, Keisuke Nonaka, Toshiharu Horiuchi, Satoshi Komorita, Gen Hattori, Sei Naito, Yasuhiro Takishima
2021Facial Micro-Expression Generation based on Deep Motion Retargeting and Transfer Learning.
Xinqi Fan, Ali Raza Shahid, Hong Yan
2021Facial Prior Based First Order Motion Model for Micro-expression Generation.
Yi Zhang, Youjun Zhao, Yuhang Wen, Zixuan Tang, Xinhua Xu, Mengyuan Liu
2021Fake Gradient: A Security and Privacy Protection Framework for DNN-based Image Classification.
Xianglong Feng, Yi Xie, Mengmei Ye, Zhongze Tang, Bo Yuan, Sheng Wei
2021FakeTagger: Robust Safeguards against DeepFake Dissemination via Provenance Tracking.
Run Wang, Felix Juefei-Xu, Meng Luo, Yang Liu, Lina Wang
2021Fast Video Visual Quality and Resolution Improvement using SR-UNet.
Federico Vaccaro, Marco Bertini, Tiberio Uricchio, Alberto Del Bimbo
2021Fast and Accurate Lane Detection via Frequency Domain Learning.
Yulin He, Wei Chen, Zhengfa Liang, Dan Chen, Yusong Tan, Xin Luo, Chen Li, Yulan Guo
2021Fast and Flexible Human Pose Estimation with HyperPose.
Yixiao Guo, Jiawei Liu, Guo Li, Luo Mai, Hao Dong
2021Fast, High-Quality Hierarchical Depth-Map Super-Resolution.
Yiguo Qiao, Licheng Jiao, Wenbin Li, Christian Richardt, Darren Cosker
2021Fast-forwarding, Rewinding, and Path Exploration in Interactive Branched Video Streaming.
Albin Vogel, Erik Kronberg, Niklas Carlsson
2021Faster-PPN: Towards Real-Time Semantic Segmentation with Dual Mutual Learning for Ultra-High Resolution Images.
Bicheng Dai, Kaisheng Wu, Tong Wu, Kai Li, Yanyun Qu, Yuan Xie, Yun Fu
2021Feature Stylization and Domain-aware Contrastive Learning for Domain Generalization.
Seogkyu Jeon, Kibeom Hong, Pilhyeon Lee, Jewook Lee, Hyeran Byun
2021Feedback Network for Mutually Boosted Stereo Image Super-Resolution and Disparity Estimation.
Qinyan Dai, Juncheng Li, Qiaosi Yi, Faming Fang, Guixu Zhang
2021Few-Shot Multi-Agent Perception.
Chenyou Fan, Junjie Hu, Jianwei Huang
2021Few-shot Fine-Grained Action Recognition via Bidirectional Attention and Contrastive Meta-Learning.
Jiahao Wang, Yunhong Wang, Sheng Liu, Annan Li
2021Few-shot Learning for Multi-Modality Tasks.
Jie Chen, Qixiang Ye, Xiaoshan Yang, S. Kevin Zhou, Xiaopeng Hong, Li Zhang
2021Few-shot Unsupervised Domain Adaptation with Image-to-Class Sparse Similarity Encoding.
Shengqi Huang, Wanqi Yang, Lei Wang, Luping Zhou, Ming Yang
2021Fine-Grained Language Identification in Scene Text Images.
Yongrui Li, Shilian Wu, Jun Yu, Zengfu Wang
2021Fine-grained Cross-modal Alignment Network for Text-Video Retrieval.
Ning Han, Jingjing Chen, Guangyi Xiao, Hao Zhang, Yawen Zeng, Hao Chen
2021Fingerspelling Recognition in the Wild with Fixed-Query based Visual Attention.
Srinivas Kruthiventi S. S, George Jose, Nitya Tandon, Rajesh Roshan Biswal, Aashish Kumar
2021Focal and Composed Vision-semantic Modeling for Visual Question Answering.
Yudong Han, Yangyang Guo, Jianhua Yin, Meng Liu, Yupeng Hu, Liqiang Nie
2021Focusing on Persons: Colorizing Old Images Learning from Modern Historical Movies.
Xin Jin, Zhonglan Li, Ke Liu, Dongqing Zou, Xiaodong Li, Xingfan Zhu, Ziyin Zhou, Qilong Sun, Qingyu Liu
2021FoodLogoDet-1500: A Dataset for Large-Scale Food Logo Detection via Multi-Scale Feature Decoupling Network.
Qiang Hou, Weiqing Min, Jing Wang, Sujuan Hou, Yuanjie Zheng, Shuqiang Jiang
2021Former-DFER: Dynamic Facial Expression Recognition Transformer.
Zengqun Zhao, Qingshan Liu
2021From Image to Imuge: Immunized Image Generation.
Qichao Ying, Zhenxing Qian, Hang Zhou, Haisheng Xu, Xinpeng Zhang, Siyi Li
2021From Superficial to Deep: Language Bias driven Curriculum Learning for Visual Question Answering.
Mingrui Lao, Yanming Guo, Yu Liu, Wei Chen, Nan Pu, Michael S. Lew
2021From Synthetic to Real: Image Dehazing Collaborating with Unlabeled Real Data.
Ye Liu, Lei Zhu, Shunda Pei, Huazhu Fu, Jing Qin, Qing Zhang, Liang Wan, Wei Feng
2021From Voxel to Point: IoU-guided 3D Object Detection for Point Cloud with Voxel-to-Point Decoder.
Jiale Li, Hang Dai, Ling Shao, Yong Ding
2021Fully Functional Image Manipulation Using Scene Graphs in A Bounding-Box Free Way.
Sitong Su, Lianli Gao, Junchen Zhu, Jie Shao, Jingkuan Song
2021Fully Quantized Image Super-Resolution Networks.
Hu Wang, Peng Chen, Bohan Zhuang, Chunhua Shen
2021GAMnet: Robust Feature Matching via Graph Adversarial-Matching Network.
Bo Jiang, Pengfei Sun, Ziyan Zhang, Jin Tang, Bin Luo
2021GAN-aided Serial Dependence Study in Medical Image Perception.
Zhihang Ren
2021GCCN: Geometric Constraint Co-attention Network for 6D Object Pose Estimation.
Yongming Wen, Yiquan Fang, Junhao Cai, Kimwa Tung, Hui Cheng
2021GCM-Net: Towards Effective Global Context Modeling for Image Inpainting.
Huan Zheng, Zhao Zhang, Yang Wang, Zheng Zhang, Mingliang Xu, Yi Yang, Meng Wang
2021GCNIllustrator: Illustrating the Effect of Hyperparameters on Graph Convolutional Networks.
Ivona Najdenkoska, Jeroen den Boef, Thomas Schneider, Justo van der Werf, Reinier de Ridder, Fajar Fathurrahman, Marcel Worring
2021GLM-Net: Global and Local Motion Estimation via Task-Oriented Encoder-Decoder Structure.
Yuchen Yang, Ye Xiang, Shuaicheng Liu, Lifang Wu, Boxuan Zhao, Bing Zeng
2021Game Theory-driven Rate Control for 360-Degree Video Coding.
Tiesong Zhao, Jielian Lin, Yanjie Song, Xu Wang, Yuzhen Niu
2021General Approximate Cross Validation for Model Selection: Supervised, Semi-supervised and Pairwise Learning.
Bowei Zhu, Yong Liu
2021Generally Boosting Few-Shot Learning with HandCrafted Features.
Yi Zhang, Sheng Huang, Fengtao Zhou
2021Generating Point Cloud from Single Image in The Few Shot Scenario.
Yu Lin, Jinghui Guo, Yang Gao, Yi-Fan Li, Zhuoyi Wang, Latifur Khan
2021Generative Adversarial Network for Text-to-Face Synthesis and Manipulation.
Yutong Zhou
2021Get The Best of the Three Worlds: Real-Time Neural Image Compression in a Non-GPU Environment.
Zekun Zheng, Xiaodong Wang, Xinye Lin, Shaohe Lv
2021Graph Convolutional Multi-modal Hashing for Flexible Multimedia Retrieval.
Xu Lu, Lei Zhu, Li Liu, Liqiang Nie, Huaxiang Zhang
2021Graph Neural Networks for Knowledge Enhanced Visual Representation of Paintings.
Athanasios Efthymiou, Stevan Rudinac, Monika Kackovic, Marcel Worring, Nachoem Wijnberg
2021Group-Level Focus of Visual Attention for Improved Next Speaker Prediction.
Chris Birmingham, Kalin Stefanov, Maja J. Mataric
2021Group-based Distinctive Image Captioning with Memory Attention.
Jiuniu Wang, Wenjia Xu, Qingzhong Wang, Antoni B. Chan
2021HANet: Hierarchical Alignment Networks for Video-Text Retrieval.
Peng Wu, Xiangteng He, Mingqian Tang, Yiliang Lv, Jing Liu
2021HAT: Hierarchical Aggregation Transformers for Person Re-identification.
Guowen Zhang, Pingping Zhang, Jinqing Qi, Huchuan Lu
2021HDA-Net: Horizontal Deformable Attention Network for Stereo Matching.
Qi Zhang, Xuesong Zhang, Baoping Li, Yuzhong Chen, Anlong Ming
2021HUMA'21: 2nd International Workshop on Human-centric Multimedia Analysis.
Wu Liu, Xinchen Liu, Jingkuan Song, Dingwen Zhang, Wenbing Huang, Junbo Guo, John Smith
2021Handling Difficult Labels for Multi-label Image Classification via Uncertainty Distillation.
Liangchen Song, Jialian Wu, Ming Yang, Qian Zhang, Yuan Li, Junsong Yuan
2021Heraclitus's Forest: An Interactive Artwork for Oral History.
Lin Wang, Zhonghao Lin, Wei Cai
2021HetEmotionNet: Two-Stream Heterogeneous Graph Recurrent Neural Network for Multi-modal Emotion Recognition.
Ziyu Jia, Youfang Lin, Jing Wang, Zhiyang Feng, Xiangheng Xie, Caijie Chen
2021Heterogeneous Face Recognition with Attention-guided Feature Disentangling.
Shanmin Yang, Xiao Yang, Yi Lin, Peng Cheng, Yi Zhang, Jianwei Zhang
2021Heterogeneous Feature Fusion and Cross-modal Alignment for Composed Image Retrieval.
Gangjian Zhang, Shikui Wei, Huaxin Pang, Yao Zhao
2021Heuristic Depth Estimation with Progressive Depth Reconstruction and Confidence-Aware Loss.
Jiehua Zhang, Liang Li, Chenggang Yan, Yaoqi Sun, Tao Shen, Jiyong Zhang, Zhan Wang
2021Hierarchical Fusion for Practical Ghost-free High Dynamic Range Imaging.
Pengfei Xiong, Yu Chen
2021Hierarchical Multi-Task Learning for Diagram Question Answering with Multi-Modal Transformer.
Zhaoquan Yuan, Xiao Peng, Xiao Wu, Changsheng Xu
2021Hierarchical View Predictor: Unsupervised 3D Global Feature Learning through Hierarchical Prediction among Unordered Views.
Zhizhong Han, Xiyang Wang, Yu-Shen Liu, Matthias Zwicker
2021How Video Super-Resolution and Frame Interpolation Mutually Benefit.
Chengcheng Zhou, Zongqing Lu, Linge Li, Qiangyu Yan, Jing-Hao Xue
2021How does Color Constancy Affect Target Recognition and Instance Segmentation?
Siyan Xue, Shaobing Gao, Minjie Tan, Zhen He, Liangtian He
2021How to Learn a Domain-Adaptive Event Simulator?
Daxin Gu, Jia Li, Yu Zhang, Yonghong Tian
2021Human Attributes Prediction under Privacy-preserving Conditions.
Anshu Singh, Shaojing Fan, Mohan S. Kankanhalli
2021Hybrid Network Compression via Meta-Learning.
Jianming Ye, Shiliang Zhang, Jingdong Wang
2021Hybrid Reasoning Network for Video-based Commonsense Captioning.
Weijiang Yu, Jian Liang, Lei Ji, Lu Li, Yuejian Fang, Nong Xiao, Nan Duan
2021I Know Your Keyboard Input: A Robust Keystroke Eavesdropper Based-on Acoustic Signals.
Jia-Xuan Bai, Bin Liu, Luchuan Song
2021I2V-GAN: Unpaired Infrared-to-Visible Video Translation.
Shuang Li, Bingfeng Han, Zhenjie Yu, Chi Harold Liu, Kai Chen, Shuigen Wang
2021ION: Instance-level Object Navigation.
Weijie Li, Xinhang Song, Yubing Bai, Sixian Zhang, Shuqiang Jiang
2021Identity-Preserving Face Anonymization via Adaptively Facial Attributes Obfuscation.
Jingzhi Li, Lutong Han, Ruoyu Chen, Hua Zhang, Bing Han, Lili Wang, Xiaochun Cao
2021Identity-aware Graph Memory Network for Action Detection.
Jingcheng Ni, Jie Qin, Di Huang
2021Image Quality Assessment in the Modern Age.
Kede Ma, Yuming Fang
2021Image Quality Caption with Attentive and Recurrent Semantic Attractor Network.
Wen Yang, Jinjian Wu, Leida Li, Weisheng Dong, Guangming Shi
2021Image Re-composition via Regional Content-Style Decoupling.
Rong Zhang, Wei Li, Yiqun Zhang, Hong Zhang, Jinhui Yu, Ruigang Yang, Weiwei Xu
2021Image Search with Text Feedback by Deep Hierarchical Attention Mutual Information Maximization.
Chunbin Gu, Jiajun Bu, Zhen Zhang, Zhi Yu, Dongfang Ma, Wei Wang
2021Image Style Transfer with Generative Adversarial Networks.
Ru Li
2021Imbalanced Source-free Domain Adaptation.
Xinhao Li, Jingjing Li, Lei Zhu, Guoqing Wang, Zi Huang
2021Imitating Arbitrary Talking Style for Realistic Audio-Driven Talking Face Synthesis.
Haozhe Wu, Jia Jia, Haoyu Wang, Yishun Dou, Chao Duan, Qingshan Deng
2021Imitative Learning for Multi-Person Action Forecasting.
Yu-Ke Li, Pin Wang, Mang Ye, Ching-Yao Chan
2021Implicit Feature Refinement for Instance Segmentation.
Lufan Ma, Tiancai Wang, Bin Dong, Jiangpeng Yan, Xiu Li, Xiangyu Zhang
2021Implicit Feedbacks are Not Always Favorable: Iterative Relabeled One-Class Collaborative Filtering against Noisy Interactions.
Zitai Wang, Qianqian Xu, Zhiyong Yang, Xiaochun Cao, Qingming Huang
2021Improving Fake News Detection by Using an Entity-enhanced Framework to Fuse Diverse Multimodal Clues.
Peng Qi, Juan Cao, Xirong Li, Huan Liu, Qiang Sheng, Xiaoyue Mi, Qin He, Yongbiao Lv, Chenyang Guo, Yingchao Yu
2021Improving Pedestrian Detection from a Long-tailed Domain Perspective.
Mengyuan Ding, Shanshan Zhang, Jian Yang
2021Improving Robustness and Accuracy via Relative Information Encoding in 3D Human Pose Estimation.
Wenkang Shan, Haopeng Lu, Shanshe Wang, Xinfeng Zhang, Wen Gao
2021Improving Weakly Supervised Object Localization via Causal Intervention.
Feifei Shao, Yawei Luo, Li Zhang, Lu Ye, Siliang Tang, Yi Yang, Jun Xiao
2021Inferring the Importance of Product Appearance with Semi-supervised Multi-modal Enhancement: A Step Towards the Screenless Retailing.
Yongshun Gong, Jinfeng Yi, Dongdong Chen, Jian Zhang, Jiayu Zhou, Zhihua Zhou
2021Information-Growth Attention Network for Image Super-Resolution.
Zhuangzi Li, Ge Li, Thomas H. Li, Shan Liu, Wei Gao
2021Informative Class-Conditioned Feature Alignment for Unsupervised Domain Adaptation.
Wanxia Deng, Yawen Cui, Zhen Liu, Gangyao Kuang, Dewen Hu, Matti Pietikäinen, Li Liu
2021InsPose: Instance-Aware Networks for Single-Stage Multi-Person Pose Estimation.
Dahu Shi, Xing Wei, Xiaodong Yu, Wenming Tan, Ye Ren, Shiliang Pu
2021Instance-wise or Class-wise? A Tale of Neighbor Shapley for Concept-based Explanation.
Jiahui Li, Kun Kuang, Lin Li, Long Chen, Songyang Zhang, Jian Shao, Jun Xiao
2021Integrating Semantic and Temporal Relationships in Facial Action Unit Detection.
Zhihua Li, Xiang Deng, Xiaotian Li, Lijun Yin
2021InterBN: Channel Fusion for Adversarial Unsupervised Domain Adaptation.
Mengzhu Wang, Wei Wang, Baopu Li, Xiang Zhang, Long Lan, Huibin Tan, Tianyi Liang, Wei Yu, Zhigang Luo
2021Interpolation Variable Rate Image Compression.
Zhenhong Sun, Zhiyu Tan, Xiuyu Sun, Fangyi Zhang, Yichen Qian, Dongyang Li, Hao Li
2021Interpreting Super-Resolution CNNs for Sub-Pixel Motion Compensation in Video Coding.
Luka Murn, Alan F. Smeaton, Marta Mrak
2021Interventional Video Relation Detection.
Yicong Li, Xun Yang, Xindi Shang, Tat-Seng Chua
2021Intrinsic Temporal Regularization for High-resolution Human Video Synthesis.
Lingbo Yang, Zhanning Gao, Siwei Ma, Wen Gao
2021Is Someone Speaking?: Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection.
Ruijie Tao, Zexu Pan, Rohan Kumar Das, Xinyuan Qian, Mike Zheng Shou, Haizhou Li
2021Is Visual Context Really Helpful for Knowledge Graph? A Representation Learning Perspective.
Meng Wang, Sen Wang, Han Yang, Zheng Zhang, Xi Chen, Guilin Qi
2021JDMAN: Joint Discriminative and Mutual Adaptation Networks for Cross-Domain Facial Expression Recognition.
Yingjian Li, Yingnan Gao, Bingzhi Chen, Zheng Zhang, Lei Zhu, Guangming Lu
2021JPGNet: Joint Predictive Filtering and Generative Network for Image Inpainting.
Qing Guo, Xiaoguang Li, Felix Juefei-Xu, Hongkai Yu, Yang Liu, Song Wang
2021Joint Implicit Image Function for Guided Depth Super-Resolution.
Jiaxiang Tang, Xiaokang Chen, Gang Zeng
2021Joint Learning for Relationship and Interaction Analysis in Video with Multimodal Feature Fusion.
Beibei Zhang, Fan Yu, Yanxin Gao, Tongwei Ren, Gangshan Wu
2021Joint Optimization in Edge-Cloud Continuum for Federated Unsupervised Person Re-identification.
Weiming Zhuang, Yonggang Wen, Shuai Zhang
2021Joint-teaching: Learning to Refine Knowledge for Resource-constrained Unsupervised Cross-modal Retrieval.
Peng-Fei Zhang, Jiasheng Duan, Zi Huang, Hongzhi Yin
2021JokerGAN: Memory-Efficient Model for Handwritten Text Generation with Text Line Awareness.
Jan Zdenek, Hideki Nakayama
2021Kandinsky Mobile: Abstract Art-Inspired Interactive Visualization of Social Discussions on Mobile Devices.
Castillo Clarence Fitzgerald Gumtang, Sourav S. Bhowmick
2021Keyframe Extraction from Motion Capture Sequences with Graph based Deep Reinforcement Learning.
Clinton Mo, Kun Hu, Shaohui Mei, Zebin Chen, Zhiyong Wang
2021Knowing When to Quit: Selective Cascaded Regression with Patch Attention for Real-Time Face Alignment.
Gil Shapira, Noga Levy, Ishay Goldin, Roy Josef Jevnisek
2021Knowledge Perceived Multi-modal Pretraining in E-commerce.
Yushan Zhu, Huaixiao Zhao, Wen Zhang, Ganqiang Ye, Hui Chen, Ningyu Zhang, Huajun Chen
2021Knowledge-Supervised Learning: Knowledge Consensus Constraints for Person Re-Identification.
Li Wang, Baoyu Fan, Zhenhua Guo, Yaqian Zhao, Runze Zhang, Rengang Li, Weifeng Gong, Endong Wang
2021L2RS: A Learning-to-Rescore Mechanism for Hybrid Speech Recognition.
Yuanfeng Song, Di Jiang, Xuefang Zhao, Qian Xu, Raymond Chi-Wing Wong, Lixin Fan, Qiang Yang
2021LSSNet: A Two-stream Convolutional Neural Network for Spotting Macro- and Micro-expression in Long Videos.
Wang-Wang Yu, Jingwen Jiang, Yong-Jie Li
2021LSTC: Boosting Atomic Action Detection with Long-Short-Term Context.
Yuxi Li, Boshen Zhang, Jian Li, Yabiao Wang, Weiyao Lin, Chengjie Wang, Jilin Li, Feiyue Huang
2021Large-scale Multi-Modality Pretrained Models: Applications and Experiences.
Jingren Zhou
2021Latent Memory-augmented Graph Transformer for Visual Storytelling.
Mengshi Qi, Jie Qin, Di Huang, Zhiqiang Shen, Yi Yang, Jiebo Luo
2021Learning Contextual Transformer Network for Image Inpainting.
Ye Deng, Siqi Hui, Sanping Zhou, Deyu Meng, Jinjun Wang
2021Learning Disentangled Factors from Paired Data in Cross-Modal Retrieval: An Implicit Identifiable VAE Approach.
Minyoung Kim, Ricardo Guerrero, Vladimir Pavlovic
2021Learning Fine-Grained Motion Embedding for Landscape Animation.
Hongwei Xue, Bei Liu, Huan Yang, Jianlong Fu, Houqiang Li, Jiebo Luo
2021Learning Hierarchal Channel Attention for Fine-grained Visual Classification.
Xiang Guan, Guoqing Wang, Xing Xu, Yi Bin
2021Learning Hierarchical Embedding for Video Instance Segmentation.
Zheyun Qin, Xiankai Lu, Xiushan Nie, Xiantong Zhen, Yilong Yin
2021Learning Human Motion Prediction via Stochastic Differential Equations.
Kedi Lyu, Zhenguang Liu, Shuang Wu, Haipeng Chen, Xuhong Zhang, Yuyu Yin
2021Learning Kinematic Formulas from Multiple View Videos.
Liangchen Song, Sheng Liu, Celong Liu, Zhong Li, Yuqi Ding, Yi Xu, Junsong Yuan
2021Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition.
Tailin Chen, Desen Zhou, Jian Wang, Shidong Wang, Yu Guan, Xuming He, Errui Ding
2021Learning Multi-context Aware Location Representations from Large-scale Geotagged Images.
Yifang Yin, Ying Zhang, Zhenguang Liu, Yuxuan Liang, Sheng Wang, Rajiv Ratn Shah, Roger Zimmermann
2021Learning Regularizer for Monocular Depth Estimation with Adversarial Guidance.
Guibao Shen, Yingkui Zhang, Jialu Li, Mingqiang Wei, Qiong Wang, Guangyong Chen, Pheng-Ann Heng
2021Learning Sample-Specific Policies for Sequential Image Augmentation.
Pu Li, Xiaobai Liu, Xiaohui Xie
2021Learning Segment Similarity and Alignment in Large-Scale Content Based Video Retrieval.
Chen Jiang, Kaiming Huang, Sifeng He, Xudong Yang, Wei Zhang, Xiaobo Zhang, Yuan Cheng, Lei Yang, Qing Wang, Furong Xu, Tan Pan, Wei Chu
2021Learning Spatial-angular Fusion for Compressive Light Field Imaging in a Cycle-consistent Framework.
Xianqiang Lyu, Zhiyu Zhu, Mantang Guo, Jing Jin, Junhui Hou, Huanqiang Zeng
2021Learning Spatio-temporal Representation by Channel Aliasing Video Perception.
Yiqi Lin, Jinpeng Wang, Manlin Zhang, Andy J. Ma
2021Learning Structure Affinity for Video Depth Estimation.
Yuanzhouhan Cao, Yidong Li, Haokui Zhang, Chao Ren, Yifan Liu
2021Learning Transferrable and Interpretable Representations for Domain Generalization.
Zhekai Du, Jingjing Li, Ke Lu, Lei Zhu, Zi Huang
2021Learning Unified Embeddings for Recommendation via Meta-path Semantics.
Qianxiu Hao, Qianqian Xu, Zhiyong Yang, Qingming Huang
2021Learning What and When to Drop: Adaptive Multimodal and Contextual Dynamics for Emotion Recognition in Conversation.
Feiyu Chen, Zhengxiao Sun, Deqiang Ouyang, Xueliang Liu, Jie Shao
2021Learning to Compose Stylistic Calligraphy Artwork with Emotions.
Shaozu Yuan, Ruixue Liu, Meng Chen, Baoyang Chen, Zhijie Qiu, Xiaodong He
2021Learning to Decode Contextual Information for Efficient Contour Detection.
Ruoxi Deng, Shengjun Liu, Jinxin Wang, Huibing Wang, Hanli Zhao, Xiaoqin Zhang
2021Learning to Understand Traffic Signs.
Yunfei Guo, Wei Feng, Fei Yin, Tao Xue, Shuqi Mei, Cheng-Lin Liu
2021Legitimate Adversarial Patches: Evading Human Eyes and Detection Models in the Physical World.
Jia Tan, Nan Ji, Haidong Xie, Xueshuang Xiang
2021Lesion-Inspired Denoising Network: Connecting Medical Image Denoising and Lesion Detection.
Kecheng Chen, Kun Long, Yazhou Ren, Jiayu Sun, Xiaorong Pu
2021Lifting the Veil of Frequency in Joint Segmentation and Depth Estimation.
Tianhao Fu, Yingying Li, Xiaoqing Ye, Xiao Tan, Hao Sun, Fumin Shen, Errui Ding
2021LightFEC: Network Adaptive FEC with a Lightweight Deep-Learning Approach.
Han Hu, Sheng Cheng, Xinggong Zhang, Zongming Guo
2021Linking the Characters: Video-oriented Social Graph Generation via Hierarchical-cumulative GCN.
Shiwei Wu, Joya Chen, Tong Xu, Liyi Chen, Lingfei Wu, Yao Hu, Enhong Chen
2021Local Graph Convolutional Networks for Cross-Modal Hashing.
Yudong Chen, Sen Wang, Jianglin Lu, Zhi Chen, Zheng Zhang, Zi Huang
2021Locally Adaptive Structure and Texture Similarity for Image Quality Assessment.
Keyan Ding, Yi Liu, Xueyi Zou, Shiqi Wang, Kede Ma
2021Long Short-term Convolutional Transformer for No-Reference Video Quality Assessment.
Junyong You
2021Long-Range Feature Propagating for Natural Image Matting.
Qinglin Liu, Haozhe Xie, Shengping Zhang, Bineng Zhong, Rongrong Ji
2021Long-tailed Distribution Adaptation.
Zhiliang Peng, Wei Huang, Zonghao Guo, Xiaosong Zhang, Jianbin Jiao, Qixiang Ye
2021M3TR: Multi-modal Multi-label Recognition with Transformer.
Jiawei Zhao, Yifan Zhao, Jia Li
2021MBRS: Enhancing Robustness of DNN-based Watermarking by Mini-Batch of Real and Simulated JPEG Compression.
Zhaoyang Jia, Han Fang, Weiming Zhang
2021MCCN: Multimodal Coordinated Clustering Network for Large-Scale Cross-modal Retrieval.
Zhixiong Zeng, Ying Sun, Wenji Mao
2021MDMS: Music Data Matching System for Query Variant Retrieval.
Rinita Roy, Ruben Mayer, Hans-Arno Jacobsen
2021MGH: Metadata Guided Hypergraph Modeling for Unsupervised Person Re-identification.
Yiming Wu, Xintian Wu, Xi Li, Jian Tian
2021MHFC: Multi-Head Feature Collaboration for Few-Shot Learning.
Shuai Shao, Lei Xing, Yan Wang, Rui Xu, Chunyan Zhao, Yanjiang Wang, Baodi Liu
2021MM '21: ACM Multimedia Conference, Virtual Event, China, October 20 - 24, 2021
Heng Tao Shen, Yueting Zhuang, John R. Smith, Yang Yang, Pablo César, Florian Metze, Balakrishnan Prabhakaran
2021MM-Flow: Multi-modal Flow Network for Point Cloud Completion.
Yiqiang Zhao, Yiyao Zhou, Rui Chen, Bin Hu, Xiding Ai
2021MM21 Pre-training for Video Understanding Challenge: Video Captioning with Pretraining Techniques.
Sihan Chen, Xinxin Zhu, Dongze Hao, Wei Liu, Jiawei Liu, Zijia Zhao, Longteng Guo, Jing Liu
2021MMFashion: An Open-Source Toolbox for Visual Fashion Analysis.
Xin Liu, Jiancheng Li, Jiaqi Wang, Ziwei Liu
2021MMOCR: A Comprehensive Toolbox for Text Detection, Recognition and Understanding.
Zhanghui Kuang, Hongbin Sun, Zhizhong Li, Xiaoyu Yue, Tsui Hin Lin, Jianyong Chen, Huaqiang Wei, Yiqin Zhu, Tong Gao, Wenwei Zhang, Kai Chen, Wayne Zhang, Dahua Lin
2021MMSports'21: 4th International Workshop on Multimedia Content Analysis in Sports.
Rainer Lienhart, Thomas B. Moeslund, Hideo Saito
2021MS-GraphSIM: Inferring Point Cloud Quality via Multiscale Graph Similarity.
Yujie Zhang, Qi Yang, Yiling Xu
2021MSO: Multi-Feature Space Joint Optimization Network for RGB-Infrared Person Re-Identification.
Yajun Gao, Tengfei Liang, Yi Jin, Xiaoyan Gu, Wu Liu, Yidong Li, Congyan Lang
2021MULL'21: First International Workshop on Multimedia Understanding with Less Labeling.
Xiu-Shen Wei, Jufeng Yang, Han-Jia Ye, Jian Yang
2021MV-TON: Memory-based Video Virtual Try-on network.
Xiaojing Zhong, Zhonghua Wu, Taizhe Tan, Guosheng Lin, Qingyao Wu
2021MageAdd: Real-Time Interaction Simulation for Scene Synthesis.
Shao-Kui Zhang, Yi-Xiao Li, Yu He, Yong-Liang Yang, Song-Hai Zhang
2021Mask and Predict: Multi-step Reasoning for Scene Graph Generation.
Hongshuo Tian, Ning Xu, An-An Liu, Chenggang Yan, Zhendong Mao, Quan Zhang, Yongdong Zhang
2021Mask is All You Need: Rethinking Mask R-CNN for Dense and Arbitrary-Shaped Scene Text Detection.
Xugong Qin, Yu Zhou, Youhui Guo, Dayan Wu, Zhihong Tian, Ning Jiang, Hongbin Wang, Weiping Wang
2021Memory-Augmented Deep Unfolding Network for Compressive Sensing.
Jiechong Song, Bin Chen, Jian Zhang
2021Merging Multiple Template Matching Predictions in Intra Coding with Attentive Convolutional Neural Network.
Qijun Wang, Guodong Zheng
2021MeronymNet: A Hierarchical Model for Unified and Controllable Multi-Category Object Generation.
Rishabh Baghel, Abhishek Trivedi, Tejas Ravichandran, Ravi Kiran Sarvadevabhatla
2021MeshNet++: A Network with a Face.
Vinit Veerendraveer Singh, Shivanand Venkanna Sheshappanavar, Chandra Kambhamettu
2021Meta Self-Paced Learning for Cross-Modal Matching.
Jiwei Wei, Xing Xu, Zheng Wang, Guoqing Wang
2021Meta-FDMixup: Cross-Domain Few-Shot Learning Guided by Labeled Target Data.
Yuqian Fu, Yanwei Fu, Yu-Gang Jiang
2021Metaverse for Social Good: A University Campus Prototype.
Haihan Duan, Jiaye Li, Sizheng Fan, Zhonghao Lin, Xiao Wu, Wei Cai
2021Metric Learning for Anti-Compression Facial Forgery Detection.
Shenhao Cao, Qin Zou, Xiuqing Mao, Dengpan Ye, Zhongyuan Wang
2021Milliseconds Color Stippling.
Lei Ma, Jian Shi, Yanyun Chen
2021Mining Latent Structures for Multimedia Recommendation.
Jinghao Zhang, Yanqiao Zhu, Qiang Liu, Shu Wu, Shuhui Wang, Liang Wang
2021Missing Data Imputation for Solar Yield Prediction using Temporal Multi-Modal Variational Auto-Encoder.
Meng Shen, Huaizheng Zhang, Yixin Cao, Fan Yang, Yonggang Wen
2021Mitigating Generation Shifts for Generalized Zero-Shot Learning.
Zhi Chen, Yadan Luo, Sen Wang, Ruihong Qiu, Jingjing Li, Zi Huang
2021Mix-order Attention Networks for Image Restoration.
Tao Dai, Yalei Lv, Bin Chen, Zhi Wang, Zexuan Zhu, Shu-Tao Xia
2021Modeling the Uncertainty for Self-supervised 3D Skeleton Action Representation Learning.
Yukun Su, Guosheng Lin, Ruizhou Sun, Yun Hao, Qingyao Wu
2021Motion Prediction via Joint Dependency Modeling in Phase Space.
Pengxiang Su, Zhenguang Liu, Shuang Wu, Lei Zhu, Yifang Yin, Xuanjing Shen
2021Move As You Like: Image Animation in E-Commerce Scenario.
Borun Xu, Biao Wang, Jiale Tao, Tiezheng Ge, Yuning Jiang, Wen Li, Lixin Duan
2021MovieREP: A New Movie Reproduction Framework for Film Soundtrack.
Ruiqi Wang, Long Ye, Qin Zhang
2021MuCAI'21: 2nd ACM Multimedia Workshop on Multimodal Conversational AI.
João Magalhães, Alexander G. Hauptmann, Ricardo Gamelas Sousa, Carlos Santiago
2021MuSe 2021 Challenge: Multimodal Emotion, Sentiment, Physiological-Emotion, and Stress Detection.
Lukas Stappen, Eva-Maria Meßner, Erik Cambria, Guoying Zhao, Björn W. Schuller
2021Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning.
Xi Zhang, Feifei Zhang, Changsheng Xu
2021Multi-Level Visual Representation with Semantic-Reinforced Learning for Video Captioning.
Chengbo Dong, Xinru Chen, Aozhu Chen, Fan Hu, Zihan Wang, Xirong Li
2021Multi-Modal Multi-Instance Learning for Retinal Disease Recognition.
Xirong Li, Yang Zhou, Jie Wang, Hailan Lin, Jianchun Zhao, Dayong Ding, Weihong Yu, Youxin Chen
2021Multi-Modal Sarcasm Detection with Interactive In-Modal and Cross-Modal Graphs.
Bin Liang, Chenwei Lou, Xiang Li, Lin Gui, Min Yang, Ruifeng Xu
2021Multi-Perspective Video Captioning.
Yi Bin, Xindi Shang, Bo Peng, Yujuan Ding, Tat-Seng Chua
2021Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus.
Rongjie Huang, Feiyang Chen, Yi Ren, Jinglin Liu, Chenye Cui, Zhou Zhao
2021Multi-Source Fusion and Automatic Predictor Selection for Zero-Shot Video Object Segmentation.
Xiaoqi Zhao, Youwei Pang, Jiaxing Yang, Lihe Zhang, Huchuan Lu
2021Multi-branch Channel-wise Enhancement Network for Fine-grained Visual Recognition.
Guangjun Li, Yongxiong Wang, Fengting Zhu
2021Multi-caption Text-to-Face Synthesis: Dataset and Algorithm.
Jianxin Sun, Qi Li, Weining Wang, Jian Zhao, Zhenan Sun
2021Multi-initialization Optimization Network for Accurate 3D Human Pose and Shape Estimation.
Zhiwei Liu, Xiangyu Zhu, Lu Yang, Xiang Yan, Ming Tang, Zhen Lei, Guibo Zhu, Xuetao Feng, Yan Wang, Jinqiao Wang
2021Multi-label Pattern Image Retrieval via Attention Mechanism Driven Graph Convolutional Network.
Ying Li, Hongwei Zhou, Yeyu Yin, Jiaquan Gao
2021Multi-modal Representation Learning for Video Advertisement Content Structuring.
Daya Guo, Zhaoyang Zeng
2021Multi-view 3D Smooth Human Pose Estimation based on Heatmap Filtering and Spatio-temporal Information.
Zehai Niu, Ke Lu, Jian Xue, Haifeng Ma, Runchen Wei
2021Multi-view Clustering via Deep Matrix Factorization and Partition Alignment.
Chen Zhang, Siwei Wang, Jiyuan Liu, Sihang Zhou, Pei Zhang, Xinwang Liu, En Zhu, Changwang Zhang
2021MultiMediate: Multi-modal Group Behaviour Analysis for Artificial Mediation.
Philipp Müller, Michael Dietz, Dominik Schiller, Dominike Thomas, Guanhua Zhang, Patrick Gebhard, Elisabeth André, Andreas Bulling
2021MultiModal Language Modelling on Knowledge Graphs for Deep Video Understanding.
Vishal Anand, Raksha Ramesh, Boshen Jin, Ziyin Wang, Xiaoxiao Lei, Ching-Yung Lin
2021Multifocal Attention-Based Cross-Scale Network for Image De-raining.
Zheyu Zhang, Yurui Zhu, Xueyang Fu, Zhiwei Xiong, Zheng-Jun Zha, Feng Wu
2021Multimedia Classifiers: Behind the Scenes.
Manjunath Iyer
2021Multimodal Asymmetric Dual Learning for Unsupervised Eyeglasses Removal.
Qing Lin, Bo Yan, Weimin Tan
2021Multimodal Compatibility Modeling via Exploring the Consistent and Complementary Correlations.
Weili Guan, Haokun Wen, Xuemeng Song, Chung-Hsing Yeh, Xiaojun Chang, Liqiang Nie
2021Multimodal Dialog System: Relational Graph-based Context-aware Question Understanding.
Haoyu Zhang, Meng Liu, Zan Gao, Xiaoqiang Lei, Yinglong Wang, Liqiang Nie
2021Multimodal Entity Linking: A New Dataset and A Baseline.
Jingru Gan, Jinchang Luo, Haiwei Wang, Shuhui Wang, Wei He, Qingming Huang
2021Multimodal Global Relation Knowledge Distillation for Egocentric Action Anticipation.
Yi Huang, Xiaoshan Yang, Changsheng Xu
2021Multimodal Relation Extraction with Efficient Graph Alignment.
Changmeng Zheng, Junhao Feng, Ze Fu, Yi Cai, Qing Li, Tao Wang
2021Multimodal Video Summarization via Time-Aware Transformers.
Xindi Shang, Zehuan Yuan, Anran Wang, Changhu Wang
2021Multiple Object Tracking by Trajectory Map Regression with Temporal Priors Embedding.
Xingyu Wan, Sanping Zhou, Jinjun Wang, Rongye Meng
2021Multiple Objects-Aware Visual Question Generation.
Jiayuan Xie, Yi Cai, Qingbao Huang, Tao Wang
2021Multiview Detection with Shadow Transformer (and View-Coherent Data Augmentation).
Yunzhong Hou, Liang Zheng
2021MusicBERT: A Self-supervised Learning of Music Representation.
Hongyuan Zhu, Ye Niu, Di Fu, Hao Wang
2021NJU MCG - Sensetime Team Submission to Pre-training for Video Understanding Challenge Track II.
Liwei Jin, Haoyue Cheng, Su Xu, Wayne Wu, Limin Wang
2021Neighbor-Vote: Improving Monocular 3D Object Detection through Neighbor Distance Voting.
Xiaomeng Chu, Jiajun Deng, Yao Li, Zhenxun Yuan, Yanyong Zhang, Jianmin Ji, Yu Zhang
2021Neighbor-view Enhanced Model for Vision and Language Navigation.
Dong An, Yuankai Qi, Yan Huang, Qi Wu, Liang Wang, Tieniu Tan
2021Neural Free-Viewpoint Performance Rendering under Complex Human-object Interactions.
Guoxing Sun, Xin Chen, Yizhang Chen, Anqi Pang, Pei Lin, Yuheng Jiang, Lan Xu, Jingyi Yu, Jingya Wang
2021Neural-based Rendering and Application.
Peng Dai
2021No-Reference Video Quality Assessment with Heterogeneous Knowledge Ensemble.
Jinjian Wu, Yongxu Liu, Leida Li, Weisheng Dong, Guangming Shi
2021Non-Linear Fusion for Self-Paced Multi-View Clustering.
Zongmo Huang, Yazhou Ren, Xiaorong Pu, Lifang He
2021Object Point Cloud Classification via Poly-Convolutional Architecture Search.
Xuanxiang Lin, Ke Chen, Kui Jia
2021Object-aware Long-short-range Spatial Alignment for Few-Shot Fine-Grained Image Classification.
Yike Wu, Bo Zhang, Gang Yu, Weixi Zhang, Bin Wang, Tao Chen, Jiayuan Fan
2021Occlusion-aware Bi-directional Guided Network for Light Field Salient Object Detection.
Dong Jing, Shuo Zhang, Runmin Cong, Youfang Lin
2021On-demand Action Detection System using Pose Information.
Noboru Yoshida, Jianquan Liu
2021Once and for All: Self-supervised Multi-modal Co-training on One-billion Videos at Alibaba.
Lianghua Huang, Yu Liu, Xiangzeng Zhou, Ansheng You, Ming Li, Bin Wang, Yingya Zhang, Pan Pan, Yinghui Xu
2021One-Stage Incomplete Multi-view Clustering via Late Fusion.
Yi Zhang, Xinwang Liu, Siwei Wang, Jiyuan Liu, Sisi Dai, En Zhu
2021One-Stage Visual Grounding via Semantic-Aware Feature Filter.
Jiabo Ye, Xin Lin, Liang He, Dingbang Li, Qin Chen
2021One-stage Context and Identity Hallucination Network.
Yinglu Liu, Mingcan Xiang, Hailin Shi, Tao Mei
2021Open Set Face Anti-Spoofing in Unseen Attacks.
Xin Dong, Hao Liu, Weiwei Cai, Pengyuan Lv, Zekuan Yu
2021OsGG-Net: One-step Graph Generation Network for Unbiased Head Pose Estimation.
Shentong Mo, Xin Miao
2021Out-of-distribution Generalization and Its Applications for Multimedia.
Xin Wang, Peng Cui, Wenwu Zhu
2021Overview of Tencent Multi-modal Ads Video Understanding.
Zhenzhi Wang, Zhimin Li, Liyu Wu, Jiangfeng Xiong, Qinglin Lu
2021PFFN: Progressive Feature Fusion Network for Lightweight Image Super-Resolution.
Dongyang Zhang, Changyu Li, Ning Xie, Guoqing Wang, Jie Shao
2021PIMNet: A Parallel, Iterative and Mimicking Network for Scene Text Recognition.
Zhi Qiao, Yu Zhou, Jin Wei, Wei Wang, Yuan Zhang, Ning Jiang, Hongbin Wang, Weiping Wang
2021PRNet: A Progressive Recovery Network for Revealing Perceptually Encrypted Images.
Tao Xiang, Ying Yang, Shangwei Guo, Hangcheng Liu, Hantao Liu
2021PUGCQ: A Large Scale Dataset for Quality Assessment of Professional User-Generated Content.
Guo Li, Baoliang Chen, Lingyu Zhu, Qingwen He, Hongfei Fan, Shiqi Wang
2021Pairwise Emotional Relationship Recognition in Drama Videos: Dataset and Benchmark.
Xun Gao, Yin Zhao, Jie Zhang, Longjun Cai
2021Pairwise VLAD Interaction Network for Video Question Answering.
Hui Wang, Dan Guo, Xian-Sheng Hua, Meng Wang
2021Parametric Reshaping of Portraits in Videos.
Xiangjun Tang, Wenxin Sun, Yong-Liang Yang, Xiaogang Jin
2021Pareto Optimality for Fairness-constrained Collaborative Filtering.
Qianxiu Hao, Qianqian Xu, Zhiyong Yang, Qingming Huang
2021Partial Tubal Nuclear Norm Regularized Multi-view Learning.
Yongyong Chen, Shuqin Wang, Chong Peng, Guangming Lu, Yicong Zhou
2021Partially Fake it Till you Make It: Mixing Real and Fake Thermal Images for Improved Object Detection.
Francesco Bongini, Lorenzo Berlincioni, Marco Bertini, Alberto Del Bimbo
2021Perception-Oriented Stereo Image Super-Resolution.
Chenxi Ma, Bo Yan, Weimin Tan, Xuhao Jiang
2021Perceptual Quality Assessment of Internet Videos.
Jiahua Xu, Jing Li, Xingguang Zhou, Wei Zhou, Baichao Wang, Zhibo Chen
2021Personality Recognition by Modelling Person-specific Cognitive Processes using Graph Representation.
Zilong Shao, Siyang Song, Shashank Jaiswal, Linlin Shen, Michel F. Valstar, Hatice Gunes
2021Personalized Multi-modal Video Retrieval on Mobile Devices.
Haotian Zhang, Allan D. Jepson, Iqbal Mohomed, Konstantinos G. Derpanis, Ran Zhang, Afsaneh Fazly
2021Phoenix: Combining Highest-Profit First Scheduling and Responsive Congestion Control for Delay-sensitive Multimedia Transmission.
Haozhe Li
2021Pixel-level Intra-domain Adaptation for Semantic Segmentation.
Zizheng Yan, Xianggang Yu, Yipeng Qin, Yushuang Wu, Xiaoguang Han, Shuguang Cui
2021Pixel-wise Graph Attention Networks for Person Re-identification.
Wenyu Zhang, Qing Ding, Jian Hu, Yi Ma, Mingzhe Lu
2021Plenoptic Quality Assessment: The JPEG Pleno Experience.
António M. G. Pinheiro
2021Point Cloud Projection and Multi-Scale Feature Fusion Network Based Blind Quality Assessment for Colored Point Clouds.
Wenxu Tao, Gangyi Jiang, Zhidi Jiang, Mei Yu
2021Polar Ray: A Single-stage Angle-free Detector for Oriented Object Detection in Aerial Images.
Shuai Liu, Lu Zhang, Shuai Hao, Huchuan Lu, You He
2021Pose-Guided Feature Learning with Knowledge Distillation for Occluded Person Re-Identification.
Kecheng Zheng, Cuiling Lan, Wenjun Zeng, Jiawei Liu, Zhizheng Zhang, Zheng-Jun Zha
2021Pose-guided Inter- and Intra-part Relational Transformer for Occluded Person Re-Identification.
Zhongxing Ma, Yifan Zhao, Jia Li
2021Position-Augmented Transformers with Entity-Aligned Mesh for TextVQA.
Xuanyu Zhang, Qing Yang
2021Post2Story: Automatically Generating Storylines from Microblogging Platforms.
Xujian Zhao, Chongwei Wang, Peiquan Jin, Hui Zhang, Chunming Yang, Bo Li
2021Pre-training Graph Transformer with Multimodal Side Information for Recommendation.
Yong Liu, Susen Yang, Chenyi Lei, Guoxin Wang, Haihong Tang, Juyong Zhang, Aixin Sun, Chunyan Miao
2021Privacy-Preserving Portrait Matting.
Jizhizi Li, Sihan Ma, Jing Zhang, Dacheng Tao
2021Product-oriented Machine Translation with Cross-modal Cross-lingual Pre-training.
Yuqing Song, Shizhe Chen, Qin Jin, Wei Luo, Jun Xie, Fei Huang
2021Progressive Graph Attention Network for Video Question Answering.
Liang Peng, Shuangji Yang, Yi Bin, Guoqing Wang
2021Progressive Semantic Matching for Video-Text Retrieval.
Hongying Liu, Ruyi Luo, Fanhua Shang, Mantang Niu, Yuanyuan Liu
2021Progressive and Selective Fusion Network for High Dynamic Range Imaging.
Qian Ye, Jun Xiao, Kin-Man Lam, Takayuki Okatani
2021Pseudo Graph Convolutional Network for Vehicle ReID.
Wen Qian, Zhiqun He, Silong Peng, Chen Chen, Wei Wu
2021PyTorchVideo: A Deep Learning Library for Video Understanding.
Haoqi Fan, Tullie Murrell, Heng Wang, Kalyan Vasudev Alwala, Yanghao Li, Yilei Li, Bo Xiong, Nikhila Ravi, Meng Li, Haichuan Yang, Jitendra Malik, Ross B. Girshick, Matt Feiszli, Aaron Adcock, Wan-Yen Lo, Christoph Feichtenhofer
2021Q-Art Code: Generating Scanning-robust Art-style QR Codes by Deformable Convolution.
Hao Su, Jianwei Niu, Xuefeng Liu, Qingfeng Li, Ji Wan, Mingliang Xu
2021QoE Ready to Respond: A QoE-aware MEC Selection Scheme for DASH-based Adaptive Video Streaming to Mobile Users.
Wanxin Shi, Qing Li, Ruishan Zhang, Gengbiao Shen, Yong Jiang, Zhenhui Yuan, Gabriel-Miro Muntean
2021Quality Assessment of End-to-End Learned Image Compression: The Benchmark and Objective Measure.
Yang Li, Shiqi Wang, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Yue Wang
2021Question-controlled Text-aware Image Captioning.
Anwen Hu, Shizhe Chen, Qin Jin
2021R-GAN: Exploring Human-like Way for Reasonable Text-to-Image Synthesis via Generative Adversarial Networks.
Yanyuan Qiao, Qi Chen, Chaorui Deng, Ning Ding, Yuankai Qi, Mingkui Tan, Xincheng Ren, Qi Wu
2021RAMS-Trans: Recurrent Attention Multi-scale Transformer for Fine-grained Image Recognition.
Yunqing Hu, Xuan Jin, Yin Zhang, Haiwen Hong, Jingfeng Zhang, Yuan He, Hui Xue
2021RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object Detection.
Zhuofan Zong, Qianggang Cao, Biao Leng
2021ROECS: A Robust Semi-direct Pipeline Towards Online Extrinsics Correction of the Surround-view System.
Tianjun Zhang, Brian Nlong Zhao, Ying Shen, Xuan Shao, Lin Zhang, Yicong Zhou
2021ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration.
Yuhao Cui, Zhou Yu, Chunqi Wang, Zhongzhou Zhao, Ji Zhang, Meng Wang, Jun Yu
2021Rate Adaptation and Block Scheduling for Delay-sensitive Multimedia Applications.
Dongyuan Su, Laizhong Cui, Lei Zhang, Yanyan Suo, Yan Qiu
2021ReLLIE: Deep Reinforcement Learning for Customized Low-Light Image Enhancement.
Rongkai Zhang, Lanqing Guo, Siyu Huang, Bihan Wen
2021RecipeLog: Recipe Authoring App for Accurate Food Recording.
Akihisa Ishino, Yoko Yamakata, Hiroaki Karasawa, Kiyoharu Aizawa
2021ReconVAT: A Semi-Supervised Automatic Music Transcription Framework for Low-Resource Real-World Data.
Kin Wai Cheuk, Dorien Herremans, Li Su
2021Reconstruction: A Motion Driven Interactive Artwork Inspired by Chinese Shadow Puppet.
Wenli Jiang, Chong Cao
2021Recovering the Unbiased Scene Graphs from the Biased Ones.
Meng-Jiun Chiou, Henghui Ding, Hanshu Yan, Changhu Wang, Roger Zimmermann, Jiashi Feng
2021Recursive Fusion and Deformable Spatiotemporal Attention for Video Compression Artifact Reduction.
Minyi Zhao, Yi Xu, Shuigeng Zhou
2021RecycleNet: An Overlapped Text Instance Recovery Approach.
Yiqing Hu, Yan Zheng, Xinghua Jiang, Hao Liu, Deqiang Jiang, Yinsong Liu, Bo Ren, Rongrong Ji
2021Recycling Discriminator: Towards Opinion-Unaware Image Quality Assessment Using Wasserstein GAN.
Yunan Zhu, Haichuan Ma, Jialun Peng, Dong Liu, Zhiwei Xiong
2021Relationship-Preserving Knowledge Distillation for Zero-Shot Sketch Based Image Retrieval.
Jialin Tian, Xing Xu, Zheng Wang, Fumin Shen, Xin Liu
2021Remember and Reuse: Cross-Task Blind Image Quality Assessment via Relevance-aware Incremental Learning.
Rui Ma, Hanxiao Luo, Qingbo Wu, King Ngi Ngan, Hongliang Li, Fanman Meng, Linfeng Xu
2021Reproducibility Companion Paper: Blind Natural Video Quality Prediction via Statistical Temporal Features and Deep Spatial Features.
Jari Korhonen, Yicheng Su, Junyong You, Steven Hicks, Cise Midoglu
2021Reproducibility Companion Paper: Campus3D: A Photogrammetry Point Cloud Benchmark for Outdoor Scene Hierarchical Understanding.
Yuqing Liao, Xinke Li, Zekun Tong, Yabang Zhao, Andrew Lim, Zhenzhong Kuang, Cise Midoglu
2021Reproducibility Companion Paper: Describing Subjective Experiment Consistency by p-Value P-P Plot.
Jakub Nawala, Lucjan Janowski, Bogdan Cmiel, Krzysztof Rusek, Marc A. Kastner, Jan Zahálka
2021Reproducibility Companion Paper: Kalman Filter-Based Head Motion Prediction for Cloud-Based Mixed Reality.
Serhan Gül, Sebastian Bosse, Dimitri Podborski, Thomas Schierl, Cornelius Hellge, Marc A. Kastner, Jan Zahálka
2021Reproducibility Companion Paper: Norm-in-Norm Loss with Faster Convergence and Better Performance for Image Quality Assessment.
Dingquan Li, Tingting Jiang, Ming Jiang, Vajira Lasantha Thambawita, Haoliang Wang
2021Reproducibility Companion Paper: On Learning Disentangled Representation for Acoustic Event Detection.
Lijian Gao, Qirong Mao, Jingjing Chen, Ming Dong, Ratna Babu Chinnam, Lucile Sassatelli, Miguel Fabián Romero Rondón, Ujjwal Sharma
2021Reproducibility Companion Paper: Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework.
Li Tao, Xueting Wang, Toshihiko Yamasaki, Jingjing Chen, Steven Hicks
2021Reproducibility Companion Paper: Visual Relation of Interest Detection.
Fan Yu, Haonan Wang, Tongwei Ren, Jinhui Tang, Gangshan Wu, Jingjing Chen, Zhenzhong Kuang
2021Research on Micro-Expression Spotting Method Based on Optical Flow Features.
Yuhong He
2021Rethinking the Impacts of Overfitting and Feature Quality on Small-scale Video Classification.
Xuansheng Wu, Feichi Yang, Tong Zhou, Xinyue Lin
2021Retinomorphic Sensing: A Novel Paradigm for Future Multimedia Computing.
Zhaodong Kang, Jianing Li, Lin Zhu, Yonghong Tian
2021Revisiting Mid-Level Patterns for Cross-Domain Few-Shot Recognition.
Yixiong Zou, Shanghang Zhang, Jianpeng Yu, Yonghong Tian, José M. F. Moura
2021Robust Logo Detection in E-Commerce Images by Data Augmentation.
Hang Chen, Xiao Li, Zefan Wang, Xiaolin Hu
2021Robust Real-World Image Super-Resolution against Adversarial Attacks.
Jiutao Yue, Haofeng Li, Pengxu Wei, Guanbin Li, Liang Lin
2021Robust Shadow Detection by Exploring Effective Shadow Contexts.
Xianyong Fang, Xiaohao He, Linbo Wang, Jianbing Shen
2021SFE-Net: EEG-based Emotion Recognition with Symmetrical Spatial Feature Extraction.
Xiangwen Deng, Junlin Zhu, Shangming Yang
2021SI3DP: Source Identification Challenges and Benchmark for Consumer-Level 3D Printer Forensics.
Bo Seok Shim, Yoo Seung Shin, Seong-Wook Park, Jong-Uk Hou
2021SINGA-Easy: An Easy-to-Use Framework for MultiModal Analysis.
Naili Xing, Sai Ho Yeung, Chenghao Cai, Teck Khim Ng, Wei Wang, Kaiyuan Yang, Nan Yang, Meihui Zhang, Gang Chen, Beng Chin Ooi
2021SM-SGE: A Self-Supervised Multi-Scale Skeleton Graph Encoding Framework for Person Re-Identification.
Haocong Rao, Xiping Hu, Jun Cheng, Bin Hu
2021SOGAN: 3D-Aware Shadow and Occlusion Robust GAN for Makeup Transfer.
Yueming Lyu, Jing Dong, Bo Peng, Wei Wang, Tieniu Tan
2021SRNet: Spatial Relation Network for Efficient Single-stage Instance Segmentation in Videos.
Xiaowen Ying, Xin Li, Mooi Choo Chuah
2021SSFlow: Style-guided Neural Spline Flows for Face Image Manipulation.
Hanbang Liang, Xianxu Hou, Linlin Shen
2021SSPU-Net: Self-Supervised Point Cloud Upsampling via Differentiable Rendering.
Yifan Zhao, Le Hui, Jin Xie
2021SSconv: Explicit Spectral-to-Spatial Convolution for Pansharpening.
Yudong Wang, Liang-Jian Deng, Tian-Jing Zhang, Xiao Wu
2021STST: Spatial-Temporal Specialized Transformer for Skeleton-based Action Recognition.
Yuhan Zhang, Bo Wu, Wen Li, Lixin Duan, Chuang Gan
2021SUMAC'21: 3rd Workshop on Structuring and Understanding of Multimedia heritAge Contents.
Valérie Gouet-Brunet, Margarita Khokhlova, Ronak Kosti, Li Weng
2021SVHAN: Sequential View Based Hierarchical Attention Network for 3D Shape Recognition.
Yue Zhao, Weizhi Nie, An-An Liu, Zan Gao, Yuting Su
2021SalS-GAN: Spatially-Adaptive Latent Space in StyleGAN for Real Image Embedding.
Lingyun Zhang, Xiuxiu Bai, Yao Gao
2021Salient Error Detection based Refinement for Wide-baseline Image Interpolation.
Yuan Chang, Yisong Chen, Guoping Wang
2021Sand Scope: An Interactive Installation for Revealing the Connection Between Mental Space and Life Space in a Microcosm of the World.
Lyn Chao-ling Chen
2021Scalable Multi-view Subspace Clustering with Unified Anchors.
Mengjing Sun, Pei Zhang, Siwei Wang, Sihang Zhou, Wenxuan Tu, Xinwang Liu, En Zhu, Changjian Wang
2021Scene Graph with 3D Information for Change Captioning.
Zeming Liao, Qingbao Huang, Yu Liang, Mingyi Fu, Yi Cai, Qing Li
2021Scene Text Image Super-Resolution via Parallelly Contextual Attention Network.
Cairong Zhao, Shuyang Feng, Brian Nlong Zhao, Zhijun Ding, Jun Wu, Fumin Shen, Heng Tao Shen
2021Searching Motion Graphs for Human Motion Synthesis.
Chenchen Liu, Yadong Mu
2021Searching a Hierarchically Aggregated Fusion Architecture for Fast Multi-Modality Image Fusion.
Risheng Liu, Zhu Liu, Jinyuan Liu, Xin Fan
2021Seeing is Believing?: Effects of Visualization on Smart Device Privacy Perceptions.
Carlos Bermejo Fernandez, Petteri Nurmi, Pan Hui
2021Selective Dependency Aggregation for Action Classification.
Yi Tan, Yanbin Hao, Xiangnan He, Yinwei Wei, Xun Yang
2021Self-Contrastive Learning with Hard Negative Sampling for Self-supervised Point Cloud Learning.
Bi'an Du, Xiang Gao, Wei Hu, Xin Li
2021Self-Representation Subspace Clustering for Incomplete Multi-view Data.
Jiyuan Liu, Xinwang Liu, Yi Zhang, Pei Zhang, Wenxuan Tu, Siwei Wang, Sihang Zhou, Weixuan Liang, Siqi Wang, Yuexiang Yang
2021Self-Supervised Pre-training on the Target Domain for Cross-Domain Person Re-identification.
Junyin Zhang, Yongxin Ge, Xinqian Gu, Boyu Hua, Tao Xiang
2021Self-Supervised Regional and Temporal Auxiliary Tasks for Facial Action Unit Recognition.
Jingwei Yan, Jingjing Wang, Qiang Li, Chunmao Wang, Shiliang Pu
2021Self-feature Learning: An Efficient Deep Lightweight Network for Image Super-resolution.
Jun Xiao, Qian Ye, Rui Zhao, Kin-Man Lam, Kao Wan
2021Self-supervised Consensus Representation Learning for Attributed Graph.
Changshu Liu, Liangjian Wen, Zhao Kang, Guangchun Luo, Ling Tian
2021Self-supervised Multi-view Multi-Human Association and Tracking.
Yiyang Gan, Ruize Han, Liqiang Yin, Wei Feng, Song Wang
2021Self-supervising Action Recognition by Statistical Moment and Subspace Descriptors.
Lei Wang, Piotr Koniusz
2021Semantic Media Conversion: Possibilities and Limits.
H. V. Jagadish
2021Semantic Scalable Image Compression with Cross-Layer Priors.
Hanyue Tu, Li Li, Wengang Zhou, Houqiang Li
2021Semantic Tag Augmented XlanV Model for Video Captioning.
Yiqing Huang, Hongwei Xue, Jiansheng Chen, Huimin Ma, Hongbing Ma
2021Semantic-Guided Relation Propagation Network for Few-shot Action Recognition.
Xiao Wang, Weirong Ye, Zhongang Qi, Xun Zhao, Guangge Wang, Ying Shan, Hanzi Wang
2021Semantic-aware Transfer with Instance-adaptive Parsing for Crowded Scenes Pose Estimation.
Xuanhan Wang, Lianli Gao, Yan Dai, Yixuan Zhou, Jingkuan Song
2021Semi-Autoregressive Image Captioning.
Xu Yan, Zhengcong Fei, Zekang Li, Shuhui Wang, Qingming Huang, Qi Tian
2021Semi-supervised Domain Adaptive Retrieval via Discriminative Hashing Learning.
Haifeng Xia, Taotao Jing, Chen Chen, Zhengming Ding
2021Semi-supervised Learning via Improved Teacher-Student Network for Robust 3D Reconstruction of Stereo Endoscopic Image.
Hongkuan Shi, Zhiwei Wang, Jinxin Lv, Yilang Wang, Peng Zhang, Fei Zhu, Qiang Li
2021Sensor-Augmented Egocentric-Video Captioning with Dynamic Modal Attention.
Katsuyuki Nakamura, Hiroki Ohashi, Mitsuhiro Okada
2021Shadow Detection via Predicting the Confidence Maps of Shadow Detection Methods.
Jingwei Liao, Yanli Liu, Guanyu Xing, Housheng Wei, Jueyu Chen, Songhua Xu
2021Shape Controllable Virtual Try-on for Underwear Models.
Xin Gao, Zhenjiang Liu, Zunlei Feng, Chengji Shen, Kairi Ou, Haihong Tang, Mingli Song
2021Show, Read and Reason: Table Structure Recognition with Flexible Context Aggregator.
Hao Liu, Xin Li, Bing Liu, Deqiang Jiang, Yinsong Liu, Bo Ren, Rongrong Ji
2021Similar Scenes Arouse Similar Emotions: Parallel Data Augmentation for Stylized Image Captioning.
Guodun Li, Yuchen Zhai, Zehao Lin, Yin Zhang
2021Simplifying Multimodal Emotion Recognition with Single Eye Movement Modality.
Xu Yan, Li-Ming Zhao, Bao-Liang Lu
2021SimulLR: Simultaneous Lip Reading Transducer with Attention-Guided Adaptive Memory.
Zhijie Lin, Zhou Zhao, Haoyuan Li, Jinglin Liu, Meng Zhang, Xingshan Zeng, Xiaofei He
2021SimulSLT: End-to-End Simultaneous Sign Language Translation.
Aoxiong Yin, Zhou Zhao, Jinglin Liu, Weike Jin, Meng Zhang, Xingshan Zeng, Xiaofei He
2021Single Image 3D Object Estimation with Primitive Graph Networks.
Qian He, Desen Zhou, Bo Wan, Xuming He
2021Situational Anomaly Detection in Multimedia Data under Concept Drift.
Pratibha Kumari
2021Skeleton-Aware Neural Sign Language Translation.
Shiwei Gan, Yafeng Yin, Zhiwei Jiang, Lei Xie, Sanglu Lu
2021Skeleton-Contrastive 3D Action Representation Learning.
Fida Mohammad Thoker, Hazel Doughty, Cees G. M. Snoek
2021SmartEye: An Open Source Framework for Real-Time Video Analytics with Edge-Cloud Collaboration.
Xuezhi Wang, Guanyu Gao
2021SmartMeeting: Automatic Meeting Transcription and Summarization for In-Person Conversations.
Yuanfeng Song, Di Jiang, Xuefang Zhao, Xiaoling Huang, Qian Xu, Raymond Chi-Wing Wong, Qiang Yang
2021SmartSales: An AI-Powered Telemarketing Coaching System in FinTech.
Yuanfeng Song, Xuefang Zhao, Di Jiang, Xiaoling Huang, Weiwei Zhao, Qian Xu, Raymond Chi-Wing Wong, Qiang Yang
2021Social Signals and Multimedia: Past, Present, Future.
Hayley Hung, Cathal Gurrin, Martha A. Larson, Hatice Gunes, Fabien Ringeval, Elisabeth André, Louis-Philippe Morency
2021Softly: Simulated Empathic Touch between an Agent and a Human.
Maxime Grandidier, Fabien Boucaud, Indira Thouvenin, Catherine Pelachaud
2021Source Data-free Unsupervised Domain Adaptation for Semantic Segmentation.
Mucong Ye, Jing Zhang, Jinpeng Ouyang, Ding Yuan
2021Space-Angle Super-Resolution for Multi-View Images.
Yuqi Sun, Ri Cheng, Bo Yan, Shili Zhou
2021Sparse to Dense Depth Completion using a Generative Adversarial Network with Intelligent Sampling Strategies.
Md Fahim Faysal Khan, Nelson Daniel Troncoso Aldas, Abhishek Kumar, Siddharth Advani, Vijaykrishnan Narayanan
2021Spatio-Temporal Interaction Graph Parsing Networks for Human-Object Interaction Recognition.
Ning Wang, Guangming Zhu, Liang Zhang, Peiyi Shen, Hongsheng Li, Cong Hua
2021Spatiotemporal Inconsistency Learning for DeepFake Video Detection.
Zhihao Gu, Yang Chen, Taiping Yao, Shouhong Ding, Jilin Li, Feiyue Huang, Lizhuang Ma
2021Speech2AffectiveGestures: Synthesizing Co-Speech Gestures with Generative Adversarial Affective Expression Learning.
Uttaran Bhattacharya, Elizabeth Childs, Nicholas Rewkowski, Dinesh Manocha
2021Stacked Semantically-Guided Learning for Image De-distortion.
Huiyuan Fu, Changhao Tian, Xin Wang, Huadong Ma
2021State-aware Video Procedural Captioning.
Taichi Nishimura, Atsushi Hashimoto, Yoshitaka Ushiku, Hirotaka Kameko, Shinsuke Mori
2021Stereo Video Super-Resolution via Exploiting View-Temporal Correlations.
Ruikang Xu, Zeyu Xiao, Mingde Yao, Yueyi Zhang, Zhiwei Xiong
2021StrucTexT: Structured Text Understanding with Multi-Modal Transformers.
Yulin Li, Yuxi Qian, Yuechen Yu, Xiameng Qin, Chengquan Zhang, Yan Liu, Kun Yao, Junyu Han, Jingtuo Liu, Errui Ding
2021Structure-aware Mathematical Expression Recognition with Sequence-Level Modeling.
Minli Li, Peilin Zhao, Yifan Zhang, Shuaicheng Niu, Qingyao Wu, Mingkui Tan
2021Structured Multi-modal Feature Embedding and Alignment for Image-Sentence Retrieval.
Xuri Ge, Fuhai Chen, Joemon M. Jose, Zhilong Ji, Zhongqin Wu, Xiao Liu
2021Style-Aware Image Recommendation for Social Media Marketing.
Yiwei Zhang, Toshihiko Yamasaki
2021SuperFront: From Low-resolution to High-resolution Frontal Face Synthesis.
Yu Yin, Joseph P. Robinson, Songyao Jiang, Yue Bai, Can Qin, Yun Fu
2021Sync Glass: Virtual Pouring and Toasting Experience with Multimodal Presentation.
Yuki Tajima, Toshiharu Horiuchi, Gen Hattori
2021Syntropic Counterpoints: Metaphysics of The Machines.
Predrag K. Nikolic, Ruiyang Liu, Shengcheng Luo
2021TACR-Net: Editing on Deep Video and Voice Portraits.
Luchuan Song, Bin Liu, Guojun Yin, Xiaoyi Dong, Yufei Zhang, Jia-Xuan Bai
2021TBRA: Tiling and Bitrate Adaptation for Mobile 360-Degree Video Streaming.
Lei Zhang, Yanyan Suo, Ximing Wu, Feng Wang, Yuchi Chen, Laizhong Cui, Jiangchuan Liu, Zhong Ming
2021TDI TextSpotter: Taking Data Imbalance into Account in Scene Text Spotting.
Yu Zhou, Hongtao Xie, Shancheng Fang, Jing Wang, Zhengjun Zha, Yongdong Zhang
2021TSA-Net: Tube Self-Attention Network for Action Quality Assessment.
Shunli Wang, Dingkang Yang, Peng Zhai, Chixiao Chen, Lihua Zhang
2021Target-guided Adaptive Base Class Reweighting for Few-Shot Learning.
Jiliang Yan, Deming Zhai, Junjun Jiang, Xianming Liu
2021Text as Neural Operator: Image Manipulation by Text Instruction.
Tianhao Zhang, Hung-Yu Tseng, Lu Jiang, Weilong Yang, Honglak Lee, Irfan Essa
2021Text is NOT Enough: Integrating Visual Impressions into Open-domain Dialogue Generation.
Lei Shen, Haolan Zhan, Xin Shen, Yonghao Song, Xiaofang Zhao
2021Text to Scene: A System of Configurable 3D Indoor Scene Synthesis.
Xinyan Yang, Fei Hu, Long Ye
2021Text-driven 3D Avatar Animation with Emotional and Expressive Behaviors.
Li Hu, Jinwei Qi, Bang Zhang, Pan Pan, Yinghui Xu
2021Text2Video: Automatic Video Generation Based on Text Scripts.
Yipeng Yu, Zirui Tu, Longyu Lu, Xiao Chen, Hui Zhan, Zixun Sun
2021The ACM Multimedia 2021 Meet Deadline Requirements Grand Challenge.
Jie Zhang, Junjie Deng, Mowei Wang, Yong Cui, Wei Tsang Ooi, Jiangchuan Liu, Xinyu Zhang, Kai Zheng, Yi Li
2021The Next Generation Multimodal Conversational Search and Recommendation.
João Magalhães, Tat-Seng Chua, Tao Mei, Alan F. Smeaton
2021Theophany: Multimodal Speech Augmentation in Instantaneous Privacy Channels.
Abhishek Kumar, Tristan Braud, Lik Hang Lee, Pan Hui
2021Token Shift Transformer for Video Classification.
Hao Zhang, Yanbin Hao, Chong-Wah Ngo
2021Towards Accurate Localization by Instance Search.
Yi-Geng Hong, Hui-Chu Xiao, Wan-Lei Zhao
2021Towards Adversarial Patch Analysis and Certified Defense against Crowd Counting.
Qiming Wu, Zhikang Zou, Pan Zhou, Xiaoqing Ye, Binghui Wang, Ang Li
2021Towards Bridging Video and Language by Caption Generation and Sentence Localization.
Shaoxiang Chen
2021Towards Controllable and Photorealistic Region-wise Image Manipulation.
Ansheng You, Chenglin Zhou, Qixuan Zhang, Lan Xu
2021Towards Cross-Granularity Few-Shot Learning: Coarse-to-Fine Pseudo-Labeling with Visual-Semantic Meta-Embedding.
Jinhai Yang, Hua Yang, Lin Chen
2021Towards Fast and High-Quality Sign Language Production.
Wencan Huang, Wenwen Pan, Zhou Zhao, Qi Tian
2021Towards Multiple Black-boxes Attack via Adversarial Example Generation Network.
Mingxing Duan, Kenli Li, Lingxi Xie, Qi Tian, Bin Xiao
2021Towards Realistic Visual Dubbing with Heterogeneous Sources.
Tianyi Xie, Liucheng Liao, Cheng Bi, Benlai Tang, Xiang Yin, Jianfei Yang, Mingjie Wang, Jiali Yao, Yang Zhang, Zejun Ma
2021Towards Reasoning Ability in Scene Text Visual Question Answering.
Qingqing Wang, Liqiang Xiao, Yue Lu, Yaohui Jin, Hao He
2021Towards Robust Cross-domain Image Understanding with Unsupervised Noise Removal.
Lei Zhu, Zhaojing Luo, Wei Wang, Meihui Zhang, Gang Chen, Kaiping Zheng
2021Towards Robust Deep Hiding Under Non-Differentiable Distortions for Practical Blind Watermarking.
Chaoning Zhang, Adil Karjauv, Philipp Benz, In So Kweon
2021Towards a Unified Middle Modality Learning for Visible-Infrared Person Re-Identification.
Yukang Zhang, Yan Yan, Yang Lu, Hanzi Wang
2021Trajectory is not Enough: Hidden Following Detection.
Danni Xu, Ruimin Hu, Zixiang Xiong, Zheng Wang, Linbo Luo, Dengshi Li
2021TransFusion: Multi-Modal Fusion for Video Tag Inference via Translation-based Knowledge Embedding.
Di Jin, Zhongang Qi, Yingmin Luo, Ying Shan
2021TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding.
Dailan He, Yusheng Zhao, Junyu Luo, Tianrui Hui, Shaofei Huang, Aixi Zhang, Si Liu
2021Transfer Vision Patterns for Multi-Task Pixel Learning.
Xiaoya Zhang, Ling Zhou, Yong Li, Zhen Cui, Jin Xie, Jian Yang
2021Transferrable Contrastive Learning for Visual Domain Adaptation.
Yang Chen, Yingwei Pan, Yu Wang, Ting Yao, Xinmei Tian, Tao Mei
2021Transformer-based Feature Reconstruction Network for Robust Multimodal Sentiment Analysis.
Ziqi Yuan, Wei Li, Hua Xu, Wenmeng Yu
2021TriTransNet: RGB-D Salient Object Detection with a Triplet Transformer Embedding Network.
Zhengyi Liu, Yuan Wang, Zhengzheng Tu, Yun Xiao, Bin Tang
2021Triangle-Reward Reinforcement Learning: A Visual-Linguistic Semantic Alignment for Image Captioning.
Weizhi Nie, Jiesi Li, Ning Xu, An-An Liu, Xuanya Li, Yongdong Zhang
2021Trustworthy AI'21: 1st International Workshop on Trustworthy AI for Multimedia Computing.
Teddy Furon, Jingen Liu, Yogesh S. Rawat, Wei Zhang, Qi Zhao
2021Trustworthy Multimedia Analysis.
Xiaowen Huang, Jiaming Zhang, Yi Zhang, Xian Zhao, Jitao Sang
2021TsFPS: An Accurate and Flexible 6DoF Tracking System with Fiducial Platonic Solids.
Nan Xiang, Xiaosong Yang, Jian J. Zhang
2021Two-pronged Strategy: Lightweight Augmented Graph Network Hashing for Scalable Image Retrieval.
Hui Cui, Lei Zhu, Jingjing Li, Zhiyong Cheng, Zheng Zhang
2021Two-stage Visual Cues Enhancement Network for Referring Image Segmentation.
Yang Jiao, Zequn Jie, Weixin Luo, Jingjing Chen, Yu-Gang Jiang, Xiaolin Wei, Lin Ma
2021UACANet: Uncertainty Augmented Context Attention for Polyp Segmentation.
Taehun Kim, Hyemin Lee, Daijin Kim
2021Understanding Chinese Video and Language via Contrastive Multimodal Pre-Training.
Chenyi Lei, Shixian Luo, Yong Liu, Wanggui He, Jiamang Wang, Guoxin Wang, Haihong Tang, Chunyan Miao, Houqiang Li
2021Underwater Species Detection using Channel Sharpening Attention.
Lihao Jiang, Yi Wang, Qi Jia, Shengwei Xu, Yu Liu, Xin Fan, Haojie Li, Risheng Liu, Xinwei Xue, Ruili Wang
2021UniCon: Unified Context Network for Robust Active Speaker Detection.
Yuanhang Zhang, Susan Liang, Shuang Yang, Xiao Liu, Zhongqin Wu, Shiguang Shan, Xilin Chen
2021Unifying Multimodal Transformer for Bi-directional Image and Text Generation.
Yupan Huang, Hongwei Xue, Bei Liu, Yutong Lu
2021Unsupervised Cross-Modal Distillation for Thermal Infrared Tracking.
Jingxian Sun, Lichao Zhang, Yufei Zha, Abel Gonzalez-Garcia, Peng Zhang, Wei Huang, Yanning Zhang
2021Unsupervised Image Deraining: Optimization Model Driven Deep CNN.
Changfeng Yu, Yi Chang, Yi Li, Xile Zhao, Luxin Yan
2021Unsupervised Portrait Shadow Removal via Generative Priors.
Yingqing He, Yazhou Xing, Tianjia Zhang, Qifeng Chen
2021Unsupervised Vehicle Search in the Wild: A New Benchmark.
Xian Zhong, Shilei Zhao, Xiao Wang, Kui Jiang, Wenxuan Liu, Wenxin Huang, Zheng Wang
2021UrbanMM'21: 1st International Workshop on Multimedia Computing for Urban Data.
Stevan Rudinac, Alessandro Bozzon, Tat-Seng Chua, Suzanne Little, Daniel Gatica-Perez, Kiyoharu Aizawa
2021Using Interaction Data to Predict Engagement with Interactive Media.
Jonathan Carlton, Andy Brown, Caroline Jay, John Keane
2021Using Motion Histories for Eye Contact Detection in Multiperson Group Conversations.
Eugene Yujun Fu, Michael W. Ngai
2021VASTile: Viewport Adaptive Scalable 360-Degree Video Frame Tiling.
Chamara Madarasingha, Kanchana Thilakarathna
2021VLAD-VSA: Cross-Domain Face Presentation Attack Detection with Vocabulary Separation and Adaptation.
Jiong Wang, Zhou Zhao, Weike Jin, Xinyu Duan, Zhen Lei, Baoxing Huai, Yiling Wu, Xiaofei He
2021VQMG: Hierarchical Vector Quantised and Multi-hops Graph Reasoning for Explicit Representation Learning.
Lei Li, Chun Yuan
2021Vehicle Counting Network with Attention-based Mask Refinement and Spatial-awareness Block Loss.
Ji Zhang, Jian-Jun Qiao, Xiao Wu, Wei Li
2021VeloCity: Using Voice Assistants for Cyclists to Provide Traffic Reports.
Gian-Luca Savino, Jessé Moraes Braga, Johannes Schöning
2021ViDA-MAN: Visual Dialog with Digital Humans.
Tong Shen, Jiawei Zuo, Fan Shi, Jin Zhang, Liqin Jiang, Meng Chen, Zhengchen Zhang, Wei Zhang, Xiaodong He, Tao Mei
2021VidVRD 2021: The Third Grand Challenge on Video Relation Detection.
Wei Ji, Yicong Li, Meng Wei, Xindi Shang, Junbin Xiao, Tongwei Ren, Tat-Seng Chua
2021Video Background Music Generation with Controllable Music Transformer.
Shangzhe Di, Zeren Jiang, Si Liu, Zhaokai Wang, Leyan Zhu, Zexin He, Hongming Liu, Shuicheng Yan
2021Video Coding for Machine.
Wen Gao
2021Video Relation Detection via Tracklet based Visual Transformer.
Kaifeng Gao, Long Chen, Yifeng Huang, Jun Xiao
2021Video Representation Learning with Graph Contrastive Augmentation.
Jingran Zhang, Xing Xu, Fumin Shen, Yazhou Yao, Jie Shao, Xiaofeng Zhu
2021Video Semantic Segmentation via Sparse Temporal Transformer.
Jiangtong Li, Wentao Wang, Junjie Chen, Li Niu, Jianlou Si, Chen Qian, Liqing Zhang
2021Video Similarity and Alignment Learning on Partial Video Copy Detection.
Zhen Han, Xiangteng He, Mingqian Tang, Yiliang Lv
2021Video Transformer for Deepfake Detection with Incremental Learning.
Sohail Ahmed Khan, Hang Dai
2021Video Visual Relation Detection via Iterative Inference.
Xindi Shang, Yicong Li, Junbin Xiao, Wei Ji, Tat-Seng Chua
2021Video-to-Image Casting: A Flatting Method for Video Analysis.
Xu Chen, Chenqiang Gao, Feng Yang, Xiaohan Wang, Yi Yang, Yahong Han
2021VideoDiscovery: An Automatic Short-Video Generation System for E-commerce Live-streaming.
Yanhao Zhang, Qiang Wang, Yun Zheng, Pan Pan, Yinghui Xu
2021View-normalized Skeleton Generation for Action Recognition.
Qingzhe Pan, Zhifu Zhao, Xuemei Xie, Jianan Li, Yuhan Cao, Guangming Shi
2021Viewing from Frequency Domain: A DCT-based Information Enhancement Network for Video Person Re-Identification.
Liangchen Liu, Xi Yang, Nannan Wang, Xinbo Gao
2021Visible Watermark Removal via Self-calibrated Localization and Background Refinement.
Jing Liang, Li Niu, Fengjun Guo, Teng Long, Liqing Zhang
2021Vision-guided Music Source Separation via a Fine-grained Cycle-Separation Network.
Shuo Ma, Yanli Ji, Xing Xu, Xiaofeng Zhu
2021Visual Co-Occurrence Alignment Learning for Weakly-Supervised Video Moment Retrieval.
Zheng Wang, Jingjing Chen, Yu-Gang Jiang
2021Visual Language Based Succinct Zero-Shot Object Detection.
Ye Zheng, Xi Huang, Li Cui
2021VmAP: A Fair Metric for Video Object Detection.
Anupam Sobti, Vaibhav Mavi, M. Balakrishnan, Chetan Arora
2021VoteHMR: Occlusion-Aware Voting Network for Robust 3D Human Mesh Recovery from Partial Point Clouds.
Guanze Liu, Yu Rong, Lu Sheng
2021WAB'21: 1st Workshop on Multimodal Product Identification in Livestreaming and WAB Challenge.
Yueting Zhuang, Xing Tang, Guilin Wu, Yahong Han, Haihong Tang, Xiaobo Li, Xiaohan Wang, Baoming Yan, Bo Gao, Yi Yang
2021WAS-VTON: Warping Architecture Search for Virtual Try-on Network.
Zhenyu Xie, Xujie Zhang, Fuwei Zhao, Haoye Dong, Michael C. Kampffmeyer, Haonan Yan, Xiaodan Liang
2021WeClick: Weakly-Supervised Video Semantic Segmentation with Click Annotations.
Peidong Liu, Zibin He, Xiyu Yan, Yong Jiang, Shu-Tao Xia, Feng Zheng, Maowei Hu
2021WePerson: Learning a Generalized Re-identification Model from All-weather Virtual Data.
He Li, Mang Ye, Bo Du
2021Weakly-Supervised Temporal Action Localization via Cross-Stream Collaborative Learning.
Yuan Ji, Xu Jia, Huchuan Lu, Xiang Ruan
2021Weakly-Supervised Video Object Grounding via Stable Context Learning.
Wei Wang, Junyu Gao, Changsheng Xu
2021Weight Evolution: Improving Deep Neural Networks Training through Evolving Inferior Weight Values.
Zhenquan Lin, Kailing Guo, Xiaofen Xing, Xiangmin Xu
2021Weighted Gaussian Loss based Hamming Hashing.
Rong-Cheng Tu, Xian-Ling Mao, Cihang Kong, Zihang Shao, Zelin Li, Wei Wei, Heyan Huang
2021When Face Completion Meets Irregular Holes: An Attributes Guided Deep Inpainting Network.
Jie Xiao, Dandan Zhan, Haoran Qi, Zhi Jin
2021When Video Classification Meets Incremental Classes.
Hanbin Zhao, Xin Qin, Shihao Su, Yongjian Fu, Zibo Lin, Xi Li
2021Why Do We Click: Visual Impression-aware News Recommendation.
Jiahao Xun, Shengyu Zhang, Zhou Zhao, Jieming Zhu, Qi Zhang, Jingjie Li, Xiuqiang He, Xiaofei He, Tat-Seng Chua, Fei Wu
2021Windowing Decomposition Convolutional Neural Network for Image Enhancement.
Chuanjun Zheng, Daming Shi, Yukun Liu
2021Wisdom of (Binned) Crowds: A Bayesian Stratification Paradigm for Crowd Counting.
Sravya Vardhani Shivapuja, Mansi Pradeep Khamkar, Divij Bajaj, Ganesh Ramakrishnan, Ravi Kiran Sarvadevabhatla
2021X-GGM: Graph Generative Modeling for Out-of-distribution Generalization in Visual Question Answering.
Jingjing Jiang, Ziyi Liu, Yifan Liu, Zhixiong Nan, Nanning Zheng
2021X-modaler: A Versatile and High-performance Codebase for Cross-modal Analytics.
Yehao Li, Yingwei Pan, Jingwen Chen, Ting Yao, Tao Mei
2021Yes, "Attention Is All You Need", for Exemplar based Colorization.
Wang Yin, Peng Lu, Zhaoran Zhao, Xujun Peng
2021Zero-shot Video Emotion Recognition via Multimodal Protagonist-aware Transformer Network.
Fan Qi, Xiaoshan Yang, Changsheng Xu
2021ZiGAN: Fine-grained Chinese Calligraphy Font Generation via a Few-shot Style Transfer Approach.
Qi Wen, Shuang Li, Bingfeng Han, Yi Yuan
2021ZoomSense: A Scalable Infrastructure for Augmenting Zoom.
Tom Bartindale, Peter Chen, Harrison Marshall, Stanislav Pozdniakov, Dan Richardson
2021aBio: Active Bi-Olfactory Display Using Subwoofers for Virtual Reality.
Youyang Hu, Yao Fu Jan, Kuan-Wei Tseng, You-Shin Tsai, Hung-Ming Sung, Jin-Yao Lin, Yi-Ping Hung
2021iART: A Search Engine for Art-Historical Images to Support Research in the Humanities.
Matthias Springstein, Stefanie Schneider, Javad Rahnama, Eyke Hüllermeier, Hubertus Kohle, Ralph Ewerth
2021iButter: Neural Interactive Bullet Time Generator for Human Free-viewpoint Rendering.
Liao Wang, Ziyu Wang, Pei Lin, Yuheng Jiang, Xin Suo, Minye Wu, Lan Xu, Jingyi Yu