| 2023 | 360-Degree Panorama Generation from Few Unregistered NFoV Images. Jionghao Wang, Ziyu Chen, Jun Ling, Rong Xie, Li Song |
| 2023 | 360RVW: Fusing Real 360° Videos and Interactive Virtual Worlds. Mizuki Takenawa, Naoki Sugimoto, Leslie Wöhler, Satoshi Ikehata, Kiyoharu Aizawa |
| 2023 | 3D Creation at Your Fingertips: From Text or Image to 3D Assets. Yang Chen, Jingwen Chen, Yingwei Pan, Xinmei Tian, Tao Mei |
| 2023 | 3DStyle-Diffusion: Pursuing Fine-grained Text-driven 3D Stylization with 2D Diffusion Models. Haibo Yang, Yang Chen, Yingwei Pan, Ting Yao, Zhineng Chen, Tao Mei |
| 2023 | A Baseline Investigation: Transformer-based Cross-view Baseline for Text-based Person Search. Xianghao Zang, Wei Gao, Ge Li, Han Fang, Chao Ban, Zhongjiang He, Hao Sun |
| 2023 | A Blind Streaming System for Multi-client Online 6-DoF View Touring. Sheng-Ming Tang, Yuan-Chun Sun, Cheng-Hsin Hsu |
| 2023 | A Capture to Registration Framework for Realistic Image Super-Resolution in the Industry Environment. Boyang Wang, Yan Wang, Qing Zhao, Junxiong Lin, Zeng Tao, Pinxue Guo, Zhaoyu Chen, Kaixun Jiang, Shaoqi Yan, Shuyong Gao, Wenqiang Zhang |
| 2023 | A Closer Look at Classifier in Adversarial Domain Generalization. Ye Wang, Junyang Chen, Mengzhu Wang, Hao Li, Wei Wang, Houcheng Su, Zhihui Lai, Wei Wang, Zhenghan Chen |
| 2023 | A Contrastive Learning Framework for Dual-Target Cross-Domain Recommendation. Jinhu Lu, Guohao Sun, Xiu Fang, Jian Yang, Wei He |
| 2023 | A Figure Skating Jumping Dataset for Replay-Guided Action Quality Assessment. Yanchao Liu, Xina Cheng, Takeshi Ikenaga |
| 2023 | A Four-Pronged Defense Against Byzantine Attacks in Federated Learning. Wei Wan, Shengshan Hu, Minghui Li, Jianrong Lu, Longling Zhang, Leo Yu Zhang, Hai Jin |
| 2023 | A Generalized Physical-knowledge-guided Dynamic Model for Underwater Image Enhancement. Pan Mu, Hanning Xu, Zheyuan Liu, Zheng Wang, Sixian Chan, Cong Bai |
| 2023 | A Hardware-efficient Unified Motion Estimation for Video Coding. Xizhong Zhu, Guoqing Xiang, Peng Zhang, Huizhu Jia, Xiaodong Xie |
| 2023 | A Hierarchical Deep Video Understanding Method with Shot-Based Instance Search and Large Language Model. Ruizhe Li, Jiahao Guo, Mingxi Li, Zhengqian Wu, Chao Liang |
| 2023 | A Lightweight Collective-attention Network for Change Detection. Yuchao Feng, Yanyan Shao, Honghui Xu, Jinshan Xu, Jianwei Zheng |
| 2023 | A Method of Micro-Geometric Details Preserving in Surface Reconstruction from Gradient. Wuyuan Xie, Miaohui Wang |
| 2023 | A Model-Agnostic Semantic-Quality Compatible Framework based on Self-Supervised Semantic Decoupling. Xiaoyu Ma, Chenxi Feng, Jiaojiao Wang, Qiang Lin, Suiyu Zhang, Jinchi Zhu, Xiaodiao Chen, Chang Liu, Dingguo Yu |
| 2023 | A Multiple Prediction Mechanisms Ensemble for Complex Remote Sensing Scenes. Qifeng Lin, Luojun Lin, Yuanlong Yu, Gang Fu |
| 2023 | A Multitask Framework for Graffiti-to-Image Translation. Ying Yang, Mulin Chen, Xuelong Li |
| 2023 | A Novel Deep Video Watermarking Framework with Enhanced Robustness to H.264/AVC Compression. Yulin Zhang, Jiangqun Ni, Wenkang Su, Xin Liao |
| 2023 | A Novel Temporal Channel Enhancement and Contextual Excavation Network for Temporal Action Localization. Zan Gao, Xinglei Cui, Yibo Zhao, Tao Zhuo, Weili Guan, Meng Wang |
| 2023 | A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval. Jiancheng Pan, Qing Ma, Cong Bai |
| 2023 | A Reference-free Self-supervised Domain Adaptation Framework for Low-quality Fundus Image Enhancement. Qingshan Hou, Peng Cao, Jiaqi Wang, Xiaoli Liu, Jinzhu Yang, Osmar R. Zaïane |
| 2023 | A Reinforcement Learning-Based Automatic Video Editing Method Using Pre-trained Vision-Language Model. Panwen Hu, Nan Xiao, Feifei Li, Yongquan Chen, Rui Huang |
| 2023 | A Simple Baseline for Open-World Tracking via Self-training. Bingyang Wang, Tanlin Li, Jiannan Wu, Yi Jiang, Huchuan Lu, You He |
| 2023 | A Symbolic Characters Aware Model for Solving Geometry Problems. Maizhen Ning, Qiu-Feng Wang, Kaizhu Huang, Xiaowei Huang |
| 2023 | A Tale of Two Graphs: Freezing and Denoising Graph Structures for Multimodal Recommendation. Xin Zhou, Zhiqi Shen |
| 2023 | A Unified Query-based Paradigm for Camouflaged Instance Segmentation. Bo Dong, Jialun Pei, Rongrong Gao, Tian-Zhu Xiang, Shuo Wang, Huan Xiong |
| 2023 | ACM Multimedia 2023 Grand Challenge Report: Invisible Video Watermark. Jin Chen, Yi Yu, Shien Song, Xinying Wang, Jie Yang, Yifei Xue, Yizhen Lao |
| 2023 | ACQ: Few-shot Backdoor Defense via Activation Clipping and Quantizing. Yulin Jin, Xiaoyu Zhang, Jian Lou, Xiaofeng Chen |
| 2023 | ALA: Naturalness-aware Adversarial Lightness Attack. Yihao Huang, Liangru Sun, Qing Guo, Felix Juefei-Xu, Jiayi Zhu, Jincao Feng, Yang Liu, Geguang Pu |
| 2023 | ALDA: An Adaptive Layout Design Assistant for Diverse Posters throughout the Design Process. Qiuyun Zhang, Bin Guo, Lina Yao, Han Wang, Ying Zhang, Zhiwen Yu |
| 2023 | ALEX: Towards Effective Graph Transfer Learning with Noisy Labels. Jingyang Yuan, Xiao Luo, Yifang Qin, Zhengyang Mao, Wei Ju, Ming Zhang |
| 2023 | AMC-SME '23: 2023 Workshop on Advanced Multimedia Computing for Smart Manufacturing and Engineering. Junxin Chen, Wei Wang, Gwanggil Jeon |
| 2023 | ASTDF-Net: Attention-Based Spatial-Temporal Dual-Stream Fusion Network for EEG-Based Emotion Recognition. Peiliang Gong, Ziyu Jia, Pengpai Wang, Yueying Zhou, Daoqiang Zhang |
| 2023 | ATM: Action Temporality Modeling for Video Question Answering. Junwen Chen, Jie Zhu, Yu Kong |
| 2023 | AbCoRD: Exploiting multimodal generative approach for Aspect-based Complaint and Rationale Detection. Raghav Jain, Apoorva Singh, Vivek Kumar Gangwar, Sriparna Saha |
| 2023 | AcFormer: An Aligned and Compact Transformer for Multimodal Sentiment Analysis. Daoming Zong, Chaoyue Ding, Baoxiang Li, Jiakui Li, Ken Zheng, Qunyan Zhou |
| 2023 | Active CT Reconstruction with a Learned Sampling Policy. Ce Wang, Kun Shang, Haimiao Zhang, Shang Zhao, Dong Liang, S. Kevin Zhou |
| 2023 | Ada-DQA: Adaptive Diverse Quality-aware Feature Acquisition for Video Quality Assessment. Hongbo Liu, Mingda Wu, Kun Yuan, Ming Sun, Yansong Tang, Chuanchuan Zheng, Xing Wen, Xiu Li |
| 2023 | Ada3Diff: Defending against 3D Adversarial Point Clouds via Adaptive Diffusion. Kui Zhang, Hang Zhou, Jie Zhang, Qidong Huang, Weiming Zhang, Nenghai Yu |
| 2023 | AdaBrowse: Adaptive Video Browser for Efficient Continuous Sign Language Recognition. Lianyu Hu, Liqing Gao, Zekang Liu, Chi-Man Pun, Wei Feng |
| 2023 | AdaCLIP: Towards Pragmatic Multimodal Video Retrieval. Zhiming Hu, Angela Ning Ye, Salar Hosseini Khorasgani, Iqbal Mohomed |
| 2023 | Adaptive Contrastive Learning for Learning Robust Representations under Label Noise. Zihao Wang, Weichen Zhang, Weihong Bao, Fei Long, Chun Yuan |
| 2023 | Adaptive Decoupled Pose Knowledge Distillation. Jie Xu, Shanshan Zhang, Jian Yang |
| 2023 | Adaptive Feature Swapping for Unsupervised Domain Adaptation. Junbao Zhuo, Xingyu Zhao, Shuhao Cui, Qingming Huang, Shuhui Wang |
| 2023 | Adaptive Spatio-Temporal Directed Graph Neural Network for Parkinson's Detection using Vertical Ground Reaction Force. Xiaotian Wang, Shuo Liang, Zhifu Zhao, Xinyu Cui, Kai Chen, Xuanhang Xu |
| 2023 | Addressing Scalability for Real-time Multiuser Holo-portation: Introducing and Assessing a Multipoint Control Unit (MCU) for Volumetric Video. Sergi Fernández, Mario Montagud, David Rincón, Juame Moragues, Gianluca Cernigliaro |
| 2023 | AdvCLIP: Downstream-agnostic Adversarial Examples in Multimodal Contrastive Learning. Ziqi Zhou, Shengshan Hu, Minghui Li, Hangtao Zhang, Yechao Zhang, Hai Jin |
| 2023 | Advancing Audio Emotion and Intent Recognition with Large Pre-Trained Models and Bayesian Inference. Dejan Porjazovski, Yaroslav Getman, Tamás Grósz, Mikko Kurimo |
| 2023 | Advancing Video Question Answering with a Multi-modal and Multi-layer Question Enhancement Network. Meng Liu, Fenglei Zhang, Xin Luo, Fan Liu, Yinwei Wei, Liqiang Nie |
| 2023 | Adversarial Attack for Robust Watermark Protection Against Inpainting-based and Blind Watermark Removers. Mingzhi Lyu, Yi Huang, Adams Wai-Kin Kong |
| 2023 | Adversarial Bootstrapped Question Representation Learning for Knowledge Tracing. Jianwen Sun, Fenghua Yu, Sannyuya Liu, Yawei Luo, Ruxia Liang, Xiaoxuan Shen |
| 2023 | Adversarial Training of Deep Neural Networks Guided by Texture and Structural Information. Zhaoxin Wang, Handing Wang, Cong Tian, Yaochu Jin |
| 2023 | AesCLIP: Multi-Attribute Contrastive Learning for Image Aesthetics Assessment. Xiangfei Sheng, Leida Li, Pengfei Chen, Jinjian Wu, Weisheng Dong, Yuzhe Yang, Liwu Xu, Yaqian Li, Guangming Shi |
| 2023 | Aesthetics-Driven Virtual Time-Lapse Photography Generation. Lihua Lu, Hui Wei, Xin Jin, Yihao Zhang, Boyan Dong, Longteng Jiang, Xiaohui Zhang, Ruyang Li, Yaqian Zhao |
| 2023 | AffectFAL: Federated Active Affective Computing with Non-IID Data. Zixin Zhang, Fan Qi, Shuai Li, Changsheng Xu |
| 2023 | Against Opacity: Explainable AI and Large Language Models for Effective Digital Advertising. Qi Yang, Marlo Ongpin, Sergey I. Nikolenko, Alfred Huang, Aleksandr Farseev |
| 2023 | All in One: Exploring Unified Vision-Language Tracking with Multi-Modal Alignment. Chunhui Zhang, Xin Sun, Yiqian Yang, Li Liu, Qiong Liu, Xi Zhou, Yanfeng Wang |
| 2023 | All-in-one Multi-degradation Image Restoration Network via Hierarchical Degradation Representation. Cheng Zhang, Yu Zhu, Qingsen Yan, Jinqiu Sun, Yanning Zhang |
| 2023 | Alleviating Spatial Misalignment and Motion Interference for UAV-based Video Recognition. Gege Shi, Xueyang Fu, Chengzhi Cao, Zheng-Jun Zha |
| 2023 | An Intelligent Learning Approach to Achieve Near-Second Low-Latency Live Video Streaming under Highly Fluctuating Networks. Guanghui Zhang, Ke Liu, Mengbai Xiao, Bingshu Wang, Vaneet Aggarwal |
| 2023 | An Order-Complexity Aesthetic Assessment Model for Aesthetic-aware Music Recommendation. Xin Jin, Wu Zhou, Jinyu Wang, Duo Xu, Yongsen Zheng |
| 2023 | AniPixel: Towards Animatable Pixel-Aligned Human Avatar. Jinlong Fan, Jing Zhang, Zhi Hou, Dacheng Tao |
| 2023 | Answer-Based Entity Extraction and Alignment for Visual Text Question Answering. Jun Yu, Mohan Jing, Weihao Liu, Tongxu Luo, Bingyuan Zhang, Keda Lu, Fangyu Lei, Jianqing Sun, Jiaen Liang |
| 2023 | Attentive Alignment Network for Multispectral Pedestrian Detection. Nuo Chen, Jin Xie, Jing Nie, Jiale Cao, Zhuang Shao, Yanwei Pang |
| 2023 | Attributes Grouping and Mining Hashing for Fine-Grained Image Retrieval. Xin Lu, Shikun Chen, Yichao Cao, Xin Zhou, Xiaobo Lu |
| 2023 | Audio-Visual Segmentation by Exploring Cross-Modal Mutual Semantics. Chen Liu, Peike Patrick Li, Xingqun Qi, Hu Zhang, Lincheng Li, Dadong Wang, Xin Yu |
| 2023 | Audio-Visual Spatial Integration and Recursive Attention for Robust Sound Source Localization. Sung Jin Um, Dongjin Kim, Jung Uk Kim |
| 2023 | Auditory Attention Decoding with Task-Related Multi-View Contrastive Learning. Xiaoyu Chen, Changde Du, Qiongyi Zhou, Huiguang He |
| 2023 | Augmented Digital Twins for Predictive Automatic Regulation and Fault Alarm in Sewage Plan. Yuhang Zhao, Shanchen Pang, Zhihan Lv, Sheng Miao |
| 2023 | Autistic Spectrum Disorders Diagnose with Graph Neural Networks. Lu Wei, Bin Liu, Jiujun He, Manxue Zhang, Yi Huang |
| 2023 | AutoPoster: A Highly Automatic and Content-aware Design System for Advertising Poster Generation. Jinpeng Lin, Min Zhou, Ye Ma, Yifan Gao, Chenxi Fei, Yangjian Chen, Zhang Yu, Tiezheng Ge |
| 2023 | Automatic Asymmetric Embedding Cost Learning via Generative Adversarial Networks. Dongxia Huang, Weiqi Luo, Peijia Zheng, Jiwu Huang |
| 2023 | Automatic Audio Augmentation for Requests Sub-Challenge. Yanjie Sun, Kele Xu, Chaorun Liu, Yong Dou, Kun Qian |
| 2023 | Automatic Generation of Commercial Scenes. Shao-Kui Zhang, Jia-Hong Liu, Yike Li, Tianyi Xiong, Ke-Xin Ren, Hongbo Fu, Song-Hai Zhang |
| 2023 | Automatic Human Scene Interaction through Contact Estimation and Motion Adaptation. Mingrui Zhang, Ming Chen, Yan Zhou, Li Chen, Weihua Jian, Pengfei Wan |
| 2023 | Automatic Network Architecture Search for RGB-D Semantic Segmentation. Wenna Wang, Tao Zhuo, Xiuwei Zhang, Mingjun Sun, Hanlin Yin, Yinghui Xing, Yanning Zhang |
| 2023 | Avatar Knowledge Distillation: Self-ensemble Teacher Paradigm with Uncertainty. Yuan Zhang, Weihua Chen, Yichen Lu, Tao Huang, Xiuyu Sun, Jian Cao |
| 2023 | AvatarFusion: Zero-shot Generation of Clothing-Decoupled 3D Avatars Using 2D Diffusion. Shuo Huang, Zongxin Yang, Liangting Li, Yi Yang, Jia Jia |
| 2023 | BEAMER: Behavioral Encoder to Generate Multiple Appropriate Facial Reactions. Ximi Hoque, Adamay Mann, Gulshan Sharma, Abhinav Dhall |
| 2023 | BLAT: Bootstrapping Language-Audio Pre-training based on AudioSet Tag-guided Synthetic Data. Xuenan Xu, Zhiling Zhang, Zelin Zhou, Pingyue Zhang, Zeyu Xie, Mengyue Wu, Kenny Q. Zhu |
| 2023 | BMI-Net: A Brain-inspired Multimodal Interaction Network for Image Aesthetic Assessment. Xixi Nie, Bo Hu, Xinbo Gao, Leida Li, Xiaodan Zhang, Bin Xiao |
| 2023 | Beat: Bi-directional One-to-Many Embedding Alignment for Text-based Person Retrieval. Yiwei Ma, Xiaoshuai Sun, Jiayi Ji, Guannan Jiang, Weilin Zhuang, Rongrong Ji |
| 2023 | Benign Shortcut for Debiasing: Fair Visual Recognition via Intervention with Shortcut Features. Yi Zhang, Jitao Sang, Junyang Wang, Dongmei Jiang, Yaowei Wang |
| 2023 | Better Integrating Vision and Semantics for Improving Few-shot Classification. Zhuoling Li, Yong Wang |
| 2023 | Beware of Overcorrection: Scene-induced Commonsense Graph for Scene Graph Generation. Lianggangxu Chen, Jiale Lu, Youqi Song, Changbo Wang, Gaoqi He |
| 2023 | Beyond Domain Gap: Exploiting Subjectivity in Sketch-Based Person Retrieval. Kejun Lin, Zhixiang Wang, Zheng Wang, Yinqiang Zheng, Shin'ichi Satoh |
| 2023 | Beyond First Impressions: Integrating Joint Multi-modal Cues for Comprehensive 3D Representation. Haowei Wang, Jiji Tang, Jiayi Ji, Xiaoshuai Sun, Rongsheng Zhang, Yiwei Ma, Minda Zhao, Lincheng Li, Zeng Zhao, Tangjie Lv, Rongrong Ji |
| 2023 | Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model. Kanzhi Cheng, Wenpo Song, Zheng Ma, Wenhao Zhu, Zixuan Zhu, Jianbing Zhang |
| 2023 | BiFPro: A Bidirectional Facial-data Protection Framework against DeepFake. Honggu Liu, Xiaodan Li, Wenbo Zhou, Han Fang, Paolo Bestagini, Weiming Zhang, Yuefeng Chen, Stefano Tubaro, Nenghai Yu, Yuan He, Hui Xue |
| 2023 | Biased-Predicate Annotation Identification via Unbiased Visual Predicate Representation. Li Li, Chenwei Wang, You Qin, Wei Ji, Renjie Liang |
| 2023 | Bidomain Modeling Paradigm for Pansharpening. Junming Hou, Qi Cao, Ran Ran, Che Liu, Junling Li, Liang-Jian Deng |
| 2023 | Bilevel Generative Learning for Low-Light Vision. Yingchi Liu, Zhu Liu, Long Ma, Jinyuan Liu, Xin Fan, Zhongxuan Luo, Risheng Liu |
| 2023 | Bio-Inspired Audiovisual Multi-Representation Integration via Self-Supervised Learning. Zhaojian Li, Bin Zhao, Yuan Yuan |
| 2023 | Blind Image Super-resolution with Rich Texture-Aware Codebook. Rui Qin, Ming Sun, Fangyuan Zhang, Xing Wen, Bin Wang |
| 2023 | Boosting Few-shot 3D Point Cloud Segmentation via Query-Guided Enhancement. Zhenhua Ning, Zhuotao Tian, Guangming Lu, Wenjie Pei |
| 2023 | BranchClash: A Fully On-Chain Tower Defense Blockchain Game with New Collaboration Mechanism. Hao Wu, Yueyao Li, Yan Zhuang, Xinyao Sun, Wei Cai |
| 2023 | Breaking the Barrier Between Pre-training and Fine-tuning: A Hybrid Prompting Model for Knowledge-Based VQA. Zhongfan Sun, Yongli Hu, Qingqing Gao, Huajie Jiang, Junbin Gao, Yanfeng Sun, Baocai Yin |
| 2023 | Bridging Language and Geometric Primitives for Zero-shot Point Cloud Segmentation. Runnan Chen, Xinge Zhu, Nenglun Chen, Wei Li, Yuexin Ma, Ruigang Yang, Wenping Wang |
| 2023 | Bridging Trustworthiness and Open-World Learning: An Exploratory Neural Approach for Enhancing Interpretability, Generalization, and Robustness. Shide Du, Zihan Fang, Shiyang Lan, Yanchao Tan, Manuel Günther, Shiping Wang, Wenzhong Guo |
| 2023 | Brighten-and-Colorize: A Decoupled Network for Customized Low-Light Image Enhancement. Chenxi Wang, Zhi Jin |
| 2023 | Building Robust Multimodal Sentiment Recognition via a Simple yet Effective Multimodal Transformer. Daoming Zong, Chaoyue Ding, Baoxiang Li, Dinghao Zhou, Jiakui Li, Ken Zheng, Qunyan Zhou |
| 2023 | C2MR: Continual Cross-Modal Retrieval for Streaming Multi-modal Data. Huaiwen Zhang, Yang Yang, Fan Qi, Shengsheng Qian, Changsheng Xu |
| 2023 | CALM: An Enhanced Encoding and Confidence Evaluating Framework for Trustworthy Multi-view Learning. Hai Zhou, Zhe Xue, Ying Liu, Boang Li, Junping Du, MeiYu Liang, Yuankai Qi |
| 2023 | CARIS: Context-Aware Referring Image Segmentation. Sun'ao Liu, Yiheng Zhang, Zhaofan Qiu, Hongtao Xie, Yongdong Zhang, Ting Yao |
| 2023 | CATR: Combinatorial-Dependence Audio-Queried Transformer for Audio-Visual Video Segmentation. Kexin Li, Zongxin Yang, Lei Chen, Yi Yang, Jun Xiao |
| 2023 | CCMB: A Large-scale Chinese Cross-modal Benchmark. Chunyu Xie, Heng Cai, Jincheng Li, Fanjing Kong, Xiaoyu Wu, Jianfei Song, Henrique Morimitsu, Lin Yao, Dexin Wang, Xiangzheng Zhang, Dawei Leng, Baochang Zhang, Xiangyang Ji, Yafeng Deng |
| 2023 | CFTF: Controllable Fine-grained Text2Face and Its Human-in-the-loop Suspect Portraits Application. Zhanbin Hu, Jianwu Wu, Danyang Gao, Yixu Zhou, Qiang Zhu |
| 2023 | CHAIN: Exploring Global-Local Spatio-Temporal Information for Improved Self-Supervised Video Hashing. Rukai Wei, Yu Liu, Jingkuan Song, Heng Cui, Yanzhao Xie, Ke Zhou |
| 2023 | CLE Diffusion: Controllable Light Enhancement Diffusion Model. YuYang Yin, Dejia Xu, Chuangchuang Tan, Ping Liu, Yao Zhao, Yunchao Wei |
| 2023 | CLG-INet: Coupled Local-Global Interactive Network for Image Restoration. Yuqi Jiang, Chune Zhang, Shuo Jin, Jiao Liu, Jiapeng Wang |
| 2023 | CLIP-Count: Towards Text-Guided Zero-Shot Object Counting. Ruixiang Jiang, Lingbo Liu, Changwen Chen |
| 2023 | CLIP-Hand3D: Exploiting 3D Hand Pose Estimation via Context-Aware Prompting. Shaoxiang Guo, Qing Cai, Lin Qi, Junyu Dong |
| 2023 | CMCU-CSS: Enhancing Naturalness via Commonsense-based Multi-modal Context Understanding in Conversational Speech Synthesis. Yayue Deng, Jinlong Xue, Fengping Wang, Yingming Gao, Ya Li |
| 2023 | CONICA: A Contrastive Image Captioning Framework with Robust Similarity Learning. Lin Deng, Yuzhong Zhong, Maoning Wang, Jianwei Zhang |
| 2023 | CONVERT: Contrastive Graph Clustering with Reliable Augmentation. Xihong Yang, Cheng Tan, Yue Liu, Ke Liang, Siwei Wang, Sihang Zhou, Jun Xia, Stan Z. Li, Xinwang Liu, En Zhu |
| 2023 | COPA : Efficient Vision-Language Pre-training through Collaborative Object- and Patch-Text Alignment. Chaoya Jiang, Haiyang Xu, Wei Ye, Qinghao Ye, Chenliang Li, Ming Yan, Bin Bi, Shikun Zhang, Fei Huang, Ji Zhang |
| 2023 | COVES: A Cognitive-Affective Deep Model that Personalizes Math Problem Difficulty in Real Time and Improves Student Engagement with an Online Tutor. Hao Yu, Danielle A. Allessio, Will Lee, William Rebelsky, Frank Sylvia, Tom Murray, John J. Magee, Ivon Arroyo, Beverly P. Woolf, Sarah Adel Bargal, Margrit Betke |
| 2023 | CPLFormer: Cross-scale Prototype Learning Transformer for Image Snow Removal. Sixiang Chen, Tian Ye, Yun Liu, Jinbin Bai, Haoyu Chen, Yunlong Lin, Jun Shi, Erkang Chen |
| 2023 | CPNet: Cartoon Parsing with Pixel and Part Correlation. Jian-Jun Qiao, Jie Zhang, Xiao Wu, Yu-Pei Song, Wei Li |
| 2023 | CPU: Codebook Lookup Transformer with Knowledge Distillation for Point Cloud Upsampling. Weibing Zhao, Haiming Zhang, Chaoda Zheng, Xu Yan, Shuguang Cui, Zhen Li |
| 2023 | CTCP: Cross Transformer and CNN for Pansharpening. Zhao Su, Yong Yang, Shuying Huang, Weiguo Wan, Wei Tu, Hangyuan Lu, Changjie Chen |
| 2023 | CUCL: Codebook for Unsupervised Continual Learning. Chen Cheng, Jingkuan Song, Xiaosu Zhu, Junchen Zhu, Lianli Gao, Hengtao Shen |
| 2023 | Cal-SFDA: Source-Free Domain-adaptive Semantic Segmentation with Differentiable Expected Calibration Error. Zixin Wang, Yadan Luo, Zhi Chen, Sen Wang, Zi Huang |
| 2023 | Calibration-based Dual Prototypical Contrastive Learning Approach for Domain Generalization Semantic Segmentation. Muxin Liao, Shishun Tian, Yuhang Zhang, Guoguang Hua, Wenbin Zou, Xia Li |
| 2023 | Capturing Co-existing Distortions in User-Generated Content for No-reference Video Quality Assessment. Kun Yuan, Zishang Kong, Chuanchuan Zheng, Ming Sun, Xing Wen |
| 2023 | Cascaded Cross-Modal Transformer for Request and Complaint Detection. Nicolae-Catalin Ristea, Radu Tudor Ionescu |
| 2023 | Category-Level Articulated Object 9D Pose Estimation via Reinforcement Learning. Liu Liu, Jianming Du, Hao Wu, Xun Yang, Zhenguang Liu, Richang Hong, Meng Wang |
| 2023 | Category-Specific Prompts for Animal Action Recognition with Pretrained Vision-Language Models. Yinuo Jing, Chunyu Wang, Ruxu Zhang, Kongming Liang, Zhanyu Ma |
| 2023 | Causal Intervention for Sparse-View Gait Recognition. Jilong Wang, Saihui Hou, Yan Huang, Chunshui Cao, Xu Liu, Yongzhen Huang, Liang Wang |
| 2023 | CenterLPS: Segment Instances by Centers for LiDAR Panoptic Segmentation. Jianbiao Mei, Yu Yang, Mengmeng Wang, Zizhang Li, Xiaojun Hou, Jongwon Ra, Laijian Li, Yong Liu |
| 2023 | Cerebrovascular Segmentation in TOF-MRA with Topology Regularization Adversarial Model. Cheng Chen, Yunqing Chen, Shuang Song, Jianan Wang, Huansheng Ning, Ruoxiu Xiao |
| 2023 | CgT-GAN: CLIP-guided Text GAN for Image Captioning. Jiarui Yu, Haoran Li, Yanbin Hao, Bin Zhu, Tong Xu, Xiangnan He |
| 2023 | Chain of Propagation Prompting for Node Classification. Yonghua Zhu, Zhenyun Deng, Yang Chen, Robert Amor, Michael Witbrock |
| 2023 | Chain-of-Look Prompting for Verb-centric Surgical Triplet Recognition in Endoscopic Videos. Nan Xi, Jingjing Meng, Junsong Yuan |
| 2023 | Chaos to Order: A Label Propagation Perspective on Source-Free Domain Adaptation. Chunwei Wu, Guitao Cao, Yan Li, Xidong Xi, Wenming Cao, Hong Wang |
| 2023 | ChinaOpen: A Dataset for Open-world Multimodal Learning. Aozhu Chen, Ziyuan Wang, Chengbo Dong, Kaibin Tian, Ruixiang Zhao, Xun Liang, Zhanhui Kang, Xirong Li |
| 2023 | Class-level Structural Relation Modeling and Smoothing for Visual Representation Learning. Zitan Chen, Zhuang Qi, Xiao Cao, Xiangxian Li, Xiangxu Meng, Lei Meng |
| 2023 | Client-Adaptive Cross-Model Reconstruction Network for Modality-Incomplete Multimodal Federated Learning. Baochen Xiong, Xiaoshan Yang, Yaguang Song, Yaowei Wang, Changsheng Xu |
| 2023 | Clip Fusion with Bi-level Optimization for Human Mesh Reconstruction from Monocular Videos. Peng Wu, Xiankai Lu, Jianbing Shen, Yilong Yin |
| 2023 | Co-Salient Object Detection with Semantic-Level Consensus Extraction and Dispersion. Peiran Xu, Yadong Mu |
| 2023 | CoCa: A Connectivity-Aware Cascade Framework for Histology Gland Segmentation. Yu Bai, Bo Zhang, Zheng Zhang, Wu Liu, Jinwen Li, Xiangyang Gong, Wendong Wang |
| 2023 | CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model. Zhen Ye, Wei Xue, Xu Tan, Jie Chen, Qifeng Liu, Yike Guo |
| 2023 | CoP: Chain-of-Pose for Image Animation in Large Pose Changes. Xiaomeng Fu, Xi Wang, Jin Liu, Shuhui Wang, Jiao Dai, Jizhong Han |
| 2023 | ColSLAM: A Versatile Collaborative SLAM System for Mobile Phones Using Point-Line Features and Map Caching. Wanting Li, Yongcai Wang, Yongyu Guo, Shuo Wang, Yu Shao, Xuewei Bai, Xudong Cai, Qiang Ye, Deying Li |
| 2023 | Collaborative Fraud Detection: How Collaboration Impacts Fraud Detection. Jinzhang Hu, Ruimin Hu, Zheng Wang, Dengshi Li, Junhang Wu, Lingfei Ren, Yilong Zang, Zijun Huang, Mei Wang |
| 2023 | Collaborative Learning of Diverse Experts for Source-free Universal Domain Adaptation. Meng Shen, Yanzuo Lu, Yanxu Hu, Andy J. Ma |
| 2023 | Combating Misinformation in the Era of Generative AI Models. Danni Xu, Shaojing Fan, Mohan S. Kankanhalli |
| 2023 | Combating Online Misinformation Videos: Characterization, Detection, and Future Directions. Yuyan Bu, Qiang Sheng, Juan Cao, Peng Qi, Danding Wang, Jintao Li |
| 2023 | Concerto: Client-server Orchestration for Real-Time Video Analytics. Chaoyang Li, Rui-Xiao Zhang, Tianchi Huang, Lianchen Jia, Lifeng Sun |
| 2023 | Confidence-Aware Contrastive Learning for Semantic Segmentation. Lele Lv, Qing Liu, Shichao Kan, Yixiong Liang |
| 2023 | Consistency-aware Feature Learning for Hierarchical Fine-grained Visual Classification. Rui Wang, Cong Zou, Weizhong Zhang, Zixuan Zhu, Lihua Jing |
| 2023 | Constructing Holistic Spatio-Temporal Scene Graph for Video Semantic Role Labeling. Yu Zhao, Hao Fei, Yixin Cao, Bobo Li, Meishan Zhang, Jianguo Wei, Min Zhang, Tat-Seng Chua |
| 2023 | Context-Aware Talking-Head Video Editing. Songlin Yang, Wei Wang, Jun Ling, Bo Peng, Xu Tan, Jing Dong |
| 2023 | Contrast-augmented Diffusion Model with Fine-grained Sequence Alignment for Markup-to-Image Generation. Guojin Zhong, Jin Yuan, Pan Wang, Kailun Yang, Weili Guan, Zhiyong Li |
| 2023 | Contrastive Intra- and Inter-Modality Generation for Enhancing Incomplete Multimedia Recommendation. Zhenghong Lin, Yanchao Tan, Yunfei Zhan, Weiming Liu, Fan Wang, Chaochao Chen, Shiping Wang, Carl Yang |
| 2023 | Control3D: Towards Controllable Text-to-3D Generation. Yang Chen, Yingwei Pan, Yehao Li, Ting Yao, Tao Mei |
| 2023 | ControlStyle: Text-Driven Stylized Image Generation Using Diffusion Priors. Jingwen Chen, Yingwei Pan, Ting Yao, Tao Mei |
| 2023 | Controllable Face Sketch-Photo Synthesis with Flexible Generative Priors. Kun Cheng, Mingrui Zhu, Nannan Wang, Guozhang Li, Xiaoyu Wang, Xinbo Gao |
| 2023 | Conversational Composed Retrieval with Iterative Sequence Refinement. Hao Wei, Shuhui Wang, Zhe Xue, Shengbo Chen, Qingming Huang |
| 2023 | Cooperative Colorization: Exploring Latent Cross-Domain Priors for NIR Image Spectrum Translation. Xingxing Yang, Jie Chen, Zaifeng Yang |
| 2023 | Counterfactual Cross-modality Reasoning for Weakly Supervised Video Moment Localization. Zezhong Lv, Bing Su, Ji-Rong Wen |
| 2023 | CropCap: Embedding Visual Cross-Partition Dependency for Image Captioning. Bo Wang, Zhao Zhang, Suiyi Zhao, Haijun Zhang, Richang Hong, Meng Wang |
| 2023 | Cross-Architecture Distillation for Face Recognition. Weisong Zhao, Xiangyu Zhu, Zhixiang He, Xiaoyu Zhang, Zhen Lei |
| 2023 | Cross-Illumination Video Anomaly Detection Benchmark. Dongliang Zhu, Ruimin Hu, Shengli Song, Xiang Guo, Xixi Li, Zheng Wang |
| 2023 | Cross-Lingual Transfer of Large Language Model by Visually-Derived Supervision Toward Low-Resource Languages. Masayasu Muraoka, Bishwaranjan Bhattacharjee, Michele Merler, Graeme Blackwood, Yulong Li, Yang Zhao |
| 2023 | Cross-Modal Graph Attention Network for Entity Alignment. Baogui Xu, Chengjin Xu, Bing Su |
| 2023 | Cross-Modal and Multi-Attribute Face Recognition: A Benchmark. Feng Lin, Kaiqiang Fu, Hao Luo, Ziyue Zhan, Zhibo Wang, Zhenguang Liu, Lorenzo Cavallaro, Kui Ren |
| 2023 | Cross-Silo Prototypical Calibration for Federated Learning with Non-IID Data. Zhuang Qi, Lei Meng, Zitan Chen, Han Hu, Hui Lin, Xiangxu Meng |
| 2023 | Cross-modal & Cross-domain Learning for Unsupervised LiDAR Semantic Segmentation. Yiyang Chen, Shanshan Zhao, Changxing Ding, Liyao Tang, Chaoyue Wang, Dacheng Tao |
| 2023 | Cross-modal Contrastive Learning for Multimodal Fake News Detection. Longzheng Wang, Chuang Zhang, Hongbo Xu, Yongxiu Xu, Xiaohan Xu, Siqi Wang |
| 2023 | Cross-modal Unsupervised Domain Adaptation for 3D Semantic Segmentation via Bidirectional Fusion-then-Distillation. Yao Wu, Mingwei Xing, Yachao Zhang, Yuan Xie, Jianping Fan, Zhongchao Shi, Yanyun Qu |
| 2023 | Cross-modal and Cross-medium Adversarial Attack for Audio. Liguo Zhang, Zilin Tian, Yunfei Long, Sizhao Li, Guisheng Yin |
| 2023 | Cross-modality Representation Interactive Learning for Multimodal Sentiment Analysis. Jian Huang, Yanli Ji, Yang Yang, Heng Tao Shen |
| 2023 | Cross-view Resolution and Frame Rate Joint Enhancement for Binocular Video. Panda Pan, Yang Zhao, Yuan Chen, Wei Jia, Zhao Zhang, Ronggang Wang |
| 2023 | Cuing Without Sharing: A Federated Cued Speech Recognition Framework via Mutual Knowledge Distillation. Yuxuan Zhang, Lei Liu, Li Liu |
| 2023 | Cultural Self-Adaptive Multimodal Gesture Generation Based on Multiple Culture Gesture Dataset. Jingyu Wu, Shi Chen, Shuyu Gan, Weijun Li, Changyuan Yang, Lingyun Sun |
| 2023 | Curriculum-Listener: Consistency- and Complementarity-Aware Audio-Enhanced Temporal Sentence Grounding. Houlun Chen, Xin Wang, Xiaohan Lan, Hong Chen, Xuguang Duan, Jia Jia, Wenwu Zhu |
| 2023 | DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder. Chenpeng Du, Qi Chen, Tianyu He, Xu Tan, Xie Chen, Kai Yu, Sheng Zhao, Jiang Bian |
| 2023 | DANet: Multi-scale UAV Target Detection with Dynamic Feature Perception and Scale-aware Knowledge Distillation. Houzhang Fang, Zikai Liao, Lu Wang, Qingshan Li, Yi Chang, Luxin Yan, Xuhua Wang |
| 2023 | DAOT: Domain-Agnostically Aligned Optimal Transport for Domain-Adaptive Crowd Counting. Huilin Zhu, Jingling Yuan, Xian Zhong, Zhengwei Yang, Zheng Wang, Shengfeng He |
| 2023 | DAWN: Direction-aware Attention Wavelet Network for Image Deraining. Kui Jiang, Wenxuan Liu, Zheng Wang, Xian Zhong, Junjun Jiang, Chia-Wen Lin |
| 2023 | DCEL: Deep Cross-modal Evidential Learning for Text-Based Person Retrieval. Shenshen Li, Xing Xu, Yang Yang, Fumin Shen, Yijun Mo, Yujie Li, Heng Tao Shen |
| 2023 | DCTM: Dilated Convolutional Transformer Model for Multimodal Engagement Estimation in Conversation. Vu Ngoc Tu, Van Thong Huynh, Hyung-Jeong Yang, Soo-Hyung Kim, Shah Nawaz, Karthik Nandakumar, Muhammad Zaigham Zaheer |
| 2023 | DFIL: Deepfake Incremental Learning by Exploiting Domain-invariant Forgery Clues. Kun Pan, Yifang Yin, Yao Wei, Feng Lin, Zhongjie Ba, Zhenguang Liu, Zhibo Wang, Lorenzo Cavallaro, Kui Ren |
| 2023 | DLFusion: Painting-Depth Augmenting-LiDAR for Multimodal Fusion 3D Object Detection. Junyin Wang, Chenghu Du, Hui Li, Shengwu Xiong |
| 2023 | DPNET: Dynamic Poly-attention Network for Trustworthy Multi-modal Classification. Xin Zou, Chang Tang, Xiao Zheng, Zhenglai Li, Xiao He, Shan An, Xinwang Liu |
| 2023 | DRIN: Dynamic Relation Interactive Network for Multimodal Entity Linking. Shangyu Xing, Fei Zhao, Zhen Wu, Chunhui Li, Jianbing Zhang, Xinyu Dai |
| 2023 | DTF-Net: Category-Level Pose Estimation and Shape Reconstruction via Deformable Template Field. Haowen Wang, Zhipeng Fan, Zhen Zhao, Zhengping Che, Zhiyuan Xu, Dong Liu, Feifei Feng, Yakun Huang, Xiuquan Qiao, Jian Tang |
| 2023 | DUSA: Decoupled Unsupervised Sim2Real Adaptation for Vehicle-to-Everything Collaborative Perception. Xianghao Kong, Wentao Jiang, Jinrang Jia, Yifeng Shi, Runsheng Xu, Si Liu |
| 2023 | Dance with You: The Diversity Controllable Dancer Generation via Diffusion Models. Siyue Yao, Mingjie Sun, Bingliang Li, Fengyu Yang, Junle Wang, Ruimao Zhang |
| 2023 | Dark Knowledge Balance Learning for Unbiased Scene Graph Generation. Zhiqing Chen, Yawei Luo, Jian Shao, Yi Yang, Chunping Wang, Lei Chen, Jun Xiao |
| 2023 | Data Augmentation for Human Behavior Analysis in Multi-Person Conversations. Kun Li, Dan Guo, Guoliang Chen, Feiyang Liu, Meng Wang |
| 2023 | Data-Efficient Masked Video Modeling for Self-supervised Action Recognition. Qiankun Li, Xiaolong Huang, Zhifan Wan, Lanqing Hu, Shuzhe Wu, Jie Zhang, Shiguang Shan, Zengfu Wang |
| 2023 | Data-Scarce Animal Face Alignment via Bi-Directional Cross-Species Knowledge Transfer. Dan Zeng, Shanchuan Hong, Shuiwang Li, Qiaomu Shen, Bo Tang |
| 2023 | DeNoL: A Few-Shot-Sample-Based Decoupling Noise Layer for Cross-channel Watermarking Robustness. Han Fang, Kejiang Chen, Yupeng Qiu, Jiayang Liu, Ke Xu, Chengfang Fang, Weiming Zhang, Ee-Chien Chang |
| 2023 | DeNoising-MOT: Towards Multiple Object Tracking with Severe Occlusions. Teng Fu, Xiaocong Wang, Haiyang Yu, Ke Niu, Bin Li, Xiangyang Xue |
| 2023 | DealMVC: Dual Contrastive Calibration for Multi-view Clustering. Xihong Yang, Jiaqi Jin, Siwei Wang, Ke Liang, Yue Liu, Yi Wen, Suyuan Liu, Sihang Zhou, Xinwang Liu, En Zhu |
| 2023 | Debunking Free Fusion Myth: Online Multi-view Anomaly Detection with Disentangled Product-of-Experts Modeling. Hao Wang, Zhi-Qi Cheng, Jingdong Sun, Xin Yang, Xiao Wu, Hongyang Chen, Yan Yang |
| 2023 | DecenterNet: Bottom-Up Human Pose Estimation Via Decentralized Pose Representation. Tao Wang, Lei Jin, Zhang Wang, Xiaojin Fan, Yu Cheng, Yinglei Teng, Junliang Xing, Jian Zhao |
| 2023 | Deconfounded Multimodal Learning for Spatio-temporal Video Grounding. Jiawei Wang, Zhanchang Ma, Da Cao, Yuquan Le, Junbin Xiao, Tat-Seng Chua |
| 2023 | Deconfounded Visual Question Generation with Causal Inference. Jiali Chen, Zhenjun Guo, Jiayuan Xie, Yi Cai, Qing Li |
| 2023 | Decoupled Cross-Scale Cross-View Interaction for Stereo Image Enhancement in the Dark. Huan Zheng, Zhao Zhang, Jicong Fan, Richang Hong, Yi Yang, Shuicheng Yan |
| 2023 | Deep Algorithm Unrolling with Registration Embedding for Pansharpening. Tingting Wang, Yongxu Ye, Faming Fang, Guixu Zhang, Ming Xu |
| 2023 | Deep Image Harmonization in Dual Color Spaces. Linfeng Tan, Jiangtong Li, Li Niu, Liqing Zhang |
| 2023 | Deep Multimodal Learning for Information Retrieval. Wei Ji, Yinwei Wei, Zhedong Zheng, Hao Fei, Tat-Seng Chua |
| 2023 | Deep Neural Network Watermarking against Model Extraction Attack. Jingxuan Tan, Nan Zhong, Zhenxing Qian, Xinpeng Zhang, Sheng Li |
| 2023 | Deep Video Understanding with Video-Language Model. Runze Liu, Yaqun Fang, Fan Yu, Ruiqi Tian, Tongwei Ren, Gangshan Wu |
| 2023 | DeepSVC: Deep Scalable Video Coding for Both Machine and Human Vision. Hongbin Lin, Bolin Chen, Zhichen Zhang, Jielian Lin, Xu Wang, Tiesong Zhao |
| 2023 | Degeneration-Tuning: Using Scrambled Grid shield Unwanted Concepts from Stable Diffusion. Zixuan Ni, Longhui Wei, Jiacheng Li, Siliang Tang, Yueting Zhuang, Qi Tian |
| 2023 | Dense Object Grounding in 3D Scenes. Wencan Huang, Daizong Liu, Wei Hu |
| 2023 | Depth-Aware Sparse Transformer for Video-Language Learning. Haonan Zhang, Lianli Gao, Pengpeng Zeng, Alan Hanjalic, Heng Tao Shen |
| 2023 | Depth-aided Camouflaged Object Detection. Qingwei Wang, Jinyu Yang, Xiaosheng Yu, Fangyi Wang, Peng Chen, Feng Zheng |
| 2023 | Designing Loving-Kindness Meditation in Virtual Reality for Long-Distance Romantic Relationships. Xian Wang, Xiaoyu Mo, Lik-Hang Lee, Xiaoying Wei, Xiaofu Jin, Mingming Fan, Pan Hui |
| 2023 | Development of an Online Marathon System using Acoustic AR. Yuki Konishi, Panote Siriaraya, Da Li, Katsumi Tanaka, Yukiko Kawai, Shinsuke Nakajima |
| 2023 | DiVa: An Iterative Framework to Harvest More Diverse and Valid Labels from User Comments for Music. Hongru Liang, Jingyao Liu, Yuanxin Xiang, Jiachen Du, Lanjun Zhou, Shushen Pan, Wenqiang Lei |
| 2023 | Diff4Rec: Sequential Recommendation with Curriculum-scheduled Diffusion Augmentation. Zihao Wu, Xin Wang, Hong Chen, Kaidong Li, Yi Han, Lifeng Sun, Wenwu Zhu |
| 2023 | DiffBFR: Bootstrapping Diffusion Model for Blind Face Restoration. Xinmin Qiu, Congying Han, Zicheng Zhang, Bonan Li, Tiande Guo, Xuecheng Nie |
| 2023 | DiffDance: Cascaded Human Motion Diffusion Model for Dance Generation. Qiaosong Qi, Le Zhuo, Aixi Zhang, Yue Liao, Fei Fang, Si Liu, Shuicheng Yan |
| 2023 | Differentially Private Sparse Mapping for Privacy-Preserving Cross Domain Recommendation. Weiming Liu, Xiaolin Zheng, Chaochao Chen, Mengling Hu, Xinting Liao, Fan Wang, Yanchao Tan, Dan Meng, Jun Wang |
| 2023 | Diffused Fourier Network for Video Action Segmentation. Borui Jiang, Yadong Mu |
| 2023 | Diffusion Models in Generative AI. Cem Sazara |
| 2023 | Diffusion-Augmented Depth Prediction with Sparse Annotations. Jiaqi Li, Yiran Wang, Zihao Huang, Jinghong Zheng, Ke Xian, Zhiguo Cao, Jianming Zhang |
| 2023 | Digging into Depth Priors for Outdoor Neural Radiance Fields. Chen Wang, Jiadai Sun, Lina Liu, Chenming Wu, Zhelun Shen, Dayan Wu, Yuchao Dai, Liangjun Zhang |
| 2023 | Digital Twins Fuzzy System Based on Time Series Forecasting Model LFTformer. Jinkang Guo, Zhibo Wan, Zhihan Lv |
| 2023 | Disentangle Propagation and Restoration for Efficient Video Recovery. Cong Huang, Jiahao Li, Lei Chu, Dong Liu, Yan Lu |
| 2023 | Disentangled Representation Learning for Multimedia. Xin Wang, Hong Chen, Wenwu Zhu |
| 2023 | Disentangled Representation Learning with Causality for Unsupervised Domain Adaptation. Shanshan Wang, Yiyang Chen, Zhenwei He, Xun Yang, Mengzhu Wang, Quanzeng You, Xingyi Zhang |
| 2023 | Disentangling Multi-view Representations Beyond Inductive Bias. Guanzhou Ke, Yang Yu, Guoqing Chao, Xiaoli Wang, Chenyang Xu, Shengfeng He |
| 2023 | Distilling Vision-Language Foundation Models: A Data-Free Approach via Prompt Diversification. Yunyi Xuan, Weijie Chen, Shicai Yang, Di Xie, Luojun Lin, Yueting Zhuang |
| 2023 | Distortion-aware Transformer in 360° Salient Object Detection. Yinjie Zhao, Lichen Zhao, Qian Yu, Lu Sheng, Jing Zhang, Dong Xu |
| 2023 | Distribution Consistency based Fast Anchor Imputation for Incomplete Multi-view Clustering. Xingfeng Li, Yinghui Sun, Quansen Sun, Jia Dai, Zhenwen Ren |
| 2023 | Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR. Zhenyang Li, Yangyang Guo, Kejie Wang, Xiaolin Chen, Liqiang Nie, Mohan S. Kankanhalli |
| 2023 | DocDiff: Document Enhancement via Residual Diffusion Models. Zongyuan Yang, Baolin Liu, Yongping Xiong, Lan Yi, Guibin Wu, Xiaojun Tang, Ziqi Liu, Junjie Zhou, Xing Zhang |
| 2023 | Domain-irrelevant Feature Learning for Generalizable Pan-sharpening. Yunlong Lin, Zhenqi Fu, Ge Meng, Yingying Wang, Yuhang Dong, Linyu Fan, Hedeng Yu, Xinghao Ding |
| 2023 | Double Doodles: Sketching Animation in Immersive Environment With 3+6 DOFs Motion Gestures. Ruizhao Chen, Ye Pan, Zhigang Deng, Lili Wang, Lizhuang Ma |
| 2023 | Double-Fine-Tuning Multi-Objective Vision-and-Language Transformer for Social Media Popularity Prediction. Xiaolu Chen, Weilong Chen, Chenghao Huang, Zhongjian Zhang, Lixin Duan, Yanru Zhang |
| 2023 | Doubly Intention Learning for Cold-start Recommendation with Uncertainty-aware Stochastic Meta Process. Huafeng Liu, Mingjie Zhou, Liping Jing, Michael K. Ng |
| 2023 | Draw2Edit: Mask-Free Sketch-Guided Image Manipulation. Yiwen Xu, Ruoyu Guo, Maurice Pagnucco, Yang Song |
| 2023 | Dropping Pathways Towards Deep Multi-View Graph Subspace Clustering Networks. Zihao Zhang, Qianqian Wang, Zhiqiang Tao, Quanxue Gao, Wei Feng |
| 2023 | DuDoINet: Dual-Domain Implicit Network for Multi-Modality MR Image Arbitrary-scale Super-Resolution. Guangyuan Li, Wei Xing, Lei Zhao, Zehua Lan, Zhanjie Zhang, Jiakai Sun, Haolin Yin, Huaizhong Lin, Zhijie Lin |
| 2023 | Dual Dynamic Proxy Hashing Network for Long-tailed Image Retrieval. Yan Jiang, Hongtao Xie, Lei Zhang, Pandeng Li, Dongming Zhang, Yongdong Zhang |
| 2023 | Dual-Modal Attention-Enhanced Text-Video Retrieval with Triplet Partial Margin Contrastive Learning. Chen Jiang, Hong Liu, Xuzheng Yu, Qing Wang, Yuan Cheng, Jia Xu, Zhongyi Liu, Qingpei Guo, Wei Chu, Ming Yang, Yuan Qi |
| 2023 | Ducho: A Unified Framework for the Extraction of Multimodal Features in Recommendation. Daniele Malitesta, Giuseppe Gassi, Claudio Pomo, Tommaso Di Noia |
| 2023 | Dynamic Compositional Graph Convolutional Network for Efficient Composite Human Motion Prediction. Wanying Zhang, Shen Zhao, Fanyang Meng, Songtao Wu, Mengyuan Liu |
| 2023 | Dynamic Contrastive Learning with Pseudo-samples Intervention for Weakly Supervised Joint Video MR and HD. Shuhan Kong, Liang Li, Beichen Zhang, Wenyu Wang, Bin Jiang, Chenggang Yan, Changhao Xu |
| 2023 | Dynamic Grouped Interaction Network for Low-Light Stereo Image Enhancement. Baiang Li, Huan Zheng, Zhao Zhang, Yang Zhao, Zhongqiu Zhao, Haijun Zhang |
| 2023 | Dynamic Low-Rank Instance Adaptation for Universal Neural Image Compression. Yue Lv, Jinxi Xiang, Jun Zhang, Wenming Yang, Xiao Han, Wei Yang |
| 2023 | Dynamic Triple Reweighting Network for Automatic Femoral Head Necrosis Diagnosis from Computed Tomography. Lingfeng Li, Gangming Zhao, Yizhou Yu, Jinpeng Li |
| 2023 | Dynamic View Synthesis with Spatio-Temporal Feature Warping from Sparse Views. Deqi Li, Shi-Sheng Huang, Tianyu Shen, Hua Huang |
| 2023 | EAT: An Enhancer for Aesthetics-Oriented Transformers. Shuai He, Anlong Ming, Shuntian Zheng, Haobin Zhong, Huadong Ma |
| 2023 | ECENet: Explainable and Context-Enhanced Network for Muti-modal Fact verification. Fanrui Zhang, Jiawei Liu, Qiang Zhang, Esther Sun, Jingyi Xie, Zheng-Jun Zha |
| 2023 | ELFIC: A Learning-based Flexible Image Codec with Rate-Distortion-Complexity Optimization. Zhichen Zhang, Bolin Chen, Hongbin Lin, Jielian Lin, Xu Wang, Tiesong Zhao |
| 2023 | ENTRO: Tackling the Encoding and Networking Trade-off in Offloaded Video Analytics. Seyeon Kim, Kyungmin Bin, Donggyu Yang, Sangtae Ha, Song Chong, Kyunghan Lee |
| 2023 | EasyNet: An Easy Network for 3D Industrial Anomaly Detection. Ruitao Chen, Guoyang Xie, Jiaqi Liu, Jinbao Wang, Ziqi Luo, Jinfan Wang, Feng Zheng |
| 2023 | Echoes: Unsupervised Debiasing via Pseudo-bias Labeling in an Echo Chamber. Rui Hu, Yahan Tu, Jitao Sang |
| 2023 | Edge-Assisted On-Device Model Update for Video Analytics in Adverse Environments. Yuxin Kong, Peng Yang, Yan Cheng |
| 2023 | EditAnything: Empowering Unparalleled Flexibility in Image Editing and Generation. Shanghua Gao, Zhijie Lin, Xingyu Xie, Pan Zhou, Ming-Ming Cheng, Shuicheng Yan |
| 2023 | Effect of Attention and Self-Supervised Speech Embeddings on Non-Semantic Speech Tasks. Payal Mohapatra, Akash Pandey, Yueyuan Sui, Qi Zhu |
| 2023 | Efficiency-optimized Video Diffusion Models. Zijun Deng, Xiangteng He, Yuxin Peng |
| 2023 | Efficient Bilateral Cross-Modality Cluster Matching for Unsupervised Visible-Infrared Person ReID. De Cheng, Lingfeng He, Nannan Wang, Shizhou Zhang, Zhen Wang, Xinbo Gao |
| 2023 | Efficient Hierarchical Multi-view Fusion Transformer for 3D Human Pose Estimation. Kangkang Zhou, Lijun Zhang, Feng Lu, Xiang-Dong Zhou, Yu Shi |
| 2023 | Efficient Labelling of Affective Video Datasets via Few-Shot & Multi-Task Contrastive Learning. Ravikiran Parameshwara, Ibrahim Radwan, Akshay Asthana, Iman Abbasnejad, Ramanathan Subramanian, Roland Goecke |
| 2023 | Efficient Micro-Expression Spotting Based on Main Directional Mean Optical Flow Feature. Jun Yu, Zhongpeng Cai, Shenshen Du, Xiaxin Shen, Lei Wang, Fang Gao |
| 2023 | Efficient Multi-View Graph Clustering with Local and Global Structure Preservation. Yi Wen, Suyuan Liu, Xinhang Wan, Siwei Wang, Ke Liang, Xinwang Liu, Xihong Yang, Pei Zhang |
| 2023 | Efficient Multimedia Computing: Unleashing the Power of AutoML. Debanjan Datta, Gerald Friedland |
| 2023 | Efficient Parallel Multi-Scale Detail and Semantic Encoding Network for Lightweight Semantic Segmentation. Xiao Liu, Xiuya Shi, Lufei Chen, Linbo Qing, Chao Ren |
| 2023 | Efficient Spatio-Temporal Video Grounding with Semantic-Guided Feature Decomposition. Weikang Wang, Jing Liu, Yuting Su, Weizhi Nie |
| 2023 | Elucidate Gender Fairness in Singing Voice Transcription. Xiangming Gu, Wei Zeng, Ye Wang |
| 2023 | Emo-DNA: Emotion Decoupling and Alignment Learning for Cross-Corpus Speech Emotion Recognition. Jiaxin Ye, Yujie Wei, Xin-Cheng Wen, Chenglong Ma, Zhizhong Huang, Kunhong Liu, Hongming Shan |
| 2023 | Emotion Recognition ToolKit (ERTK): Standardising Tools For Emotion Recognition Research. Aaron Keesing, Yun Sing Koh, Vithya Yogarajan, Michael Witbrock |
| 2023 | Emotion-Prior Awareness Network for Emotional Video Captioning. Peipei Song, Dan Guo, Xun Yang, Shengeng Tang, Erkun Yang, Meng Wang |
| 2023 | EmotionKD: A Cross-Modal Knowledge Distillation Framework for Emotion Recognition Based on Physiological Signals. Yucheng Liu, Ziyu Jia, Haichao Wang |
| 2023 | Emotionally Situated Text-to-Speech Synthesis in User-Agent Conversation. Yuchen Liu, Haoyu Zhang, Shichao Liu, Xiang Yin, Zejun Ma, Qin Jin |
| 2023 | Encoding and Decoding Narratives: Datafication and Alternative Access Models for Audiovisual Archives. Yuchen Yang |
| 2023 | End-to-end XY Separation for Single Image Blind Deblurring. Liuhan Chen, Yirou Wang, Yongyong Chen |
| 2023 | Enhanced CatBoost with Stacking Features for Social Media Prediction. Shijian Mao, Wudong Xi, Lei Yu, Gaotian Lü, Xingxing Xing, Xingchen Zhou, Wei Wan |
| 2023 | Enhanced Image Deblurring: An Efficient Frequency Exploitation and Preservation Network. Shuting Dong, Zhe Wu, Feng Lu, Chun Yuan |
| 2023 | Enhancing Adversarial Robustness of Multi-modal Recommendation via Modality Balancing. Yu Shang, Chen Gao, Jiansheng Chen, Depeng Jin, Huimin Ma, Yong Li |
| 2023 | Enhancing Domain-Invariant Parts for Generalized Zero-Shot Learning. Yang Zhang, Songhe Feng |
| 2023 | Enhancing Fake News Detection in Social Media via Label Propagation on Cross-modal Tweet Graph. Wanqing Zhao, Yuta Nakashima, Haiyuan Chen, Noboru Babaguchi |
| 2023 | Enhancing Multi-modal Multi-hop Question Answering via Structured Knowledge and Unified Retrieval-Generation. Qian Yang, Qian Chen, Wen Wang, Baotian Hu, Min Zhang |
| 2023 | Enhancing Product Representation with Multi-form Interactions for Multimodal Conversational Recommendation. Wenzhe Du, Haoyang Su, Cam-Tu Nguyen, Jian Sun |
| 2023 | Enhancing Real-Time Super Resolution with Partial Convolution and Efficient Variance Attention. Zhou Zhou, Jiahao Chao, Jiali Gong, Hongfan Gao, Zhenbing Zeng, Zhengfeng Yang |
| 2023 | Enhancing Sentence Representation with Visually-supervised Multimodal Pre-training. Zhe Li, Laurence T. Yang, Xin Nie, Bocheng Ren, Xianjun Deng |
| 2023 | Enhancing Visibility in Nighttime Haze Images Using Guided APSF and Gradient Adaptive Convolution. Yeying Jin, Beibei Lin, Wending Yan, Yuan Yuan, Wei Ye, Robby T. Tan |
| 2023 | Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner. Zikang Liu, Sihan Chen, Longteng Guo, Handong Li, Xingjian He, Jing Liu |
| 2023 | Enhancing Visually-Rich Document Understanding via Layout Structure Modeling. Qiwei Li, Zuchao Li, Xiantao Cai, Bo Du, Hai Zhao |
| 2023 | Entropy Neural Estimation for Graph Contrastive Learning. Yixuan Ma, Xiaolin Zhang, Peng Zhang, Kun Zhan |
| 2023 | Entropy-based Optimization on Individual and Global Predictions for Semi-Supervised Learning. Zhen Zhao, Meng Zhao, Ye Liu, Di Yin, Luping Zhou |
| 2023 | Equivariant Learning for Out-of-Distribution Cold-start Recommendation. Wenjie Wang, Xinyu Lin, Liuhui Wang, Fuli Feng, Yinwei Wei, Tat-Seng Chua |
| 2023 | Event-Diffusion: Event-Based Image Reconstruction and Restoration with Diffusion Models. Quanmin Liang, Xiawu Zheng, Kai Huang, Yan Zhang, Jie Chen, Yonghong Tian |
| 2023 | Event-Enhanced Multi-Modal Spiking Neural Network for Dynamic Obstacle Avoidance. Yang Wang, Bo Dong, Yuji Zhang, Yunduo Zhou, Haiyang Mei, Ziqi Wei, Xin Yang |
| 2023 | Event-based Motion Deblurring with Modality-Aware Decomposition and Recomposition. Wen Yang, Jinjian Wu, Leida Li, Weisheng Dong, Guangming Shi |
| 2023 | Event-guided Frame Interpolation and Dynamic Range Expansion of Single Rolling Shutter Image. Guixu Lin, Jin Han, Mingdeng Cao, Zhihang Zhong, Yinqiang Zheng |
| 2023 | Expand BERT Representation with Visual Information via Grounded Language Learning with Multimodal Partial Alignment. Cong-Duy Nguyen, The-Anh Vu-Le, Thong Nguyen, Tho Quan, Anh Tuan Luu |
| 2023 | Explicifying Neural Implicit Fields for Efficient Dynamic Human Avatar Modeling via a Neural Explicit Surface. Ruiqi Zhang, Jie Chen, Qiang Wang |
| 2023 | Exploiting Fine-Grained DCT Representations for Hiding Image-Level Messages within JPEG Images. Junxue Yang, Xin Liao |
| 2023 | Exploiting Low-confidence Pseudo-labels for Source-free Object Detection. Zhihong Chen, Zilei Wang, Yixin Zhang |
| 2023 | Exploiting Time-Frequency Conformers for Music Audio Enhancement. Yunkee Chae, Junghyun Koo, Sungho Lee, Kyogu Lee |
| 2023 | Exploring Coarse-to-Fine Action Token Localization and Interaction for Fine-grained Video Action Recognition. Baoli Sun, Xinchen Ye, Zhihui Wang, Haojie Li, Zhiyong Wang |
| 2023 | Exploring Correlations in Degraded Spatial Identity Features for Blind Face Restoration. Qian Ning, Fangfang Wu, Weisheng Dong, Xin Li, Guangming Shi |
| 2023 | Exploring Dual Representations in Large-Scale Point Clouds: A Simple Weakly Supervised Semantic Segmentation Framework. Jiaming Liu, Yue Wu, Maoguo Gong, Qiguang Miao, Wenping Ma, Cai Xu |
| 2023 | Exploring High-Correlation Source Domain Information for Multi-Source Domain Adaptation in Semantic Segmentation. Yuxiang Cai, Meng Xi, Yongheng Shang, Jianwei Yin |
| 2023 | Exploring Hyperspectral Histopathology Image Segmentation from a Deformable Perspective. Xingran Xie, Ting Jin, Boxiang Yun, Qingli Li, Yan Wang |
| 2023 | Exploring Inconsistent Knowledge Distillation for Object Detection with Data Augmentation. Jiawei Liang, Siyuan Liang, Aishan Liu, Ke Ma, Jingzhi Li, Xiaochun Cao |
| 2023 | Exploring Motion Cues for Video Test-Time Adaptation. Runhao Zeng, Qi Deng, Huixuan Xu, Shuaicheng Niu, Jian Chen |
| 2023 | Exploring Shape Embedding for Cloth-Changing Person Re-Identification via 2D-3D Correspondences. Yubin Wang, Huimin Yu, Yuming Yan, Shuyi Song, Biyang Liu, Yichong Lu |
| 2023 | Exploring Universal Principles for Graph Contrastive Learning: A Statistical Perspective. Jinyong Wen, Shiming Xiang, Chunhong Pan |
| 2023 | Exploring the Adversarial Robustness of Video Object Segmentation via One-shot Adversarial Attacks. Kaixun Jiang, Lingyi Hong, Zhaoyu Chen, Pinxue Guo, Zeng Tao, Yan Wang, Wenqiang Zhang |
| 2023 | Exploring the Knowledge Transferred by Response-Based Teacher-Student Distillation. Liangchen Song, Xuan Gong, Helong Zhou, Jiajie Chen, Qian Zhang, David S. Doermann, Junsong Yuan |
| 2023 | External Knowledge Dynamic Modeling for Image-text Retrieval. Song Yang, Qiang Li, Wenhui Li, Min Liu, Xuanya Li, Anan Liu |
| 2023 | FCBoost-Net: A Generative Network for Synthesizing Multiple Collocated Outfits via Fashion Compatibility Boosting. Dongliang Zhou, Haijun Zhang, Jianghong Ma, Jicong Fan, Zhao Zhang |
| 2023 | FDCNet: Feature Drift Compensation Network for Class-Incremental Weakly Supervised Object Localization. Sejin Park, Taehyung Lee, Yeejin Lee, Byeongkeun Kang |
| 2023 | FFNeRV: Flow-Guided Frame-Wise Neural Representations for Videos. Joo Chan Lee, Daniel Rho, Jong Hwan Ko, Eunbyung Park |
| 2023 | FME '23: 3rd Facial Micro-Expression Workshop. Adrian K. Davison, Jingting Li, Moi Hoon Yap, John See, Wen-Huang Cheng, Xiaobai Li, Xiaopeng Hong, Su-Jing Wang |
| 2023 | FOLT: Fast Multiple Object Tracking from UAV-captured Videos Based on Optical Flow. Mufeng Yao, Jiaqi Wang, Jinlong Peng, Mingmin Chi, Chao Liu |
| 2023 | FSNet: Frequency Domain Guided Superpixel Segmentation Network for Complex Scenes. Hua Li, Junyan Liang, Wenjie Li, Wenhui Wu |
| 2023 | FSR-Net: Deep Fourier Network for Shadow Removal. Jun Yu, Peng He, Ziqi Peng |
| 2023 | Face Encryption via Frequency-Restricted Identity-Agnostic Attacks. Xin Dong, Rui Wang, Siyuan Liang, Aishan Liu, Lihua Jing |
| 2023 | Face-Driven Zero-Shot Voice Conversion with Memory-based Face-Voice Alignment. Zhengyan Sheng, Yang Ai, Yan-Nian Chen, Zhen-Hua Ling |
| 2023 | Facial Auto Rigging from 4D Expressions via Skinning Decomposition. Zhihe Zhao, Dongdong Weng, Hanzhi Guo, Jing Hou, Jixiang Zhou |
| 2023 | Factorized Omnidirectional Representation based Vision GNN for Anisotropic 3D Multimodal MR Image Segmentation. Bo Zhang, Yunpeng Tan, Zheng Zhang, Wu Liu, Hui Gao, Zhijun Xi, Wendong Wang |
| 2023 | FashionDiff: A Controllable Diffusion Model Using Pairwise Fashion Elements for Intelligent Design. Han Yan, Haijun Zhang, Xiangyu Mu, Jicong Fan, Zhao Zhang |
| 2023 | FastLLVE: Real-Time Low-Light Video Enhancement with Intensity-Aware Look-Up Table. Wenhao Li, Guangyang Wu, Wenyi Wang, Peiran Ren, Xiaohong Liu |
| 2023 | FastReID: A Pytorch Toolbox for General Instance Re-identification. Lingxiao He, Xingyu Liao, Wu Liu, Xinchen Liu, Peng Cheng, Tao Mei |
| 2023 | Faster Video Moment Retrieval with Point-Level Supervision. Xun Jiang, Zailei Zhou, Xing Xu, Yang Yang, Guoqing Wang, Heng Tao Shen |
| 2023 | FeaCo: Reaching Robust Feature-Level Consensus in Noisy Pose Conditions. Jiaming Gu, Jingyu Zhang, Muyang Zhang, Weiliang Meng, Shibiao Xu, Jiguang Zhang, Xiaopeng Zhang |
| 2023 | Fearless Luminance Adaptation: A Macro-Micro-Hierarchical Transformer for Exposure Correction. Gehui Li, Jinyuan Liu, Long Ma, Zhiying Jiang, Xin Fan, Risheng Liu |
| 2023 | Feature Decoupling-Recycling Network for Fast Interactive Segmentation. Huimin Zeng, Weinong Wang, Xin Tao, Zhiwei Xiong, Yu-Wing Tai, Wenjie Pei |
| 2023 | Feature-Suppressed Contrast for Self-Supervised Food Pre-training. Xinda Liu, Yaohui Zhu, Linhu Liu, Jiang Tian, Lili Wang |
| 2023 | FedAA: Using Non-sensitive Modalities to Improve Federated Learning while Preserving Image Privacy. Dong Chen, Siliang Tang, Zijin Shen, Guoming Wang, Jun Xiao, Yueting Zhuang, Carl Yang |
| 2023 | FedCD: A Classifier Debiased Federated Learning Framework for Non-IID Data. Yunfei Long, Zhe Xue, Lingyang Chu, Tianlong Zhang, Junjiang Wu, Yu Zang, Junping Du |
| 2023 | FedCE: Personalized Federated Learning Method based on Clustering Ensembles. Luxin Cai, Naiyue Chen, Yuanzhouhan Cao, Jiahuan He, Yidong Li |
| 2023 | FedGH: Heterogeneous Federated Learning with Generalized Global Header. Liping Yi, Gang Wang, Xiaoguang Liu, Zhuan Shi, Han Yu |
| 2023 | FedVQA: Personalized Federated Visual Question Answering over Heterogeneous Scenes. Mingrui Lao, Nan Pu, Zhun Zhong, Nicu Sebe, Michael S. Lew |
| 2023 | Federated Deep Multi-View Clustering with Global Self-Supervision. Xinyue Chen, Jie Xu, Yazhou Ren, Xiaorong Pu, Ce Zhu, Xiaofeng Zhu, Zhifeng Hao, Lifang He |
| 2023 | Federated Learning with Label-Masking Distillation. Jianghu Lu, Shikun Li, Kexin Bao, Pengju Wang, Zhenxing Qian, Shiming Ge |
| 2023 | Feeling Positive? Predicting Emotional Image Similarity from Brain Signals. Tuukka Ruotsalo, Kalle Mäkelä, Michiel M. A. Spapé, Luis A. Leiva |
| 2023 | Feeling Present! From Physical to Virtual Cinematography Lighting Education with Metashadow. Zheng Wei, Xian Xu, Lik-Hang Lee, Wai Tong, Huamin Qu, Pan Hui |
| 2023 | Few-shot Multimodal Sentiment Analysis Based on Multimodal Probabilistic Fusion Prompts. Xiaocui Yang, Shi Feng, Daling Wang, Yifei Zhang, Soujanya Poria |
| 2023 | Filling in the Blank: Rationale-Augmented Prompt Tuning for TextVQA. Gangyan Zeng, Yuan Zhang, Yu Zhou, Bo Fang, Guoqing Zhao, Xin Wei, Weiping Wang |
| 2023 | Filling the Information Gap between Video and Query for Language-Driven Moment Retrieval. Daizong Liu, Xiaoye Qu, Jianfeng Dong, Guoshun Nan, Pan Zhou, Zichuan Xu, Lixing Chen, He Yan, Yu Cheng |
| 2023 | Finding Efficient Pruned Network via Refined Gradients for Pruned Weights. Jangho Kim, Jayeon Yoo, Yeji Song, KiYoon Yoo, Nojun Kwak |
| 2023 | Fine-Grained Multimodal Named Entity Recognition and Grounding with a Generative Framework. Jieming Wang, Ziyan Li, Jianfei Yu, Li Yang, Rui Xia |
| 2023 | Fine-Grained Music Plagiarism Detection: Revealing Plagiarists through Bipartite Graph Matching and a Comprehensive Large-Scale Dataset. Wenxuan Liu, Tianyao He, Chen Gong, Ning Zhang, Hua Yang, Junchi Yan |
| 2023 | Fine-Grained Spatiotemporal Motion Alignment for Contrastive Video Representation Learning. Minghao Zhu, Xiao Lin, Ronghao Dang, Chengju Liu, Qijun Chen |
| 2023 | Fine-Grained Visual Prompt Learning of Vision-Language Models for Image Recognition. Hongbo Sun, Xiangteng He, Jiahuan Zhou, Yuxin Peng |
| 2023 | Fine-grained Key-Value Memory Enhanced Predictor for Video Representation Learning. Xiaojie Li, Jianlong Wu, Shaowei He, Shuo Kang, Yue Yu, Liqiang Nie, Min Zhang |
| 2023 | Fine-grained Pseudo Labels for Scene Text Recognition. Xiaoyu Li, Xiaoxue Chen, Zuming Huang, Lele Xie, Jingdong Chen, Ming Yang |
| 2023 | Finetuning Language Models for Multimodal Question Answering. Xin Zhang, Wen Xie, Ziqi Dai, Jun Rao, Haokun Wen, Xuan Luo, Meishan Zhang, Min Zhang |
| 2023 | FlatGAN: A Holistic Approach for Robust Flat-Coloring in High-Definition with Understanding Line Discontinuity. Han Kim, Chunggi Lee, Junsoo Lee, Dohyun Kim, Kwangjin Lee, Moohyun Oh, Daesik Kim |
| 2023 | FlexIcon: Flexible Icon Colorization via Guided Images and Palettes. Shukai Wu, Yuhang Yang, Shuchang Xu, Weiming Liu, Xiao Yan, Sanyuan Zhang |
| 2023 | Flexible and Secure Watermarking for Latent Diffusion Model. Cheng Xiong, Chuan Qin, Guorui Feng, Xinpeng Zhang |
| 2023 | Focusing on Flexible Masks: A Novel Framework for Panoptic Scene Graph Generation with Relation Constraints. Jiarui Yang, Chuan Wang, Zeming Liu, Jiahong Wu, Dongsheng Wang, Liang Yang, Xiaochun Cao |
| 2023 | Follow-me: Deceiving Trackers with Fabricated Paths. Shengtao Lou, Buyu Liu, Jun Bao, Jiajun Ding, Jun Yu |
| 2023 | Food-500 Cap: A Fine-Grained Food Caption Benchmark for Evaluating Vision-Language Models. Zheng Ma, Mianzhi Pan, Wenhan Wu, Kanzhi Cheng, Jianbing Zhang, Shujian Huang, Jiajun Chen |
| 2023 | Foreground/Background-Masked Interaction Learning for Spatio-temporal Action Detection. Keke Chen, Xiangbo Shu, Guo-Sen Xie, Rui Yan, Jinhui Tang |
| 2023 | FourLLIE: Boosting Low-Light Image Enhancement by Fourier Frequency Information. Chenxi Wang, Hongjun Wu, Zhi Jin |
| 2023 | Free Fine-tuning: A Plug-and-Play Watermarking Scheme for Deep Neural Networks. Run Wang, Jixing Ren, Boheng Li, Tianyi She, Wenhui Zhang, Liming Fang, Jing Chen, Lina Wang |
| 2023 | Freq-HD: An Interpretable Frequency-based High-Dynamics Affective Clip Selection Method for in-the-Wild Facial Expression Recognition in Videos. Zeng Tao, Yan Wang, Zhaoyu Chen, Boyang Wang, Shaoqi Yan, Kaixun Jiang, Shuyong Gao, Wenqiang Zhang |
| 2023 | Frequency Perception Network for Camouflaged Object Detection. Runmin Cong, Mengyao Sun, Sanyi Zhang, Xiaofei Zhou, Wei Zhang, Yao Zhao |
| 2023 | Frequency Representation Integration for Camouflaged Object Detection. Chenxi Xie, Changqun Xia, Tianshu Yu, Jia Li |
| 2023 | Frequency-based Zero-Shot Learning with Phase Augmentation. Wanting Yin, Hongtao Xie, Lei Zhang, Jiannan Ge, Pandeng Li, Chuanbin Liu, Yongdong Zhang |
| 2023 | G-PCC++: Enhanced Geometry-based Point Cloud Compression. Junzhe Zhang, Tong Chen, Dandan Ding, Zhan Ma |
| 2023 | G2-DUN: Gradient Guided Deep Unfolding Network for Image Compressive Sensing. Wenxue Cui, Xingtao Wang, Xiaopeng Fan, Shaohui Liu, Chen Ma, Debin Zhao |
| 2023 | GCL: Gradient-Guided Contrastive Learning for Medical Image Segmentation with Multi-Perspective Meta Labels. Yixuan Wu, Jintai Chen, Jiahuan Yan, Yiheng Zhu, Danny Z. Chen, Jian Wu |
| 2023 | GCMA: Generative Cross-Modal Transferable Adversarial Attacks from Images to Videos. Kai Chen, Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang |
| 2023 | Gaze Analysis System for Immersive 360° Video for Preservice Teacher Education. Chris Lenart, Pegah Ahadian, Yuxin Yang, Simon Suo, Ashton Corsello, Karl W. Kosko, Qiang Guan |
| 2023 | General Debiasing for Multimodal Sentiment Analysis. Teng Sun, Juntong Ni, Wenjie Wang, Liqiang Jing, Yinwei Wei, Liqiang Nie |
| 2023 | Generalizable Label Distribution Learning. Xingyu Zhao, Lei Qi, Yuexuan An, Xin Geng |
| 2023 | Generalized Universal Domain Adaptation with Generative Flow Networks. Didi Zhu, Yinchuan Li, Yunfeng Shao, Jianye Hao, Fei Wu, Kun Kuang, Jun Xiao, Chao Wu |
| 2023 | Generalizing Face Forgery Detection via Uncertainty Learning. Yanqi Wu, Xue Song, Jingjing Chen, Yu-Gang Jiang |
| 2023 | Generating Explanations for Embodied Action Decision from Visual Observation. Xiaohan Wang, Yuehu Liu, Xinhang Song, Beibei Wang, Shuqiang Jiang |
| 2023 | Generative Neutral Features-Disentangled Learning for Facial Expression Recognition. Zhenqian Wu, Yazhou Ren, Xiaorong Pu, Zhifeng Hao, Lifang He |
| 2023 | Giving Text More Imagination Space for Image-text Matching. Xinfeng Dong, Longfei Han, Dingwen Zhang, Li Liu, Junwei Han, Huaxiang Zhang |
| 2023 | Globally-Robust Instance Identification and Locally-Accurate Keypoint Alignment for Multi-Person Pose Estimation. Fangzheng Tian, Sungchan Kim |
| 2023 | GoRec: A Generative Cold-start Recommendation Framework. Haoyue Bai, Min Hou, Le Wu, Yonghui Yang, Kun Zhang, Richang Hong, Meng Wang |
| 2023 | GraMMaR: Ground-aware Motion Model for 3D Human Motion Reconstruction. Sihan Ma, Qiong Cao, Hongwei Yi, Jing Zhang, Dacheng Tao |
| 2023 | Gradient Boost Tree Network based on Extensive Feature Analysis for Popularity Prediction of Social Posts. Chih-Chung Hsu, Chia-Ming Lee, Xiu-Yu Hou, Chi-Han Tsai |
| 2023 | Gradient-Free Textual Inversion. Zhengcong Fei, Mingyuan Fan, Junshi Huang |
| 2023 | Graph Convolutional Incomplete Multi-modal Hashing. Xiaobo Shen, Yinfan Chen, Shirui Pan, Weiwei Liu, Yuhui Zheng |
| 2023 | Graph Spectral Perturbation for 3D Point Cloud Contrastive Learning. Yuehui Han, Jiaxin Chen, Jianjun Qian, Jin Xie |
| 2023 | Graph based Spatial-temporal Fusion for Multi-modal Person Re-identification. Yaobin Zhang, Jianming Lv, Chen Liu, Hongmin Cai |
| 2023 | Graph to Grid: Learning Deep Representations for Multimodal Emotion Recognition. Ming Jin, Jinpeng Li |
| 2023 | Graph-Based Video-Language Learning with Multi-Grained Audio-Visual Alignment. Chenyang Lyu, Wenxi Li, Tianbo Ji, Longyue Wang, Liting Zhou, Cathal Gurrin, Linyi Yang, Yi Yu, Yvette Graham, Jennifer Foster |
| 2023 | GraphMedia: Communication-balanced Graph Searching for Billion-scale Social Media Access. Xinbiao Gan, Jiaqi Guo, Peilin Guo, Guang Wu, Jiaqi Si, Songzhu Mei, Cong Liu, Tiejun Li |
| 2023 | GridFormer: Towards Accurate Table Structure Recognition via Grid Prediction. Pengyuan Lyu, Weihong Ma, Hongyi Wang, Yuechen Yu, Chengquan Zhang, Kun Yao, Yang Xue, Jingdong Wang |
| 2023 | GrooveMeter: Enabling Music Engagement-aware Apps by Detecting Reactions to Daily Music Listening via Earable Sensing. Euihyeok Lee, Chulhong Min, Jaeseung Lee, Jin Yu, Seungwoo Kang |
| 2023 | Ground-to-Aerial Person Search: Benchmark Dataset and Approach. Shizhou Zhang, Qingchun Yang, De Cheng, Yinghui Xing, Guoqiang Liang, Peng Wang, Yanning Zhang |
| 2023 | Guided Image Synthesis via Initial Image Editing in Diffusion Model. Jiafeng Mao, Xueting Wang, Kiyoharu Aizawa |
| 2023 | H2V4Sports: Real-Time Horizontal-to-Vertical Video Converter for Sports Lives via Fast Object Detection and Tracking. Yi Han, Kaidong Li, Zihan Song, Wei Feng, Xiang Cao, Shida Guo, Xin Wang, Xuguang Duan, Wenwu Zhu |
| 2023 | HAAN: Human Action Aware Network for Multi-label Temporal Action Detection. Zikai Gao, Peng Qiao, Yong Dou |
| 2023 | HARP: Let Object Detector Undergo Hyperplasia to Counter Adversarial Patches. Junzhe Cai, Shuiyan Chen, Heng Li, Beihao Xia, Zimin Mao, Wei Yuan |
| 2023 | HCMA '23: 4th International Workshop on Human-Centric Multimedia Analysis. Jingkuan Song, Wu Liu, Xinchen Liu, Dingwen Zhang, Chaowei Fang, Hongyuan Zhu, Wenbing Huang, John Smith, Xin Wang |
| 2023 | HCSD-Net: Single Image Desnowing with Color Space Transformation. Ting Zhang, Nanfeng Jiang, HongXin Wu, Keke Zhang, Yuzhen Niu, Tiesong Zhao |
| 2023 | HELIOS: Hyper-Relational Schema Modeling from Knowledge Graphs. Yuhuan Lu, Bangchao Deng, Weijian Yu, Dingqi Yang |
| 2023 | HSIC-based Moving Weight Averaging for Few-Shot Open-Set Object Detection. Binyi Su, Hua Zhang, Zhong Zhou |
| 2023 | HSVLT: Hierarchical Scale-Aware Vision-Language Transformer for Multi-Label Image Classification. Shuyi Ouyang, Hongyi Wang, Ziwei Niu, Zhenjia Bai, Shiao Xie, Yingying Xu, Ruofeng Tong, Yen-Wei Chen, Lanfen Lin |
| 2023 | Handling Label Uncertainty for Camera Incremental Person Re-Identification. Zexian Yang, Dayan Wu, Wanqian Zhang, Bo Li, Weiping Wang |
| 2023 | Handwritten Chemical Structure Image to Structure-Specific Markup Using Random Conditional Guided Decoder. Jinshui Hu, Hao Wu, Mingjun Chen, Chenyu Liu, Jiajia Wu, Shi Yin, Baocai Yin, Bing Yin, Cong Liu, Jun Du, Lirong Dai |
| 2023 | Haptic-aware Interaction: Design and Evaluation. Ying Fang |
| 2023 | Hardware-friendly Scalable Image Super Resolution with Progressive Structured Sparsity. Fangchen Ye, Jin Lin, Hongzhan Huang, Jianping Fan, Zhongchao Shi, Yuan Xie, Yanyun Qu |
| 2023 | Hashing One With All. Jiaguo Yu, Yuming Shen, Haofeng Zhang |
| 2023 | Hawkeye: A PyTorch-based Library for Fine-Grained Image Recognition with Deep Learning. Jiabei He, Yang Shen, Xiu-Shen Wei, Ye Wu |
| 2023 | Hermes: Leveraging Implicit Inter-Frame Correlation for Bandwidth-Efficient Mobile Volumetric Video Streaming. Yizong Wang, Dong Zhao, Huanhuan Zhang, Chenghao Huang, Teng Gao, Zixuan Guo, Liming Pang, Huadong Ma |
| 2023 | Hi-SIGIR: Hierachical Semantic-Guided Image-to-image Retrieval via Scene Graph. Yulu Wang, Pengwen Dai, Xiaojun Jia, Zhitao Zeng, Rui Li, Xiaochun Cao |
| 2023 | Hierarchical Audio-Visual Information Fusion with Multi-label Joint Decoding for MER 2023. Haotian Wang, Yuxuan Xi, Hang Chen, Jun Du, Yan Song, Qing Wang, Hengshun Zhou, Chenxi Wang, Jiefeng Ma, Pengfei Hu, Ya Jiang, Shi Cheng, Jie Zhang, Yuzhe Weng |
| 2023 | Hierarchical Category-Enhanced Prototype Learning for Imbalanced Temporal Recommendation. Xiyue Gao, Zhuoqi Ma, Jiangtao Cui, Xiaofang Xia, Cai Xu |
| 2023 | Hierarchical Dynamic Image Harmonization. Haoxing Chen, Zhangxuan Gu, Yaohui Li, Jun Lan, Changhua Meng, Weiqiang Wang, Huaxiong Li |
| 2023 | Hierarchical Masked 3D Diffusion Model for Video Outpainting. Fanda Fan, Chaoxu Guo, Litong Gong, Biao Wang, Tiezheng Ge, Yuning Jiang, Chunjie Luo, Jianfeng Zhan |
| 2023 | Hierarchical Prompt Learning Using CLIP for Multi-label Classification with Single Positive Labels. Ao Wang, Hui Chen, Zijia Lin, Zixuan Ding, Pengzhang Liu, Yongjun Bao, Weipeng Yan, Guiguang Ding |
| 2023 | Hierarchical Reasoning Network with Contrastive Learning for Few-Shot Human-Object Interaction Recognition. Jiale Yu, Baopeng Zhang, Qirui Li, Haoyang Chen, Zhu Teng |
| 2023 | Hierarchical Semantic Enhancement Network for Multimodal Fake News Detection. Qiang Zhang, Jiawei Liu, Fanrui Zhang, Jingyi Xie, Zheng-Jun Zha |
| 2023 | Hierarchical Semantic Perceptual Listener Head Video Generation: A High-performance Pipeline. Zhigang Chang, Weitai Hu, Qing Yang, Shibao Zheng |
| 2023 | Hierarchical Visual Attribute Learning in the Wild. Kongming Liang, Xinran Wang, Haiwen Zhang, Zhanyu Ma, Jun Guo |
| 2023 | High Fidelity Face Swapping via Semantics Disentanglement and Structure Enhancement. Fengyuan Liu, Lingyun Yu, Hongtao Xie, Chuanbin Liu, Zhiguo Ding, Quanwei Yang, Yongdong Zhang |
| 2023 | High Visual-Fidelity Learned Video Compression. Meng Li, Yibo Shi, Jing Wang, Yunqi Huang |
| 2023 | High-Order Tensor Recovery Coupling Multilayer Subspace Priori with Application in Video Restoration. Hao Tan, Weichao Kong, Feng Zhang, Wenjin Qin, Jianjun Wang |
| 2023 | High-order Complementarity Induced Fast Multi-View Clustering with Enhanced Tensor Rank Minimization. Jintian Ji, Songhe Feng |
| 2023 | HoloSinger: Semantics and Music Driven Motion Generation with Octahedral Holographic Projection. Zeyu Jin, Zixuan Wang, Qixin Wang, Jia Jia, Ye Bai, Yi Zhao, Hao Li, Xiaorui Wang |
| 2023 | HumVis: Human-Centric Visual Analysis System. Dongkai Wang, Shiliang Zhang, Yaowei Wang, Yonghong Tian, Tiejun Huang, Wen Gao |
| 2023 | Human-Object-Object Interaction: Towards Human-Centric Complex Interaction Detection. Mingxuan Zhang, Xiao Wu, Zhaoquan Yuan, Qi He, Xiang Huang |
| 2023 | Hybrid Interaction Temporal Knowledge Graph Embedding Based on Householder Transformations. Sensen Zhang, Xun Liang, Hui Tang, Zhenyu Guan |
| 2023 | HypLL: The Hyperbolic Learning Library. Max van Spengler, Philipp Wirth, Pascal Mettes |
| 2023 | Hypergraph-Enhanced Hashing for Unsupervised Cross-Modal Retrieval via Robust Similarity Guidance. Fangming Zhong, Chenglong Chu, Zijie Zhu, Zhikui Chen |
| 2023 | Hyperspectral Image Denoising with Spectrum Alignment. Jiahua Xiao, Yantao Ji, Xing Wei |
| 2023 | ICMH-Net: Neural Image Compression Towards both Machine Vision and Human Vision. Lei Liu, Zhihao Hu, Zhenghao Chen, Dong Xu |
| 2023 | IDDR-NGP: Incorporating Detectors for Distractors Removal with Instant Neural Radiance Field. Xianliang Huang, Jiajie Gou, Shuhang Chen, Zhizhou Zhong, Jihong Guan, Shuigeng Zhou |
| 2023 | IFS-SED: Incremental Few-Shot Sound Event Detection Using Explicit Learning and Calibration. Ming Feng, Kele Xu, Hengxing Cai |
| 2023 | IGG: Improved Graph Generation for Domain Adaptive Object Detection. Pengteng Li, Ying He, F. Richard Yu, Pinhao Song, Dongfu Yin, Guang Zhou |
| 2023 | IN/ACTive: A Distance-Technology-Mediated Stage for Performer-Audience Telepresence and Environmental Control. Ray LC, Sijia Liu, Qiaosheng Lyu |
| 2023 | IRCasTRF: Inverse Rendering by Optimizing Cascaded Tensorial Radiance Fields, Lighting, and Materials From Multi-view Images. Wenpeng Xing, Jie Chen, Ka Chun Cheung, Simon See |
| 2023 | IS2Net: Intra-domain Semantic and Inter-domain Style Enhancement for Semi-supervised Medical Domain Generalization. Shiao Xie, Ziwei Niu, Huimin Huang, Hao Sun, Rui Qin, Yen-Wei Chen, Lanfen Lin |
| 2023 | IXR '23: 2nd International Workshop on Interactive eXtended Reality. Irene Viola, Hadi Amirpour, Stephanie Arévalo Arboleda, Maria Torres Vega |
| 2023 | Implicit Decouple Network for Efficient Pose Estimation. Lei Zhao, Le Han, Min Yao, Nenggan Zheng |
| 2023 | Implicit Obstacle Map-driven Indoor Navigation Model for Robust Obstacle Avoidance. Wei Xie, Haobo Jiang, Shuo Gu, Jin Xie |
| 2023 | Improvements on SadTalker-based Approach for ViCo Conversational Head Generation Challenge. Wei Dai |
| 2023 | Improving Anomaly Segmentation with Multi-Granularity Cross-Domain Alignment. Ji Zhang, Xiao Wu, Zhi-Qi Cheng, Qi He, Wei Li |
| 2023 | Improving Cross-Modal Recipe Retrieval with Component-Aware Prompted CLIP Embedding. Xu Huang, Jin Liu, Zhizhong Zhang, Yuan Xie |
| 2023 | Improving Federated Person Re-Identification through Feature-Aware Proximity and Aggregation. Pengling Zhang, Huibin Yan, Wenhui Wu, Shuoyao Wang |
| 2023 | Improving Few-shot Image Generation by Structural Discrimination and Textural Modulation. Mengping Yang, Zhe Wang, Wenyi Feng, Qian Zhang, Ting Xiao |
| 2023 | Improving Human-Object Interaction Detection via Virtual Image Learning. Shuman Fang, Shuai Liu, Jie Li, Guannan Jiang, Xianming Lin, Rongrong Ji |
| 2023 | Improving Image Captioning through Visual and Semantic Mutual Promotion. Jing Zhang, Yingshuai Xie, Xiaoqiang Liu |
| 2023 | Improving Rumor Detection by Class-based Adversarial Domain Adaptation. Jingqiu Li, Lanjun Wang, Jianlin He, Yongdong Zhang, Anan Liu |
| 2023 | Improving Scene Graph Generation with Superpixel-Based Interaction Learning. Jingyi Wang, Can Zhang, Jinfa Huang, Botao Ren, Zhidong Deng |
| 2023 | Improving Semi-Supervised Semantic Segmentation with Dual-Level Siamese Structure Network. Zhibo Tian, Xiaolin Zhang, Peng Zhang, Kun Zhan |
| 2023 | Improving Zero-shot Visual Question Answering via Large Language Models with Reasoning Question Prompts. Yunshi Lan, Xiang Li, Xin Liu, Yang Li, Wei Qin, Weining Qian |
| 2023 | Improving the Transferability of Adversarial Examples with Arbitrary Style Transfer. Zhijin Ge, Fanhua Shang, Hongying Liu, Yuanyuan Liu, Liang Wan, Wei Feng, Xiaosen Wang |
| 2023 | In-processing User Constrained Dominant Sets for User-Oriented Fairness in Recommender Systems. Zhongxuan Han, Chaochao Chen, Xiaolin Zheng, Weiming Liu, Jun Wang, Wenjie Cheng, Yuyuan Li |
| 2023 | Incomplete Multi-View Clustering with Regularized Hierarchical Graph. Shuping Zhao, Lunke Fei, Jie Wen, Bob Zhang, Pengyang Zhao |
| 2023 | Incorporating Domain Knowledge Graph into Multimodal Movie Genre Classification with Self-Supervised Attention and Contrastive Learning. Jiaqi Li, Guilin Qi, Chuanyi Zhang, Yongrui Chen, Yiming Tan, Chenlong Xia, Ye Tian |
| 2023 | Incremental Few Shot Semantic Segmentation via Class-agnostic Mask Proposal and Language-driven Classifier. Leo Shan, Wenzhang Zhou, Grace Zhao |
| 2023 | Induction Network: Audio-Visual Modality Gap-Bridging for Self-Supervised Sound Source Localization. Tianyu Liu, Peng Zhang, Wei Huang, Yufei Zha, Tao You, Yanning Zhang |
| 2023 | Informative Classes Matter: Towards Unsupervised Domain Adaptive Nighttime Semantic Segmentation. Shiqin Wang, Xin Xu, Xianzheng Ma, Kui Jiang, Zheng Wang |
| 2023 | InspirNET: An Unsupervised Generative Adversarial Network with Controllable Fine-grained Texture Disentanglement for Fashion Generation. Han Yan, Haijun Zhang, Jie Hou, Jicong Fan, Zhao Zhang |
| 2023 | Integrating VideoMAE based model and Optical Flow for Micro- and Macro-expression Spotting. Ke Xu, Kang Chen, Licai Sun, Zheng Lian, Bin Liu, Gong Chen, Haiyang Sun, Mingyu Xu, Jianhua Tao |
| 2023 | Interactive Image Style Transfer Guided by Graffiti. Quan Wang, Yanli Ren, Xinpeng Zhang, Guorui Feng |
| 2023 | Interactive Interior Design Recommendation via Coarse-to-fine Multimodal Reinforcement Learning. He Zhang, Ying Sun, Weiyu Guo, Yafei Liu, Haonan Lu, Xiaodong Lin, Hui Xiong |
| 2023 | Internet of Video Things: Technical Challenges and Emerging Applications. Chang Wen Chen |
| 2023 | Interpolation Normalization for Contrast Domain Generalization. Mengzhu Wang, Junyang Chen, Huan Wang, Huisi Wu, Zhidan Liu, Qin Zhang |
| 2023 | Intra- and Inter-Modal Curriculum for Multimodal Learning. Yuwei Zhou, Xin Wang, Hong Chen, Xuguang Duan, Wenwu Zhu |
| 2023 | Invariant Meets Specific: A Scalable Harmful Memes Detection Framework. Chuanpeng Yang, Fuqing Zhu, Jizhong Han, Songlin Hu |
| 2023 | Invisible Video Watermark Method Based on Maximum Voting and Probabilistic Superposition. Kangshuai Guo, Zhijian Xu, Shichao Luo, Feigao Wei, Yan Wang, Yanru Zhang |
| 2023 | Isolation and Induction: Training Robust Deep Neural Networks against Model Stealing Attacks. Jun Guo, Xingyu Zheng, Aishan Liu, Siyuan Liang, Yisong Xiao, Yichao Wu, Xianglong Liu |
| 2023 | Iterative Learning with Extra and Inner Knowledge for Long-tail Dynamic Scene Graph Generation. Yiming Li, Xiaoshan Yang, Changsheng Xu |
| 2023 | JAVP: Joint-Aware Video Processing with Edge-Cloud Collaboration for DNN Inference. Zheming Yang, Wen Ji, Qi Guo, Zhi Wang |
| 2023 | Joint Local Relational Augmentation and Global Nash Equilibrium for Federated Learning with Non-IID Data. Xinting Liao, Chaochao Chen, Weiming Liu, Pengyang Zhou, Huabin Zhu, Shuheng Shen, Weiqiang Wang, Mengling Hu, Yanchao Tan, Xiaolin Zheng |
| 2023 | Joint Searching and Grounding: Multi-Granularity Video Content Retrieval. Zhiguo Chen, Xun Jiang, Xing Xu, Zuo Cao, Yijun Mo, Heng Tao Shen |
| 2023 | Jurassic World Remake: Bringing Ancient Fossils Back to Life via Zero-Shot Long Image-to-Image Translation. Alexander Martin, Haitian Zheng, Jie An, Jiebo Luo |
| 2023 | Karma: Adaptive Video Streaming via Causal Sequence Modeling. Bowei Xu, Hao Chen, Zhan Ma |
| 2023 | Kernel Dimension Matters: To Activate Available Kernels for Real-time Video Super-Resolution. Shuo Jin, Meiqin Liu, Chao Yao, Chunyu Lin, Yao Zhao |
| 2023 | KeyPosS: Plug-and-Play Facial Landmark Detection through GPS-Inspired True-Range Multilateration. Xu Bao, Zhi-Qi Cheng, Jun-Yan He, Wangmeng Xiang, Chenyang Li, Jingdong Sun, Hanbing Liu, Wei Liu, Bin Luo, Yifeng Geng, Xuansong Xie |
| 2023 | Knowledge Decomposition and Replay: A Novel Cross-modal Image-Text Retrieval Continual Learning Method. Rui Yang, Shuang Wang, Huan Zhang, Siyuan Xu, Yanhe Guo, Xiutiao Ye, Biao Hou, Licheng Jiao |
| 2023 | Knowledge Prompt-tuning for Sequential Recommendation. Jianyang Zhai, Xiawu Zheng, Chang-Dong Wang, Hui Li, Yonghong Tian |
| 2023 | LDRM: Degradation Rectify Model for Low-light Imaging via Color-Monochrome Cameras. Junhong Lin, Shufan Pei, Bing Chen, Nanfeng Jiang, Wei Gao, Tiesong Zhao |
| 2023 | LGFat-RGCN: Faster Attention with Heterogeneous RGCN for Medical ICD Coding Generation. Zhenghan Chen, Changzeng Fu, Ruoxue Wu, Ye Wang, Xunzhu Tang, Xiaoxuan Liang |
| 2023 | LGM3A '23: 1st Workshop on Large Generative Models Meet Multimodal Applications. Zheng Wang, Cheng Long, Shihao Xu, Bingzheng Gan, Wei Shi, Zhao Cao, Tat-Seng Chua |
| 2023 | LGViT: Dynamic Early Exiting for Accelerating Vision Transformer. Guanyu Xu, Jiawei Hao, Li Shen, Han Hu, Yong Luo, Hui Lin, Jialie Shen |
| 2023 | LHAct: Rectifying Extremely Low and High Activations for Out-of-Distribution Detection. Yue Yuan, Rundong He, Zhongyi Han, Yilong Yin |
| 2023 | LHNet: A Low-cost Hybrid Network for Single Image Dehazing. Shenghai Yuan, Jijia Chen, Jiaqi Li, Wenchao Jiang, Song Guo |
| 2023 | LUNA: Language as Continuing Anchors for Referring Expression Comprehension. Yaoyuan Liang, Zhao Yang, Yansong Tang, Jiashuo Fan, Ziran Li, Jingang Wang, Philip H. S. Torr, Shao-Lun Huang |
| 2023 | LaDI-VTON: Latent Diffusion Textual-Inversion Enhanced Virtual Try-On. Davide Morelli, Alberto Baldrati, Giuseppe Cartella, Marcella Cornia, Marco Bertini, Rita Cucchiara |
| 2023 | LandmarkGait: Intrinsic Human Parsing for Gait Recognition. Zengbin Wang, Saihui Hou, Man Zhang, Xu Liu, Chunshui Cao, Yongzhen Huang, Shibiao Xu |
| 2023 | Language-Guided Visual Aggregation Network for Video Question Answering. Xiao Liang, Di Wang, Quan Wang, Bo Wan, Lingling An, Lihuo He |
| 2023 | Language-guided Human Motion Synthesis with Atomic Actions. Yuanhao Zhai, Mingzhen Huang, Tianyu Luan, Lu Dong, Ifeoma Nwogu, Siwei Lyu, David S. Doermann, Junsong Yuan |
| 2023 | Latent-space Unfolding for MRI Reconstruction. Jiawei Jiang, Yuchao Feng, Jiacheng Chen, Dongyan Guo, Jianwei Zheng |
| 2023 | Layout Sequence Prediction From Noisy Mobile Modality. Haichao Zhang, Yi Xu, Hongsheng Lu, Takayuki Shimizu, Yun Fu |
| 2023 | LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation. Leigang Qu, Shengqiong Wu, Hao Fei, Liqiang Nie, Tat-Seng Chua |
| 2023 | Learnable Graph Filter for Multi-view Clustering. Peng Zhou, Liang Du |
| 2023 | Learning Causality-inspired Representation Consistency for Video Anomaly Detection. Yang Liu, Zhaoyang Xia, Mengyang Zhao, Donglai Wei, Yuzheng Wang, Siao Liu, Bobo Ju, Gaoyun Fang, Jing Liu, Liang Song |
| 2023 | Learning Comprehensive Representations with Richer Self for Text-to-Image Person Re-Identification. Shuanglin Yan, Neng Dong, Jun Liu, Liyan Zhang, Jinhui Tang |
| 2023 | Learning Discriminative Feature Representation for Open Set Action Recognition. Hongjie Zhang, Yi Liu, Yali Wang, Limin Wang, Yu Qiao |
| 2023 | Learning Dynamic Point Cloud Compression via Hierarchical Inter-frame Block Matching. Shuting Xia, Tingyu Fan, Yiling Xu, Jenq-Neng Hwang, Zhu Li |
| 2023 | Learning Event-Specific Localization Preferences for Audio-Visual Event Localization. Shiping Ge, Zhiwei Jiang, Yafeng Yin, Cong Wang, Zifeng Cheng, Qing Gu |
| 2023 | Learning Generalized Representations for Open-Set Temporal Action Localization. Junshan Hu, Liansheng Zhuang, Weisong Dong, Shiming Ge, Shafei Wang |
| 2023 | Learning High-frequency Feature Enhancement and Alignment for Pan-sharpening. Yingying Wang, Yunlong Lin, Ge Meng, Zhenqi Fu, Yuhang Dong, Linyu Fan, Hedeng Yu, Xinghao Ding, Yue Huang |
| 2023 | Learning Implicit Entity-object Relations by Bidirectional Generative Alignment for Multimodal NER. Feng Chen, Jiajia Liu, Kaixiang Ji, Wang Ren, Jian Wang, Jingdong Chen |
| 2023 | Learning Intra and Inter-Camera Invariance for Isolated Camera Supervised Person Re-identification. Menglin Wang, Xiaojin Gong |
| 2023 | Learning Non-Uniform-Sampling for Ultra-High-Definition Image Enhancement. Wei Yu, Qi Zhu, Naishan Zheng, Jie Huang, Man Zhou, Feng Zhao |
| 2023 | Learning Occlusion Disentanglement with Fine-grained Localization for Occluded Person Re-identification. Wenfeng Liu, Xudong Wang, Lei Tan, Yan Zhang, Pingyang Dai, Yongjian Wu, Rongrong Ji |
| 2023 | Learning Pixel-wise Alignment for Unsupervised Image Stitching. Qi Jia, Xiaomei Feng, Yu Liu, Xin Fan, Longin Jan Latecki |
| 2023 | Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning. Huiguo He, Tianfu Wang, Huan Yang, Jianlong Fu, Nicholas Jing Yuan, Jian Yin, Hongyang Chao, Qi Zhang |
| 2023 | Learning Semantics-Grounded Vocabulary Representation for Video-Text Retrieval. Yaya Shi, Haowei Liu, Haiyang Xu, Zongyang Ma, Qinghao Ye, Anwen Hu, Ming Yan, Ji Zhang, Fei Huang, Chunfeng Yuan, Bing Li, Weiming Hu, Zheng-Jun Zha |
| 2023 | Learning Shared Semantic Information from Multimodal Bio-signals for Brain-Muscle Modulation Analysis. Tian-Yu Xiang, Xiao-Hu Zhou, Xiao-Liang Xie, Shi-Qi Liu, Hong-Jun Yang, Zhen-Qiu Feng, Mei-Jiang Gui, Hao Li, De-Xing Huang, Zeng-Guang Hou |
| 2023 | Learning Spectral-wise Correlation for Spectral Super-Resolution: Where Similarity Meets Particularity. Hongyuan Wang, Lizhi Wang, Chang Chen, Xue Hu, Fenglong Song, Hua Huang |
| 2023 | Learning Style-Invariant Robust Representation for Generalizable Visual Instance Retrieval. Tianyu Chang, Xun Yang, Xin Luo, Wei Ji, Meng Wang |
| 2023 | Learning a Graph Neural Network with Cross Modality Interaction for Image Fusion. Jiawei Li, Jiansheng Chen, Jinyuan Liu, Huimin Ma |
| 2023 | Learning and Evaluating Human Preferences for Conversational Head Generation. Mohan Zhou, Yalong Bai, Wei Zhang, Ting Yao, Tiejun Zhao, Tao Mei |
| 2023 | Learning from Easy to Hard Pairs: Multi-step Reasoning Network for Human-Object Interaction Detection. Yuchen Zhou, Guang Tan, Mengtang Li, Chao Gou |
| 2023 | Learning from More: Combating Uncertainty Cross-multidomain for Facial Expression Recognition. Hanwei Liu, Huiling Cai, Qingcheng Lin, Xuefeng Li, Hui Xiao |
| 2023 | Let's ViCE! Mimicking Human Cognitive Behavior in Image Generation Evaluation. Federico Betti, Jacopo Staiano, Lorenzo Baraldi, Lorenzo Baraldi, Rita Cucchiara, Nicu Sebe |
| 2023 | Leveraging the Latent Diffusion Models for Offline Facial Multiple Appropriate Reactions Generation. Jun Yu, Ji Zhao, Guochen Xie, Fengxin Chen, Ye Yu, Liang Peng, Minglei Li, Zonghong Dai |
| 2023 | LiFT: Transfer Learning in Vision-Language Models for Downstream Adaptation and Generalization. Jingzheng Li, Hailong Sun |
| 2023 | Lifelong Scene Text Recognizer via Expert Modules. Shifeng Xia, Lin Geng, Ningzhong Liu, Han Sun, Jie Qin |
| 2023 | Light-VQA: A Multi-Dimensional Quality Assessment Model for Low-Light Video Enhancement. Yunlong Dong, Xiaohong Liu, Yixuan Gao, Xunchu Zhou, Tao Tan, Guangtao Zhai |
| 2023 | Lightweight Super-Resolution Head for Human Pose Estimation. Haonan Wang, Jie Liu, Jie Tang, Gangshan Wu |
| 2023 | Limited-Reference Image Quality Assessment: Paradigms and Discussions. Keke Zhang |
| 2023 | Lite-MKD: A Multi-modal Knowledge Distillation Framework for Lightweight Few-shot Action Recognition. Baolong Liu, Tianyi Zheng, Peng Zheng, Daizong Liu, Xiaoye Qu, Junyu Gao, Jianfeng Dong, Xun Wang |
| 2023 | Little Strokes Fell Great Oaks: Boosting the Hierarchical Features for Multi-exposure Image Fusion. Pan Mu, Zhiying Du, Jinyuan Liu, Cong Bai |
| 2023 | LocLoc: Low-level Cues and Local-area Guides for Weakly Supervised Object Localization. Xinzi Cao, Xiawu Zheng, Yunhang Shen, Ke Li, Jie Chen, Yutong Lu, Yonghong Tian |
| 2023 | Local Consensus Enhanced Siamese Network with Reciprocal Loss for Two-view Correspondence Learning. Linbo Wang, Jing Wu, Xianyong Fang, Zhengyi Liu, Chenjie Cao, Yanwei Fu |
| 2023 | LocalPose: Object Pose Estimation with Local Geometry Guidance. Yang Xiao, Bo Duan, Mingwei Sun, Jingwei Huang |
| 2023 | Localization-assisted Uncertainty Score Disentanglement Network for Action Quality Assessment. Yanli Ji, Lingfeng Ye, Huili Huang, Lijing Mao, Yang Zhou, Lingling Gao |
| 2023 | Localized and Balanced Efficient Incomplete Multi-view Clustering. Jie Wen, Gehui Xu, Chengliang Liu, Lunke Fei, Chao Huang, Wei Wang, Yong Xu |
| 2023 | Locate and Verify: A Two-Stream Network for Improved Deepfake Detection. Chao Shuai, Jieming Zhong, Shuang Wu, Feng Lin, Zhibo Wang, Zhongjie Ba, Zhenguang Liu, Lorenzo Cavallaro, Kui Ren |
| 2023 | Long Short-Term Graph Memory Against Class-imbalanced Over-smoothing. Liang Yang, Jiayi Wang, Tingting Zhang, Dongxiao He, Chuan Wang, Yuanfang Guo, Xiaochun Cao, Bingxin Niu, Zhen Wang |
| 2023 | M2ATS: A Real-world Multimodal Air Traffic Situation Benchmark Dataset and Beyond. Dongyue Guo, Yi Lin, Xuehang You, Zhongping Yang, Jizhe Zhou, Bo Yang, Jianwei Zhang, Han Shi, Shasha Hu, Zheng Zhang |
| 2023 | M3Net: Multi-view Encoding, Matching, and Fusion for Few-shot Fine-grained Action Recognition. Hao Tang, Jun Liu, Shuanglin Yan, Rui Yan, Zechao Li, Jinhui Tang |
| 2023 | M3R: Masked Token Mixup and Cross-Modal Reconstruction for Zero-Shot Learning. Peng Zhao, Qiangchang Wang, Yilong Yin |
| 2023 | MADiMa '23: 8th International Workshop on Multimedia Assisted Dietary Management. Stavroula G. Mougiakakou, Keiji Yanai, Dario Allegra |
| 2023 | MAE-DFER: Efficient Masked Autoencoder for Self-supervised Dynamic Facial Expression Recognition. Licai Sun, Zheng Lian, Bin Liu, Jianhua Tao |
| 2023 | MAGIC-TBR: Multiview Attention Fusion for Transformer-based Bodily Behavior Recognition in Group Settings. Surbhi Madan, Rishabh Jain, Gulshan Sharma, Ramanathan Subramanian, Abhinav Dhall |
| 2023 | MATK: The Meme Analytical Tool Kit. Ming Shan Hee, Aditi Kumaresan, Nguyen-Khoi Hoang, Nirmalendu Prakash, Rui Cao, Roy Ka-Wei Lee |
| 2023 | MCG-MNER: A Multi-Granularity Cross-Modality Generative Framework for Multimodal NER with Instruction. Junjie Wu, Chen Gong, Ziqiang Cao, Guohong Fu |
| 2023 | MCUNeRF: Packing NeRF into an MCU with 1MB Memory. Zhixiang Ye, Qinghao Hu, Tianli Zhao, Wangping Zhou, Jian Cheng |
| 2023 | MEAformer: Multi-modal Entity Alignment Transformer for Meta Modality Hybrid. Zhuo Chen, Jiaoyan Chen, Wen Zhang, Lingbing Guo, Yin Fang, Yufeng Huang, Yichi Zhang, Yuxia Geng, Jeff Z. Pan, Wenting Song, Huajun Chen |
| 2023 | MEDIC: A Multimodal Empathy Dataset in Counseling. Zhouan Zhu, Chenguang Li, Jicai Pan, Xin Li, Yufei Xiao, Yanan Chang, Feiyi Zheng, Shangfei Wang |
| 2023 | MEGC2023: ACM Multimedia 2023 ME Grand Challenge. Adrian K. Davison, Jingting Li, Moi Hoon Yap, John See, Wen-Huang Cheng, Xiaobai Li, Xiaopeng Hong, Su-Jing Wang |
| 2023 | MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning. Zheng Lian, Haiyang Sun, Licai Sun, Kang Chen, Mingyu Xu, Kexin Wang, Ke Xu, Yu He, Ying Li, Jinming Zhao, Ye Liu, Bin Liu, Jiangyan Yi, Meng Wang, Erik Cambria, Guoying Zhao, Björn W. Schuller, Jianhua Tao |
| 2023 | MFR-Net: Multi-faceted Responsive Listening Head Generation via Denoising Diffusion Model. Jin Liu, Xi Wang, Xiaomeng Fu, Yesheng Chai, Cai Yu, Jiao Dai, Jizhong Han |
| 2023 | MIEP: Channel Pruning with Multi-granular Importance Estimation for Object Detection. Liangwei Jiang, Jiaxin Chen, Di Huang, Yunhong Wang |
| 2023 | MISSRec: Pre-training and Transferring Multi-modal Interest-aware Sequence Representation for Recommendation. Jinpeng Wang, Ziyun Zeng, Yunxiao Wang, Yuting Wang, Xingyu Lu, Tianxiang Li, Jun Yuan, Rui Zhang, Hai-Tao Zheng, Shu-Tao Xia |
| 2023 | MLIC: Multi-Reference Entropy Model for Learned Image Compression. Wei Jiang, Jiayu Yang, Yongqi Zhai, Peirong Ning, Feng Gao, Ronggang Wang |
| 2023 | MM-AU: Towards Multimodal Understanding of Advertisement Videos. Digbalay Bose, Rajat Hebbar, Tiantian Feng, Krishna Somandepalli, Anfeng Xu, Shrikanth Narayanan |
| 2023 | MMSports '23: 6th International Workshop on Multimedia Content Analysis in Sports. Hideo Saito, Thomas B. Moeslund, Rainer Lienhart |
| 2023 | MORE: A Multimodal Object-Entity Relation Extraction Dataset with a Benchmark Evaluation. Liang He, Hongke Wang, Yongchang Cao, Zhen Wu, Jianbing Zhang, Xinyu Dai |
| 2023 | MRAC'23: 1st International Workshop on Multimodal and Responsible Affective Computing. Zheng Lian, Erik Cambria, Guoying Zhao, Björn W. Schuller, Jianhua Tao |
| 2023 | MSECNet: Accurate and Robust Normal Estimation for 3D Point Clouds by Multi-Scale Edge Conditioning. Haoyi Xiu, Xin Liu, Weimin Wang, Kyoung-Sook Kim, Masashi Matsuoka |
| 2023 | MTSN: Multiscale Temporal Similarity Network for Temporal Action Localization. Xiaodong Jin, Taiping Zhang |
| 2023 | MUP: Multi-granularity Unified Perception for Panoramic Activity Recognition. Meiqi Cao, Rui Yan, Xiangbo Shu, Jiachao Zhang, Jinpeng Wang, Guo-Sen Xie |
| 2023 | MV-Diffusion: Motion-aware Video Diffusion Model. Zijun Deng, Xiangteng He, Yuxin Peng, Xiongwei Zhu, Lele Cheng |
| 2023 | MVCIR-net: Multi-view Clustering Information Reinforcement Network. Shaokui Gu, Xu Yuan, Liang Zhao, Zhenjiao Liu, Yan Hu, Zhikui Chen |
| 2023 | MVFlow: Deep Optical Flow Estimation of Compressed Videos with Motion Vector Prior. Shili Zhou, Xuhao Jiang, Weimin Tan, Ruian He, Bo Yan |
| 2023 | MaTCR: Modality-Aligned Thought Chain Reasoning for Multimodal Task-Oriented Dialogue Generation. Yiting Liu, Liang Li, Beichen Zhang, Shan Huang, Zheng-Jun Zha, Qingming Huang |
| 2023 | Make-It-4D: Synthesizing a Consistent Long-Term Dynamic Scene Video from a Single Image. Liao Shen, Xingyi Li, Huiqiang Sun, Juewen Peng, Ke Xian, Zhiguo Cao, Guosheng Lin |
| 2023 | Making Users Indistinguishable: Attribute-wise Unlearning in Recommender Systems. Yuyuan Li, Chaochao Chen, Xiaolin Zheng, Yizhao Zhang, Zhongxuan Han, Dan Meng, Jun Wang |
| 2023 | Mamba: Bringing Multi-Dimensional ABR to WebRTC. Yueheng Li, Zicheng Zhang, Hao Chen, Zhan Ma |
| 2023 | Margin MCC: Chance-Robust Metric for Video Boundary Detection with Allowed Margin. Kosuke Mizufune, Shunsuke Tanaka, Toshihide Yukitake, Tatsushi Matsubayashi |
| 2023 | Mask Again: Masked Knowledge Distillation for Masked Video Modeling. Xiaojie Li, Shaowei He, Jianlong Wu, Yue Yu, Liqiang Nie, Min Zhang |
| 2023 | Mask to Reconstruct: Cooperative Semantics Completion for Video-text Retrieval. Han Fang, Zhifei Yang, Xianghao Zang, Chao Ban, Zhongjiang He, Hao Sun, Lanxiang Zhou |
| 2023 | Mask-Guided Progressive Network for Joint Raindrop and Rain Streak Removal in Videos. Hongtao Wu, Yijun Yang, Haoyu Chen, Jingjing Ren, Lei Zhu |
| 2023 | Masked Text Modeling: A Self-Supervised Pre-training Method for Scene Text Detection. Keran Wang, Hongtao Xie, Yuxin Wang, Dongming Zhang, Yadong Qu, Zuan Gao, Yongdong Zhang |
| 2023 | McGE '23: 1st International Workshop on Multimedia Content Generation and Evaluation: New Methods and Practice. Cheng Jin, Liang He, Mingli Song, Rui Wang |
| 2023 | MetaCast: A Self-Driven Metaverse Announcer Architecture Based on Quality of Experience Evaluation Model. Zhonghao Lin, Haihan Duan, Jiaye Li, Xinyao Sun, Wei Cai |
| 2023 | MetaFBP: Learning to Learn High-Order Predictor for Personalized Facial Beauty Prediction. Luojun Lin, Zhifeng Shen, Jia-Li Yin, Qipeng Liu, Yuanlong Yu, Weijie Chen |
| 2023 | Micro-Expression Spotting with Face Alignment and Optical Flow. Wenfeng Qin, Bochao Zou, Xin Li, Weiping Wang, Huimin Ma |
| 2023 | Mind the Gap: Improving Success Rate of Vision-and-Language Navigation by Revisiting Oracle Success Routes. Chongyang Zhao, Yuankai Qi, Qi Wu |
| 2023 | MindDiffuser: Controlled Image Reconstruction from Human Brain Activity with Semantic and Structural Diffusion. Yizhuo Lu, Changde Du, Qiongyi Zhou, Dianpeng Wang, Huiguang He |
| 2023 | Mining High-quality Samples from Raw Data and Majority Voting Method for Multimodal Emotion Recognition. Qifei Li, Yingming Gao, Ya Li |
| 2023 | Mirror-NeRF: Learning Neural Radiance Fields for Mirrors with Whitted-Style Ray Tracing. Junyi Zeng, Chong Bao, Rui Chen, Zilong Dong, Guofeng Zhang, Hujun Bao, Zhaopeng Cui |
| 2023 | Mixture-of-Experts Learner for Single Long-Tailed Domain Generalization. Mengzhu Wang, Jianlong Yuan, Zhibin Wang |
| 2023 | Mixup-Augmented Temporally Debiased Video Grounding with Content-Location Disentanglement. Xin Wang, Zihao Wu, Hong Chen, Xiaohan Lan, Wenwu Zhu |
| 2023 | MobileVidFactory: Automatic Diffusion-Based Social Media Video Generation for Mobile Devices from Text. Junchen Zhu, Huan Yang, Wenjing Wang, Huiguo He, Zixi Tuo, Yongsheng Yu, Wen-Huang Cheng, Lianli Gao, Jingkuan Song, Jianlong Fu, Jiebo Luo |
| 2023 | Moby: Empowering 2D Models for Efficient Point Cloud Analytics on the Edge. Jingzong Li, Yik Hong Cai, Libin Liu, Yu Mao, Chun Jason Xue, Hong Xu |
| 2023 | Modal-aware Bias Constrained Contrastive Learning for Multimodal Recommendation. Wei Yang, Zhengru Fang, Tianle Zhang, Shiguang Wu, Chi Lu |
| 2023 | Modal-aware Visual Prompting for Incomplete Multi-modal Brain Tumor Segmentation. Yansheng Qiu, Ziyuan Zhao, Hongdou Yao, Delin Chen, Zheng Wang |
| 2023 | Modality Profile - A New Critical Aspect to be Considered When Generating RGB-D Salient Object Detection Training Set. Xuehao Wang, Shuai Li, Chenglizhao Chen, Aimin Hao, Hong Qin |
| 2023 | Modality-agnostic Augmented Multi-Collaboration Representation for Semi-supervised Heterogenous Face Recognition. Decheng Liu, Weizhao Yang, Chunlei Peng, Nannan Wang, Ruimin Hu, Xinbo Gao |
| 2023 | Model Inversion Attack via Dynamic Memory Learning. Gege Qi, Yuefeng Chen, Xiaofeng Mao, Binyuan Hui, Xiaodan Li, Rong Zhang, Hui Xue |
| 2023 | Model-Contrastive Learning for Backdoor Elimination. Zhihao Yue, Jun Xia, Zhiwei Ling, Ming Hu, Ting Wang, Xian Wei, Mingsong Chen |
| 2023 | Modeling Multi-Relational Connectivity for Personalized Fashion Matching. Yujuan Ding, P. Y. Mok, Yi Bin, Xun Yang, Zhiyong Cheng |
| 2023 | Moiré Backdoor Attack (MBA): A Novel Trigger for Pedestrian Detectors in the Physical World. Hui Wei, Hanxun Yu, Kewei Zhang, Zhixiang Wang, Jianke Zhu, Zheng Wang |
| 2023 | Motion-Decoupled Spiking Transformer for Audio-Visual Zero-Shot Learning. Wenrui Li, Xi-Le Zhao, Zhengyu Ma, Xingtao Wang, Xiaopeng Fan, Yonghong Tian |
| 2023 | MovieFactory: Automatic Movie Creation from Text using Large Generative Models for Language and Images. Junchen Zhu, Huan Yang, Huiguo He, Wenjing Wang, Zixi Tuo, Wen-Huang Cheng, Lianli Gao, Jingkuan Song, Jianlong Fu |
| 2023 | MuSe 2023 Challenge: Multimodal Prediction of Mimicked Emotions, Cross-Cultural Humour, and Personalised Recognition of Affects. Shahin Amiriparian, Lukas Christ, Andreas König, Alan Cowen, Eva-Maria Meßner, Erik Cambria, Björn W. Schuller |
| 2023 | Multi-Domain Lifelong Visual Question Answering via Self-Critical Distillation. Mingrui Lao, Nan Pu, Yu Liu, Zhun Zhong, Erwin M. Bakker, Nicu Sebe, Michael S. Lew |
| 2023 | Multi-Frame Self-Supervised Depth Estimation with Multi-Scale Feature Fusion in Dynamic Scenes. Jiquan Zhong, Xiaolin Huang, Xiao Yu |
| 2023 | Multi-Granularity Interactive Transformer Hashing for Cross-modal Retrieval. Yishu Liu, Qingpeng Wu, Zheng Zhang, Jingyi Zhang, Guangming Lu |
| 2023 | Multi-Layer Acoustic & Linguistic Feature Fusion for ComParE-23 Emotion and Requests Challenge. Siddhant R. Viksit, Vinayak Abrol |
| 2023 | Multi-Modal and Multi-Scale Temporal Fusion Architecture Search for Audio-Visual Video Parsing. Jiayi Zhang, Weixin Li |
| 2023 | Multi-Part Token Transformer with Dual Contrastive Learning for Fine-grained Image Classification. Chuanming Wang, Huiyuan Fu, Huadong Ma |
| 2023 | Multi-Scale Similarity Aggregation for Dynamic Metric Learning. Dingyi Zhang, Yingming Li, Zhongfei Zhang |
| 2023 | Multi-Spectral Image Stitching via Spatial Graph Reasoning. Zhiying Jiang, Zengxi Zhang, Jinyuan Liu, Xin Fan, Risheng Liu |
| 2023 | Multi-Speed Global Contextual Subspace Matching for Few-Shot Action Recognition. Tianwei Yu, Peng Chen, Yuanjie Dang, Ruohong Huan, Ronghua Liang |
| 2023 | Multi-View Graph Convolutional Network for Multimedia Recommendation. Penghang Yu, Zhiyi Tan, Guanming Lu, Bing-Kun Bao |
| 2023 | Multi-View Representation Learning via View-Aware Modulation. Ren Wang, Haoliang Sun, Xiushan Nie, Yuxiu Lin, Xiaoming Xi, Yilong Yin |
| 2023 | Multi-label Emotion Analysis in Conversation via Multimodal Knowledge Distillation. Sidharth Anand, Naresh Kumar Devulapally, Sreyasee Das Bhattacharjee, Junsong Yuan |
| 2023 | Multi-modal Social Bot Detection: Learning Homophilic and Heterophilic Connections Adaptively. Shilong Li, Boyu Qiao, Kun Li, Qianqian Lu, Meng Lin, Wei Zhou |
| 2023 | Multi-scale Conformer Fusion Network for Multi-participant Behavior Analysis. Qiya Song, Renwei Dian, Bin Sun, Jie Xie, Shutao Li |
| 2023 | Multi-scale Spatial-Spectral Attention Guided Fusion Network for Pansharpening. Yong Yang, Mengzhen Li, Shuying Huang, Hangyuan Lu, Wei Tu, Weiguo Wan |
| 2023 | Multi-scale Target-Aware Framework for Constrained Splicing Detection and Localization. Yuxuan Tan, Yuanman Li, Limin Zeng, Jiaxiong Ye, Wei Wang, Xia Li |
| 2023 | Multi-stage Factorized Spatio-Temporal Representation for RGB-D Action and Gesture Recognition. Yujun Ma, Benjia Zhou, Ruili Wang, Pichao Wang |
| 2023 | Multi-teacher Self-training for Semi-supervised Node Classification with Noisy Labels. Yujing Liu, Zongqian Wu, Zhengyu Lu, Guoqiu Wen, Junbo Ma, Guangquan Lu, Xiaofeng Zhu |
| 2023 | Multi-view Graph Clustering via Efficient Global-Local Spectral Embedding Fusion. Penglei Wang, Danyang Wu, Rong Wang, Feiping Nie |
| 2023 | Multi-view Self-Expressive Subspace Clustering Network. Jinrong Cui, Yuting Li, Yulu Fu, Jie Wen |
| 2023 | MultiMediate '23: Engagement Estimation and Bodily Behaviour Recognition in Social Interactions. Philipp Müller, Michal Balazia, Tobias Baur, Michael Dietz, Alexander Heimerl, Dominik Schiller, Mohammed Guermal, Dominike Thomas, François Brémond, Jan Alexandersson, Elisabeth André, Andreas Bulling |
| 2023 | MultiMediate 2023: Engagement Level Detection using Audio and Video Features. Chunxi Yang, Kangzhong Wang, Peter Q. Chen, MK Michael Cheung, Youqian Zhang, Eugene Yujun Fu, Grace Ngai |
| 2023 | Multimodal AI & LLMs for Peacekeeping and Emergency Response. Alejandro Jaimes |
| 2023 | Multimodal Adaptive Emotion Transformer with Flexible Modality Inputs on A Novel Dataset with Continuous Labels. Wei-Bang Jiang, Xuan-Hao Liu, Wei-Long Zheng, Bao-Liang Lu |
| 2023 | Multimodal Color Recommendation in Vector Graphic Documents. Qianru Qiu, Xueting Wang, Mayu Otani |
| 2023 | Multimodal Emotion Interaction and Visualization Platform. Zheng Zhang, Songling Chen, Mixiao Hou, Guangming Lu |
| 2023 | Multimodal Emotion Recognition in Noisy Environment Based on Progressive Label Revision. Sunan Li, Hailun Lian, Cheng Lu, Yan Zhao, Chuangao Tang, Yuan Zong, Wenming Zheng |
| 2023 | Multimodal Physiological Signals Fusion for Online Emotion Recognition. Tongjie Pan, Yalan Ye, Hecheng Cai, Shudong Huang, Yang Yang, Guoqing Wang |
| 2023 | Multimodal Prompt Transformer with Hybrid Contrastive Learning for Emotion Recognition in Conversation. Shihao Zou, Xianying Huang, Xudong Shen |
| 2023 | Multispectral Object Detection via Cross-Modal Conflict-Aware Learning. Xiao He, Chang Tang, Xin Zou, Wei Zhang |
| 2023 | Mutual Information-driven Triple Interaction Network for Efficient Image Dehazing. Hao Shen, Zhong-Qiu Zhao, Yulun Zhang, Zhao Zhang |
| 2023 | Mutual-Guided Dynamic Network for Image Fusion. Yuanshen Guan, Ruikang Xu, Mingde Yao, Lizhi Wang, Zhiwei Xiong |
| 2023 | My Brother Helps Me: Node Injection Based Adversarial Attack on Social Bot Detection. Lanjun Wang, Xinran Qiao, Yanwei Xie, Weizhi Nie, Yongdong Zhang, Anan Liu |
| 2023 | NIF: A Fast Implicit Image Compression with Bottleneck Layers and Modulated Sinusoidal Activations. Lorenzo Catania, Dario Allegra |
| 2023 | NPF-200: A Multi-Modal Eye Fixation Dataset and Method for Non-Photorealistic Videos. Ziyu Yang, Sucheng Ren, Zongwei Wu, Nanxuan Zhao, Junle Wang, Jing Qin, Shengfeng He |
| 2023 | NarSUM '23: The 2nd Workshop on User-Centric Narrative Summarization of Long Videos. Mohan S. Kankanhalli, Ioannis (Yiannis) Patras, Jianquan Liu, Yongkang Wong, Takahiro Komamizu, Satoshi Yamazaki, Karen Stephen, Kajal Kansal |
| 2023 | Neural Image Popularity Assessment with Retrieval-augmented Transformer. Liya Ji, Chan Ho Park, Zhefan Rao, Qifeng Chen |
| 2023 | Neural Video Compression with Spatio-Temporal Cross-Covariance Transformers. Zhenghao Chen, Lucas Relic, Roberto Azevedo, Yang Zhang, Markus Gross, Dong Xu, Luping Zhou, Christopher Schroers |
| 2023 | NightHazeFormer: Single Nighttime Haze Removal Using Prior Query Transformer. Yun Liu, Zhongsheng Yan, Sixiang Chen, Tian Ye, Wenqi Ren, Erkang Chen |
| 2023 | Noise-Robust Continual Test-Time Domain Adaptation. Zhiqi Yu, Jingjing Li, Zhekai Du, Fengling Li, Lei Zhu, Yang Yang |
| 2023 | Non-Exemplar Class-Incremental Learning via Adaptive Old Class Reconstruction. Shaokun Wang, Weiwei Shi, Yuhang He, Yifan Yu, Yihong Gong |
| 2023 | Non-Local Geometry and Color Gradient Aggregation Graph Model for No-Reference Point Cloud Quality Assessment. Songtao Wang, Xiaoqi Wang, Hao Gao, Jian Xiong |
| 2023 | Normality Learning-based Graph Anomaly Detection via Multi-Scale Contrastive Learning. Jingcan Duan, Pei Zhang, Siwei Wang, Jingtao Hu, Hu Jin, Jiaxin Zhang, Haifang Zhou, Xinwang Liu |
| 2023 | Null-text Guidance in Diffusion Models is Secretly a Cartoon-style Creator. Jing Zhao, Heliang Zheng, Chaoyue Wang, Long Lan, Wanrong Huang, Wenjing Yang |
| 2023 | OCSKB: An Object Component Sketch Knowledge Base for Fast 6D Pose Estimation. Guangming Shi, Xuyang Li, Xuemei Xie, Mingxuan Yu, Chengwei Rao, Jiakai Luo |
| 2023 | Object Detection Difficulty: Suppressing Over-aggregation for Faster and Better Video Object Detection. Bingqing Zhang, Sen Wang, Yifan Liu, Brano Kusy, Xue Li, Jiajun Liu |
| 2023 | Object Part Parsing with Hierarchical Dual Transformer. Jiamin Chen, Jianlou Si, Naihao Liu, Yao Wu, Li Niu, Chen Qian |
| 2023 | Object Segmentation by Mining Cross-Modal Semantics. Zongwei Wu, Jingjing Wang, Zhuyun Zhou, Zhaochong An, Qiuping Jiang, Cédric Demonceaux, Guolei Sun, Radu Timofte |
| 2023 | OccluBEV: Occlusion Aware Spatiotemporal Modeling for Multi-view 3D Object Detection. Ziteng Wen, Hai Xu, Chenyu Liu, Tao Guo, Jinshui Hu, Xuming He, Fengren Wang, Shun Lou, Haibo Fan |
| 2023 | Occluded Skeleton-Based Human Action Recognition with Dual Inhibition Training. Zhenjie Chen, Hongsong Wang, Jie Gui |
| 2023 | On Physically Occluded Fake Identity Document Detection. Haoyue Wang, Sheng Li, Silu Cao, Rui Yang, Jishen Zeng, Zhenxing Qian, Xinpeng Zhang |
| 2023 | On Regularizing Multiple Clusterings for Ensemble Clustering by Graph Tensor Learning. Man-Sheng Chen, Jia-Qi Lin, Chang-Dong Wang, Wudong Xi, Dong Huang |
| 2023 | On the Impact of Interactive eXtended Reality: Challenges and Opportunities for Multimedia Research. Irene Viola, Maria Torres Vega |
| 2023 | On the Importance of Spatial Relations for Few-shot Action Recognition. Yilun Zhang, Yuqian Fu, Xingjun Ma, Lizhe Qi, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang |
| 2023 | On the Performance of Subjective Visual Quality Assessment Protocols for Nearly Visually Lossless Image Compression. Michela Testolina, Davi Lazzarotto, Rafael Rodrigues, Shima Mohammadi, João Ascenso, António M. G. Pinheiro, Touradj Ebrahimi |
| 2023 | One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer. Hang Guo, Tao Dai, Mingyan Zhu, Guanghao Meng, Bin Chen, Zhi Wang, Shu-Tao Xia |
| 2023 | Online Distillation-enhanced Multi-modal Transformer for Sequential Recommendation. Wei Ji, Xiangyan Liu, An Zhang, Yinwei Wei, Yongxin Ni, Xiang Wang |
| 2023 | Open-RoadAtlas: Leveraging VLMs for Road Condition Survey with Real-Time Mobile Auditing. Djamahl Etchegaray, Yadan Luo, Zachary FitzChance, Anthony Southon, Jinjiang Zhong |
| 2023 | Open-Scenario Domain Adaptive Object Detection in Autonomous Driving. Zeyu Ma, Ziqiang Zheng, Jiwei Wei, Xiaoyong Wei, Yang Yang, Heng Tao Shen |
| 2023 | Open-Vocabulary Object Detection via Scene Graph Discovery. Hengcan Shi, Munawar Hayat, Jianfei Cai |
| 2023 | OpenDMC: An Open-Source Library and Performance Evaluation for Deep-learning-based Multi-frame Compression. Wei Gao, Shangkun Sun, Huiming Zheng, Yuyang Wu, Hua Ye, Yongchi Zhang |
| 2023 | OpenFastVC: An Open Source Library for Video Coding Fast Algorithm Implementation. Hang Yuan, Wei Gao |
| 2023 | Optimizing Adaptive Video Streaming with Human Feedback. Tianchi Huang, Rui-Xiao Zhang, Chenglei Wu, Lifeng Sun |
| 2023 | OraclePoints: A Hybrid Neural Representation for Oracle Character. Runhua Jiang, Yongge Liu, Boyuan Zhang, Xu Chen, Deng Li, Yahong Han |
| 2023 | Orthogonal Temporal Interpolation for Zero-Shot Video Recognition. Yan Zhu, Junbao Zhuo, Bin Ma, Jiajia Geng, Xiaoming Wei, Xiaolin Wei, Shuhui Wang |
| 2023 | Orthogonal Uncertainty Representation of Data Manifold for Robust Long-Tailed Learning. Yanbiao Ma, Licheng Jiao, Fang Liu, Shuyuan Yang, Xu Liu, Lingling Li |
| 2023 | P2I-NET: Mapping Camera Pose to Image via Adversarial Learning for New View Synthesis in Real Indoor Environments. Xujie Kang, Kanglin Liu, Jiang Duan, Yuanhao Gong, Guoping Qiu |
| 2023 | PAIF: Perception-Aware Infrared-Visible Image Fusion for Attack-Tolerant Semantic Segmentation. Zhu Liu, Jinyuan Liu, Benzhuang Zhang, Long Ma, Xin Fan, Risheng Liu |
| 2023 | PBFormer: Capturing Complex Scene Text Shape with Polynomial Band Transformer. Ruijin Liu, Ning Lu, Dapeng Chen, Cheng Li, Zejian Yuan, Wei Peng |
| 2023 | PDE-based Progressive Prediction Framework for Attribute Compression of 3D Point Clouds. Xiaodong Yang, Yiting Shao, Shan Liu, Thomas H. Li, Ge Li |
| 2023 | PEARL: Preprocessing Enhanced Adversarial Robust Learning of Image Deraining for Semantic Segmentation. Xianghao Jiao, Yaohua Liu, Jiaxin Gao, Xinyuan Chu, Xin Fan, Risheng Liu |
| 2023 | PI-NeRF: A Partial-Invertible Neural Radiance Fields for Pose Estimation. Zhihao Li, Kexue Fu, Haoran Wang, Manning Wang |
| 2023 | PMVC: Data Augmentation-Based Prosody Modeling for Expressive Voice Conversion. Yimin Deng, Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao |
| 2023 | PNT-Edge: Towards Robust Edge Detection with Noisy Labels by Learning Pixel-level Noise Transitions. Wenjie Xuan, Shanshan Zhao, Yu Yao, Juhua Liu, Tongliang Liu, Yixin Chen, Bo Du, Dacheng Tao |
| 2023 | POAR: Towards Open Vocabulary Pedestrian Attribute Recognition. Yue Zhang, Suchen Wang, Shichao Kan, Zhenyu Weng, Yigang Cen, Yap-Peng Tan |
| 2023 | POV: Prompt-Oriented View-Agnostic Learning for Egocentric Hand-Object Interaction in the Multi-view World. Boshen Xu, Sipeng Zheng, Qin Jin |
| 2023 | PSNEA: Pseudo-Siamese Network for Entity Alignment between Multi-modal Knowledge Graphs. Wenxin Ni, Qianqian Xu, Yangbangyan Jiang, Zongsheng Cao, Xiaochun Cao, Qingming Huang |
| 2023 | PVG: Progressive Vision Graph for Vision Recognition. Jiafu Wu, Jian Li, Jiangning Zhang, Boshen Zhang, Mingmin Chi, Yabiao Wang, Chengjie Wang |
| 2023 | Pagoda: Privacy Protection for Volumetric Video Streaming through Poisson Diffusion Model. Rui Lu, Lai Wei, Shuntao Zhu, Chuang Hu, Dan Wang |
| 2023 | Painterly Image Harmonization using Diffusion Model. Lingxiao Lu, Jiangtong Li, Junyan Cao, Li Niu, Liqing Zhang |
| 2023 | Panel: Multimodal Large Foundation Models. Mohan S. Kankanhalli, Marcel Worring |
| 2023 | Parameter Exchange for Robust Dynamic Domain Generalization. Luojun Lin, Zhifeng Shen, Zhishu Sun, Yuanlong Yu, Lei Zhang, Weijie Chen |
| 2023 | Parameter-Efficient Transfer Learning for Audio-Visual-Language Tasks. Hongye Liu, Xianhai Xie, Yang Gao, Zhou Yu |
| 2023 | Pareto Invariant Representation Learning for Multimedia Recommendation. Shanshan Huang, Haoxuan Li, Qingsong Li, Chunyuan Zheng, Li Liu |
| 2023 | ParliRobo: Participant Lightweight AI Robots for Massively Multiplayer Online Games (MMOGs). Jianwei Zheng, Changnan Xiao, Mingliang Li, Zhenhua Li, Feng Qian, Wei Liu, Xudong Wu |
| 2023 | Parsing is All You Need for Accurate Gait Recognition in the Wild. Jinkai Zheng, Xinchen Liu, Shuai Wang, Lihao Wang, Chenggang Yan, Wu Liu |
| 2023 | Partial Annotation-based Video Moment Retrieval via Iterative Learning. Wei Ji, Renjie Liang, Lizi Liao, Hao Fei, Fuli Feng |
| 2023 | Partitioned Saliency Ranking with Dense Pyramid Transformers. Chengxiao Sun, Yan Xu, Jialun Pei, Haopeng Fang, He Tang |
| 2023 | Patch-Aware Representation Learning for Facial Expression Recognition. Yi Wu, Shangfei Wang, Yanan Chang |
| 2023 | PatchBackdoor: Backdoor Attack against Deep Neural Networks without Model Modification. Yizhen Yuan, Rui Kong, Shenghao Xie, Yuanchun Li, Yunxin Liu |
| 2023 | Patchmatch Stereo++: Patchmatch Binocular Stereo with Continuous Disparity Optimization. Wenjia Ren, Qingmin Liao, Zhijing Shao, Xiangru Lin, Xin Yue, Yu Zhang, Zongqing Lu |
| 2023 | Pedestrian-specific Bipartite-aware Similarity Learning for Text-based Person Retrieval. Fei Shen, Xiangbo Shu, Xiaoyu Du, Jinhui Tang |
| 2023 | Peering into The Sketch: Ultra-Low Bitrate Face Compression for Joint Human and Machine Perception. Yudong Mao, Peilin Chen, Shurun Wang, Shiqi Wang, Dapeng Wu |
| 2023 | Perceiving Ambiguity and Semantics without Recognition: An Efficient and Effective Ambiguous Scene Text Detector. Yan Shu, Wei Wang, Yu Zhou, Shaohui Liu, Aoting Zhang, Dongbao Yang, Weiping Wang |
| 2023 | Personalized Behavior-Aware Transformer for Multi-Behavior Sequential Recommendation. Jiajie Su, Chaochao Chen, Zibin Lin, Xi Li, Weiming Liu, Xiaolin Zheng |
| 2023 | Personalized Content Recommender System via Non-verbal Interaction Using Face Mesh and Facial Expression. Yuya Moroto, Rintaro Yanagi, Naoki Ogawa, Kyohei Kamikawa, Keigo Sakurai, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama |
| 2023 | Personalized Image Aesthetics Assessment with Attribute-guided Fine-grained Feature Representation. Hancheng Zhu, Zhiwen Shao, Yong Zhou, Guangcheng Wang, Pengfei Chen, Leida Li |
| 2023 | Personalized Single Image Reflection Removal Network through Adaptive Cascade Refinement. Mengyi Wang, Xinxin Zhang, Yongshun Gong, Yilong Yin |
| 2023 | PetalView: Fine-grained Location and Orientation Extraction of Street-view Images via Cross-view Local Search. Wenmiao Hu, Yichen Zhang, Yuxuan Liang, Xianjing Han, Yifang Yin, Hannes Kruppa, See-Kiong Ng, Roger Zimmermann |
| 2023 | Physical Invisible Backdoor Based on Camera Imaging. Yusheng Guo, Nan Zhong, Zhenxing Qian, Xinpeng Zhang |
| 2023 | Physics-Based Adversarial Attack on Near-Infrared Human Detector for Nighttime Surveillance Camera Systems. Muyao Niu, Zhuoxiao Li, Yifan Zhan, Huy H. Nguyen, Isao Echizen, Yinqiang Zheng |
| 2023 | PiPa: Pixel- and Patch-wise Self-supervised Learning for Domain Adaptative Semantic Segmentation. Mu Chen, Zhedong Zheng, Yi Yang, Tat-Seng Chua |
| 2023 | Pixel Adapter: A Graph-Based Post-Processing Approach for Scene Text Image Super-Resolution. Wenyu Zhang, Xin Deng, Baojun Jia, Xingtong Yu, Yifan Chen, Jin Ma, Qing Ding, Xinming Zhang |
| 2023 | PixelFace+: Towards Controllable Face Generation and Manipulation with Text Descriptions and Segmentation Masks. Xiaoxiong Du, Jun Peng, Yiyi Zhou, Jinlu Zhang, Siting Chen, Guannan Jiang, Xiaoshuai Sun, Rongrong Ji |
| 2023 | PoSynDA: Multi-Hypothesis Pose Synthesis Domain Adaptation for Robust 3D Human Pose Estimation. Hanbing Liu, Jun-Yan He, Zhi-Qi Cheng, Wangmeng Xiang, Qize Yang, Wenhao Chai, Gaoang Wang, Xu Bao, Bin Luo, Yifeng Geng, Xuansong Xie |
| 2023 | Point-aware Interaction and CNN-induced Refinement Network for RGB-D Salient Object Detection. Runmin Cong, Hongyu Liu, Chen Zhang, Wei Zhang, Feng Zheng, Ran Song, Sam Kwong |
| 2023 | PointCRT: Detecting Backdoor in 3D Point Cloud via Corruption Robustness. Shengshan Hu, Wei Liu, Minghui Li, Yechao Zhang, Xiaogeng Liu, Xianlong Wang, Leo Yu Zhang, Junhui Hou |
| 2023 | Points-to-3D: Bridging the Gap between Sparse Points and Shape-Controllable Text-to-3D Generation. Chaohui Yu, Qiang Zhou, Jingliang Li, Zhe Zhang, Zhibin Wang, Fan Wang |
| 2023 | Practical Deep Dispersed Watermarking with Synchronization and Fusion. Hengchang Guo, Qilong Zhang, Junwei Luo, Feng Guo, Wenbin Zhang, Xiaodong Su, Minglei Li |
| 2023 | Practical Edge Detection via Robust Collaborative Learning. Yuanbin Fu, Xiaojie Guo |
| 2023 | Precise Target-Oriented Attack against Deep Hashing-based Retrieval. Wenshuo Zhao, Jingkuan Song, Shengming Yuan, Lianli Gao, Yang Yang, Hengtao Shen |
| 2023 | Predictive Sampling for Efficient Pairwise Subjective Image Quality Assessment. Shima Mohammadi, João Ascenso |
| 2023 | Preserving Local and Global Information: An Effective Metric-based Subspace Clustering. Yixi Liu, Yuze Tan, Hongjie Wu, Shudong Huang, Yazhou Ren, Jiancheng Lv |
| 2023 | Pretrained Implicit-Ensemble Transformer for Open-Set Authentication on Multimodal Mobile Biometrics. Jaeho Yoon, Jaewoo Park, Kensuke Wagata, Hojin Park, Andrew Beng Jin Teoh |
| 2023 | Prior Knowledge-driven Dynamic Scene Graph Generation with Causal Inference. Jiale Lu, Lianggangxu Chen, Youqi Song, Shaohui Lin, Changbo Wang, Gaoqi He |
| 2023 | Prior-Guided Accuracy-Bias Tradeoff Learning for CTR Prediction in Multimedia Recommendation. Dugang Liu, Yang Qiao, Xing Tang, Liang Chen, Xiuqiang He, Zhong Ming |
| 2023 | Pro-Cap: Leveraging a Frozen Vision-Language Model for Hateful Meme Detection. Rui Cao, Ming Shan Hee, Adriel Kuek, Wen-Haw Chong, Roy Ka-Wei Lee, Jing Jiang |
| 2023 | ProTegO: Protect Text Content against OCR Extraction Attack. Yanru He, Kejiang Chen, Guoqiang Chen, Zehua Ma, Kui Zhang, Jie Zhang, Huanyu Bian, Han Fang, Weiming Zhang, Nenghai Yu |
| 2023 | Probability Distribution Based Frame-supervised Language-driven Action Localization. Shuo Yang, Zirui Shang, Xinxiao Wu |
| 2023 | Proceedings of the 31st ACM International Conference on Multimedia, MM 2023, Ottawa, ON, Canada, 29 October 2023- 3 November 2023 Abdulmotaleb El Saddik, Tao Mei, Rita Cucchiara, Marco Bertini, Diana Patricia Tobon Vallejo, Pradeep K. Atrey, M. Shamim Hossain |
| 2023 | Progressive Domain-style Translation for Nighttime Tracking. Jinpu Zhang, Ziwen Li, Ruonan Wei, Yuehuan Wang |
| 2023 | Progressive Positive Association Framework for Image and Text Retrieval. Wenhui Li, Yan Wang, Yuting Su, Lanjun Wang, Weizhi Nie, An-An Liu |
| 2023 | Progressive Spatio-temporal Perception for Audio-Visual Question Answering. Guangyao Li, Wenxuan Hou, Di Hu |
| 2023 | Progressive Visual Content Understanding Network for Image Emotion Classification. Jicai Pan, Shangfei Wang |
| 2023 | Prompt Me Up: Unleashing the Power of Alignments for Multimodal Entity and Relation Extraction. Xuming Hu, Junzhe Chen, Aiwei Liu, Shiao Meng, Lijie Wen, Philip S. Yu |
| 2023 | PromptMTopic: Unsupervised Multimodal Topic Modeling of Memes using Large Language Models. Nirmalendu Prakash, Han Wang, Nguyen-Khoi Hoang, Ming Shan Hee, Roy Ka-Wei Lee |
| 2023 | Prompted Contrast with Masked Motion Modeling: Towards Versatile 3D Action Representation Learning. Jiahang Zhang, Lilang Lin, Jiaying Liu |
| 2023 | Propagation is All You Need: A New Framework for Representation Learning and Classifier Training on Graphs. Jiaming Zhuo, Can Cui, Kun Fu, Bingxin Niu, Dongxiao He, Yuanfang Guo, Zhen Wang, Chuan Wang, Xiaochun Cao, Liang Yang |
| 2023 | ProtoHPE: Prototype-guided High-frequency Patch Enhancement for Visible-Infrared Person Re-identification. Guiwei Zhang, Yongfei Zhang, Zichang Tan |
| 2023 | Prototype-guided Cross-modal Completion and Alignment for Incomplete Text-based Person Re-identification. Tiantian Gong, Guodong Du, Junsheng Wang, Yongkang Ding, Liyan Zhang |
| 2023 | Prototype-guided Knowledge Transfer for Federated Unsupervised Cross-modal Hashing. Jingzhi Li, Fengling Li, Lei Zhu, Hui Cui, Jingjing Li |
| 2023 | Prototypical Cross-domain Knowledge Transfer for Cervical Dysplasia Visual Inspection. Yichen Zhang, Yifang Yin, Ying Zhang, Zhenguang Liu, Zheng Wang, Roger Zimmermann |
| 2023 | Pseudo Object Replay and Mining for Incremental Object Detection. Dongbao Yang, Yu Zhou, Xiaopeng Hong, Aoting Zhang, Xin Wei, Linchengxi Zeng, Zhi Qiao, Weiping Wang |
| 2023 | QA-CLIMS: Question-Answer Cross Language Image Matching for Weakly Supervised Semantic Segmentation. Songhe Deng, Wei Zhuo, Jinheng Xie, Linlin Shen |
| 2023 | Quality-Aware RGBT Tracking via Supervised Reliability Learning and Weighted Residual Guidance. Lei Liu, Chenglong Li, Yun Xiao, Jin Tang |
| 2023 | Query-aware Long Video Localization and Relation Discrimination for Deep Video Understanding. Yuanxing Xu, Yuting Wei, Bin Wu |
| 2023 | RAHNet: Retrieval Augmented Hybrid Network for Long-tailed Graph Classification. Zhengyang Mao, Wei Ju, Yifang Qin, Xiao Luo, Ming Zhang |
| 2023 | RAIRNet: Region-Aware Identity Rectification for Face Forgery Detection. Mingqi Fang, Lingyun Yu, Hongtao Xie, Junqiang Wu, Zezheng Wang, Jiahong Li, Yongdong Zhang |
| 2023 | RAMM: Retrieval-augmented Biomedical Visual Question Answering with Multi-modal Pre-training. Zheng Yuan, Qiao Jin, Chuanqi Tan, Zhengyun Zhao, Hongyi Yuan, Fei Huang, Songfang Huang |
| 2023 | RD-FGFS: A Rule-Data Hybrid Framework for Fine-Grained Footstep Sound Synthesis from Visual Guidance. Qiutang Qi, Haonan Cheng, Yang Wang, Long Ye, Shaobin Li |
| 2023 | REACT2023: The First Multiple Appropriate Facial Reaction Generation Challenge. Siyang Song, Micol Spitale, Cheng Luo, Germán Barquero, Cristina Palmero, Sergio Escalera, Michel F. Valstar, Tobias Baur, Fabien Ringeval, Elisabeth André, Hatice Gunes |
| 2023 | ROAD: Robust Unsupervised Domain Adaptation with Noisy Labels. Yanglin Feng, Hongyuan Zhu, Dezhong Peng, Xi Peng, Peng Hu |
| 2023 | RTQ: Rethinking Video-language Understanding Based on Image-text Model. Xiao Wang, Yaoyu Li, Tian Gan, Zheng Zhang, Jingjing Lv, Liqiang Nie |
| 2023 | ReCo: A Dataset for Residential Community Layout Planning. Xi Chen, Yun Xiong, Siqi Wang, Haofen Wang, Tao Sheng, Yao Zhang, Yu Ye |
| 2023 | Read Ten Lines at One Glance: Line-Aware Semi-Autoregressive Transformer for Multi-Line Handwritten Mathematical Expression Recognition. Wentao Yang, Zhe Li, Dezhi Peng, Lianwen Jin, Mengchao He, Cong Yao |
| 2023 | Real-time Facial Animation for 3D Stylized Character with Emotion Dynamics. Ye Pan, Ruisi Zhang, Jingying Wang, Yu Ding, Kenny Mitchell |
| 2023 | Real20M: A Large-scale E-commerce Dataset for Cross-domain Retrieval. Yanzhe Chen, Huasong Zhong, Xiangteng He, Yuxin Peng, Lele Cheng |
| 2023 | Recognizing High-Speed Moving Objects with Spike Camera. Junwei Zhao, Jianming Ye, Shiliang Zhang, Zhaofei Yu, Tiejun Huang |
| 2023 | RecolorNeRF: Layer Decomposed Radiance Fields for Efficient Color Editing of 3D Scenes. Bingchen Gong, Yuehao Wang, Xiaoguang Han, Qi Dou |
| 2023 | Reconnecting the Broken Civilization: Patchwork Integration of Fragments from Ancient Manuscripts. Yuqing Zhang, Zhou Fang, Xinyu Yang, Shengyu Zhang, Baoyi He, Huaiyong Dou, Junchi Yan, Yongquan Zhang, Fei Wu |
| 2023 | Recurrent Multi-scale Transformer for High-Resolution Salient Object Detection. Xinhao Deng, Pingping Zhang, Wei Liu, Huchuan Lu |
| 2023 | Recurrent Self-Supervised Video Denoising with Denser Receptive Field. Zichun Wang, Yulun Zhang, Debing Zhang, Ying Fu |
| 2023 | Recurrent Spike-based Image Restoration under General Illumination. Lin Zhu, Yunlong Zheng, Mengyue Geng, Lizhi Wang, Hua Huang |
| 2023 | Reducing Intrinsic and Extrinsic Data Biases for Moment Localization with Natural Language. Jiong Yin, Liang Li, Jiehua Zhang, Chenggang Yan, Lei Zhang, Zunjie Zhu |
| 2023 | Redundancy-aware Transformer for Video Question Answering. Yicong Li, Xun Yang, An Zhang, Chun Feng, Xiang Wang, Tat-Seng Chua |
| 2023 | Reference-based Dense Pose Estimation via Partial 3D Point Cloud Matching. Rintaro Yanagi, Atsushi Hashimoto, Naoya Chiba, Yoshitaka Ushiku |
| 2023 | RefineTAD: Learning Proposal-free Refinement for Temporal Action Detection. Yue Feng, Zhengye Zhang, Rong Quan, Limin Wang, Jie Qin |
| 2023 | Regress Before Construct: Regress Autoencoder for Point Cloud Self-supervised Learning. Yang Liu, Chen Chen, Can Wang, Xulin King, Mengyuan Liu |
| 2023 | Reinforcement Graph Clustering with Unknown Cluster Number. Yue Liu, Ke Liang, Jun Xia, Xihong Yang, Sihang Zhou, Meng Liu, Xinwang Liu, Stan Z. Li |
| 2023 | Reinforcement Learning-based Adversarial Attacks on Object Detectors using Reward Shaping. Zhenbo Shi, Wei Yang, Zhenbo Xu, Zhidong Yu, Liusheng Huang |
| 2023 | Relation Triplet Construction for Cross-modal Text-to-Video Retrieval. Xue Song, Jingjing Chen, Yu-Gang Jiang |
| 2023 | Relational Contrastive Learning for Scene Text Recognition. Jinglei Zhang, Tiancheng Lin, Yi Xu, Kai Chen, Rui Zhang |
| 2023 | Relative NN-Descent: A Fast Index Construction for Graph-Based Approximate Nearest Neighbor Search. Naoki Ono, Yusuke Matsui |
| 2023 | Relit-NeuLF: Efficient Relighting and Novel View Synthesis via Neural 4D Light Field. Zhong Li, Liangchen Song, Zhang Chen, Xiangyu Du, Lele Chen, Junsong Yuan, Yi Xu |
| 2023 | Reparo: QoE-Aware Live Video Streaming in Low-Rate Networks by Intelligent Frame Recovery. Fulin Wang, Qing Li, Wanxin Shi, Gareth Tyson, Yong Jiang, Lianbo Ma, Peng Zhang, Yulong Lan, Zhicheng Li |
| 2023 | Reservoir Computing Transformer for Image-Text Retrieval. Wenrui Li, Zhengyu Ma, Liang-Jian Deng, Penghong Wang, Jinqiao Shi, Xiaopeng Fan |
| 2023 | Resolve Domain Conflicts for Generalizable Remote Physiological Measurement. Weiyu Sun, Xinyu Zhang, Hao Lu, Ying Chen, Yun Ge, Xiaolin Huang, Jie Yuan, Yingcong Chen |
| 2023 | Resource Constrained Model Compression via Minimax Optimization for Spiking Neural Networks. Jue Chen, Huan Yuan, Jianchao Tan, Bin Chen, Chengru Song, Di Zhang |
| 2023 | Restoration of Multiple Image Distortions using a Semi-dynamic Deep Neural Network. Hongming Luo, Fei Zhou, Zehong Zhou, Kin-Man Lam, Guoping Qiu |
| 2023 | Rethinking Missing Modality Learning from a Decoding Perspective. Tao Jin, Xize Cheng, Linjun Li, Wang Lin, Ye Wang, Zhou Zhao |
| 2023 | Rethinking Neighborhood Consistency Learning on Unsupervised Domain Adaptation. Chang Liu, Lichen Wang, Yun Fu |
| 2023 | Rethinking Neural Style Transfer: Generating Personalized and Watermarked Stylized Images. Quan Wang, Sheng Li, Xinpeng Zhang, Guorui Feng |
| 2023 | Rethinking Pseudo-Label-Based Unsupervised Person Re-ID with Hierarchical Prototype-based Graph. Ben Sha, Baopu Li, Tao Chen, Jiayuan Fan, Tao Sheng |
| 2023 | Rethinking Voice-Face Correlation: A Geometry View. Xiang Li, Yandong Wen, Muqiao Yang, Jinglu Wang, Rita Singh, Bhiksha Raj |
| 2023 | Rethinking the Localization in Weakly Supervised Object Localization. Rui Xu, Yong Luo, Han Hu, Bo Du, Jialie Shen, Yonggang Wen |
| 2023 | RetouchingFFHQ: A Large-scale Dataset for Fine-grained Face Retouching Detection. Qichao Ying, Jiaxin Liu, Sheng Li, Haisheng Xu, Zhenxing Qian, Xinpeng Zhang |
| 2023 | Retrieval-based Knowledge Augmented Vision Language Pre-training. Jiahua Rao, Zifei Shan, Longpo Liu, Yao Zhou, Yuedong Yang |
| 2023 | Revisiting Disentanglement and Fusion on Modality and Context in Conversational Multimodal Emotion Recognition. Bobo Li, Hao Fei, Lizi Liao, Yu Zhao, Chong Teng, Tat-Seng Chua, Donghong Ji, Fei Li |
| 2023 | Revisiting Learning Paradigms for Multimedia Data Generation. Xu Tan |
| 2023 | Robust Image Steganography against General Scaling Attacks. Qingliang Liu, Jiangqun Ni, Xianglei Hu |
| 2023 | Robust Spectral Embedding Completion Based Incomplete Multi-view Clustering. Chao Zhang, Jingwen Wei, Bo Wang, Zechao Li, Chunlin Chen, Huaxiong Li |
| 2023 | RoomDreamer: Text-Driven 3D Indoor Scene Synthesis with Coherent Geometry and Texture. Liangchen Song, Liangliang Cao, Hongyu Xu, Kai Kang, Feng Tang, Junsong Yuan, Zhao Yang |
| 2023 | S-OmniMVS: Incorporating Sphere Geometry into Omnidirectional Stereo Matching. Zisong Chen, Chunyu Lin, Lang Nie, Zhijie Shen, Kang Liao, Yuanzhouhan Cao, Yao Zhao |
| 2023 | S3DS: Self-supervised Learning of 3D Skeletons from Single View Images. Jianwei Hu, Ningna Wang, Baorong Yang, Gang Chen, Xiaohu Guo, Bin Wang |
| 2023 | SA-GDA: Spectral Augmentation for Graph Domain Adaptation. Jinhui Pang, Zixuan Wang, Jiliang Tang, Mingyan Xiao, Nan Yin |
| 2023 | SAAML: A Framework for Semi-supervised Affective Adaptation via Metric Learning. Minh Tran, Yelin Kim, Che-Chun Su, Cheng-Hao Kuo, Mohammad Soleymani |
| 2023 | SAUNet: Spatial-Attention Unfolding Network for Image Compressive Sensing. Ping Wang, Xin Yuan |
| 2023 | SCLAV: Supervised Cross-modal Contrastive Learning for Audio-Visual Coding. Chao Sun, Min Chen, Jialiang Cheng, Han Liang, Chuanbo Zhu, Jincai Chen |
| 2023 | SD-Net: Spatially-Disentangled Point Cloud Completion Network. Junxian Chen, Ying Liu, Yiqi Liang, Dandan Long, Xiaolin He, Ruihui Li |
| 2023 | SDDNet: Style-guided Dual-layer Disentanglement Network for Shadow Detection. Runmin Cong, Yuchen Guan, Jinpeng Chen, Wei Zhang, Yao Zhao, Sam Kwong |
| 2023 | SEAM: Searching Transferable Mixed-Precision Quantization Policy through Large Margin Regularization. Chen Tang, Kai Ouyang, Zenghao Chai, Yunpeng Bai, Yuan Meng, Zhi Wang, Wenwu Zhu |
| 2023 | SEAR: Semantically-grounded Audio Representations. Rajat Hebbar, Digbalay Bose, Shrikanth Narayanan |
| 2023 | SGDiff: A Style Guided Diffusion Model for Fashion Synthesis. Zhengwentai Sun, Yanghong Zhou, Honghong He, P. Y. Mok |
| 2023 | SIEGE: Self-Supervised Incremental Deep Graph Learning for Ethereum Phishing Scam Detection. Shucheng Li, Runchuan Wang, Hao Wu, Sheng Zhong, Fengyuan Xu |
| 2023 | SMM: Self-supervised Multi-Illumination Color Constancy Model with Multiple Pretext Tasks. Ziyu Feng, Zheming Xu, Haina Qin, Congyan Lang, Bing Li, Weihua Xiong |
| 2023 | SMP Challenge: An Overview and Analysis of Social Media Prediction Challenge. Bo Wu, Peiye Liu, Wen-Huang Cheng, Bei Liu, Zhaoyang Zeng, Jia Wang, Qiushi Huang, Jiebo Luo |
| 2023 | SSPU-Net: A Structure Sensitive Point Cloud Upsampling Network with Multi-Scale Spatial Refinement. Jin Wang, Jiade Chen, Yunhui Shi, Nam Ling, Baocai Yin |
| 2023 | STIRER: A Unified Model for Low-Resolution Scene Text Image Recovery and Recognition. Minyi Zhao, Shijie Xuyang, Jihong Guan, Shuigeng Zhou |
| 2023 | SUG: Single-dataset Unified Generalization for 3D Point Cloud Classification. Siyuan Huang, Bo Zhang, Botian Shi, Hongsheng Li, Yikang Li, Peng Gao |
| 2023 | SUMAC '23: 5th Workshop on the analySis, Understanding and proMotion of heritAge Contents: Advances in Machine Learning, Signal Processing, Multimodal Techniques and Human-machine Interaction. Valérie Gouet-Brunet, Ronak Kosti, Li Weng |
| 2023 | SUR-adapter: Enhancing Text-to-Image Pre-trained Diffusion Models with Large Language Models. Shanshan Zhong, Zhongzhan Huang, Wushao Wen, Jinghui Qin, Liang Lin |
| 2023 | Saliency Prototype for RGB-D and RGB-T Salient Object Detection. Zihao Zhang, Jie Wang, Yahong Han |
| 2023 | Sample Less, Learn More: Efficient Action Recognition via Frame Feature Restoration. Harry Cheng, Yangyang Guo, Liqiang Nie, Zhiyong Cheng, Mohan S. Kankanhalli |
| 2023 | Scalable Incomplete Multi-View Clustering with Structure Alignment. Yi Wen, Siwei Wang, Ke Liang, Weixuan Liang, Xinhang Wan, Xinwang Liu, Suyuan Liu, Jiyuan Liu, En Zhu |
| 2023 | Scale-space Tokenization for Improving the Robustness of Vision Transformers. Lei Xu, Rei Kawakami, Nakamasa Inoue |
| 2023 | ScaleFlow: Efficient Deep Vision Pipeline with Closed-Loop Scale-Adaptive Inference. Yuyang Leng, Renyuan Liu, Hongpeng Guo, Songqing Chen, Shuochao Yao |
| 2023 | Scene Graph Masked Variational Autoencoders for 3D Scene Generation. Rui Xu, Le Hui, Yuehui Han, Jianjun Qian, Jin Xie |
| 2023 | Scene Text Segmentation with Text-Focused Transformers. Haiyang Yu, Xiaocong Wang, Ke Niu, Bin Li, Xiangyang Xue |
| 2023 | Scene-Generalizable Interactive Segmentation of Radiance Fields. Songlin Tang, Wenjie Pei, Xin Tao, Tanghui Jia, Guangming Lu, Yu-Wing Tai |
| 2023 | Scene-aware Human Pose Generation using Transformer. Jieteng Yao, Junjie Chen, Li Niu, Bin Sheng |
| 2023 | Scene-text Oriented Visual Entailment: Task, Dataset and Solution. Nan Li, Pijian Li, Dongsheng Xu, Wenye Zhao, Yi Cai, Qingbao Huang |
| 2023 | Screen-based 3D Subjective Experiment Software. Songlin Fan, Wei Gao |
| 2023 | ScribbleVC: Scribble-supervised Medical Image Segmentation with Vision-Class Embedding. Zihan Li, Yuan Zheng, Xiangde Luo, Dandan Shan, Qingqi Hong |
| 2023 | Secondary Labeling: A Novel Labeling Strategy for Image Manipulation Detection. Yang Wei, Bin Xiao, Xiuli Bi, Zhuoran Ma, Yang Liu, Zhuo Ma |
| 2023 | Securing Fixed Neural Network Steganography. Zicong Luo, Sheng Li, Guobiao Li, Zhenxing Qian, Xinpeng Zhang |
| 2023 | SeeDS: Semantic Separable Diffusion Synthesizer for Zero-shot Food Detection. Pengfei Zhou, Weiqing Min, Yang Zhang, Jiajun Song, Ying Jin, Shuqiang Jiang |
| 2023 | Seeing in Flowing: Adapting CLIP for Action Recognition with Motion Prompts Learning. Qiang Wang, Junlong Du, Ke Yan, Shouhong Ding |
| 2023 | Selecting Learnable Training Samples is All DETRs Need in Crowded Pedestrian Detection. Feng Gao, Jiaxu Leng, Ji Gan, Xinbo Gao |
| 2023 | Self-Contrastive Graph Diffusion Network. Yixuan Ma, Kun Zhan |
| 2023 | Self-Distillation Dual-Memory Online Hashing with Hash Centers for Streaming Data Retrieval. Chong-Yu Zhang, Xin Luo, Yu-Wei Zhan, Peng-Fei Zhang, Zhen-Duo Chen, Yongxin Wang, Xun Yang, Xin-Shun Xu |
| 2023 | Self-PT: Adaptive Self-Prompt Tuning for Low-Resource Visual Question Answering. Bowen Yuan, Sisi You, Bing-Kun Bao |
| 2023 | Self-Reference Image Super-Resolution via Pre-trained Diffusion Large Model and Window Adjustable Transformer. Guangyuan Li, Wei Xing, Lei Zhao, Zehua Lan, Jiakai Sun, Zhanjie Zhang, Quanwei Zhang, Huaizhong Lin, Zhijie Lin |
| 2023 | Self-Relational Graph Convolution Network for Skeleton-Based Action Recognition. Sophyani Banaamwini Yussif, Ning Xie, Yang Yang, Heng Tao Shen |
| 2023 | Self-Supervised Cross-Language Scene Text Editing. Fuxiang Yang, Tonghua Su, Xiang Zhou, Donglin Di, Zhongjie Wang, Songze Li |
| 2023 | Self-supervised Video Summarization Guided by Semantic Inverse Optimal Transport. Yutong Wang, Hongteng Xu, Dixin Luo |
| 2023 | SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces. Ziqiao Peng, Yihao Luo, Yue Shi, Hao Xu, Xiangyu Zhu, Hongyan Liu, Jun He, Zhaoxin Fan |
| 2023 | Semantic-Aware Generator and Low-level Feature Augmentation for Few-shot Image Generation. Zhe Wang, Jiaoyan Guan, Mengping Yang, Ting Xiao, Ziqiu Chi |
| 2023 | Semantic-Guided Feature Distillation for Multimodal Recommendation. Fan Liu, Huilin Chen, Zhiyong Cheng, Liqiang Nie, Mohan S. Kankanhalli |
| 2023 | Semantic-aware Consistency Network for Cloth-changing Person Re-Identification. Peini Guo, Hong Liu, Jianbing Wu, Guoquan Wang, Tao Wang |
| 2023 | Semantic-based Selection, Synthesis, and Supervision for Few-shot Learning. Jinda Lu, Shuo Wang, Xinyu Zhang, Yanbin Hao, Xiangnan He |
| 2023 | SemanticRT: A Large-Scale Dataset and Method for Robust Semantic Segmentation in Multispectral Images. Wei Ji, Jingjing Li, Cheng Bian, Zhicheng Zhang, Li Cheng |
| 2023 | Semantics-Enriched Cross-Modal Alignment for Complex-Query Video Moment Retrieval. Xingyu Shen, Xiang Zhang, Xun Yang, Yibing Zhan, Long Lan, Jianfeng Dong, Hongzhou Wu |
| 2023 | Semantics2Hands: Transferring Hand Motion Semantics between Avatars. Zijie Ye, Jia Jia, Junliang Xing |
| 2023 | Semi-Supervised Convolutional Vision Transformer with Bi-Level Uncertainty Estimation for Medical Image Segmentation. Huimin Huang, Yawen Huang, Shiao Xie, Lanfen Lin, Ruofeng Tong, Yen-Wei Chen, Yuexiang Li, Yefeng Zheng |
| 2023 | Semi-Supervised Multimodal Emotion Recognition with Class-Balanced Pseudo-labeling. Haifeng Chen, Chujia Guo, Yan Li, Peng Zhang, Dongmei Jiang |
| 2023 | Semi-Supervised Multimodal Emotion Recognition with Expression MAE. Zebang Cheng, Yuxiang Lin, Zhaoru Chen, Xiang Li, Shuyi Mao, Fan Zhang, Daijun Ding, Bowen Zhang, Xiaojiang Peng |
| 2023 | Semi-Supervised Panoptic Narrative Grounding. Danni Yang, Jiayi Ji, Xiaoshuai Sun, Haowei Wang, Yinan Li, Yiwei Ma, Rongrong Ji |
| 2023 | Semi-supervised Deep Multi-view Stereo. Hongbin Xu, Weitao Chen, Yang Liu, Zhipeng Zhou, Haihong Xiao, Baigui Sun, Xuansong Xie, Wenxiong Kang |
| 2023 | Semi-supervised Domain Adaptation via Joint Contrastive Learning with Sensitivity. Keyu Tu, Zilei Wang, Junjie Li, Yixin Zhang |
| 2023 | Semi-supervised Semantic Segmentation with Mutual Knowledge Distillation. Jianlong Yuan, Jinchao Ge, Zhibin Wang, Yifan Liu |
| 2023 | Sensing Micro-Motion Human Patterns using Multimodal mmRadar and Video Signal for Affective and Psychological Intelligence. Yiwei Ru, Peipei Li, Muyi Sun, Yunlong Wang, Kunbo Zhang, Qi Li, Zhaofeng He, Zhenan Sun |
| 2023 | SepMark: Deep Separable Watermarking for Unified Source Tracing and Deepfake Detection. Xiaoshuai Wu, Xin Liao, Bo Ou |
| 2023 | Separable Modulation Network for Efficient Image Super-Resolution. Zhijian Wu, Jun Li, Dingjiang Huang |
| 2023 | Separate and Locate: Rethink the Text in Text-based Visual Question Answering. Chengyang Fang, Jiangnan Li, Liang Li, Can Ma, Dayong Hu |
| 2023 | Sequential Affinity Learning for Video Restoration. Tian Ye, Sixiang Chen, Yun Liu, Wenhao Chai, Jinbin Bai, Wenbin Zou, Yunchen Zhang, Mingchao Jiang, Erkang Chen, Chenghao Xue |
| 2023 | SetterVision: Motion-based Tactical Training System for Volleyball Setters in Virtual Reality. Yu-Hsuan Chen, Chen-Wei Fu, Wei-Lun Huang, Ming-Cong Su, Hsin-Yu Huang, Andrew Chen, Tse-Yu Pan |
| 2023 | Shift Pruning: Equivalent Weight Pruning for CNN via Differentiable Shift Operator. Tao Niu, Yihang Lou, Yinglei Teng, Jianzhong He, Yiding Liu |
| 2023 | Shifted GCN-GAT and Cumulative-Transformer based Social Relation Recognition for Long Videos. Haorui Wang, Yibo Hu, Yangfu Zhu, Jinsheng Qi, Bin Wu |
| 2023 | SiFDetectCracker: An Adversarial Attack Against Fake Voice Detection Based on Speaker-Irrelative Features. Xuan Hai, Xin Liu, Yuan Tan, Qingguo Zhou |
| 2023 | SimHMR: A Simple Query-based Framework for Parameterized Human Mesh Reconstruction. Zihao Huang, Min Shi, Chengxin Liu, Ke Xian, Zhiguo Cao |
| 2023 | Simple Techniques are Sufficient for Boosting Adversarial Transferability. Chaoning Zhang, Philipp Benz, Adil Karjauv, In So Kweon, Choong Seon Hong |
| 2023 | SimulFlow: Simultaneously Extracting Feature and Identifying Target for Unsupervised Video Object Segmentation. Lingyi Hong, Wei Zhang, Shuyong Gao, Hong Lu, Wenqiang Zhang |
| 2023 | Single Domain Generalization via Unsupervised Diversity Probe. Kehua Guo, Rui Ding, Tian Qiu, Xiangyuan Zhu, Zheng Wu, Liwei Wang, Hui Fang |
| 2023 | Single-Stage Multi-human Parsing via Point Sets and Center-based Offsets. Jiaming Chu, Lei Jin, Xiaojin Fan, Yinglei Teng, Yunchao Wei, Yuqiang Fang, Junliang Xing, Jian Zhao |
| 2023 | Skeletal Spatial-Temporal Semantics Guided Homogeneous-Heterogeneous Multimodal Network for Action Recognition. Chenwei Zhang, Yuxuan Hu, Min Yang, Chengming Li, Xiping Hu |
| 2023 | Skeleton MixFormer: Multivariate Topology Representation for Skeleton-based Action Recognition. Wentian Xin, Qiguang Miao, Yi Liu, Ruyi Liu, Chi-Man Pun, Cheng Shi |
| 2023 | Sketch Input Method Editor: A Comprehensive Dataset and Methodology for Systematic Input Recognition. Guangming Zhu, Siyuan Wang, Qing Cheng, Kelong Wu, Hao Li, Liang Zhang |
| 2023 | SkipStreaming: Pinpointing User-Perceived Redundancy in Correlated Web Video Streaming through the Lens of Scenes. Wei Liu, Xinlei Yang, Zhenhua Li, Feng Qian |
| 2023 | Sliding Window Seq2seq Modeling for Engagement Estimation. Jun Yu, Keda Lu, Mohan Jing, Ziqi Liang, Bingyuan Zhang, Jianqing Sun, Jiaen Liang |
| 2023 | Slow-Fast Time Parameter Aggregation Network for Class-Incremental Lip Reading. Xueyi Zhang, Chengwei Zhang, Tao Wang, Jun Tang, Songyang Lao, Haizhou Li |
| 2023 | Slowfast Diversity-aware Prototype Learning for Egocentric Action Recognition. Guangzhao Dai, Xiangbo Shu, Rui Yan, Peng Huang, Jinhui Tang |
| 2023 | SpaceCLIP: A Vision-Language Pretraining Framework With Spatial Reconstruction On Text. Bo Zou, Chao Yang, Chengbin Quan, Youjian Zhao |
| 2023 | Sparse Sharing Relation Network for Panoptic Driving Perception. Fan Jiang, Zilei Wang |
| 2023 | Spatial-angular Quality-aware Representation Learning for Blind Light Field Image Quality Assessment. Jianjun Xiang, Yuanjie Dang, Peng Chen, Ronghua Liang, Ruohong Huan, Zhengyu Zhang |
| 2023 | Spatio-Temporal Branching for Motion Prediction using Motion Increments. Jiexin Wang, Yujie Zhou, Wenwen Qiang, Ying Ba, Bing Su, Ji-Rong Wen |
| 2023 | Spatio-Temporal Catcher: A Self-Supervised Transformer for Deepfake Video Detection. Maosen Li, Xurong Li, Kun Yu, Cheng Deng, Heng Huang, Feng Mao, Hui Xue, Minghao Li |
| 2023 | Speech-Driven 3D Face Animation with Composite and Regional Facial Movements. Haozhe Wu, Songtao Zhou, Jia Jia, Junliang Xing, Qi Wen, Xiang Wen |
| 2023 | SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody. Hui Lu, Xixin Wu, Zhiyong Wu, Helen Meng |
| 2023 | StableVQA: A Deep No-Reference Quality Assessment Model for Video Stability. Tengchuan Kou, Xiaohong Liu, Wei Sun, Jun Jia, Xiongkuo Min, Guangtao Zhai, Ning Liu |
| 2023 | StegaDDPM: Generative Image Steganography based on Denoising Diffusion Probabilistic Model. Yinyin Peng, Donghui Hu, Yaofei Wang, Kejiang Chen, Gang Pei, Weiming Zhang |
| 2023 | Stepwise Refinement Short Hashing for Image Retrieval. Yuan Sun, Dezhong Peng, Jian Dai, Zhenwen Ren |
| 2023 | Striking a Balance: Unsupervised Cross-Domain Crowd Counting via Knowledge Diffusion. Haiyang Xie, Zhengwei Yang, Huilin Zhu, Zheng Wang |
| 2023 | Stroke-based Neural Painting and Stylization with Dynamically Predicted Painting Region. Teng Hu, Ran Yi, Haokun Zhu, Liang Liu, Jinlong Peng, Yabiao Wang, Chengjie Wang, Lizhuang Ma |
| 2023 | Style Transfer Meets Super-Resolution: Advancing Unpaired Infrared-to-Visible Image Translation with Detail Enhancement. Yirui Shen, Jingxuan Kang, Shuang Li, Zhenjie Yu, Shuigen Wang |
| 2023 | Style-Controllable Generalized Person Re-identification. Yuke Li, Jingkuan Song, Hao Ni, Heng Tao Shen |
| 2023 | StyleEDL: Style-Guided High-order Attention Network for Image Emotion Distribution Learning. Peiguang Jing, Xianyi Liu, Ji Wang, Yinwei Wei, Liqiang Nie, Yuting Su |
| 2023 | StylePrompter: All Styles Need Is Attention. Chenyi Zhuang, Pan Gao, Aljosa Smolic |
| 2023 | Suspected Objects Matter: Rethinking Model's Prediction for One-stage Visual Grounding. Yang Jiao, Zequn Jie, Jingjing Chen, Lin Ma, Yu-Gang Jiang |
| 2023 | Swin-UNIT: Transformer-based GAN for High-resolution Unpaired Image Translation. Yifan Li, Yaochen Li, Wenneng Tang, Zhifeng Zhu, Jinhuo Yang, Yuehu Liu |
| 2023 | Symmetrical Linguistic Feature Distillation with CLIP for Scene Text Recognition. Zixiao Wang, Hongtao Xie, Yuxin Wang, Jianjun Xu, Boqiang Zhang, Yongdong Zhang |
| 2023 | Synthesizing Long-Term Human Motions with Diffusion Models via Coherent Sampling. Zhao Yang, Bing Su, Ji-Rong Wen |
| 2023 | Synthesizing Videos from Images for Image-to-Video Adaptation. Junbao Zhuo, Xingyu Zhao, Shuhui Wang, Huimin Ma, Qingming Huang |
| 2023 | TE-KWS: Text-Informed Speech Enhancement for Noise-Robust Keyword Spotting. Dong Liu, Qirong Mao, Lijian Gao, Qinghua Ren, Zhenghan Chen, Ming Dong |
| 2023 | TIRDet: Mono-Modality Thermal InfraRed Object Detection Based on Prior Thermal-To-Visible Translation. Zeyu Wang, Fabien Colonnier, Jinghong Zheng, Jyotibdha Acharya, Wenyu Jiang, Kejie Huang |
| 2023 | TIVA-KG: A Multimodal Knowledge Graph with Text, Image, Video and Audio. Xin Wang, Benyuan Meng, Hong Chen, Yuan Meng, Ke Lv, Wenwu Zhu |
| 2023 | TMac: Temporal Multi-Modal Graph Learning for Acoustic Event Classification. Meng Liu, Ke Liang, Dayu Hu, Hao Yu, Yue Liu, Lingyuan Meng, Wenxuan Tu, Sihang Zhou, Xinwang Liu |
| 2023 | TSSAT: Two-Stage Statistics-Aware Transformation for Artistic Style Transfer. Haibo Chen, Lei Zhao, Jun Li, Jian Yang |
| 2023 | TTPOINT: A Tensorized Point Cloud Network for Lightweight Action Recognition with Event Cameras. Hongwei Ren, Yue Zhou, Haotian Fu, Yulong Huang, Renjing Xu, Bojun Cheng |
| 2023 | Taking a Part for the Whole: An Archetype-agnostic Framework for Voice-Face Association. Guancheng Chen, Xin Liu, Xing Xu, Yiu-ming Cheung, Taihao Li |
| 2023 | Taming the Power of Diffusion Models for High-Quality Virtual Try-On with Appearance Flow. Junhong Gou, Siyu Sun, Jianfu Zhang, Jianlou Si, Chen Qian, Liqing Zhang |
| 2023 | Target-Guided Composed Image Retrieval. Haokun Wen, Xian Zhang, Xuemeng Song, Yinwei Wei, Liqiang Nie |
| 2023 | Task-Adversarial Adaptation for Multi-modal Recommendation. Hongzu Su, Jingjing Li, Fengling Li, Lei Zhu, Ke Lu, Yang Yang |
| 2023 | TeViS: Translating Text Synopses to Video Storyboards. Xu Gu, Yuchong Sun, Feiyue Ni, Shizhe Chen, Xihua Wang, Ruihua Song, Boyuan Li, Xiang Cao |
| 2023 | Temporal Sentence Grounding in Streaming Videos. Tian Gan, Xiao Wang, Yan Sun, Jianlong Wu, Qingpei Guo, Liqiang Nie |
| 2023 | Temporally Efficient Gabor Transformer for Unsupervised Video Object Segmentation. Jiaqing Fan, Tiankang Su, Kaihua Zhang, Bo Liu, Qingshan Liu |
| 2023 | Text-Only Training for Visual Storytelling. Yuechen Wang, Wengang Zhou, Zhenbo Lu, Houqiang Li |
| 2023 | Text-based Person Search without Parallel Image-Text Data. Yang Bai, Jingyao Wang, Min Cao, Chen Chen, Ziqiang Cao, Liqiang Nie, Min Zhang |
| 2023 | Text-to-Audio Generation using Instruction Guided Latent Diffusion Model. Deepanway Ghosal, Navonil Majumder, Ambuj Mehrish, Soujanya Poria |
| 2023 | Text-to-Image Diffusion Models can be Easily Backdoored through Multimodal Data Poisoning. Shengfang Zhai, Yinpeng Dong, Qingni Shen, Shi Pu, Yuejian Fang, Hang Su |
| 2023 | Text-to-Metaverse: Towards a Digital Twin-Enabled Multimodal Conditional Generative Metaverse. Ahmed Elhagry |
| 2023 | TextPainter: Multimodal Text Image Generation with Visual-harmony and Text-comprehension for Poster Design. Yifan Gao, Jinpeng Lin, Min Zhou, Chuanbin Liu, Hongtao Xie, Tiezheng Ge, Yuning Jiang |
| 2023 | That's What I Said: Fully-Controllable Talking Face Generation. Youngjoon Jang, Kyeongha Rho, Jong-Bin Woo, Hyeongkeun Lee, Jihwan Park, Youshin Lim, Byeong-Yeol Kim, Joon Son Chung |
| 2023 | The ACM Multimedia 2023 Computational Paralinguistics Challenge: Emotion Share & Requests. Björn W. Schuller, Anton Batliner, Shahin Amiriparian, Alexander Barnhill, Maurice Gerczuk, Andreas Triantafyllopoulos, Alice E. Baird, Panagiotis Tzirakis, Chris Gagne, Alan S. Cowen, Nikola Lackovic, Marie-José Caraty, Claude Montacié |
| 2023 | The ACM Multimedia 2023 Deep Video Understanding Grand Challenge. Keith Curtis, George Awad, Afzal Godil, Ian Soboroff |
| 2023 | The Effects of Viewing Formats and Song Genres on Audience Experiences in Virtual Avatar Concerts. Sebin Lee, Daye Kim, Jungjin Lee |
| 2023 | The Silent Manipulator: A Practical and Inaudible Backdoor Attack against Speech Recognition Systems. Zhicong Zheng, Xinfeng Li, Chen Yan, Xiaoyu Ji, Wenyuan Xu |
| 2023 | Think before You Leap: Content-Aware Low-Cost Edge-Assisted Video Semantic Segmentation. Mingxuan Yan, Yi Wang, Xuedou Xiao, Zhiqing Luo, Jianhua He, Wei Wang |
| 2023 | TikTalk: A Video-Based Dialogue Dataset for Multi-Modal Chitchat in Real World. Hongpeng Lin, Ludan Ruan, Wenke Xia, Peiyu Liu, Jingyuan Wen, Yixin Xu, Di Hu, Ruihua Song, Wayne Xin Zhao, Qin Jin, Zhiwu Lu |
| 2023 | Tile Classification Based Viewport Prediction with Multi-modal Fusion Transformer. Zhihao Zhang, Yiwei Chen, Weizhan Zhang, Caixia Yan, Qinghua Zheng, Qi Wang, Wangdu Chen |
| 2023 | TopicCAT: Unsupervised Topic-Guided Co-Attention Transformer for Extreme Multimodal Summarisation. Peggy Tang, Kun Hu, Lei Zhang, Junbin Gao, Jiebo Luo, Zhiyong Wang |
| 2023 | Topological Structure Learning for Weakly-Supervised Out-of-Distribution Detection. Rundong He, Rongxue Li, Zhongyi Han, Xihong Yang, Yilong Yin |
| 2023 | Toward High Quality Facial Representation Learning. Yue Wang, Jinlong Peng, Jiangning Zhang, Ran Yi, Liang Liu, Yabiao Wang, Chengjie Wang |
| 2023 | Toward Human Perception-Centric Video Thumbnail Generation. Tao Yang, Fan Wang, Junfan Lin, Zhongang Qi, Yang Wu, Jing Xu, Ying Shan, Changwen Chen |
| 2023 | Toward Intelligent Interactive Design: A Generation Framework Based on Cross-domain Fashion Elements. Jianyang Shi, Haijun Zhang, Dongliang Zhou, Zhao Zhang |
| 2023 | Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach. Sha Guo, Zhuo Chen, Yang Zhao, Ning Zhang, Xiaotong Li, Lingyu Duan |
| 2023 | Toward Zero-shot Character Recognition: A Gold Standard Dataset with Radical-level Annotations. Xiaolei Diao, Daqian Shi, Jian Li, Lida Shi, Mingzhe Yue, Ruihua Qi, Chuntao Li, Hao Xu |
| 2023 | Towards Accurate Lip-to-Speech Synthesis in-the-Wild. Sindhu B. Hegde, Rudrabha Mukhopadhyay, C. V. Jawahar, Vinay P. Namboodiri |
| 2023 | Towards Adaptable Graph Representation Learning: An Adaptive Multi-Graph Contrastive Transformer. Yan Li, Liang Zhang, Xiangyuan Lan, Dongmei Jiang |
| 2023 | Towards Balanced Active Learning for Multimodal Classification. Meng Shen, Yizheng Huang, Jianxiong Yin, Heqing Zou, Deepu Rajan, Simon See |
| 2023 | Towards Better Multi-modal Keyphrase Generation via Visual Entity Enhancement and Multi-granularity Image Noise Filtering. Yifan Dong, Suhang Wu, Fandong Meng, Jie Zhou, Xiaoli Wang, Jianxin Lin, Jinsong Su |
| 2023 | Towards Decision-based Sparse Attacks on Video Recognition. Kaixun Jiang, Zhaoyu Chen, Xinyu Zhou, Jingyu Zhang, Lingyi Hong, Jiafeng Wang, Bo Li, Yan Wang, Wenqiang Zhang |
| 2023 | Towards Deconfounded Image-Text Matching with Causal Inference. Wenhui Li, Xinqi Su, Dan Song, Lanjun Wang, Kun Zhang, An-An Liu |
| 2023 | Towards End-to-End Unsupervised Saliency Detection with Self-Supervised Top-Down Context. Yicheng Song, Shuyong Gao, Haozhe Xing, Yiting Cheng, Yan Wang, Wenqiang Zhang |
| 2023 | Towards Explainable In-the-Wild Video Quality Assessment: A Database and a Language-Prompted Approach. Haoning Wu, Erli Zhang, Liang Liao, Chaofeng Chen, Jingwen Hou, Annan Wang, Wenxiu Sun, Qiong Yan, Weisi Lin |
| 2023 | Towards Fast and Stable Federated Learning: Confronting Heterogeneity via Knowledge Anchor. Jinqian Chen, Jihua Zhu, Qinghai Zheng |
| 2023 | Towards Flexible and Universal: A Novel Endpoint-based Framework for Vessel Structural Information Extraction. Xiyao Ma, Shiqi Liu, Xiaoliang Xie, Xiao-Hu Zhou, Zengguang Hou, Xinkai Qu, Wenzheng Han, Ming Wang, Meng Song, Lin-Sen Zhang |
| 2023 | Towards Real-Time Neural Video Codec for Cross-Platform Application Using Calibration Information. Kuan Tian, Yonghang Guan, Jinxi Xiang, Jun Zhang, Xiao Han, Wei Yang |
| 2023 | Towards Real-Time Sign Language Recognition and Translation on Edge Devices. Shiwei Gan, Yafeng Yin, Zhiwei Jiang, Lei Xie, Sanglu Lu |
| 2023 | Towards Realistic Conversational Head Generation: A Comprehensive Framework for Lifelike Video Synthesis. Meng Liu, Yongqiang Li, Shuyan Zhai, Weili Guan, Liqiang Nie |
| 2023 | Towards Robust Real-Time Scene Text Detection: From Semantic to Instance Representation Learning. Xugong Qin, Pengyuan Lyu, Chengquan Zhang, Yu Zhou, Kun Yao, Peng Zhang, Hailun Lin, Weiping Wang |
| 2023 | Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark. Shuyu Yang, Yinan Zhou, Zhedong Zheng, Yaxiong Wang, Li Zhu, Yujiao Wu |
| 2023 | Towards Visual Taxonomy Expansion. Tinghui Zhu, Jingping Liu, Jiaqing Liang, Haiyun Jiang, Yanghua Xiao, Zongyu Wang, Rui Xie, Yunsen Xian |
| 2023 | Train One, Generalize to All: Generalizable Semantic Segmentation from Single-Scene to All Adverse Scenes. Ziyang Gong, Fuhao Li, Yupeng Deng, Wenjun Shen, Xianzheng Ma, Zhenming Ji, Nan Xia |
| 2023 | Training Multimedia Event Extraction With Generated Images and Captions. Zilin Du, Yunxin Li, Xu Guo, Yidan Sun, Boyang Li |
| 2023 | Tran-GCN: Multi-label Pattern Image Retrieval via Transformer Driven Graph Convolutional Network. Ying Li, Chunming Guan, Rui Cai, Erwan Ye, Ding Yuxiang, Jiaquan Gao |
| 2023 | Transferring CLIP's Knowledge into Zero-Shot Point Cloud Semantic Segmentation. Yuanbin Wang, Shaofei Huang, Yulu Gao, Zhen Wang, Rui Wang, Kehua Sheng, Bo Zhang, Si Liu |
| 2023 | Transformer-based Open-world Instance Segmentation with Cross-task Consistency Regularization. Xizhe Xue, Dongdong Yu, Lingqiao Liu, Yu Liu, Satoshi Tsutsui, Ying Li, Zehuan Yuan, Ping Song, Mike Zheng Shou |
| 2023 | Transformer-based Point Cloud Generation Network. Rui Xu, Le Hui, Yuehui Han, Jianjun Qian, Jin Xie |
| 2023 | Transition and Adaptability: The Cornerstone of Resilience in Future Networked Multimedia Systems and Beyond. Ralf Steinmetz |
| 2023 | Triple Correlations-Guided Label Supplementation for Unbiased Video Scene Graph Generation. Wenqing Wang, Kaifeng Gao, Yawei Luo, Tao Jiang, Fei Gao, Jian Shao, Jianwen Sun, Jun Xiao |
| 2023 | Triple-Granularity Contrastive Learning for Deep Multi-View Subspace Clustering. Jing Wang, Songhe Feng, Gengyu Lyu, Zhibin Gu |
| 2023 | TwinStar: A Practical Multi-path Transmission Framework for Ultra-Low Latency Video Delivery. Haiping Wang, Zhenhua Yu, Ruixiao Zhang, Siping Tao, Hebin Yu, Shu Shi |
| 2023 | Two-stage Content-Aware Layout Generation for Poster Designs. Shang Chai, Liansheng Zhuang, Fengying Yan, Zihan Zhou |
| 2023 | U2Net: A General Framework with Spatial-Spectral-Integrated Double U-Net for Image Fusion. Siran Peng, Chenhao Guo, Xiao Wu, Liang-Jian Deng |
| 2023 | UAVM '23: 2023 Workshop on UAVs in Multimedia: Capturing the World from a New Perspective. Zhedong Zheng, Yujiao Shi, Tingyu Wang, Jun Liu, Jianwu Fang, Yunchao Wei, Tat-Seng Chua |
| 2023 | UER: A Heuristic Bias Addressing Approach for Online Continual Learning. Huiwei Lin, Shanshan Feng, Baoquan Zhang, Hongliang Qiao, Xutao Li, Yunming Ye |
| 2023 | UMMAFormer: A Universal Multimodal-adaptive Transformer Framework for Temporal Forgery Localization. Rui Zhang, Hongxia Wang, Mingshan Du, Hanqing Liu, Yang Zhou, Qiang Zeng |
| 2023 | Unambiguous Object Tracking by Exploiting Target Cues. Jie Gao, Bineng Zhong, Yan Chen |
| 2023 | Unbalanced Multi-view Deep Learning. Cai Xu, Zehui Li, Ziyu Guan, Wei Zhao, Xiangyu Song, Yue Wu, Jianxin Li |
| 2023 | Uncertainty-Aware Variate Decomposition for Self-supervised Blind Image Deblurring. Runhua Jiang, Yahong Han |
| 2023 | Uncertainty-Driven Dynamic Degradation Perceiving and Background Modeling for Efficient Single Image Desnowing. Sixiang Chen, Tian Ye, Chenghao Xue, Haoyu Chen, Yun Liu, Erkang Chen, Lei Zhu |
| 2023 | Uncertainty-Guided End-to-End Audio-Visual Speaker Diarization for Far-Field Recordings. Chenyu Yang, Mengxi Chen, Yanfeng Wang, Yu Wang |
| 2023 | Uncertainty-Guided Spatial Pruning Architecture for Efficient Frame Interpolation. Ri Cheng, Xuhao Jiang, Ruian He, Shili Zhou, Weimin Tan, Bo Yan |
| 2023 | Uncovering the Unseen: Discover Hidden Intentions by Micro-Behavior Graph Reasoning. Zhuo Zhou, Wenxuan Liu, Danni Xu, Zheng Wang, Jian Zhao |
| 2023 | Understanding User Behavior in Volumetric Video Watching: Dataset, Analysis and Prediction. Kaiyuan Hu, Haowen Yang, Yili Jin, Junhua Liu, Yongting Chen, Miao Zhang, Fangxin Wang |
| 2023 | Underwater Image Enhancement by Transformer-based Diffusion Model with Non-uniform Sampling for Skip Strategy. Yi Tang, Hiroshi Kawasaki, Takafumi Iwaguchi |
| 2023 | Unfolding Once is Enough: A Deployment-Friendly Transformer Unit for Super-Resolution. Yong Liu, Hang Dong, Boyang Liang, Songwei Liu, Qingji Dong, Kai Chen, Fangmin Chen, Lean Fu, Fei Wang |
| 2023 | Uni-Dual: A Generic Unified Dual-Task Medical Self-Supervised Learning Framework. Boxiang Yun, Xingran Xie, Qingli Li, Yan Wang |
| 2023 | Uni-paint: A Unified Framework for Multimodal Image Inpainting with Pretrained Diffusion Model. Shiyuan Yang, Xiaodong Chen, Jing Liao |
| 2023 | UniFaRN: Unified Transformer for Facial Reaction Generation. Cong Liang, Jiahe Wang, Haofan Zhang, Bing Tang, Junshan Huang, Shangfei Wang, Xiaoping Chen |
| 2023 | UniNeXt: Exploring A Unified Architecture for Vision Recognition. Fangjian Lin, Jianlong Yuan, Sitong Wu, Fan Wang, Zhibin Wang |
| 2023 | UniSA: Unified Generative Framework for Sentiment Analysis. Zaijing Li, Ting-En Lin, Yuchuan Wu, Meng Liu, Fengxiao Tang, Ming Zhao, Yongbin Li |
| 2023 | UniSinger: Unified End-to-End Singing Voice Synthesis With Cross-Modality Information Matching. Zhiqing Hong, Chenye Cui, Rongjie Huang, Lichao Zhang, Jinglin Liu, Jinzheng He, Zhou Zhao |
| 2023 | Unified Multi-modal Unsupervised Representation Learning for Skeleton-based Action Understanding. Shengkai Sun, Daizong Liu, Jianfeng Dong, Xiaoye Qu, Junyu Gao, Xun Yang, Xun Wang, Meng Wang |
| 2023 | UnifiedGesture: A Unified Gesture Synthesis Model for Multiple Skeletons. Sicheng Yang, Zilin Wang, Zhiyong Wu, Minglei Li, Zhensong Zhang, Qiaochu Huang, Lei Hao, Songcen Xu, Xiaofei Wu, Changpeng Yang, Zonghong Dai |
| 2023 | Uniformly Distributed Category Prototype-Guided Vision-Language Framework for Long-Tail Recognition. Xiaoxuan He, Siming Fu, Xinpeng Ding, Yuchen Cao, Hualiang Wang |
| 2023 | Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval. Yi Bin, Haoxuan Li, Yahui Xu, Xing Xu, Yang Yang, Heng Tao Shen |
| 2023 | Unite-Divide-Unite: Joint Boosting Trunk and Structure for High-accuracy Dichotomous Image Segmentation. Jialun Pei, Zhangjun Zhou, Yueming Jin, He Tang, Pheng-Ann Heng |
| 2023 | Universal Defensive Underpainting Patch: Making Your Text Invisible to Optical Character Recognition. Jiacheng Deng, Li Dong, Jiahao Chen, Diqun Yan, Rangding Wang, Dengpan Ye, Lingchen Zhao, Jinyu Tian |
| 2023 | Universal Domain Adaptive Network Embedding for Node Classification. Jushuo Chen, Feifei Dai, Xiaoyan Gu, Jiang Zhou, Bo Li, Weiping Wang |
| 2023 | Unlearnable Examples Give a False Sense of Security: Piercing through Unexploitable Data with Learnable Examples. Wan Jiang, Yunfeng Diao, He Wang, Jianxin Sun, Meng Wang, Richang Hong |
| 2023 | Unlocking the Power of Cross-Dimensional Semantic Dependency for Image-Text Matching. Kun Zhang, Lei Zhang, Bo Hu, Mengxiao Zhu, Zhendong Mao |
| 2023 | Unlocking the Power of Multimodal Learning for Emotion Recognition in Conversation. Yunxiao Wang, Meng Liu, Zhe Li, Yupeng Hu, Xin Luo, Liqiang Nie |
| 2023 | Unsupervised Domain Adaptation for Referring Semantic Segmentation. Haonan Shi, Wenwen Pan, Zhou Zhao, Mingmin Zhang, Fei Wu |
| 2023 | Unsupervised Domain Adaptation for Video Object Grounding with Cascaded Debiasing Learning. Mengze Li, Haoyu Zhang, Juncheng Li, Zhou Zhao, Wenqiao Zhang, Shengyu Zhang, Shiliang Pu, Yueting Zhuang, Fei Wu |
| 2023 | Unsupervised Hashing with Contrastive Learning by Exploiting Similarity Knowledge and Hidden Structure of Data. Zhenpeng Song, Qinliang Su, Jiayang Chen |
| 2023 | Unsupervised Multiplex Graph learning with Complementary and Consistent Information. Liang Peng, Xin Wang, Xiaofeng Zhu |
| 2023 | Unsupervised Visible-Infrared Person ReID by Collaborative Learning with Neighbor-Guided Label Refinement. De Cheng, Xiaojian Huang, Nannan Wang, Lingfeng He, Zhihui Li, Xinbo Gao |
| 2023 | Unveiling Subtle Cues: Backchannel Detection Using Temporal Multimodal Attention Networks. Kangzhong Wang, MK Michael Cheung, Youqian Zhang, Chunxi Yang, Peter Q. Chen, Eugene Yujun Fu, Grace Ngai |
| 2023 | Unveiling the Power of CLIP in Unsupervised Visible-Infrared Person Re-Identification. Zhong Chen, Zhizhong Zhang, Xin Tan, Yanyun Qu, Yuan Xie |
| 2023 | Up to Thousands-fold Storage Saving: Towards Efficient Data-Free Distillation of Large-Scale Visual Classifiers. Fanfan Ye, Bingyi Lu, Liang Ma, Qiaoyong Zhong, Di Xie |
| 2023 | V2Depth: Monocular Depth Estimation via Feature-Level Virtual-View Simulation and Refinement. Zizhang Wu, Zhuozheng Li, Zhi-Gang Fan, Yunzhe Wu, Jian Pu, Xianzhi Li |
| 2023 | VCMaster: Generating Diverse and Fluent Live Video Comments Based on Multimodal Contexts. Manman Zhang, Ge Luo, Yuchen Ma, Sheng Li, Zhenxing Qian, Xinpeng Zhang |
| 2023 | VPA: Fully Test-Time Visual Prompt Adaptation. Jiachen Sun, Mark Ibrahim, Melissa Hall, Ivan Evtimov, Z. Morley Mao, Cristian Canton-Ferrer, Caner Hazirbas |
| 2023 | VQBA: Visual-Quality-Driven Bit Allocation for Low-Latency Point Cloud Streaming. Shuoqian Wang, Mufeng Zhu, Na Li, Mengbai Xiao, Yao Liu |
| 2023 | VTLayout: A Multi-Modal Approach for Video Text Layout. Yuxuan Zhao, Jin Ma, Zhongang Qi, Zehua Xie, Yu Luo, Qiusheng Kang, Ying Shan |
| 2023 | VTQA2023: ACM Multimedia 2023 Visual Text Question Answering Challenge. Kang Chen, Tianli Zhao, Xiangqian Wu |
| 2023 | VTQAGen: BART-based Generative Model For Visual Text Question Answering. Haoru Chen, Tianjiao Wan, Zhimin Lin, Kele Xu, Jin Wang, Huaimin Wang |
| 2023 | Variance-Aware Bi-Attention Expression Transformer for Open-Set Facial Expression Recognition in the Wild. Junjie Zhu, Bingjun Luo, Ao Sun, Jinghang Tan, Xibin Zhao, Yue Gao |
| 2023 | Versatile Face Animator: Driving Arbitrary 3D Facial Avatar in RGBD Space. Haoyu Wang, Haozhe Wu, Junliang Xing, Jia Jia |
| 2023 | Video Entailment via Reaching a Structure-Aware Cross-modal Consensus. Xuan Yao, Junyu Gao, Mengyuan Chen, Changsheng Xu |
| 2023 | Video Frame Interpolation with Flow Transformer. Pan Gao, Haoyue Tian, Jie Qin |
| 2023 | Video Infringement Detection via Feature Disentanglement and Mutual Information Maximization. Zhenguang Liu, Xinyang Yu, Ruili Wang, Shuai Ye, Zhe Ma, Jianfeng Dong, Sifeng He, Feng Qian, Xiaobo Zhang, Roger Zimmermann, Lei Yang |
| 2023 | Video Inverse Tone Mapping Network with Luma and Chroma Mapping. Peihuan Huang, Gaofeng Cao, Fei Zhou, Guoping Qiu |
| 2023 | Video Scene Graph Generation with Spatial-Temporal Knowledge. Tao Pu |
| 2023 | Video-based Visible-Infrared Person Re-Identification via Style Disturbance Defense and Dual Interaction. Chuhao Zhou, Jinxing Li, Huafeng Li, Guangming Lu, Yong Xu, Min Zhang |
| 2023 | View while Moving: Efficient Video Recognition in Long-untrimmed Videos. Ye Tian, Mengyu Yang, Lanshan Zhang, Zhizhen Zhang, Yang Liu, Xiaohui Xie, Xirong Que, Wendong Wang |
| 2023 | VioLET: Vision-Language Efficient Tuning with Collaborative Multi-modal Gradients. Yaoming Wang, Yuchen Liu, Xiaopeng Zhang, Jin Li, Bowen Shi, Chenglin Li, Wenrui Dai, Hongkai Xiong, Qi Tian |
| 2023 | Visual Captioning at Will: Describing Images and Videos Guided by a Few Stylized Sentences. Dingyi Yang, Hongyu Chen, Xinglin Hou, Tiezheng Ge, Yuning Jiang, Qin Jin |
| 2023 | Visual Causal Scene Refinement for Video Question Answering. Yushen Wei, Yang Liu, Hong Yan, Guanbin Li, Liang Lin |
| 2023 | Visual Redundancy Removal of Composite Images via Multimodal Learning. Wuyuan Xie, Shukang Wang, Rong Zhang, Miaohui Wang |
| 2023 | WRAP: Watermarking Approach Robust Against Film-coating upon Printed Photographs. Gaozhi Liu, Yichao Si, Zhenxing Qian, Xinpeng Zhang, Sheng Li, Wanli Peng |
| 2023 | WaterFlow: Heuristic Normalizing Flow for Underwater Image Enhancement and Beyond. Zengxi Zhang, Zhiying Jiang, Jinyuan Liu, Xin Fan, Risheng Liu |
| 2023 | Weakly-Supervised Text Instance Segmentation. Xinyan Zu, Haiyang Yu, Bin Li, Xiangyang Xue |
| 2023 | Weakly-supervised Video Scene Graph Generation via Unbiased Cross-modal Learning. Ziyue Wu, Junyu Gao, Changsheng Xu |
| 2023 | What2comm: Towards Communication-efficient Collaborative Perception via Feature Decoupling. Kun Yang, Dingkang Yang, Jingyu Zhang, Hanqi Wang, Peng Sun, Liang Song |
| 2023 | When Masked Image Modeling Meets Source-free Unsupervised Domain Adaptation: Dual-Level Masked Network for Semantic Segmentation. Gang Li, Xianzheng Ma, Zhao Wang, Hao Li, Qifei Zhang, Chao Wu |
| 2023 | When Measures are Unreliable: Imperceptible Adversarial Perturbations toward Top-k Multi-Label Learning. Yuchen Sun, Qianqian Xu, Zitai Wang, Qingming Huang |
| 2023 | When Perceptual Authentication Hashing Meets Neural Architecture Search. Yuanding Zhou, Xinran Li, Yaodong Fang, Chuan Qin |
| 2023 | Where and How: Mitigating Confusion in Neural Radiance Fields from Sparse Inputs. Yanqi Bao, Yuxin Li, Jing Huo, Tianyu Ding, Xinyue Liang, Wenbin Li, Yang Gao |
| 2023 | Where to Find Fascinating Inter-Graph Supervision: Imbalanced Graph Classification with Kernel Information Bottleneck. Hui Tang, Xun Liang |
| 2023 | Whether you can locate or not? Interactive Referring Expression Generation. Fulong Ye, Yuxing Long, Fangxiang Feng, Xiaojie Wang |
| 2023 | Who is Speaking Actually? Robust and Versatile Speaker Traceability for Voice Conversion. Yanzhen Ren, Hongcheng Zhu, Liming Zhai, Zongkun Sun, Rubing Shen, Lina Wang |
| 2023 | WormTrack: Dataset and Benchmark for Multi-Object Tracking in Worm Crowds. Zhiyu Jin, Hanyang Yu, Chen Haul, Linxiang Wang, Zuobin Zhu, Qiu Shen, Xun Cao |
| 2023 | YOGA: Yet Another Geometry-based Point Cloud Compressor. Junteng Zhang, Tong Chen, Dandan Ding, Zhan Ma |
| 2023 | Your Negative May not Be True Negative: Boosting Image-Text Matching with False Negative Elimination. Haoxuan Li, Yi Bin, Junrong Liao, Yang Yang, Heng Tao Shen |
| 2023 | Your tone speaks louder than your face! Modality Order Infused Multi-modal Sarcasm Detection. Mohit Tomar, Abhisek Tiwari, Tulika Saha, Sriparna Saha |
| 2023 | ZRIGF: An Innovative Multimodal Framework for Zero-Resource Image-Grounded Dialogue Generation. Bo Zhang, Jian Wang, Hui Ma, Bo Xu, Hongfei Lin |
| 2023 | Zero-Shot Image Retrieval with Human Feedback. Lorenzo Agnolucci, Alberto Baldrati, Marco Bertini, Alberto Del Bimbo |
| 2023 | Zero-Shot Learning by Harnessing Adversarial Samples. Zhi Chen, Peng-Fei Zhang, Jingjing Li, Sen Wang, Zi Huang |
| 2023 | Zero-Shot Learning for Computer Vision Applications. Sandipan Sarma |
| 2023 | Zero-Shot Object Detection by Semantics-Aware DETR with Adaptive Contrastive Loss. Huan Liu, Lu Zhang, Jihong Guan, Shuigeng Zhou |
| 2023 | Zero-TextCap: Zero-shot Framework for Text-based Image Captioning. Dongsheng Xu, Wenye Zhao, Yi Cai, Qingbao Huang |
| 2023 | Zero-shot Micro-video Classification with Neural Variational Inference in Graph Prototype Network. Junyang Chen, Jialong Wang, Zhijiang Dai, Huisi Wu, Mengzhu Wang, Qin Zhang, Huan Wang |
| 2023 | Zero-shot Skeleton-based Action Recognition via Mutual Information Estimation and Maximization. Yujie Zhou, Wenwen Qiang, Anyi Rao, Ning Lin, Bing Su, Jiaqi Wang |
| 2023 | mPLUG-Octopus: The Versatile Assistant Empowered by A Modularized End-to-End Multimodal LLM. Qinghao Ye, Haiyang Xu, Ming Yan, Chenlin Zhao, Junyang Wang, Xiaoshan Yang, Ji Zhang, Fei Huang, Jitao Sang, Changsheng Xu |
| 2023 | pmBQA: Projection-based Blind Point Cloud Quality Assessment via Multimodal Learning. Wuyuan Xie, Kaimin Wang, Yakun Ju, Miaohui Wang |
| 2023 | pyUDLF: A Python Framework for Unsupervised Distance Learning Tasks. Gustavo Leticio, Lucas Pascotti Valem, Leonardo Tadeu Lopes, Daniel Carlos Guimarães Pedronette |