| 2025 | 2S-DGM Junjie Wu, Yumeng Fu, Nan Yu, Chen Gong, Guohong Fu |
| 2025 | 3D Human Motion Corpus Moment Retrieval via Multi-Granularity Semantic Alignment. Wenlong Wang, Dahua Gao, Pengfei He, Xinyu Liu, Danhua Liu |
| 2025 | 3D-Contrastive Anchors and Structure Enhancement for Multi-modal Representations. Mingkai Sheng, Jichao Wang, Yi Liu, Wen Cheng, Lingfang Zeng |
| 2025 | 3DGCoding: Novel Framework for 3D Gaussian Video Incremental Training and Coding. Peiheng Wang, Haodan Zhang, Quanlu Jia, Jiangkai Wu, Liming Liu, Haoyang Wang, Xinggong Zhang |
| 2025 | 3DGlobalFormer: Three Domain Global Feature Fusion in 3D Human Estimation. Tianyi Ma, Muqing Wu, Zijian Zhang |
| 2025 | A Deep Single Image Rectification Approach for Pan-Tilt-Zoom Cameras. Teng Xiao, Qi Hu, Qingsong Yan, Wei Liu, Zhiwei Ye, Fei Deng |
| 2025 | A Depth Semantic Perception Network for Camouflage Object Detection. Zijun Wei, Songlin Li, Xiuhong Li, Boyuan Li, Zhenhong Jia, Haochu Ku |
| 2025 | A Domain Generalization Framework Based on Wavelet-Driven Structural Enhancement and Contrastive Alignment. Yuheng Xu, Taiping Zhang, Yang Liu |
| 2025 | A Fourier priors-Guided Diffusion Model for Image Harmonization with Structure-Preservation and Illumination-Consistency. Tianyou Wang, Xun Cai, Yanbo Gao, Yibo Wang, Shuai Li |
| 2025 | A GAN Framework for Asymmetric Embedding Costs Learning in JPEG Steganography. Bohong Li, Weiqi Luo, Peijia Zheng, Shunquan Tan, Jiwu Huang |
| 2025 | A GAN-based Data Poisoning Backdoor Attack Method for Palmprint Recognition CNNs. Yuqi Wang, Bob Zhang |
| 2025 | A Generalizable and Expressive Meta-Diffusion Policy for RTC Bandwidth Prediction. Zhiyuan Chen, Nuowen Kan, Chenglin Li, Wenrui Dai, Junni Zou, Hongkai Xiong |
| 2025 | A Knowledge Noise Mitigation Framework for Knowledge-based Visual Question Answering. Zhiyue Liu, Sihang Liu, Jinyuan Liu, Xinru Zhang |
| 2025 | A Low-Rank Defense Method for Adversarial Attack on Diffusion Models. Jiaxuan Zhu, Siyu Huang |
| 2025 | A Multi-Branch Network for Pose Trajectory Smoothing and Refinement. Panpan Chen, Ying Jiang, Haidong Hu, Chuangye Wang, Haolun Li, Hao Gao |
| 2025 | A Multi-Grained Perception Model for Sentiment Analysis with Perceived Contrastive Focal Loss. Jin Wei, Jiajie Lin, Zhenguo Yang, Haoran Xie, Fuqiang Yu, Xiaoping Li |
| 2025 | A Multi-Stage Framework for Multimodal Controllable Speech Synthesis. Rui Niu, Weihao Wu, Jie Chen, Long Ma, Zhiyong Wu |
| 2025 | A Multi-stage and Multi-target Knowledge Distillation Framework for Multimodal Conversational Emotion Recognition. Taiyu Niu, Geng Tu, Hui Wang, Bing Qin, Ruifeng Xu |
| 2025 | A Novel Differential Privacy Federated Learning Framework: An Adaptive Budget Allocation and Reversion Method. Xu Zhao, Gang Li, Jun Cai |
| 2025 | A Novel Framework for Realistic 3D Scene Regeneration with Graph of Thoughts. Yitian Kou, Kaiwei Zhang, Dandan Zhu, Xiongkuo Min, Guangtao Zhai |
| 2025 | A Novel Perspective on Leveraging Hubness in VAE for Eliminating Representative Shift Vectors in Few-Shot Learning. Quanlin Chen, Chunjin Ye, Yiming Ma, Jiahui Pan, Jingcong Li |
| 2025 | A Progressive Generation Framework with Speech Pre-trained Model for Expressive Voice Conversion. Tianrui Wang, Meng Ge, Zhikang Niu, Cheng Gong, Chunyu Qiang, Haoyu Wang, Zikang Huang, Ziyang Ma, Xiaobao Wang, Xie Chen, Longbiao Wang, Jianwu Dang |
| 2025 | A Refined ECG Delineation Framework Incorporating Single-Beat Mode and Conditional Random Field. Zhenqin Chen, Yuying Bao, Fengbo Wang, Yiwei Lin, Jinshan Xu |
| 2025 | A Semantic-Enhanced Heterogeneous Graph Learning Method for Flexible Objects Recognition. Kunshan Yang, Wenwei Luo, Yuguo Hu, Jiafu Yan, Mengmeng Jing, Lin Zuo |
| 2025 | A Simple and Better Baseline for Visual Grounding. Jingchao Wang, Wenlong Zhang, Dingjiang Huang, Hong Wang, Yefeng Zheng |
| 2025 | A Spatial-Frequency Domain Joint Mechanism Network for Cross-modal Semantic Segmentation. Yiheng Qu, Zhibing Zhang, Liqiang He |
| 2025 | A Synthetic-to-Real Dehazing Method based on Domain Unification. Zhiqiang Yuan, Jie Zhou, Jinchao Zhang |
| 2025 | A Temporal Modeling Framework for Video Pre-Training on Video Instance Segmentation. Qing Zhong, Peng-Tao Jiang, Wen Wang, Guodong Ding, Lin Wu, Kaiqi Huang |
| 2025 | A Unified Inverse-Tone-Mapped HDR Video Quality Assessment Method across Two HDR Formats. Leidong Fan, Xiongkuo Min, Qing Li, Anjie Wang |
| 2025 | A Watermark Updating Framework for Multi-stage Image Content Distribution. Yanyan Liu, Bin Liu, Jie Zhang, Xiang Zhang, Zehua Ma, Nenghai Yu |
| 2025 | A Zero Decoding Approach to Video Classification. Chen Ye Gan, Jiangtao Wen, Yuxing Han |
| 2025 | A-MESS: Anchor-based Multimodal Embedding with Semantic Synchronization for Multimodal Intent Recognition. Yaomin Shen, Xiaojian Lin, Wei Fan |
| 2025 | AAAD: Asynchronous Inter-Variable Relationship-Aware Anomaly Detection for Multivariate Time Series. Hongyi Liu, Xiaosong Huang, Mengxi Jia, Lingzhe Zhang, Tong Jia, Zhonghai Wu, Ying Li |
| 2025 | AADN++: Latent Feature Improves Adversarial Defense Transferability on Object Tracking. Zhewei Wu, Ruilong Yu, Shilin Qiu, Qihe Liu, Shijie Zhou, Zhun Zhang |
| 2025 | ABC-GS: Alignment-Based Controllable Style Transfer for 3D Gaussian Splatting. Wenjie Liu, Zhongliang Liu, Xiaoyan Yang, Man Sha, Yang Li |
| 2025 | ACCL: A Plug-and-play Adaptive Confusion-aware Contrastive Loss for UAV-to-Satellite Geolocalization. Yining Zhu, Zihao Deng, Jun Wang, Boxuan Li, Long Xiao, Jikun Shen, Yuan Yao |
| 2025 | ADoP: A Universal, Robust, Efficient, and Plug-and-Play Adversarial Example Detector. Rui Yang, Qindong Sun, Jiaming Cai, Jiangtao Yu |
| 2025 | AGFT-Tracker: Adaptive Game-Based PEFT for Object Tracking with PLMs. Mingyu Cao, Xihuai He, Xueqiong Li, Kedi Zhang, Yuhua Tang, Wanrong Huang, Huibin Tan |
| 2025 | AIM-VR: All-in-One Video Restoration via Dual-Path Mamba with Frequency Adaptive Fusion. Zhizhou Lu, Tianrui Liu, Zihan Chen, Junjie Huang, Xueqiong Li, Baili Xiao, Wentao Zhao |
| 2025 | AKVQ-VL: Attention-Aware KV Cache Adaptive 2-Bit Quantization for Vision-Language Models. Zunhai Su, Wang Shen, Linge Li, Zhe Chen, Hanyu Wei, Huangqi Yu, Kehong Yuan |
| 2025 | ALCReg: Active Label Correction for Partial Point Cloud Registration. Zongyi Xu, Xinqi Jiang, Xinyu Gao, Shanshan Zhao, Qianni Zhang, Weisheng Li, Xinbo Gao |
| 2025 | ALIVE: Asynchronous Lower Body Pose Estimation with Images, Visual-Inertial Odometry and Electromyography. Guoming Du, Zhen Ding, Xinrun Li, Su Wang, Wendi Peng, Hong Huang, Feng Jiang |
| 2025 | AMMSM: Adaptive Motion Magnification and Sparse Mamba for Micro-Expression Recognition. Xuxiong Liu, Tengteng Dong, Fei Wang, Weijie Feng, Xiao Sun |
| 2025 | AMS-Counter: Text-Guided Zero-shot Object Counting via Adaptive Multi-view Similarity-map. Cheng Qian, Jiwu Cao, Ying Mao, Kai Liu, Peng Zhu, Jun Sang |
| 2025 | AMUSE: Adaptive Multi-Segment Encoding for Dataset Watermarking. Saeed Ranjbar Alvar, Mohammad Akbari, David Ming Xuan Yue, Yong Zhang |
| 2025 | AS-Memory: Adaptive Sparse Memory Meeting Video-Language Models. Bimei Wang, Huilin Song, Jisheng Dang, Fei Shen, Hui Zhang, Liting Wang, Mangang Xie, Jizhao Liu, Jiasi Weng |
| 2025 | ASTAnet: Transformer-based Siamese Network for Robust Audio-to-Audio Alignment in Amateur User Generated Audio Clips. Malya Singh, Priyankar Choudhary, Abdulmotaleb El Saddik, Mukesh Saini |
| 2025 | ASimp: Automatic High-Poly 3D Mesh Simplification for Preprocessing Based on QoE. Lehao Lin, Hong Kang, Yuqi Shi, Haihan Duan, Abdulmotaleb El Saddik, Wei Cai |
| 2025 | ATD-AMSMamba: Improving Robustness of State Space Models for Multimodal Sentiment Analysis. Yahong Li, Zhanxun Dong, Zhou Fang, Lai Li |
| 2025 | ATM-NeRF: Learning Adaptive Tone Mapping for Normal-Light Neural Radiance Field Reconstruction. Min Wang, Xin Huang, Qing Wang |
| 2025 | AU-TTT: Vision Test-Time Training model for Facial Action Unit Detection. Bohao Xing, Kaishen Yuan, Zitong Yu, Xin Liu, Heikki Kälviäinen |
| 2025 | AVENet: Disentangling Features by Approximating Average Features for Voice Conversion. Wenyu Wang, Yiquan Zhou, Jihua Zhu, Hongwu Ding, Jiacheng Xu, Shihao Li |
| 2025 | AWUR: Adaptive Wavelet and Uncertainty Refinement for Semi-Supervised Medical Image Segmentation. Hailan Shen, Yuqi Li, Zailiang Chen, Hui Liu, Wenyan Zhong, Yudi Wang |
| 2025 | Accurate and Efficient Privacy-Preserving Image SURF Feature Extraction. Xiangyu Gao, Zhekai Luo, Peijia Zheng, Jian Li, Rui Yang |
| 2025 | Achieving Seamless Camouflage: Attention Fusion Diffusion Model for Image Synthesis. Hao Xi, Meiqin Liu, Zechen Yang, Ping Wei |
| 2025 | Achieving Zero-Glance Unlearning with Data-Free Inversion and Selective Parameters Suppression. Puwei Lian, Xiao Ke, Zhou Tan, Jianping Cai, Ximeng Liu |
| 2025 | Action Decomposition-based Actor-Critic for Supply Chain Optimization. Zhengrong Chen, Qinghua Zhu, An Zeng, Yuzhu Ji, Baoyao Yang, Dan Pan |
| 2025 | Active Object Tracking with Occluded Targets Estimation and Adversarial Reinforcement Learning. Zheng Chen, Wengang Zhou, Houqiang Li |
| 2025 | AdaMHF: Adaptive Multimodal Hierarchical Fusion for Survival Prediction. Shuaiyu Zhang, Xun Lin, Rongxiang Zhang, Yu Bai, Yong Xu, Tao Tan, Xubin Zheng, Zitong Yu |
| 2025 | Adapting Cross-Modal Semantic Discrepancy in Text-based Person Search. Xinpan Yuan, Jiabao Li, Wei Xia, Wenguang Gan, Mengxi Ying, Liujie Hua |
| 2025 | Adaptive Distribution-Aware Modeling for Transformer Tracking. Mingyu Cao, Huibin Tan, Xueqiong Li, Wanrong Huang, Kedi Zhang, Yuhua Tang, Shaowu Yang |
| 2025 | Adaptive Frequency Threshold Pooling for Mitigating Aliasing in Few-Shot Segmentation. Shangjing Chen, Feng Xu, Xin Lyu, Xin Li |
| 2025 | Adaptive Gaussian Mixture Model with Hierarchical Propagation for One-Class Graph Fraud Detection. Xiaoxiang Li, Xinyu Jiang, Zining Wang, Chang Liu, Zhibin Ni, Hai Wan, Xibin Zhao |
| 2025 | Adaptive Gradient Quantization with Bit Allocation for Distributed Deep Learning. Fei Gao, Xingyu Yan, Jian Jin, Wenhan Yang, Lingyu Duan, Zhuo Chen |
| 2025 | Adaptive Illumination Transfer Network for Shadow Removal. Yinan Wang, Si Wu, Yong Xu, Yan Huang, Patrick Le Callet |
| 2025 | Adaptive Mobile Agent for Dynamic Interactions. Yanda Li, Chi Zhang, Wenjia Jiang, Wanqi Yang, Bin Fu, Pei Cheng, Xin Chen, Meng Fang, Ling Chen, Yunchao Wei |
| 2025 | Adaptive Optimization Strategy for Semi-supervised Arbitrary-oriented Object Detection. Jiecong Chen, Chenlin Fu, Yingying Zhu |
| 2025 | Adaptive Pixel Classification and Equivalent Large Kernels for Lightweight Image Super-Resolution. Pengyu Lin, Xunxun Zeng, Wanling Liu, Huayi Chen, Fei Chen |
| 2025 | Adaptive Semantic Alignment for Automated Radiology Report Generation via Cross-Modal Knowledge Integration. Sibo Ju, Zhaozhen Chen, Yulong Xiao, Yiqing Shen, Yanzhou Su, Kai Chen, Xiangwen Liao |
| 2025 | Adaptive Semantic Compression: Compatible Bitstream for Scalable Human-Machine Perception Sample Adaption. Shaokang Wang, Dingquan Li, Guoqing Xiang, Jinchang Xu, Shanghang Zhang, Xiaodong Xie |
| 2025 | Adaptive Strategy Weighting with Fault Tolerant Localization for Object Navigation. Yanwei Zheng, Shaopu Feng, Bowen Huang, Changrui Li, Xiao Zhang, Dongxiao Yu |
| 2025 | Adaptive Training Meets Progressive Scaling: Elevating Efficiency in Diffusion Models. Wenhao Li, Xiu Su, Yu Han, Shan You, Tao Huang, Chang Xu |
| 2025 | Adaptive Weighted Parameter Fusion with CLIP for Class-Incremental Learning. Juncen Guo, Xiaoguang Zhu, Liangyu Teng, Hao Yang, Jing Liu, Yang Liu, Liang Song |
| 2025 | AdaptiveFusion: LiDAR-Camera Adaptive Fusion for 3D Object Detection. Yuhan Zhou, Xiaotian Li, Baojie Fan |
| 2025 | Addressing Emotion Bias in Music Emotion Recognition and Generation with Frechet Audio Distance. Yuanchao Li, Azalea Gui, Dimitra Emmanouilidou, Hannes Gamper |
| 2025 | Advanced Backdoor Threats and Countermeasures in Dataset Condensation. Canhui Wu, Wei Xi, Dashan Gao, He Yang, Jizhong Zhao |
| 2025 | Advancing Multi-Hop Question Answering via Alternating Retrieval and Reasoning over Multi-view Knowledge Integration. Mengchao Liu, Chao Yang, Bin Jiang, Chenglong Lei |
| 2025 | Advancing Safe Language Generation: Exploring Alternative Constrained RLHF. Fanyu Meng, Zhixin Bai, Yanming Wang, Jing Huo, Boyan Wang, Xi Yang, Yang Gao |
| 2025 | Adversarial Attacks and Robust Defenses in Speaker Embedding based Zero-Shot Text-to-Speech System. Ze Li, Yao Shi, Yunfei Xu, Ming Li |
| 2025 | Adversarial Examples Detection Based on Adversarial Attack Sensitivity. Cong Ming, Haojie Yuan, Xiangwen Wang, Qi Chu, Tao Gong, Bin Liu, Nenghai Yu |
| 2025 | Align-AV-HuBERT: AV-HuBERT with Audio-Visual Temporal Alignment. Cancan Li, Fei Su, Juan Liu |
| 2025 | AlignKT: Explicitly Modeling Knowledge State for Knowledge Tracing with Ideal State Alignment. Jing Xiao, Chang You, Zhiyu Chen |
| 2025 | An EEG Dataset with Subjective-Objective Perception Data for Assessing Stereoscopic Visual Discomfort Induced by 3D Motion Videos. Na Lu, Xiaojie Zhao, Li Yao |
| 2025 | An End-To-End Class-Aware Transformer Framework For Weakly-Supervised Semantic Segmentation. Wenzhe Gu, Kaiwen Li, Bin Zhang, Baosheng Liu |
| 2025 | An End-to-End Model for Photo-Sharing Multi-Modal Dialogue Generation. Peiming Guo, Sinuo Liu, Yanzhao Zhang, Dingkun Long, Pengjun Xie, Meishan Zhang, Min Zhang |
| 2025 | An Enhanced Palmprint Adversarial Attack Against Visible and Invisible Features. Jinrong Cui, Qiuli Zhang, Ziqi Wang, Jinghua Wang, Qi Zhu |
| 2025 | An Investigation on Audio-Prompt and Structure Guided Long-Duration Music Generation Based on Diffusion Models. Ziyu Zhao, Zilu Guo, Jun Du, Feng Ma, Jia Pan |
| 2025 | Analysing and Predicting Radiologists' Expertise Using Eye-Tracking Data: Insights for Diagnostic Decision-Making. Yueran Ma, Jiang Liu, Yixiao Li, Yingying Wu, Richard D. White, Phillip Wardle, Gualtiero Colombo, Padraig Corcoran, Wei Zhou, Hantao Liu |
| 2025 | AnoCLIP: Text-Guided Zero-shot Anomaly Localization via Self-Supervised Adaptation. Hanqiu Deng, Zhaoxiang Zhang, Jinan Bao, Xingyu Li |
| 2025 | AnyArtisticGlyph: Multilingual Controllable Artistic Glyph Generation. Xiongbo Lu, Yaxiong Chen, Shengwu Xiong |
| 2025 | Aparecium: Revealing Secrets from Physical Photographs. Zhe Lei, Jie Zhang, Jingtao Li, Tianwei Zhang, Haibin Kan, Weiming Zhang, Nenghai Yu |
| 2025 | ArtTypo: Multi-Level Controlled Artistic Typography with Iterative Feedback. Kaiyue Liu, Lei Wu, Mingzhe Yu, Xiaole Liu, Yajie Xu, Xiangxu Meng |
| 2025 | Aspect-attentioned Prompting for Multimodal Sentiment Analysis. Yutian Li, Jiaming Yang, Yiwen Hu, Lap-Kei Lee, Fu Lee Wang, Zhenguo Yang |
| 2025 | Assessing the Generalizability of Deep Models without Out-of-Distribution Data. Guoqing Zhu, Xiaojie Gan, Lingye Zhao, Luojun Lin |
| 2025 | Attribute-Guided Zero-Shot CLIP in Image Classification. Guoxi Qiu, Xiangyu Zhang, Yong Xu, Jinghua Wang |
| 2025 | Attributed Synthetic Data Generation for Zero-shot Domain-specific Image Classification. Shijian Wang, Linxin Song, Ryotaro Shimizu, Masayuki Goto, Hanqian Wu |
| 2025 | Audio-Driven Emotion-Aware 3D Talking Face Generation from Single Image. Chun-Shuo Qiu, Feng-Lin Liu, Hongbo Fu, Fan Zhang, Yan-Pei Cao, Yu-Kun Lai, Lin Gao |
| 2025 | Audio-Driven Gesture Generation via Deviation Feature in the Latent Space. Jiahui Chen, Huan Yang, Runhua Shi, Chaofan Ding, Xiaoqi Mo, Siyu Xiong, Yinong He |
| 2025 | AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speech Synthesis. Dan Luo, Chengyuan Ma, Weiqin Li, Jun Wang, Wei Chen, Zhiyong Wu |
| 2025 | Automated Radiology Report Generation Based on Topic-Keyword Semantic Guidance. Jing Xiao, Hongfei Liu, Ruiqi Dong, Jimin Liu, Haoyong Yu |
| 2025 | Automatic Natural Image Matting via Dual Encoder Aggregation. Meng-Lun Yu, Wen-Jiin Tsai |
| 2025 | BAMNet: A Brain Area Mapping-Based Multimodal Saliency Prediction Method. Shibo Wang |
| 2025 | BEAR: A Video Dataset For Fine-grained Behaviors Recognition Oriented with Action and Environment Factors. Chengyang Hu, Yuduo Chen, Lizhuang Ma |
| 2025 | BEV-MMC: Bird's-Eye-View-Based Multimodal Compression for Enhanced Visual Recognition. Zhiwei Dong, Ying Liu |
| 2025 | BFPS: A Boundary-Focused Polyp Segmentation Model via Frequency Domain Separation. Wanqi Ma, Huanhuan Lv, Songru Jiang, Jiale Wu |
| 2025 | BI-RADS Boosted Breast Cancer Diagnosis With Masked Pretraining On Imbalanced Ultrasound Data. Xueqian Pang, Ziyun Li, Junhui Lv, Ruiquan Ge, Zhuoxuan Wu, Fei Gao |
| 2025 | BMCA: Weakly Supervised Semantic Segmentation via Beta Modulation and Cross-Modality Alignment. Ying Gao, Jing Lin, Wentian Cai, Yandan Chen, Zihao Huang, Zhiyong Xia |
| 2025 | BPCLIP: A Bottom-up Image Quality Assessment from Distortion to Semantics Based on CLIP. Chenyue Song, Chen Hui, Wei Zhang, Haiqi Zhu, Shaohui Liu, Hong Huang, Feng Jiang |
| 2025 | BanditRewriter: Training-free Adaptive Prompt Optimization for Text-to-Image Generation. Ao Shen, Yue Liu, Yanlei Shang |
| 2025 | Bayesian-Inspired Cross-Spectral Fusion Network for Robust Depth Estimation. Jiafu Yan, Wenwei Luo, Yuguo Hu, Changhua Zhang, Mengmeng Jing, Lin Zuo |
| 2025 | BeatFM: Improving Beat Tracking with Pre-trained Music Foundation Model. Ganghui Ru, Jieying Wang, Jiahao Zhao, Yulun Wu, Yi Yu, Nannan Jiang, Wei Wang, Wei Li |
| 2025 | Beckman Adversarial Defense. A. V. Subramanyam |
| 2025 | Beyond Macro-Actions: A Bio-Inspired Framework for Fine-Grained Micro-Action Recognition. Yiwei Ru, Churan Yu, Dongsen Zhang, Mupei Li, Yongji Liu, Zhaofeng He |
| 2025 | Beyond Multimodal Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization. Zhiyuan Zhao, Bin Wang, Linke Ouyang, Xiaoyi Dong, Jiaqi Wang, Conghui He |
| 2025 | Beyond Sliders: Mastering the Art of Diffusion-based Image Manipulation. Yufei Tang, Daiheng Gao, Pingyu Wu, Wenbo Zhou, Bang Zhang, Weiming Zhang |
| 2025 | Beyond Statistical Correlation: Causal Insights into Emotion Recognition. Tao Chen, Yanrong Guo, Shijie Hao, Richang Hong |
| 2025 | Beyond the Label: Unveiling Fairness through Dynamic Attribute Projections in Classification. Haoze Jiang, Zunlei Feng, Jiacong Hu, Binde Hu, Mingli Song, Yuanyu Wan |
| 2025 | Bi-Grid Reconstruction for Image Anomaly Detection. Huichuan Huang, Zhiqing Zhong, Guangyu Wei, Yonghao Wan, Wenlong Sun, Aimin Feng |
| 2025 | BiFD: A Bidirectional Feature Discrepancy Defense against Hijacking Attack in Split Learning. Xiaoyang Xu, Wenzhe Yi, Juan Wang, Yong Zhuang, Mengda Yang, Ziang Li, Yaxin Liu |
| 2025 | Bidirectional Feature Fusion and Adaptive Decision Network for Multimodal Fake News Detection. Dilxat Abdureyim, Bo Ma, Yating Yang, Rui Dong, YiDu Chen, Azmat Anwar, Lei Wang |
| 2025 | Bilateral Enhanced Complementary Network for Camouflaged Object Detection. Yejing Guo, Ziqi Wang, Xia Yuan, Chunxia Zhao |
| 2025 | Black-box Universal Adversarial Perturbations for Image and Video Quality Assessment Methods. Georgii Bychkov, Sergey Lavrushkin, Dmitriy S. Vatolin |
| 2025 | Blended-Target Domain Adaptation via Multi-Prompt Coordination Learning. Yuwu Lu, Yihan Yang |
| 2025 | Boosting Adversarial Transferability by Constructing Adversarial Trajectories. Qiang Wan, Sanshuai Cui, Anjie Peng, Hui Zeng, Rong Wei |
| 2025 | Boosting Audio-Visual Segmentation via Triple-Modalities Alignment. Yujian Lee, Peng Gao, Zailong Chen, Wentao Fan, Guquan Jing, Yiyang Hu |
| 2025 | Boosting Road Event Detection with Adaptive Multi-Modal Models. Linkai Liu, Xiaoyan Xiao, Yijian Yang, Yuchen Zhou, Zipeng Guo, Chao Gou |
| 2025 | Boosting the Transferability of Audio Adversarial Examples with Acoustic Representation Optimization. Weifei Jin, Junjie Su, Hejia Wang, Yulin Ye, Jie Hao |
| 2025 | Brainstorming Brings Power to Large Language Models of Knowledge Reasoning. Zining Qin, Chenhao Wang, Jianxiong Guo, Huiling Qin, Weijia Jia |
| 2025 | Bridging the Gap: Balancing Human Perception and Detector Attention in Adversarial Attacks. Mingye Xie, Suncheng Xiang, Xian Gao, Ting Liu, Yuzhuo Fu |
| 2025 | Bridging the One-to-Many Gap: Multi-label Semantic Learning and Relay for Video Captioning. Shuqin Chen, Yikang Hu, Li Yang, Zhixin Sun, Liangjun Yu, Xian Zhong |
| 2025 | C3S3: Complementary Competition and Contrastive Selection for Semi-Supervised Medical Image Segmentation. Jiaying He, Yitong Lin, Jiahe Chen, Honghui Xu, Jianwei Zheng |
| 2025 | CA-Diff: Collaborative Anatomy Diffusion for Brain Tissue Segmentation. Qilong Xing, Zikai Song, Yuteng Ye, Yuke Chen, Youjia Zhang, Na Feng, Junqing Yu, Wei Yang |
| 2025 | CAMCKG: A Framework for Trigger-Action Recommendation Combining Attention Mechanism and Continuous Kernel Graph Convolution. Jiangfeng Li, Shijie Wang, Zijun Huang, Yifan Li |
| 2025 | CAP: An Advanced No-Reference Quality Assessment Method for AI-Generated 3D Meshes. Yingjie Zhou, Farong Wen, Zicheng Zhang, Yanwei Jiang, Jun Jia, Xiaohong Liu, Xiongkuo Min, Guangtao Zhai |
| 2025 | CAPAA: Classifier-Agnostic Projector-Based Adversarial Attack. Zhan Li, Mingyu Zhao, Xin Dong, Haibin Ling, Bingyao Huang |
| 2025 | CASA: Class-Agnostic Shared Attributes in Vision-Language Models for Efficient Incremental Object Detection. Mingyi Guo, Yuyang Liu, Zhiyuan Yan, Zongying Lin, Peixi Peng, Yonghong Tian |
| 2025 | CASD: Counterfactual Augmentation for Social Bot Detection on Twitter. Pin Xu, Fangfang Yuan, Yueshan Wang, Diandian Guo, Cong Cao, Yanbing Liu |
| 2025 | CCUP: A Controllable Synthetic Data Generation Pipeline for Pretraining Cloth-Changing Person Re-Identification Models. Yujian Zhao, Chengru Wu, Yinong Xu, Xuanzheng Du, Ruiyu Li, Guanglin Niu |
| 2025 | CDFormer: Cross-Domain Few-Shot Object Detection Transformer Against Feature Confusion. Boyuan Meng, Xiaohan Zhang, Peilin Li, Zhe Wu, Yiming Li, Wenkai Zhao, Beinan Yu, Hui-Liang Shen |
| 2025 | CDIQA: Collaborative Learning with Diffusion Extension for Semi-supervised Blind Image Quality Assessment. Xudong Wang |
| 2025 | CE-LoRA: Consistent Person Synthesis by Exploring the Model's Spatial Consistency. Delong Liu, Cheng Lei, Shuai Jiang, Zhicheng Zhao, Fei Su |
| 2025 | CEFW: A Comprehensive Evaluation Framework for Watermark in Large Language Models. Shuhao Zhang, Bo Cheng, Jiale Han, Yuli Chen, Zhixuan Wu, Changbao Li, Pingli Gu |
| 2025 | CFF: Coarse-to-Fine-to-Fusion Semantic Prototype Generation for Zero-Shot Classification. Yuting Lin, Xuanwen Su, Tengfei Liang, Yi Jin, Tao Wang, Yidong Li |
| 2025 | CFPER: Coarse-to-Fine Part-Experts Retrieval for Efficient Person Re-identification. Shiyu Wang, Mingming Lu |
| 2025 | CHRIS: Clothed Human Reconstruction with Side View Consistency. Dong Liu, Yifan Yang, Zixiong Huang, Yuxin Gao, Mingkui Tan |
| 2025 | CI-MER: A Novel Causal Intervention Framework For Micro-Expression Recognition. Xiqiao Fang, Qingfeng Wu, Lu Cao |
| 2025 | CLAP: Overcoming Language Priors via Contrastive Learning and Answer Perturbation. Haoquan Wang, Yong Chen, Shengbo Chen, Hong Rao |
| 2025 | CLEARSTR: Contextual Learning with Edge-guided and Adaptive-texture Reconstruction for Scene Text Removal. Sanhita Pathak, Vinay Kaushik, Brejesh Lall |
| 2025 | CLGC: Continuous Layout Guidance for Consistent Text-to-Video Editing. Xuancheng Xu, Ming Tao, Bing-Kun Bao |
| 2025 | CLIP Brings Better Features to Visual Aesthetics Learners. Liwu Xu, Jinjin Xu, Yuzhe Yang, Xilu Wang, Yi-Jie Huang, Yaqian Li |
| 2025 | CLIP Guided Multimodal Prototype Learning for One-Shot Semantic Segmentation. Yulei Jian, Lingma Sun, Xiaofeng Wang, Jin Tang |
| 2025 | CLIP-EBC: CLIP Can Count Accurately through Enhanced Blockwise Classification. Yiming Ma, Victor Sanchez, Tanaya Guha |
| 2025 | CLIP-based Robust Pedestrian Attribute Recognition via Attribute Localization and Data Augmentation. Yunpeng Zhou, Qiwen Liang, Xin Li, Jianping Ren, Liujinxiang Zhu, Shuhua Liu |
| 2025 | CLIP-driven Few-Shot Continual Learning. Ziqi Gu, Chunyan Xu, Zhen Cui |
| 2025 | CMRFusion: Efficient Feature Decomposition for RGB-T Fusion via Cross Modality Mask Reconstruction. Chao Yang, Chao Tian, Guoqing Zhu, Qiang Wang, Zhenyu He |
| 2025 | CPMDiff: Classifier Probability Measurement for Out-of-Distribution Detection via Diffusion Models. Yongheng Xu, Kaiyu Song, Hanjiang Lai |
| 2025 | CSCE: Boosting LLM Reasoning by Simultaneous Enhancing of Causal Significance and Consistency. Kangsheng Wang, Xiao Zhang, Juntao Lyu, Tianyu Hu, Huimin Ma |
| 2025 | CSDet: Clutter Suppression-Aided SAR Inshore Ship Detection Network. Yao Wang, Shuang Li, Ganggang Dong, Hongwei Liu |
| 2025 | CT-MIE: Computed Tomography Multi-Task Image Enhancement via Vision-Language Model. Yucheng Zeng, Aihua Mao, Xianghong Wang, Tianye Niu |
| 2025 | Can Drowsiness be seen in the eyes? A new detection method of driver drowsiness levels based on eye-tracking data. Runlin Zhang, Qing Xu, Yueming Zhu, Chuntie Chen |
| 2025 | Can MLLMs Tell Jokes Based on Images? A Visual Context-Driven Humor Generation Framework. Meixuan Chen, Chen Wang, Liu Hui, Yujun Wu, Ying Sha |
| 2025 | Causal Deconfounding for Spurious Correlation in Domain Generalization. Bin Qin, Yi Li, Jiangmeng Li, Xuesong Wu, Yupeng Wang, Jianwen Cao |
| 2025 | Causal Intervention with Active Learning for Large Vision-Language Models in Egocentric Contexts. Wenxin Meng, Shenshen Li, Lei Wang, Hao Yang, Chong Peng, Peng Yan, Xing Xu |
| 2025 | Center-Oriented Prototype Contrastive Clustering. Shihao Dong, Xiaotong Zhou, Yuhui Zheng, Huiying Xu, Xinzhong Zhu |
| 2025 | Challenging Dataset and Multi-Modal Gated Mixture of Experts Model for Remote Sensing Copy-Move Forgery Understanding. Ze Zhang, Enyuan Zhao, Yi Jiang, Jie Nie, Xinyue Liang |
| 2025 | Characterizing High-order Interactions between Eye Movement and Head Motion Variables in Augmented Reality-based Navigation Experience. Qing Xu, Shunbo Wang, Yunxiang Jiang, Simon Parkinson, Klaus Schoeffmann, Chuntie Chen |
| 2025 | Chinese-LiPS: A Chinese Audio-Visual Speech Recognition Dataset with Lip-Reading and Presentation Slides. Jinghua Zhao, Yuhang Jia, Shiyao Wang, Jiaming Zhou, Hui Wang, Yong Qin |
| 2025 | Clouds and Haze Co-Removal Based on Saliency-Guided Multi-Scale Diffusion Model for Remote Sensing Images. Jingxuan Zhang, Libao Zhang |
| 2025 | CoDiff-SaK: Controllable Diffusion Model with Segment Anything Knowledge for Low-dose CT Image Denoising. Fenghang Zhang, Guang Feng, Xizhan Gao, Wanying Wu, Sijie Niu |
| 2025 | Coarse-To-Fine Graph Reasoning for 3D Hand Mesh Reconstruction. Dan Fu, Wai Keung Wong, Lunke Fei, Tingting Chai, Yuzhu Ji, Qinghua Zhu |
| 2025 | Coding-Free Multiscale Latent Variables for Lossless Point Cloud Attribute Compression. Qiang Xu, Lixuan Meng, Guangjie Zhang, Wei Gao, Ge Li |
| 2025 | Cognitive Inspired Generalization Boosting for Face Forgery Detection. Yunwen Huang, Hua Yang |
| 2025 | Combating the Negative Optimization in Source-Free Domain Adaptive Medical Image Segmentation via Selective Online Self-Training. Wenjuan Zhou, Wei Chen, Yulin He, Chen Li |
| 2025 | Complementary Multi-dimensional Variance Attention Learning for 3D Human Mesh Reconstruction from Videos. Tuo Xiong, Suping Wu, Xiang Zhang, Ruijie Peng, Bing Wang, Xitie Zhang, Zhijian Duan |
| 2025 | Component Adaptive Clustering for Generalized Category Discovery. Mingfu Yan, Jiancheng Huang, Yifan Liu, Shifeng Chen |
| 2025 | Compositional Text-Modality Completion Model for Partially Relevant Video Retrieval. Yi Pan, Yujia Zhang, Xiaoguang Zhao |
| 2025 | Compression Metadata-assisted RoI Extraction and Adaptive Inference for Efficient Video Analytics. Chengzhi Wang, Peng Yang |
| 2025 | Computational Measures of Gaze Behavior Using the Concept of Situational Awareness. Yunxiang Jiang, Qing Xu, Aoxing Xu, Simon Parkinson, Klaus Schoeffmann, Chuntie Chen |
| 2025 | ConAvatar: Harnessing Facial Mesh for Controllable Avatar Animation. Zhen Tan, Wei Wei |
| 2025 | Concept-Centric Learning for Weakly-Supervised Temporal Sentence Grounding. Yaru Zhang, Haichao Shi, Xiaoyu Zhang |
| 2025 | Concretely Efficient Three-party Oblivious Selection. Shang Song, Lin Liu, Rongmao Chen, Wei Peng |
| 2025 | Conditional Residual Coding with Explicit-Implicit Temporal Buffering for Learned Video Compression. Yi-Hsin Chen, Kuan-Wei Ho, Martin Benjak, Jörn Ostermann, Wen-Hsiao Peng |
| 2025 | Confidence Breeds Success: Improving Fake News Video Detection via LVLM-Assisted Inference. Yuchen Zhang, Mingxin Li, Chao Gao, Xianghua Li |
| 2025 | Confidence-Aware Self-Distillation for Multimodal Sentiment Analysis with Incomplete Modalities. Yanqian Luo, Shijin Wang, Zhongxing Xu, Yulong Li, Feilong Tang, Jionglong Su |
| 2025 | Consistency Change Detection Framework for Unsupervised Remote Sensing Change Detection. Yating Liu, Yan Lu |
| 2025 | Consolidating Selective SSM with Spatial-Angular and Bidirectional Structural Fusion Perception for Light Field Semantic Segmentation. Wenbin Yan, Qingwei Wu, Hua Chen, Xiaogang Zhang, Shengjie Hu |
| 2025 | Construct a Powerful Discriminative Relationship for Few-Shot Action Recognition. Qianhan Tang, Yanan Liu, Ningxin Wang, Kangjian He, Hao Zhang, Dan Xu |
| 2025 | Content-Adaptive Motion Compensated Temporal Filter for Versatile Video Coding. Yunrui Jian, Yi Xue, Yue Huang, Xueli Cheng, Weilun Feng, Zhenan Lin, Chao Zhou |
| 2025 | Content-Style Disentangled Audio Style Transfer via Diffusion Model. Yiran Wang, Jiasheng Lu, Jun Chen, Xinyu Zhang, Yingshan Liang, Zhicheng Du, Qingyang Shi, Shao-Lun Huang |
| 2025 | Context Consistency Learning via Sentence Removal for Semi-Supervised Video Paragraph Grounding. Yaokun Zhong, Siyu Jiang, Jian Zhu, Jian-Fang Hu |
| 2025 | Context-Enhanced Zero-Shot Video Temporal Grounding with Adaptive Boundary Refinement. Fangkai Li, Hao Hu, Feiyu Pan, Yanzhen Wang, Yiyou Guo, Xiankai Lu |
| 2025 | Contextualizing Borderline ECG Analysis via Multi-Modal Feature Extraction and Large Language Model Inference. Yanlin Xu, Yiwei Ru, Dongsen Zhang, Yongji Liu, Zhenan Sun |
| 2025 | Continuous Lane Detection Network with Hybrid Feature Fusion and Differential Aggregation. Zhiqiang Zeng, Longpei Wu, Xiaodong Wang, Fei Yan, Haiyan Huang |
| 2025 | Contrastive Adversarial Learning for Region-Aware Weakly Annotated Object Segmentation in Hazy Remote Sensing Images. Wanning Zhu, Libao Zhang |
| 2025 | Contrastive Intent-Disentangled Variational AutoEncoder for Sequential Recommendation. Yafan Yuan, Zhen Liu, Xinxin Yang, Sibo Lu |
| 2025 | Contrastive Invariant Risk Minimization for Grounded Situation Recognition. Zhaoquan Yuan, Chengbin Zhao, Yuting Tang, Lishu Guo, Xiao Wu, Changsheng Xu |
| 2025 | ContrastiveGaussian: High-Fidelity 3D Generation with Contrastive Learning and Gaussian Splatting. Junbang Liu, Enpei Huang, Dongxing Mao, Hui Zhang, Xinyuan Song, Yongxin Ni |
| 2025 | Controllable Continual Test-Time Adaptation. Ziqi Shi, Fan Lyu, Ye Liu, Fanhua Shang, Fuyuan Hu, Wei Feng, Zhang Zhang, Liang Wang |
| 2025 | Controllable Expressive 3D Facial Animation via Diffusion in a Unified Multimodal Space. Kangwei Liu, Junwu Liu, Xiaowei Yi, Jinlin Guo, Yun Cao |
| 2025 | Coordinated Uni-modal Assistance for Enhancing Multi-modal Learning. Hongpeng Pan, Yang Yang |
| 2025 | Corer: Concept Residue Erasing in Text-to-Image Diffusion Models. Yufan Liu, Jinyang An, Huashan Chen, Wanqian Zhang, Ming Li, Dayan Wu, Jingzi Gu, Zheng Lin, Weiping Wang |
| 2025 | CosGaussian: Towards Text-to-3D Semantically Controllable 3D Object Style Transfer with Gaussian Splatting. Wendong Li, Gaojie Wu, Xiang Huang, Wei-Shi Zheng |
| 2025 | Counterfactual-Augmented Representation Learning based Event Prediction. Cheng Hu, Fangfang Yuan, Cong Cao, Pu Li, Guangjie Zeng, Yanbing Liu, Hao Peng, Philip S. Yu |
| 2025 | Create Anything Anywhere: Layout-Controllable Personalized Diffusion Model for Multiple Subjects. Wei Li, Hebei Li, Yansong Peng, Siying Wu, Yueyi Zhang, Xiaoyan Sun |
| 2025 | Cross Knowledge Distillation between Artificial and Spiking Neural Networks. Shuhan Ye, Yuanbin Qian, Chong Wang, Sunqi Lin, Jiazhen Xu, Jiangbo Qian, Yuqi Li |
| 2025 | Cross-Modal Semantic-Aware Network for Audio-Visual Event Localization. Liang Liu, Shuaiyong Li, Yongqiang Zhu |
| 2025 | Cross-Modal Task Verification via Hypergraph-based Sequential Matching. Zhiyi Huang, Xun Jiang, Zheng Wang, Fumin Shen, Jingkuan Song, Xing Xu |
| 2025 | Cross-Structure and Semantic Enhancement for Diabetic Retinopathy Grading. Xue Xia, Zipeng Lin, Jingying Zhu, Jiebin Yan, Yuming Fang |
| 2025 | Cross-View Neighborhood Contrastive Multi-View Clustering with View Mixup Feature Learning. Yixuan Ye, Yang Zhang, Liang Peng, Rui Li, Cheng Liu, Si Wu, Hau-San Wong |
| 2025 | Cross-modal Shared Concept Learning for Text-to-Image Person Retrieval. Di He, Xinshan Zhu, Lan Zhang, Siyu Wang, Zhong Zhang |
| 2025 | CrossMuSim: A Cross-Modal Framework for Music Similarity Retrieval with LLM-Powered Text Description Sourcing and Mining. Tristan Tsoi, Jiajun Deng, Yaolong Ju, Benno Weck, Holger Kirchhoff, Simon Lui |
| 2025 | Culture-based Adversarial Attack on Text-to-Image Models. Fuyi Yang, Chenyu Zhang, Lanjun Wang |
| 2025 | Customizing Image Codecs for Text-Rich Screen Content with Plugin Processing Networks. Hao Wang, Junyan Huo, Shuai Wan, Kun Yang, Gaoxing Chen, Fuzheng Yang |
| 2025 | D2AD: Diffusion Distillation for Unsupervised Image Anomaly Detection. Yuheng Shao, Zhangkai Ni, Qinyuan Liu |
| 2025 | DAE-Fuse: An Adaptive Discriminative Autoencoder for Multi-Modality Image Fusion. Yuchen Guo, Ruoxiang Xu, Rongcheng Li, Weifeng Su |
| 2025 | DAG-AFL: Directed Acyclic Graph-based Asynchronous Federated Learning. Shuaipeng Zhang, Lanju Kong, Yixin Zhang, Wei He, Yongqing Zheng, Han Yu, Lizhen Cui |
| 2025 | DAGait: Generalized Skeleton-Guided Data Alignment for Gait Recognition. Zhengxian Wu, Chuanrui Zhang, Hangrui Xu, Peng Jiao, Haoqian Wang |
| 2025 | DAPL: Integration of Positive and Negative Descriptions in Text-Based Person Search. Yuchuan Deng, Zhanpeng Hu, Zijie Xin, Chuang Deng, Qijun Zhao |
| 2025 | DATE: Dual Asymmetric Textual Embedding guided Person Re-Identification. Pengqi Yin, Hantao Yao, Changsheng Xu |
| 2025 | DATTA: Domain Diversity Aware Test-Time Adaptation for Dynamic Domain Shift Data Streams. Chuyang Ye, Dongyan Wei, Zhendong Liu, Yuanyi Pang, Yixi Lin, Qinting Jiang, Jingyan Jiang, Dongbiao He |
| 2025 | DB-NeRF: An Effective Dual-Branch Representation for Neural Radiance Fields. Hailan Shen, Yixiang Jiang, Zailiang Chen, Xujing Liu, Jian Zhang |
| 2025 | DBE: Dual Branch re-Extraction for Unseen Diffusion-Generated Image Detection. Shixiang Cai, Liangzhen Liu, Zhirui Kuai, Li Kuang, Lingyan Zhang |
| 2025 | DCCL: Discriminative Cosine Center Learning for 3D Cross-Modal Retrieval with Real-world Image. Zengyu Liu, Zhitao Liu, Yi Li, Zhenjiang Du, Lei Zhang, Ning Xie |
| 2025 | DCGNet: Detail and Context Guided Small Object Detection Network with Decoupled Detection Head. Yixin Qiao, Shiyong Lan, Wenwu Wang, Haohan Chen, Yao Li, Guonan Deng |
| 2025 | DCSA-UNet: Lightweight UNet with Dual Cross-Shaped Attention For Skin Lesion Segmentation. Boyu Chen, Lu Han, Zherui Zhang, Li Guo, Shibiao Xu |
| 2025 | DEQuant: Distribution-Enhanced Reconstruction for Post-Training Quantization. Guoming Lu, Guodong Zou, Dongnan Liu, Heng Yin, Jielei Wang, Guangchun Luo |
| 2025 | DF-Net: A Dual Fusion Network for Accurate Video Temporal Grounding. Haolong Yan, Binghao Tang, Boda Lin, Jiachen Li, Si Li |
| 2025 | DFDUN: Deep Infrared and Visible Image Fusion with Diffusion Prior Unfolding Network. Maoyi Xiong, Jun-Jie Huang, Zihan Chen, Tianrui Liu, Xueqiong Li, Lin Liu, Wentao Zhao, Yuhua Tang |
| 2025 | DLLM: Enhancing Open-World Object Detection with Dynamic Learning and Large Models. Yangyang Huang, Xing Xi, Ronghua Luo |
| 2025 | DLVQA: A Dynamic Loss Approach For Visual Question Answering with Language Biases. Shuocheng Wang, Zhenzhen Wang, Qingfeng Wu |
| 2025 | DMDH: Decentralized Multi-agent Distributed Hashing for Multimedia Retrieval. Yunfei Chen, Yitian Long, Zhan Yang, Jun Long |
| 2025 | DMDM: Photorealistic Face Age Transformation by Dual-Modal Collaborative Attention using Diffusion Models. Zepeng Su, Zhulin Liu, Zongyan Zhang, Tong Zhang, C. L. Philip Chen |
| 2025 | DPCD: A Quality Assessment Database for Dynamic Point Clouds. Yating Liu, Yujie Zhang, Qi Yang, Yiling Xu, Zhu Li, Ye-Kui Wang |
| 2025 | DPDEdit: Detail-Preserved Diffusion Models for Multimodal Fashion Image Editing. Xiaolong Wang, Zhiqi Cheng, Jue Wang, Huizi Xue, Xiaojiang Peng |
| 2025 | DRMOE: Towards Better Mixture of Experts via Dual Routing Strategy. Haiyang Liu, Shaojian Qiu, Hai Lin, Yingjie Kuang, Shunpeng Li |
| 2025 | DRTNet: Diffusion Reconstruction Texture Network for AI-generated Image Detection. Qian Yao, Jun-Jie Huang, Yongjun Wang, Zihan Chen |
| 2025 | DTAD: A Distribution-Transformed Supervised Anomaly Detection Method. Lingxing Chen, Yang Gu, Yi Guo, Jianqi Chen, Yingting Zhu, Yehong Zhuo, Dongmei Jiang, Yiqiang Chen |
| 2025 | DTSNet: A Denoising Teacher-Student Network with Reverse Distillation for Anomaly Detection. Taixiang Lin, Shuyuan Lin, Yanjie Liang, Rong Chen, Yang Lu |
| 2025 | DUPL: Domain-agnostic Unknown-aware Prompt Learning for Threshold-free Open-set Domain Generalization. Fangbin Xu, Dongyue Chen, Shizhuo Deng, Tong Jia, Hao Wang |
| 2025 | DWS-FedSeg: A Federated Learning Framework for Automatic Segmentation of CT and MRI Images. Yunhe Feng, Lingren Wang, Jiaxin Wang |
| 2025 | Dancing with Noise: Advancing Generative Speech Enhancement with Distribution Augmentation. Yue Lei, Siqi Yang, Wenxin Tai, Xueting Liu, Ting Zhong, Fan Zhou |
| 2025 | Data-Free Knowledge Distillation with Diffusion Models. Xiaohua Qi, Renda Li, Long Peng, Qiang Ling, Jun Yu, Ziyi Chen, Peng Chang, Mei Han, Jing Xiao |
| 2025 | Dataset Pruning: Optimizing Image Datasets with a Cross-Validation Method. Yanmin Chen, Shuo Wang, Mengyao Zhou, Chenglin Liu, Jun Luo |
| 2025 | Dataset Quantization Augmentation: Improving Dataset Compression Through Complexity-Guided Sampling and Augmentation. Ziyang Li, Qin Liu, Fengshan Zhao, Yujie Wang, Takeshi Ikenaga |
| 2025 | Decoding Emotional Silences: Reliable Multimodal Sentiment Analysis with Bipolar Uncertainty. Yutao Wei, Hongzhu Fu, Yuxiang Li, Yichen Xin, Xovee Xu, Fan Zhou, Ting Zhong |
| 2025 | Decoupled and Interactive Regression Modeling for High-performance One-stage 3D Object Detection. Weiping Xiao, Yiqiang Wu, Yu Qin, Chenghai Mao, Jia Liu, Xiaomao Li |
| 2025 | Decoupling Overlapped Feature Spaces: When Continual Learning Meets Fine-Grain Classification. Zhikun Feng, Mingyu Wu, Ping Kuang, Kang Dang, Mian Zhou, Liu Yu |
| 2025 | Decoupling Representations with Quantized Vectors for Semi-Supervised Action Quality Assessment. Lingfeng Ye, Kumie Gedamu, Jie Shao |
| 2025 | Defect Detection-Guided Reconstruction Network for Ground Penetrating Radar B-Scan Images. Zilong Ling, Xinran Zhong, Siyu Zhou, Yu Yang, Zhongcheng Gui, Huabin Wang |
| 2025 | Degradation-Aware Multi-Task Image Restoration with State Space Models. Tao Wu, Purui Bai, Huaibo Huang, Jie Cao, Yuang Ai, Ran He |
| 2025 | Delight-UPS: Uncalibrated Photometric Stereo via Diffusion Model-Based Relighting. Zhenyu Qiao, Jiajun Sun, MingYun He, Liu Yu, Rui Zhou, Ping Kuang |
| 2025 | Denoising Diffusion Probabilistic Model for Point Cloud Compression at Low Bit-Rates. Gabriele Spadaro, Alberto Presta, Jhony H. Giraldo, Marco Grangetto, Wei Hu, Giuseppe Valenzise, Attilio Fiandrotti, Enzo Tartaglione |
| 2025 | Detecting AI-Generated Video via Frame Consistency. Long Ma, Zhiyuan Yan, Qinglang Guo, Yong Liao, Haiyang Yu, Pengyuan Zhou |
| 2025 | Determined Multi-Label Learning via Similarity-Based Prompt. Meng Wei, Zhongnian Li, Peng Ying, Ridong Han, Tongfeng Sun, Xinzheng Xu |
| 2025 | DiBAN: Dual-Drive Broad Attentive Network for Speech Emotion Recognition. Gongli Zhang, C. L. Philip Chen, Tong Zhang, Zhulin Liu, Xiaoman Hu, Bianna Chen |
| 2025 | Dialogue Director: Bridging the Gap in Dialogue Visualization for Multimodal Storytelling. Min Zhang, Zilin Wang, Liyan Chen, Kunhong Liu, Juncong Lin |
| 2025 | DialogueAgents: A Hybrid Agent-Based Speech Synthesis Framework for Multi-Party Dialogue. Xiang Li, Duyi Pan, Hongru Xiao, Jiale Han, Jing Tang, Jiabao Ma, Wei Wang, Bo Cheng |
| 2025 | Diff-Art: Category-level Articulation Pose Estimation via Conditional Diffusion. Yukang Huo, Xianhui Meng, Li Zhang, Haonan Jiang, Yan Zhong, Mingyuan Yao, Haihua Wang |
| 2025 | Diff-Cleanse: Identifying and Mitigating Backdoor Attacks in Diffusion Models. Hao Jiang, Jin Xiao, Xiaoguang Hu, Tianyou Chen, Jiajia Zhao |
| 2025 | DiffDeid: High-Quality Face De-identification and Recovery via Diffusion Inversion. Zheyuan Liu, Jun Jia, Hongyi Miao, Yiwei Yang, Yanwei Jiang, Yingjie Zhou, Zhi Liu, Guangtao Zhai |
| 2025 | DiffLane: Diffusion Model-Based Lane Mask Generation for Accurate Video Lane Detection. Wenxiang Liu, Yongkang Liu, Weiliang Meng, Gaoqi He, Jianhua Li |
| 2025 | DiffMissing: Denoising Diffusion Model for Multivariate Time Series Forecasting with Variable Missing. Bingheng Pang, Wei Li, Zhuoxuan Liang, Yidan Chen, Zhihong Wang, Moustafa Youssef |
| 2025 | Diffusion-Based Hierarchical Image Steganography. Youmin Xu, Xuanyu Zhang, Xiandong Meng, Chong Mou, Jian Zhang |
| 2025 | Diffusion-Driven Source Consistency for Gradual Domain Adaptation. Wenwei Luo, Yuguo Hu, Jiafu Yan, Mengmeng Jing, Lin Zuo |
| 2025 | DiffusionTalker: Efficient and Compact Speech-Driven 3D Talking Head via Personalizer-Guided Distillation. Peng Chen, Xiaobao Wei, Ming Lu, Hui Chen, Feng Tian |
| 2025 | Direct Preference Optimization for LLM-Enhanced Recommendation Systems. Chao Sun, Yaobo Liang, Yaming Yang, Shilin Xu, Tianmeng Yang, Yunhai Tong |
| 2025 | Discrimination-based Method for Image Object Detection with Random Distinct Proposals. Jingzhi Zhang, Chengjie Bai |
| 2025 | DisentTalk: Cross-lingual Talking Face Generation via Semantic Disentangled Diffusion Model. Kangwei Liu, Junwu Liu, Yun Cao, Jinlin Guo, Xiaowei Yi |
| 2025 | Distraction Suppression and Feature Modulation Network for Camouflaged Object Detection. Han Lyu, Meijun Sun, Haowei Ran, Yipu Liu, Xinyu Yan, Zheng Wang |
| 2025 | Distributed Cloud-Edge Scheduling for Multimedia Data Requests: A MARL Approach. Chong Geng, Zhen Liu, Yannan Wang, Yiran Li |
| 2025 | Distribution-Aware Hadamard Quantization for Hardware-Efficient Implicit Neural Representations. Wenyong Zhou, Jiachen Ren, Taiqiang Wu, Yuxin Cheng, Zhengwu Liu, Ngai Wong |
| 2025 | Diverse Audio Caption Generation with Semantic-aware Diffusion Model. Hualei Wang, Yiming Li, Hong Liu, Xiangdong Wang |
| 2025 | Divide-And-Conquer: Dual-Hierarchical Optimization for Semantic 4D Gaussian Spatting. Zhiying Yan, Yiyuan Liang, Shilv Cai, Tao Zhang, Sheng Zhong, Luxin Yan, Xu Zou |
| 2025 | Domain Generalization via Discrete Codebook Learning. Shaocong Long, Qianyu Zhou, Xi Jiang, Chenhao Ying, Lizhuang Ma, Yuan Luo |
| 2025 | Double-Shrink: Enhancing Model Robustness under SDN Noise by Reducing Uncertain Confidence. Naihao Wang, Can Zhang, Yunfeng Liu, Wentao Chen, Ruirui Li |
| 2025 | DreamAnimate: Temporal Consistency and Detail Preservation for Character Animation. Lulu Tian, Hongxun Yao, Zhaopan Xu, Jiankun Zhu, Xi Chen, Yuxin Hou |
| 2025 | DreamPBR: Text-driven High-Resolution SVBRDF Generation with Multimodal Guidance. Linxuan Xin, Zheng Zhang, Zhiyi Pan, Jinfu Wei, Duan Gao, Wei Gao |
| 2025 | DreamScape: 3D Scene Creation via Gaussian Splatting joint Correlation Modeling. Yueming Zhao, Xuening Yuan, Hongyu Yang, Di Huang |
| 2025 | DuMo: A Dual-Model Framework for Effective Long-tailed Object Detection. Chenbo Zhang, Yinglu Zhang, Jihong Guan, Shuigeng Zhou |
| 2025 | Dual Information Speech Language Models for Emotional Conversations. Chun Wang, Chenyang Liu, Wenze Xu, Weihong Deng |
| 2025 | Dual Mutual Information-Driven Multimodal Recommendation with Denoising Graph Autoencoder. Mengduo Yang, Jie Zhou, Meng Xi, Xiaohua Pan, Ying Li, Yangyang Wu, Jinshan Zhang, Jianwei Yin |
| 2025 | Dual-Branch Attention Network for Salient Object Detection in Optical Remote Sensing Images. Yaqian Wang, Chunyang Ma, Yumei Tong, Liejun Wang, Panpan Zheng |
| 2025 | Dual-Domain Iterative Refinement Network for Camouflaged Object Detection. Qingzheng Wang, Ning Li, Jiazhi Xie |
| 2025 | DyPho-SLAM : Real-time Photorealistic SLAM in Dynamic Environments. Yi Liu, Keyu Fan, Bin Lan, Houde Liu |
| 2025 | DynaGS-SLAM: Robust Dynamic SLAM with 3D Gaussian Splatting. Ziyi Huang, Binbin Yan, Dongliang Wang, Jinglun Feng, Shuo Chen, Xiangcheng Yi |
| 2025 | DynaSplat: Dynamic-Static Gaussian Splatting with Hierarchical Motion Decomposition for Scene Reconstruction. Junli Deng, Ping Shi, Qipei Li, Jinyang Guo |
| 2025 | Dynamic Feature-Focusing with Cross-Modal Semantic Alignment for Video Moment Retrieval and Highlight Detection. Xuehui Liang, Ruomei Wang, Baoquan Zhao, Jiawei Feng |
| 2025 | Dynamic Importance in Diffusion U-Net for Enhanced Image Synthesis. Xi Wang, Ziqi He, Yang Zhou |
| 2025 | Dynamic Token Selective Transformer for Aerial-Ground Person Re-Identification. Yuhai Wang, Maryam Pishgar |
| 2025 | Dynamic Weighting Loss for Decision Boundary Adjustment based on Robust Distance in Adversarial Training. Yiqun Xu, Zhen Wei, Zhehao Li, Xing Wei, Yang Lu |
| 2025 | DynamicGaussian: Spatio-temporally Consistent 4D Gaussian Splatting for High-Fidelity Monocular Videos Reconstruction. Wenhao Dong, Youwen Yuan, Bowen Zhang, Xi Zhao |
| 2025 | EAV-Mamba: Efficient Audio-Visual Representation Learning for Weakly-Supervised Temporal Action Localization. Quan Zhang, Jinwei Fang, Yuxin Qi, Mingyang Wan, Guojun Ma, Ke Zhang, Chun Yuan |
| 2025 | ECAIF: Efficient Context Aware Information Fusion Network for Medical Image Segmentation. Luyao Ren, Wenxin Yu, Zhiqiang Zhang, Chang Liu, Jun Gong |
| 2025 | ECG-Chat: A Large ECG-Language Model for Cardiac Disease Diagnosis. Yubao Zhao, Jiaju Kang, Tian Zhang, Puyu Han, Tong Chen |
| 2025 | EFDiT: Efficient Fine-grained Image Generation Using Diffusion Transformer Models. Kun Wang, Donglin Di, Tonghua Su, Lei Fan |
| 2025 | EG-Gaussian: Epipolar Geometry and Graph Network Enhanced 3D Gaussian Splatting. Beizhen Zhao, Yifan Zhou, Zijian Wang, Hao Wang |
| 2025 | EIAD: Explainable Industrial Anomaly Detection Via Multi-Modal Large Language Models. Zongyun Zhang, Jiacheng Ruan, Xian Gao, Ting Liu, Yuzhuo Fu |
| 2025 | EMGPose: An Efficient Multi-Granularity Representation for Human Pose Estimation. Guonan Deng, Shiyong Lan, Wenwu Wang, Yixin Qiao, Yao Li, Haohan Chen, Hongyu Yang |
| 2025 | EPIC: Efficient Prompt Interaction for Text-Image Classification. Xinyao Yu, Hao Sun, Zeyu Ling, Ziwei Niu, Zhenjia Bai, Rui Qin, Yen-Wei Chen, Lanfen Lin |
| 2025 | ES-Parkour: Advanced Robot Parkour with Bio-inspired Event Camera and Spiking Neural Network. Qiang Zhang, Jiahang Cao, Jingkai Sun, Yecheng Shao, Gang Han, Wen Zhao, Yijie Guo, Renjing Xu |
| 2025 | ESTI: An Efficient Spatial-Temporal Interaction Network For Video-Based Person Re-Identification. Guquan Jing, Peng Gao, Yiyang Hu, Yujian Lee, Hui Zhang |
| 2025 | ESTJ: Efficient Semantic Segmentation via Token Joint Merging. Ziniu Liu, Mingqing Liu, Fengxia Han, Xingtong Liu, Chuan Liu, Xi Zhang, Hao Deng, Shengjie Zhao |
| 2025 | ESVQA: Perceptual Quality Assessment of Egocentric Spatial Videos. Xilei Zhu, Huiyu Duan, Liu Yang, Yucheng Zhu, Xiongkuo Min, Guangtao Zhai, Patrick Le Callet |
| 2025 | ET-Talk: Effective Training Strategy to Enhance Synchrony and Fidelity for Talking Face Generation. Baiqin Wang, Xiangyu Zhu, Fan Shen, Hao Xu, Shukai Chen, Zhen Lei |
| 2025 | EarlyMix: Hierarchical Mixing for Early Time Series Classification. Shuguo Hu, Jun Hu, Junwei Lv, Huaiwen Zhang |
| 2025 | EasySplat: View-Adaptive Learning makes 3D Gaussian Splatting Easy. Ao Gao, Luosong Guo, Tao Chen, Zhao Wang, Ying Tai, Jian Yang, Zhenyu Zhang |
| 2025 | Edge and Localization Feature Guidance Network for Accurate Polyp Segmentation. Yulong Bai, Songlin Li, Xiuhong Li, Kuan Wang, Rong Wan, Haochu Ku, Mengge Lu |
| 2025 | Eff-DFQT: Efficient Model Inversion for Data-free Quantization of Vision Transformers. Mengkui Li, Xinrui Chen, Hai Chen, Kang Zhao, Yanping Zhang, Shu Zhao, Fulan Qian |
| 2025 | Effective Linear Vision Transformer Via Selective Sampling Softmax and Multi-Feature Enhancement. Xianchao Zhang, Senqi Guan, Yunlong Gao, Linlin Zong, Wenxin Liang, Xinyue Liu |
| 2025 | Efficient Binarized Neural Network Intellectual Property Protection. Bowen Chen, Jiehua Zhang, Yuchen Sun, Li Liu |
| 2025 | Efficient Diffusion Bridge with Initial-Value Correction Strategy for Super-Resolution. Jiati Cai, Yue Lei, Wenxin Tai, Xing He, Ting Zhong, Jia Chen, Fan Zhou |
| 2025 | Efficient Explicit Joint-level Interaction Modeling with Mamba for Text-guided HOI Generation. Guohong Huang, Ling-An Zeng, Zexin Zheng, Shengbo Gu, Wei-Shi Zheng |
| 2025 | Efficient Knowledge Transfer in Multi-Task Learning through Task-Adaptive Low-Rank Representation. Xiao Zhang, Kangsheng Wang, Tianyu Hu, Huimin Ma |
| 2025 | Efficient Local-Global Collaboration Transcoding for JPEG AI. Yiming Wang, Zhaobin Zhang, Yaojun Wu, Qian Huang, Bin Tang, Kai Zhang, Li Zhang |
| 2025 | Efficient Prompt Tuning for Hierarchical Ingredient Recognition. Yinxuan Gui, Bin Zhu, Jingjing Chen, Chong-Wah Ngo |
| 2025 | Efficient RGBT Tracking via Heterogeneous Hierarchical Knowledge Distillation. Dengdi Sun, Shiqi Liu, Chenglong Li, Andong Lu |
| 2025 | Efficient Shared KVCache Attention Inference for Multimodal Large Language Models. Shouxu Kuang, Limin Cheng, Yixin Chen, Hang Qin, Ling Li |
| 2025 | Efficient Text-to-Motion via Multi-Head Generative Masked Modeling. Heng Li, Xing Liufu, Xiaotong Lin, Jian Zhu, Jian-Fang Hu |
| 2025 | Egocentric Online Action Segmentation with Behavior-Centred Feature Augmentation. Zhangye Han, Xun Jiang, Zheng Wang, Xin Liu, Fumin Shen, Xing Xu |
| 2025 | Elastic Architecture Search for Efficient Language Models. Shang Wang |
| 2025 | ElimPCL: Eliminating Noise Accumulation with Progressive Curriculum Labeling for Source-Free Domain Adaptation. Jie Cheng, Hao Zheng, Meiguang Zheng, Lei Wang, Hao Wu, Jian Zhang |
| 2025 | Embedding Compression Distortion in Video Coding for Machines. Yuxiao Sun, Yao Zhao, Meiqin Liu, Chao Yao, Weisi Lin |
| 2025 | EmoHead: Emotional Talking Head via Manipulating Semantic Expression Parameters. Xuli Shen, Hua Cai, Dingding Yu, Weilin Shen, Qing Xu, Xiangyang Xue |
| 2025 | Enabling Communication-efficient and Robust Federated Learning over Packet Lossy Networks via Random Interleaved Vector Quantization. Yixuan Guan, Jianwei Niu, Tao Ren, Xuefeng Liu |
| 2025 | Enabling Haptic-Integrated Interactive Holographic Video Streaming Powered by 5G Edge Computing. Peng Qian, Ning Wang, Carl C. Udora, Carlos Velez Redondo, Jingxuan Men, Rahim Tafazolli |
| 2025 | Encryption and Authentication with a Lensless Camera Based on a Programmable Mask. Eric Bezzam, Martin Vetterli |
| 2025 | End to End Text to Sign Language Generation using MultiGAU. Nabeela Khan |
| 2025 | End-To-End Casual Video Reconstruction: Geometry, Pose and Motion. Wenyu Li, Peng Qiao, Sidun Liu, Zongxin Ye, Ziteng Zhang, Zhenglun Sun, Yong Dou |
| 2025 | End-to-End Lyric-to-Melody Generation via Chord Integration and Bar-Level Modeling. Ke Gu, Peng Bai, Zhen Lei, Yue Zhou, Zhicong Wu, Xiaodong Shi |
| 2025 | Enhanced Cross-modal 3D Retrieval via Tri-modal Reconstruction. Junlong Ren, Hao Wang |
| 2025 | Enhanced Multimodal Chain-of-Thought with Visual Self-Contrastive Distillation. Guangmin Zheng, Jun Kong, Jin Wang, Xuejie Zhang |
| 2025 | Enhanced Self-Supervised Multi-View Representations with Modality-Missing Robustness for Audio-Visual Speech Recognition. Fei Su, Cancan Li, Juan Liu |
| 2025 | Enhancing 3D Gaussian Splatting Compression via Spatial Condition-based Prediction. Jingui Ma, Yang Hu, Luyang Tang, Jiayu Yang, Yongqi Zhai, Ronggang Wang |
| 2025 | Enhancing Cross-modal Semantic Consistency via Key Token Alignment for Image-text Retrieval. Huilong Lin, Yangtao Wang, Meie Fang, Yanzhao Xie, Da Chen, Xiaocui Li, Weilong Peng, Siyuan Chen, Maobin Tang, Ping Li |
| 2025 | Enhancing Data-Free Substitute Training for Black-Box Adversarial Attacks. Zijian Ling, Wenyu Zhou, Yi Ouyang, Yuting Zhou, Man Zhou |
| 2025 | Enhancing Diffusion-based Dataset Distillation via Adversary-Guided Curriculum Sampling. Lexiao Zou, Gongwei Chen, Yanda Chen, Miao Zhang |
| 2025 | Enhancing Dynamic CAPTCHA Verification Based on Multimodal Trustworthiness Fusion Network. Chenxi Liu, Huayu Shou, Yuqing Yin, Xu Yang, Qiang Niu |
| 2025 | Enhancing Federated Learning Robustness with Pre-trained Staged Modular Distillation. Jiankang Wei, Xu Ma, Yuan Ma, Hongwei Zhou, Jingtong Huang, Xiaoyu Zhang |
| 2025 | Enhancing Few-Shot Class-Incremental Learning via Cross-Modal Bias Alignment. Desen Wang, Zhiming Chen, Xiang Qiu, Yishu Liu, Bingzhi Chen |
| 2025 | Enhancing Handwritten Mathematical Expression Recognition with Structure and Counting Aware Network. Shiqi Mou, Zijie Li, Juxiang Zhou, Jun Wang, Jianhou Gan |
| 2025 | Enhancing Hateful Meme Detection via Modality Enhancement and Multi-View Fusion. Ying Zeng, Meiling Liu, Jiyun Zhou, Jingfeng Zhang |
| 2025 | Enhancing Human Motion Prediction via Multi-range Decoupling Decoding with Gating-adjusting Aggregation. Jiexin Wang, Wenwen Qiang, Zhao Yang, Bing Su |
| 2025 | Enhancing Large Multimodal Models with Adaptive Sparsity and KV Cache Compression. Te Zhang, Yuheng Li, Junxiang Wang, Lujun Li |
| 2025 | Enhancing Long Video Understanding via Hierarchical Event-Based Memory. Dingxin Cheng, Mingda Li, Jingyu Liu, Yongxin Guo, Bin Jiang, Qingbin Liu, Xi Chen, Bo Zhao |
| 2025 | Enhancing Multi-modal Models with Heterogeneous MoE Adapters for Fine-tuning. Sashuai Zhou, Yan Xia, Hai Huang |
| 2025 | Enhancing Multimodal Chain-of-Thought Reasoning with Tree-Searched Self-Training. Yiwen Luo, Tao Wei, Yong Luo, Zengmao Wang |
| 2025 | Enhancing Object Coherence in Layout-to-Image Synthesis. Yibin Wang, Changhai Zhou, Honghui Xu |
| 2025 | Enhancing Object-Attribute Alignment in Diffusion Models via Training-Free Contrastive Parallel Denoising. Wentao Xie, Xingyu Li |
| 2025 | Enhancing Open-Vocabulary Panoptic Segmentation with Semantic-Guided Q-Tuning. Yanxiang Huang, Kai Zhang, Yuxiang Wang, Dongtai Du, Yuping Yuan, Zheng Zhao |
| 2025 | Enhancing Personalized Recommendation via Metacognitive Profile. Jiaqi Yin, Jingyang Qiao, Tiong-Thye Goh, Yi Hu |
| 2025 | Evading Deepfake Detectors via Adversarially Degrading and Restoring Forged Images. Zhengli Shi, Chenhao Lin, Zhengyu Zhao, Peter Peer, Chao Shen |
| 2025 | Evidential Graph Contrastive Alignment for Source-Free Blending-Target Domain Adaptation. Juepeng Zheng, Yibin Wen |
| 2025 | ExGAT: Build Explicit Dependencies for Incomplete Multi-Modal Learning via Graph Attention Network. Binyu Zhao, Wei Zhang, Zhaonian Zou |
| 2025 | ExScene: Free-View 3D Scene Reconstruction with Gaussian Splatting from a Single Image. Tianyi Gong, Boyan Li, Yifei Zhong, Fangxin Wang |
| 2025 | Expansive Supervision for Neural Radiance Fields. Weixiang Zhang, Wei Yao, Shuzhao Xie, Shijia Ge, Chen Tang, Zhi Wang |
| 2025 | Exploiting Long and Short Temporal Dependence for Low-Light Video Enhancement. Hao Luo, Lingyu Zhu, Yudong Mao, Yixuan Li, Zhiwei Zhong, Shanshe Wang, Shiqi Wang |
| 2025 | Explore the Asymmetric Interference Sound Field for High-precision Localization. Xiaojie Yu, Mingzhi Pang, Zhongxu Bao, Xu Yang, Qiang Niu, Yuqing Yin |
| 2025 | Exploring Active Learning for Label-Efficient Training of Semantic Neural Radiance Field. Yuzhe Zhu, Lile Cai, Kangkang Lu, Fayao Liu, XuLei Yang |
| 2025 | Exploring Compression Strategies for Blendshape-Based Avatar Facial Animation: Subjective and Objective Analysis. Anthony Trioux, Wei Zhang, Giuseppe Valenzise, Fuzheng Yang |
| 2025 | Exploring Flexibility in Incremental Few-Shot Object Detection. Dongdong Gong, Tengfei Gong, Yaxiong Chen, Jinglin Yuan, Shengwu Xiong |
| 2025 | Exploring Part-Informed Visual-Language Learning for Person Re-Identification. Yin Lin, Yehansen Chen, Baocai Yin, Jinshui Hu, Bing Yin, Cong Liu, Zengfu Wang |
| 2025 | Exploring State Space Model in Wavelet Domain: An Infrared and Visible Image Fusion Network via Wavelet Transform and State Space Model. Tianpei Zhang, Yiming Zhu, Jufeng Zhao, Guangmang Cui, Yuchen Zheng |
| 2025 | Extended Short- and Long-Range Mesh Learning for Fast and Generalised Garment Simulation. Aoran Liu, Kun Hu, Clinton Mo, Changyang Li, Zhiyong Wang |
| 2025 | FALCON: Feedback-driven Adaptive Long/short-term memory reinforced Coding OptimizatioN. Zeyuan Li, Yangfan He, Lewei He, Jianhui Wang, Tianyu Shi, Bin Lei, Yuchen Li, Qiuwu Chen |
| 2025 | FAST: Facial Avatar Animation via Spatial-Temporal Aggregation. Gangyi Hong, Ming Lu, Senmao Tian, Xiangyi Chen, Hui Zhang |
| 2025 | FDAVS: Exploring Frequency-Driven Modality Enhancement in Audio-Visual Segmentation. Mengyuan Zhu, Yunzhi Zhuge, Sitong Gong, Lu Zhang, Huchuan Lu |
| 2025 | FDG-Diff: Frequency-Domain-Guided Diffusion Framework for Compressed Hazy Image Restoration. Ruicheng Zhang, Kanghui Tian, Zeyu Zhang, Qixiang Liu, Zhi Jin |
| 2025 | FLR: Feature-based Label Recovery in Federated Learning with Classifier-free Communication. Yibin Wang, Yucan Zhou, Xiaoyan Gu, Weiping Wang |
| 2025 | FMNet: Frequency-Assisted Mamba-Like Linear Attention Network for Camouflaged Object Detection. Ming Deng, Sijin Sun, Zihao Li, Xiaochuan Hu, Xing Wu |
| 2025 | FOCUS: Fine-grained Optimization with Semantic Guided Understanding for Pedestrian Attributes Recognition. Hongyan An, Kuan Zhu, Xin He, Haiyun Guo, Chaoyang Zhao, Ming Tang, Jinqiao Wang |
| 2025 | FSRF: Factorization-guided Semantic Recovery for Incomplete Multimodal Sentiment Analysis. Ziyang Liu, Pengjunfei Chu, Shuming Dong, Chen Zhang, Mingcheng Li, Jin Wang |
| 2025 | FairFHTL: Achieving Task-Agnostic Fairness in Federated Hetero-Task Learning. Teng Zhang, Yiqiang Chen, Xinlong Jiang, Wuliang Huang, Qian Chen, Chenlong Gao, Zhirui Wang, Bingjie Yan |
| 2025 | False Negatives Consensus Suppression for Text-to-Image Person Re-identificatio. Ruigeng Zeng, Wentao Ma, Qinglin Wang, Xinjun Mao, Jie Liu |
| 2025 | Fast CU Partition Algorithm For 360-Degree Videos on VVC. Dayong Wang, Shijie Du, Yu Sun, Shuyin Xia, Frédéric Dufaux, Hongwei Guo, Guo-Yin Wang, Ce Zhu |
| 2025 | Fast and Physically-based Neural Explicit Surface for Relightable Human Avatars. Jiacheng Wu, Ruiqi Zhang, Jie Chen, Hui Zhang |
| 2025 | FastAno: Accelerating Defect Image Generation with Efficient Sampling. Haoyu Guan, Qianzi Yu, Kai Zhu, Yang Cao, Yu Kang |
| 2025 | Faster-SNN: Towards Faster and Better Spiking Neural Networks with Hybrid Neural Coding. Yinsheng Chen, Jilong Luo, Zhiyi Yu, Shanlin Xiao |
| 2025 | Feature Affinity based Clustering for Test-Time Adaptation for Image Quality Assessment. Meghna Kapoor, Vinit Jakhetiya, Badri Narayan Subudhi, Ankur Bansal, Weisi Lin |
| 2025 | Feature and Temporal Disruption Attacks from Images to Videos. Zhanpeng Liu, Yuqiang Zhang, Tianlong Yu, Xi Lin, Yang Yang, Chenxi Huang, Bin Wang |
| 2025 | Fed3D: Enhancing Security in Federated Learning with Dataset Distillation. Canhui Wu, Wei Xi, Yuwei Fan, Yuhao Shen, Jizhong Zhao |
| 2025 | FedAdamZO: a Zeroth-order Adaptive Momentum Method for Memory-efficient Fine-tuning of Federated Large Language Models. Bo Ma, Yongqiang Gao, Yongmei Liu |
| 2025 | FedMPQ: Secure and Efficient Federated Learning with Multi-codebook Product Quantization. Xu Yang, Zhuo Tang, Boyao Hao, Xiong Xiao, Jiapeng Zhang |
| 2025 | FedRF: Input-side Client Drift Mitigation for Federated Learning via Reusing Features. Lingxiao Kong, Jiahui Jiang, Wenchao Xu, Haozhao Wang, Ruixuan Li |
| 2025 | Federated Open-Set Domain Generalization with Adaptive Adjustment Boundary and Weights. Haoyuan Liang, Shilei Cao, Yushan Lai, Juepeng Zheng |
| 2025 | Few-Shot 3D Face Generation via a Controllable Diffusion Model Guided by Text and Images. Jinfu Wei, Zheng Zhang, Qinchuan Zhang, Ran Liao, Duan Gao |
| 2025 | Few-shot Prompt Learning with Large Vision-Language Model for Image Deep Hashing. Ye Liu, Yan Pan, Jian Yin |
| 2025 | Fine-Grained Body Part Control in Text-Driven Motion Synthesis with Interactive Intention. Siyuan Fan, Longling Sun, Bo Peng, Bo Du, Xiantao Cai |
| 2025 | Fine-tuned Multimodal Large Language Models are Zero-shot Learners in Image Quality Assessment. Rui Xiong, Li Chen, Zhida Feng, Jiaxiang Liu, Shikun Feng |
| 2025 | Fitted-Singer: Singing Voice Synthesis with Style Control and Rhythm Control. Yu Cao, Sijia Li, Shiguang Liu |
| 2025 | Flexible Streaming Temporal Action Segmentation with Diffusion Models. Jinrong Zhang, Wenjun Wen, Shenglan Liu, Sifan Zhang, Yuning Ding, Lin Feng |
| 2025 | FlowJD: Your Imagination Can Help You Jailbreak in Visual Language Models. Xiaotian Zou, Yongkang Chen, Qianqian Han, Ke Li |
| 2025 | FoCTTA: Low-Memory Continual Test-Time Adaptation with Focus. Youbing Hu, Yun Cheng, Zimu Zhou, Anqi Lu, Zhiqiang Cao, Zhijun Li |
| 2025 | FoodWeight1.4M: A Large-scale Multi-modal Dataset for Weight Estimation. Lu Yuan, Zhenbo Xu, Dehua Ma, Jinghan Yang, Liuyu Xiang, Huijia Wu, Zhaofeng He |
| 2025 | ForeNet: Unlocking Long-Term Series Forecasting in High-Dimensional Scenario via Forest Structure. Xinyu Li, Hao Xu, Zhiheng Yang, Hongxiang Zhou, Hong Lu, Xin Wang, Jin Zhao |
| 2025 | Foreground Focus: Enhancing Coherence and Fidelity in Camouflaged Image Generation. Pei-Chi Chen, Yi Yao, Chan-Feng Hsu, Hong-Xia Xie, Hung-Jen Chen, Hong-Han Shuai, Wen-Huang Cheng |
| 2025 | Forensicability Assessment: Not All Samples Qualify for Recapture Detection. Yongqi Chen, Lin Zhao, Rizhao Cai, Zitong Yu, Changsheng Chen, Bin Li |
| 2025 | Fraesormer: Learning Adaptive Sparse Transformer for Efficient Food Recognition. Shun Zou, Yi Zou, Mingya Zhang, Shipeng Luo, Zhihao Chen, Guangwei Gao |
| 2025 | Free Try-On: Virtual Try-On without Garment-Agnostic Images and Warped Garments. Wei Zhang, Xuekang Peng, Zhichao Lian |
| 2025 | Frequency-guided Camouflaged Object Detection with Perceptual Enhancement and Dynamic Balance. Yuetong Li, Yilin Zhao, Qing Zhang, Qiangqiang Zhou, Yanjiao Shi |
| 2025 | From 2D Images to 3D Model: Weakly Supervised Multi-View Face Reconstruction with Deep Fusion. Weiguang Zhao, Chaolong Yang, Jianan Ye, Rui Zhang, Yuyao Yan, Xi Yang, Bin Dong, Amir Hussain, Kaizhu Huang |
| 2025 | From Camera to World: A Plug-and-Play Module for Human Mesh Transformation. Changhai Ma, Ziyu Wu, Yunkang Zhang, Qijun Ying, Boyan Liu, Xiaohui Cai |
| 2025 | From History to Goal: Enhanced Vision-and-Language Navigation with Historical Traceability. Xinguang Zhu, Min Wang, Li Li, Wengang Zhou, Houqiang Li |
| 2025 | G-TADS: GUI Task-Ability Decoupling Strategy for High-Adaptability Multimodal Intelligent Agents. Zhiqiang Xia, Xinyuan Zhang, Yang Li, Yuchen Liu, Runyu Shi, Jiaming Xu |
| 2025 | G4Seg: Generation for Inexact Segmentation Refinement with Diffusion Models. Tianjiao Zhang, Fei Zhang, Jiangchao Yao, Ya Zhang, Yanfeng Wang |
| 2025 | GA-Clip: Semantic-Aware Graph Augmentation for Contrastive Learning. Shuaiqi Lu, Yi Guo, Zhenlin An, Yan Zhu, Ning Huang |
| 2025 | GAMED-Snake: Gradient-aware Adaptive Momentum Evolution Deep Snake Model for Multi-organ Segmentation. Ruicheng Zhang, Haowei Guo, Zeyu Zhang, Puxin Yan, Shen Zhao |
| 2025 | GASEM: Boosting Generalized and Actionable Parts Segmentation and Pose Estimation via Object Motion Perception. Liu Liu, Ran Zhang, Wenbo Xu, Li Zhang, Yiming Tang, Qi Wu, Hao Wu |
| 2025 | GC-ConsFlow: Leveraging Optical Flow Residuals and Global Context for Robust Deepfake Detection. Jiaxin Chen, Miao Hu, Dengyong Zhang, Jingyang Meng |
| 2025 | GCA-SUNet: A Gated Context-Aware Swin-UNet for Exemplar-Free Counting. Yuzhe Wu, Yipeng Xu, Tianyu Xu, Jialu Zhang, Jianfeng Ren, Xudong Jiang |
| 2025 | GDNeRF: Generalizable Depth-based NeRF for sparse view synthesis. Sergio Montoya de Paco, Iván Huerta, Josep Escrig |
| 2025 | GE-Talker: Generalizable and Efficient Neural Rendering for Talking Head Generation. Zixuan Wang, Li Fang, Fei Hu, Long Ye |
| 2025 | GEST: Dual Structured Exploration with Graph ODE for Spatio-Temporal Dynamic System Modeling. Yonghao Li, Xiangyu Zhao, Ping Ye, Qingxuan Jia |
| 2025 | GLRB: Heterogeneous Federated Continual Learning via Global and Local Rebalance. Haodong Zhang, Liu Yang, Zihan Jiang |
| 2025 | GT-free_XAI: A Ground Truth-Free XAI Framework for Decision Interpretation and Evaluation. Yanchu Wu, Feng Tian |
| 2025 | GTPC-SSCD: Gate-guided Two-level Perturbation Consistency-based Semi-Supervised Change Detection. Yan Xing, Qi'ao Xu, Zongyu Guo, Rui Huang, Yuxiang Zhang |
| 2025 | GameMLD: A Game-Sourced Motion-Language Dataset for Stylized Motion Generation. Yiyu Fu, Ziming Cheng, Yihao Liao, Jiangfeiyang Wang, Ruomei Wang, Guanghui Yue, Chenlei Lv, Baoquan Zhao |
| 2025 | GateM2Net: A Gated Multi-Modal Network for Joint Emotion and Sentiment Analysis. Li Yin, Baigang Mi, Yi Fan |
| 2025 | GauSurfaceAvatar: A Realistic Human Head Model with Variable Texture Based on 2D Gaussians. Lijie Geng, Junli Zhao, Lin Gao, Ran Yi, Fuqing Duan, Zhenkuan Pan, Yong-Jin Liu |
| 2025 | Gaze4ASD: A Novel Dataset and Visual Saliency Map-Based Method for Autism Screening. Yizhang Yang, Jinshi Cui, Xi Guo, Xing Su, Wei Ni, Junshi Lu, Li Wang, Huimin Ma |
| 2025 | General Distortion Metric Based Multiple Histograms Modification for Reversible Data Hiding. Yinan Xiao, Shijun Xiang |
| 2025 | Generalized Audio Deepfake Detection Using Frame-level Latent Information Entropy. Botao Zhao, Zuheng Kang, Yayun He, Xiaoyang Qu, Junqing Peng, Jing Xiao, Jianzong Wang |
| 2025 | Generative Adversarial Network-based Image and Tabular Data Generation with Differential Privacy. Jiming Yang, Xu Wang, Yi Jin, Yidong Li, Hui Yu |
| 2025 | Generative Image Coding with Diffusion Prior. Jianhui Chang |
| 2025 | Geometric-Aware Mapping and Uncertainty Modeling for Semantic Scene Completion. Xianzhu Liu, Yuhe Zhu, Weiyu Zhao, Chen Hui, Jianping Zhong |
| 2025 | Geometrically-Inspired Irregular Expansion Techniques for Graph-based Point Cloud Learning. Qi Zhang, Haoqian Wang, Yuanxi Peng, Teng Li |
| 2025 | Geometrically-plausible and Semantically-consistent Generation of Indoor Panoramas. Zhiliang Zeng, Mengyang Wu, Xianzhi Li, Wenzhao Gao, Shaohui Jiao, Chi-Wing Fu |
| 2025 | GiVE: Guiding Visual Encoder to Perceive Overlooked Information. Junjie Li, Jianghong Ma, Xiaofeng Zhang, Yuhang Li, Jianyang Shi |
| 2025 | GlanceVAD: Exploring Glance Supervision for Label-efficient Video Anomaly Detection. Huaxin Zhang, Xiang Wang, Xiaohao Xu, Xiaonan Huang, Changxin Gao, Yuehuan Wang, Shanjun Zhang, Nong Sang |
| 2025 | Global Intervention and Distillation for Federated Out-of-Distribution Generalization. Zhuang Qi, Runhui Zhang, Lei Meng, Wei Wu, Yachong Zhang, Xiangxu Meng |
| 2025 | Global Perception Federated Recommender System for Click-Through Rate Prediction. Yicheng Di, Jiansong Fan, Rui Zhang, Song Shen, Jiayu Bao, Rongsheng Hu, Yuan Liu |
| 2025 | Global Semantic Extraction for Adaptive Cross-Semantic Learning: A Novel Framework for Remote Sensing Change Caption. Qiaoli Sun, Yan Wang, Xiaoyu Song, Hongyi Dong |
| 2025 | Global-Local Aware Scene Text Editing. Fuxiang Yang, Tonghua Su, Donglin Di, Yin Chen, Xiangqian Wu, Zhongjie Wang, Lei Fan |
| 2025 | Global-to-Local Color Correction with Full-Region Coverage for Multi-view Light Field Images. Yixu Huang, Rui Zhong, Ségolène Rogge, Adrian Munteanu |
| 2025 | Gradient-guided Attention Fusion Network for Camouflaged Object Detection. Wenrui Li, Meijun Sun, Cheng Liu, Xinyu Yan, Zheng Wang |
| 2025 | Graph Anomaly Detection via Structure to Attribute Reconstruction. Xingshen Wei, Wei Liu, Wenzhong Li, Sanglu Lu |
| 2025 | Graph-based Meta-Learning and Feature Disentanglement for Domain Generalization Crowd Counting. Yang Qu, Zhencai Shen, Yingyi Chen, Ping Zhong |
| 2025 | GraphDEH: Graph Diffusion Enhanced Hypergrpah Method for Class-Imbalanced Node Classification. Liu Yang, Mengni Chen, Tingxuan Chen, Jinqi Hu, Zidong Wang |
| 2025 | GraphTEN: Graph Enhanced Texture Encoding Network. Bo Peng, Jintao Chen, Mufeng Yao, Chenhao Zhang, Jianghui Zhang, Mingmin Chi, Jiang Tao |
| 2025 | Group-On: Boosting One-Shot Segmentation with Supportive Query. Hanjing Zhou, Mingze Yin, Danny Z. Chen, Jian Wu, Jintai Chen |
| 2025 | Guiding Yourself with Your Own Insights: Student-Driven Knowledge Distillation. Dacheng Qi, Huayu Zhang, Yufeng Wang, Shuangkang Fang, Zehao Zhang, Zesheng Wang, Wenrui Ding |
| 2025 | HCMA-UNet: A Hybrid CNN-Mamba UNet with Axial Self-Attention for Efficient Breast Cancer Segmentation. Haoxuan Li, Wei Song, Peiwu Qin, Xi Yuan, Zhenglin Chen |
| 2025 | HDCompression-DNA: Hybrid-Diffusion Neural Image Compression via DNA Storage. Cihan Ruan, Lei Lu, Rongduo Han, Wei Jiang, Wei Wang, Haoyu Wu, Qiming Yuan, Yanting Guo, Yanzhi Wang, Nam Ling |
| 2025 | HGCL: Semi-Supervised Polyp Segmentation via Hierarchical Granularity Contrastive Learning. Xiaogang Du, Dong Wang, Tao Lei, Tongfei Liu, Yingbo Wang, Asoke K. Nandi |
| 2025 | HLV-1K: A Large-scale Hour-Long Video Benchmark for Time-Specific Long Video Understanding. Heqing Zou, Tianze Luo, Guiyang Xie, Victor Xiao Jie Zhang, Fengmao Lv, Guangcong Wang, Junyang Chen, Zhuochen Wang, Hansheng Zhang, Huaijian Zhang |
| 2025 | HMPNet: A Feature Aggregation Architecture for Maritime Object Detection from a Shipborne Perspective. Yu Zhang, Fengyuan Liu, Juan Lyu, Yi Wei, Changdong Yu |
| 2025 | HMSformer: Hierarchical Multi-Scale Transformer for Multivariate Long-Term Series Forecasting. Xinyu Li, Yunqi Cai, Hao Xu, Xinyu Sun, Zhiheng Yang, Hong Lu, Xin Wang, Jin Zhao |
| 2025 | HRGR: Enhancing Image Manipulation Detection via Hierarchical Region-aware Graph Reasoning. Xudong Wang, Jiaran Zhou, Huiyu Zhou, Junyu Dong, Yuezun Li |
| 2025 | HSACNet: Hierarchical Scale-Aware Consistency Regularized Semi-Supervised Change Detection. Qi'ao Xu, Pengfei Wang, Yanjun Li, Tianwen Qian, Xiaoling Wang |
| 2025 | HSRMamba: Efficient Wavelet Stripe State Space Model for Hyperspectral Image Super-Resolution. Baisong Li, Xingwang Wang, Haixiao Xu |
| 2025 | HSS-IAD: A Heterogeneous Same-Sort Industrial Anomaly Detection Dataset. Qishan Wang, Shuyong Gao, Junjie Hu, Jiawen Yu, Xuan Tong, You Li, Wenqiang Zhang |
| 2025 | Harmony in Chaos: A Progressive Noise-Resilient Network for Robust Fake News Video Detection. Xiangzheng Kong, Zhi Zeng, Chenxi Zhu, Zihan Ma, Minnan Luo |
| 2025 | HarmonyIQA: Pioneering Benchmark and Model for Image Harmonization Quality Assessment. Zitong Xu, Huiyu Duan, Guangji Ma, Liu Yang, Jiarui Wang, Qingbo Wu, Xiongkuo Min, Guangtao Zhai, Patrick Le Callet |
| 2025 | Harnessing Counterfactual Reasoning for Explainable Multi-Modal Fact Verification with Large Language Models. Chaozhuo Li, Hui Pang, Xi Zhang, Litian Zhang, Feiran Huang, Ming Lu |
| 2025 | Harnessing Pre-trained Language Models for EEG-based Epilepsy Detection. Tao Lu, Shangyang Li |
| 2025 | HeteroGNN: A Heterogeneous Stage Division Based GNN Training Framework to Maximize CPU-GPU Parallelism. Xiangrui Yang, Zhihao Zeng, Jiawei Yang, Yekang Zhan, Qiang Cao, Jie Yao |
| 2025 | Hier-pFedMe: Hierarchical Personalized Federated Learning with Moreau Envelopes. Xi Liu, Fanfan Ji, Bo Liu, Xiao-Tong Yuan |
| 2025 | Hierarchical Graph Learning Framework for Multimodal Conversational Emotion Recognition. Jiandong Shi, Ming Li, Guoheng Huang, Siwei Zhou, Yongchun Gu, Zhanle Zhu |
| 2025 | Hierarchical Sub-action Tree for Continuous Sign Language Recognition. Dejie Yang, Zhu Xu, Xinjie Gao, Yang Liu |
| 2025 | High Resolution Wire Segmentation with Domain Adaption. Yu Zhong, Tao Xie, Anna Zhu |
| 2025 | HingeNet: A Harmonic-Aware Fine-Tuning Approach for Beat Tracking. Ganghui Ru, Jieying Wang, Jiahao Zhao, Yulun Wu, Yi Yu, Nannan Jiang, Wei Wang, Wei Li |
| 2025 | History Tracker: Retrieving Historical Image Embeddings for Efficient Fine-Grained Reasoning in Vision-Language Models. Jiahua Bao, Siyao Cheng, Jiaxing Du, Ziqian Li, Changjiang He, Jie Liu |
| 2025 | Human-Inspired Situated Question Answering with Large Language Models. Xinyu Zhao, Weichen Xu, Jian Cao, Tianhao Fu, Ruilong Ren, Xing Zhang |
| 2025 | Human-MoE: Multimodal Full-Body Human Image Synthesis with Component-driven Mixture of Experts. Yu-Jiu Huang, I-Chen Lin |
| 2025 | HyperMAN: Hypergraph-enhanced Meta-learning Adaptive Network for Next POI Recommendation. Jinze Wang, Tiehua Zhang, Lu Zhang, Yang Bai, Xin Li, Jiong Jin |
| 2025 | Hyperbolic Space Learning Method Leveraging Temporal Motion Priors for Human Mesh Recovery. Xiang Zhang, Suping Wu, Weibin Qiu, Zhaocheng Jin, Sheng Yang |
| 2025 | Hypergraph Self-Supervised Learning for Survival Prediction on Whole Slide Images. Yining Zhao, Hao Liu, Jielong Yan, Yongji Tian, Xiangmin Han |
| 2025 | Hyperspherical Dataset Distillation via Contrastive Embedding Alignment. Shuoxi Zhang, Hanpeng Liu, Stephen Lin, Kun He |
| 2025 | I-Lora: Iterative Merging of Routing-Tuned Low-Rank Adapters for Multi-Task Learning. Guoqing Zhao, Qi Zhang, Shaopeng Zhai, Dazhong Shen, Tianyi Zhang, Yu Qiao, Tong Xu |
| 2025 | ICG-MVSNet: Learning Intra-view and Cross-view Relationships for Guidance in Multi-View Stereo. Yuxi Hu, Jun Zhang, Zhe Zhang, Rafael Weilharter, Yuchen Rao, Kuangyi Chen, Runze Yuan, Friedrich Fraundorfer |
| 2025 | IEEE International Conference on Multimedia and Expo, ICME 2025, Nantes, France, June 30 - July 4, 2025 |
| 2025 | IGDiT: Illumination-Guided Low-light Image Enhancement with Diffusion Transformer Models. Bin Niu, Zhibin Zhang, Liqiang He |
| 2025 | IMTrack: Interlayer Interoperability and Multi-scene Optimization for Visual Multimodal Target Tracking. Rui Zhu, Zhaokang Lu, Bohan Liu, Yun Yang, Hua Yue, Chaogang Wang, Zixin Zhou |
| 2025 | IP-KGQA: Intent-Aware Prompt Learning for Knowledge Graph Question Answering. Zheng Dai, Chun Ding, Tianyi Chen, Si Wu, Yong Xu, Runzhe Liang, Tianshi Xu, Yedong Li, Dapeng Wu |
| 2025 | IRSTS Generalist: Improving Generalization in Infrared Small Target Segmentation Using One Shot. Bingbing Dan, Xinyu Tian, Meihui Li, Tao Tang, Jing Zhang |
| 2025 | ITJP: Image and Text Joint Prompts for Few-Shot Whole Slide Image Classification. Ziwei Zhu, Xinzhu Zhang, Zhikang Zhao, Jing Zhao |
| 2025 | Identity-Preserving Talking Head Cross-Identity Reenactment with Adaptive Structure Normalization. Zhao Jing, Hongxia Bie, Haobo Lei, Jiali Wang, Yichen Zhi, Zhisong Bie |
| 2025 | IllusionBench: A Large-scale and Comprehensive Benchmark for Visual Illusion Understanding in Vision-Language Models. Yiming Zhang, Zicheng Zhang, Xinyi Wei, Xiaohong Liu, Guangtao Zhai, Xiongkuo Min |
| 2025 | Image Demoiréing Using Dual Camera Fusion on Mobile Phones. Yanting Mei, Zhilu Zhang, Xiaojun Wu, Wangmeng Zuo |
| 2025 | Imperceptible Beam-Sensitive Adversarial Attacks for LiDAR-based Object Detection in Autonomous Driving. Fuyao Cai, Daizong Liu, Xiang Fang, Jixiang Yu, Keke Tang, Pan Zhou |
| 2025 | Imperceptible and Robust Adversarial Perturbation: Attention-Guided Watermark Vaccine Against Watermark Removal. Yujiang Li, Zhili Zhou, Zhongliang Yang, Baowei Wang, Tao Qi, Xiaohua Xie, Jiantao Zhou |
| 2025 | Improving Human-AI Collaboration in Medical Diagnosis with Combination Advice. Xuehan Zhao, Jiaqi Liu, Zhiwen Yu, Bin Guo |
| 2025 | Incongruity-aware Cross-modal Interaction Network for Multimodal Sarcasm Detection. Yujun Wu, Chen Wang, Meixuan Chen, Tongguan Wang, Ying Sha |
| 2025 | Incorporating Audio-Guided Visual Attention into Sound Event Localization and Detection with Source Distance Estimation. Qing Wang, Jun Du, Hengyi Hong, Maocheng Hu, Mingqi Cai, Xin Fang |
| 2025 | Incrementally Constrained Tucker Decomposition for Feature Extraction of Structural Diffusion Tensor Imaging Data. Fei He, Houji Du, Fan Zhang, Yipeng Liu, Ce Zhu |
| 2025 | Infrared Small Target Detection via Multi-Path Deep Conduction. Yongji Li, Luping Wang |
| 2025 | Injecting Cross-modal Fine-Grained Perception into LLMs for 3D Object-of-Interest Understanding. Qianqian Sun, Lu Shi, Linna Zhang, Gaoyun An, Yi Jin, Yidong Li, Yigang Cen |
| 2025 | InpaintFormer: Prompt-guided High-Quality Face Inpainting with Mask-Aware Self-Attention. Zhouhao Ouyang, Wen Xue, Tianyi Chen, Yan Huang, Si Wu, Yong Xu, Patrick Le Callet, Dapeng Oliver Wu |
| 2025 | Instance-Distance Active Learning for Source-Free Cross-Domain Object Detection. Kangrui Du, YuJun Qian, Juepeng Zheng |
| 2025 | Instruction-Aligned Visual Attention for Mitigating Hallucinations in Large Vision-Language Models. Bin Li, Dehong Gao, Yeyuan Wang, Linbo Jin, Shanqing Yu, Xiaoyan Cai, Libin Yang |
| 2025 | Instruction-aware Memory Network for Video Recognition. Bimei Wang, Haijiang Li, Jisheng Dang, Yun Wang, Zhixuan Chen, Jiyuan Lin, Teng Wang, Jun Yang |
| 2025 | Insulator Defect Detection Method Based on Lightweight Feature Extraction and Efficient Cross-Scale Fusion. Zhi Yang, Chunyang Ma, Liejun Wang, Zhiqing Guo |
| 2025 | IntegralCAM: Integral-based contribution estimation and visualization for convolutional neural networks. Teng-Yok Lee |
| 2025 | Integrate-and-Fire Compressor: Learning to Compress Context for LLMs Adaptively. Yunlong Zhao, Xiyun Li, Ziyi Wang, Haoran Wu, Minglun Han, Bo Xu |
| 2025 | InterID: Improving Multi-ID Interaction for Personalized Image Generation. Siting Chen, Weijie Chen, Jiji Tang, Rongsheng Zhang, Xiaoshuai Sun |
| 2025 | InterLayer: Efficient Inference with Interleaved Scheduling and Layer-Specific Optimization. Limin Cheng, Hang Qin, Shouxu Kuang, Xinyu Wang, Ling Li, Yanjun Wu, Chen Zhao |
| 2025 | Interact with me: Joint Egocentric Forecasting of Intent to Interact, Attitude and Social Actions. Tongfei Bian, Yiming Ma, Mathieu Chollet, Victor Sanchez, Tanaya Guha |
| 2025 | Interactive Sketch-Based Person Re-Identification with Text Feedback. Xinyi Wu, Cuiqun Chen, Hui Zeng, Zhiping Cai, Bo Du, Mang Ye |
| 2025 | Inversion-Free Image Editing via Rectified Flow. Zhengwei Peng, Conghan Yue, Tong Duan, Dongyu Zhang |
| 2025 | InvoxSVC: Any-to-any Zero-shot Singing Voice Conversion with In-Context Learning in Latent Flow Matching. Wangjin Zhou, Tianjiao Du, Wenhao Guan, Meng Xiao, Chenglin Xu, Yi Zhao, Tatsuya Kawahara |
| 2025 | Iterative Multi-Collaborative Training Network for Point Cloud Learning with Noisy Annotations. Xiao Shao, Weiqi Yan, Yu Zang |
| 2025 | JGHand: Joint-Driven Animatable Hand Avater via 3D Gaussian Splatting. Zhoutao Sun, Xukun Shen, Yong Hu, Yuyou Zhong, Xueyang Zhou |
| 2025 | Joint Feature Learning and Mixing via State Space Model for Remote Sensing Change Detection. Bin Chen, Shenglong Hu, Huihui Song, Kaihua Zhang |
| 2025 | JointDeblur-Gs: Joint Blur-Aware Gaussian Splatting. Sijia Hu, Peng Chen, Xinxiao Wang, Luyue Sun, Guanghao Li, Hongyu Wang, Jian Pu |
| 2025 | JointDistill: Adaptive Multi-Task Distillation for Joint Depth Estimation and Scene Segmentation. Tiancong Cheng, Ying Zhang, Yuxuan Liang, Roger Zimmermann, Zhiwen Yu, Bin Guo |
| 2025 | KAN-SAM: Kolmogorov-Arnold Network Guided Segment Anything Model for RGB-T Salient Object Detection. Xingyuan Li, Ruichao Hou, Tongwei Ren, Gangshan Wu |
| 2025 | KMoP: Knowledge-injected Mixture-of-Prefix for Joint Multimodal Aspect-Based Sentiment Analysis. Xinzhong Wang, Lingyong Fang, Jidong Li, Yichen Zhou, Gongshen Liu |
| 2025 | Key-semantics Alignment Learning with Contextual Understanding for Video Moment Retrieval. Chenghua Gao, Min Li, Junxing Ren, Lin Chen, Jitao Fu, Wenwen Su |
| 2025 | Keyword-Oriented Multimodal Modeling for Euphemism Identification. Yuxue Hu, Junsong Li, Meixuan Chen, Dongyu Su, Tongguan Wang, Ying Sha |
| 2025 | Knowledge Calibration Distillation. Chun Xie, Huimin Tong, Guoxi Xu, Yipeng Chen, Li Luking, Yiwei Chen |
| 2025 | Knowledge Distilled Group Prompts Learning for HOI Detection with Large Vision-Language Models. Xiaoqian Han, Guanglin Niu, Mingliang Zhou, Xiaowei Zhang |
| 2025 | Knowledge Graphs Acquisition via Forward-Reverse Relation Enhanced Contrastive Pretraining from Large-scale Models. Liu Yu, Fenghui Tian, Ping Kuang, Zhikun Feng, Fan Zhou |
| 2025 | Knowledge Rumination for Client Utility Evaluation in Heterogeneous Federated Learning. Xiaorui Jiang, Yu Gao, Hengwei Xu, Qi Zhang, Yong Liao, Peng Yuan Zhou |
| 2025 | Knowledge Transfer and Domain Adaptation for Fine-Grained Remote Sensing Image Segmentation. Shun Zhang, Xuechao Zou, Kai Li, Congyan Lang, Shiying Wang, Pin Tao, Tengfei Cao |
| 2025 | LD Nana Zhang, Qian Liu, Dandan Zhu, Kun Zhu, Xiongkuo Min, Guangtao Zhai |
| 2025 | LFNet: Cross-Modal LiDAR-Fisheye Fusion Network for 3D Semantic Segmentation. Weijian Zhang, Zhiwei Zhang, Tianfang Sun, Zhizhong Zhang, Xin Tan, Yuan Xie |
| 2025 | LKPM: Large Kernel Point Mamba for 3D Point Clouds. Song Zhao, Shuhua Wang, Xiaobing Zhou |
| 2025 | LL4G: Self-Supervised Dynamic Optimization for Graph-Based Personality Detection. Lingzhi Shen, Yunfei Long, Xiaohao Cai, Guanming Chen, Yuhan Wang, Imran Razzak, Shoaib Jameel |
| 2025 | LLDNet: Joint Low-light Enhancement and Local Motion Deblurring in the Dark. Haigen Liu, Yanyang Yan, Wenqi Ren |
| 2025 | LM-net: Integrating Linear Temporal Features and Multi-Scale Attention for Crop Yield Estimation. Hu Li, Long Long, Lin Cheng, Zichen Liu, Jing Wang, Yucheng Zhang, Feng Dai |
| 2025 | LV-VTON: Long-Video Virtual Try-On via Enhanced Visual Autoregressive Modeling. Lulu Tian, Hongxun Yao, Ming Li |
| 2025 | Label-guided Facial Retouching Reversion. Guanhua Zhao, Yu Gu, Xuhan Sheng, Yujie Hu, Jian Zhang |
| 2025 | Language-Conditioned Waypoint Predictor for Continuous Vision-and-Language Navigation. Zeyu Wang, Yuankai Qi, Dong An, Xu Yang, Hongxin Li, Zhaoxiang Zhang |
| 2025 | Large Language Models Meet Contrastive Learning: Zero-Shot Emotion Recognition Across Languages. Heqing Zou, Fengmao Lv, Desheng Zheng, Eng Siong Chng, Deepu Rajan |
| 2025 | Latent Diffusion-based Face Anonymization with Identity and Attribute Decoupling. Chenrui Liu, Zhichao Lian |
| 2025 | Latent Feature and Attention Dual Erasure Attack against Multi-View Diffusion Models for 3D Assets Protection. Jingwei Sun, Xuchong Zhang, Changfeng Sun, Qicheng Bai, Hongbin Sun |
| 2025 | Latent-Info and Low-Dimensional Learning for Human Mesh Recovery and Parallel Optimization. Xiang Zhang, Suping Wu, Sheng Yang |
| 2025 | Layer-wise Parameter Robustness for Continual Test-time Adaptation. Haoyu Xiong, Qiuxia Yang, Chengchao Wang, Tianze Zhong, Zhengpeng Zhao, Yuanyuan Pu |
| 2025 | LeAffordNav: Enhancing Open-vocabulary Mobile Manipulation with LLM-guided Exploration and Affordance-aware Navigation. Yuanwen Chen, Haoran Li, Yaran Chen, Dongbin Zhao |
| 2025 | Learning Adaptive High-Frequency Semantic Guidance for Low-light Image Enhancement. Hao Li, Jingxuan Zhou, Jinlong Wang, Jiangmeng Li, Xiongxin Tang, Fanjiang Xu |
| 2025 | Learning Dual-Domain Multi-Scale Representations for Single Image Deraining. Shun Zou, Yi Zou, Mingya Zhang, Shipeng Luo, Guangwei Gao, Guojun Qi |
| 2025 | Learning Physics-Informed Color-Aware Transforms for Low-Light Image Enhancement. Xingxing Yang, Jie Chen, Zaifeng Yang |
| 2025 | Learning from Global to Local: Adaptive Frequency-Aware and Spatial-Alignment for Knowledge Distillation. Wenkuan Li, Xubin Wu, Shuo Gao, Haifang Li |
| 2025 | Learning from Noisy Data Using Pretrained Vision-Language Representations. Yuqi Liao, Aodong Li, Yisha Chen, Qianfang Xu, Jiarui Xie, Anxin Li, Bo Xiao |
| 2025 | Learning from Stochastic Labels. Meng Wei, Xinzheng Xu, Peng Ying, Renke Sun, Guanjun Wang, Zhongnian Li |
| 2025 | Learning to Unify Audio, Visual and Text for Audio-Enhanced Visual Answer Localization. Zhibin Wen, Bin Li |
| 2025 | Leave the Bias in Bias: Mitigating the Label Noise Effects in Continual Visual Instruction Fine-Tuning. Xiaoyu Tan, Teqi Hao, Xihe Qiu, Shaojie Shi, Yuan Cheng, Wei Chu, Yinghui Xu, Yuan Qi |
| 2025 | Lesion Localization for Medical Imaging Using Counter-factual Generation Prompt Learning. Yang Wei, Yi Pan, Limai Jiang, Juan He, Bokai Yang, Yufu Huo, Yunpeng Cai, Ruitao Xie |
| 2025 | Leveraging 2D Annotations for Cost-Effective Dynamic Urban Scene Reconstruction. Chuming Wang, Yingshuang Zou, Haoqian Wang |
| 2025 | Leveraging Hierarchical Spatio-Temporal Distribution Prompt for Zero-Shot Species Recognition. Tie Liu, Yue Yang, Peng Chen, Qijun Zhao |
| 2025 | Leveraging Multiple Deep Experts for Online Class-incremental Learning. Zhe Tao, Lu Yu, Hantao Yao, Changsheng Xu |
| 2025 | LiPlan: A Multimodal Dataset for Livable Urban Environment Layout Generation. Jianrong Wang, Shuyun Zhang, Ying Guo, Qi Li, Ju Zhang, Di Jin |
| 2025 | LiVo: Bandwidth-Efficient Live Volumetric Video Streaming with Compact Capture and Encoding. Yizong Wang, Mingjia Yang, Liming Pang, Dong Zhao, Siwei Ma, Wen Gao |
| 2025 | Lightweight Learning-Based In-Loop Filter for Real-Time Video Coding. Yanchen Zhao, Wenhong Duan, Jiaqi Zhang, Zhimeng Huang, Lin Li, Qi Wang, Siwei Ma |
| 2025 | Lightweight Video Super-Resolution Network Based on Pyramid Optical Flow Extraction and Alignment. Xiaoqiang Cui, Kaixuan Hou, Jianping Luo |
| 2025 | LiveImage: Motion Condition Guided Diffusion Model for Video Motion Transfer. Gaurav Rai, Ojaswa Sharma |
| 2025 | Local Data Quantity-Aware Weighted Averaging for Federated Learning with Dishonest Clients. Leming Wu, Yaochu Jin, Kuangrong Hao, Han Yu |
| 2025 | Local Model Trajectory Matching for Data Heterogeneity in Federated Learning. Man Zhao, Tingting Leng, Jun Zhou |
| 2025 | Localization Hints Exploration for Object Matting. Yu Qiao, Tianyu Meng, Huilin Ge, Xinning Wang, Jiayue Zhao, Qianchen Xia, Xin Yang |
| 2025 | Localizing Step-by-Step: Multimodal Long Video Temporal Grounding with LLM. Houlun Chen, Xin Wang, Hong Chen, Wei Feng, Zihan Song, Jia Jia, Wenwu Zhu |
| 2025 | Location-Oriented Sound Event Localization and Detection with Spatial Mapping and Regression Localization. Xueping Zhang, Yaxiong Chen, Ruilin Yao, Yunfei Zi, Shengwu Xiong |
| 2025 | LogiCoTab: Controllable Tabular Data Synthesis with Logical Relationships Awareness. Ziyue Wang, Hongwei Ding, Yunqi Liu, Yan Feng, Xiaohui Cui |
| 2025 | Long-Tailed Federated Learning with Fixed Classifier. Yi Li, Weichao Li, Xin Zheng, Haiyan Fu, Yanqing Guo |
| 2025 | Low-Redundancy Knowledge Generation and Modality-Aware Interaction for Multimodal Information Extraction in Social Media. Shizhou Huang, Bo Xu, Changqun Li, Yang Yu, Xin Lin |
| 2025 | MACA-VQA: Quality Assessment of UGC Videos via Multi-level Distortion Adaptation and Spatiotemporal Cross-Attention Fusion. Bo Hu, Yimeng Zhao, Leida Li, Lihuo He, Wen Lu, Xinbo Gao |
| 2025 | MADLLM: Multivariate Anomaly Detection via Pre-trained LLMs. Wei Tao, Xiaoyang Qu, Kai Lu, Jiguang Wan, Guokuan Li, Jianzong Wang |
| 2025 | MAMF-Net: Modality-Adaptive Masked Fusion Network for Speech Emotion Recognition. Hengrui Li, Tianyi Lu, Jianfeng Wang, Xiaopei Chen, Yongbing Zhang, Shaohui Liu |
| 2025 | MAO: Efficient Model-Agnostic Optimization of Prompt Tuning for Vision-Language Models. Haoyang Li, Siyu Zhou, Liang Wang, Guodong Long |
| 2025 | MAPLE: Modality-Agnostic Prototype Learning for Egocentric Action Recognition. Da Li, Di Zhou, Yishan Zou, Shenghua Li, Meng Liu |
| 2025 | MCSMoG: Multi-Conditional Diffusion for Stylized Motion Generation with Parametric Control. Yi Yang, Xinzhu Li, Yufeng Chen, Guanghui Yue, Wei Zhou, Zhuo Su, Ruomei Wang, Fan Zhou, Baoquan Zhao |
| 2025 | MDC: Modality Distribution Consistent Distillation for Multi-View 3D Object Detection. Huikai Liu, Junyin Wang, Wenqian Zhu, Bin Fu, Shengwu Xiong, Cheng Liu |
| 2025 | MDMU:Multimodal Dynamic Mamba UNet for Multimodal sentiment analysis. Weibin Li, Jiazheng Huang, Bohuan Xue, Wenhao Shao, Yijun Liu, Xiaoyu Tang |
| 2025 | MEScan360: A Memory-Enhanced Scanpath Prediction Model for Omnidirectional Images. Yuchen Zhang, Dandan Zhu, Kaiwei Zhang, Fei Jiang, Guangtao Zhai |
| 2025 | MFA-Net: A Multi-Stage Network for Facial Acupoint Localization with Global-Local Feature Fusion and Acupoint Encoding. Chao Liu, Chuanlin Liao, Tingting Zhang, Yi Lin |
| 2025 | MG-STK: Weakly Supervised Multi-Granularity Learning Guided by Semantic Topological Knowledge. Qi Shen, Liu Yang, Canguang Ruan |
| 2025 | MIINT: Infuse Intuitive Data Correspondence for Model Interpretation. Yuyang Wang, Ligeng Chen, Bing Mao |
| 2025 | MIPP-FL: Personalized Layer Privacy Protection Federated Learning Based on Mutual Information. Xijun Zhao, Gang Li, Hongming Chen |
| 2025 | MLLM-DataEngine: Closing the Loop of Multimodal Instruction Tuning Data Generation. Zhiyuan Zhao, Bin Wang, Linke Ouyang, Yiqi Lin, Pan Zhang, Xiaoyi Dong, Jiaqi Wang, Conghui He |
| 2025 | MMPX: Multi-modal Mamba Prompter to Large Vision Foundation Model for RGB-X Semantic Segmentation. Pengfei Wu, Ye Liu, Hao Gao, Jun Liu |
| 2025 | MP-FIRE: An End-to-End Cross-Modal Framework for Complex Multi-Page Document Question Answering. Yongqi Yu, Jinxu Zhang, Yu Zhang |
| 2025 | MPCSFL: A Privacy-Preserving Split Federated Learning Framework in Edge Network. Jianfeng Guan, Haoyang Meng, Yizhong Hu, Pengcheng Wang, Kexian Liu |
| 2025 | MPT-CLIP: Multi-modal Patch-level Prompt Alignment in CLIP for Zero-shot Semantic Segmentation. Boliang Hao, Fangyu Wu, Yifan Lu, Bailing Zhang |
| 2025 | MRKD: Monotonic Relationship-based Knowledge Distillation for SAR Image Recognition. Jielei Wang, Zihan Cheng, Guoming Lu, Kexin Li, Guangchun Luo |
| 2025 | MROSS: Multi-Round Region-based Optimization for Scene Sketching. Yiqi Liang, Ying Liu, Dandan Long, Ruihui Li |
| 2025 | MSA-SAM2Net: A Polyp Segmentation Framework Based on Large Kernel Multi-Scale Attention. Jiyun Li, Jie Pan, Chen Qian, Ying Shen, Jiabao Zhao |
| 2025 | MSAF-Net: A Multi-Scale Adaptive Fusion Network for Facial Expression Recognition in Mental Health Patients. Guolong Liu, Jiayu Ye, Hao Wang, Qingxiang Wang |
| 2025 | MSC-Bench: Benchmarking and Analyzing Multi-Sensor Corruption for Driving Perception. Xiaoshuai Hao, Guanqun Liu, Yuting Zhao, Yuheng Ji, Mengchuan Wei, Haimei Zhao, Lingdong Kong, Rong Yin, Yu Liu |
| 2025 | MSC-Net: Multi-Scale Cross-Modal Network for Point Cloud Completion. Yan Zhang, Zhenjiang Du, Lei Zhang, Zhitao Liu, Mingda Tang, Feng Tian, Ning Xie |
| 2025 | MSD-HENet: Multi-Scale Detail-Preserving Holistic Enhancement Network for Infrared Images. Yijing Zhao, Chao Wang, Guanyu Liu, Yumeng Liu, Ruiheng Zhang |
| 2025 | MSDet: Receptive Field Enhanced Multiscale Detection for Tiny Pulmonary Nodule. Guohui Cai, Ruicheng Zhang, Hongyang He, Zeyu Zhang, Daji Ergu, Yuanzhouhan Cao, Jinman Zhao, Binbin Hu, Zhibin Liao, Yang Zhao, Ying Cai |
| 2025 | MSPoint-Gait: Multi-Scale Point Cloud Analysis for 3D Gait Recognition via Cross-Modal Learning. Xinzhu Li, Yi Yang, Yikun Chen, Guanghui Yue, Wei Zhou, Ruomei Wang, Xudong Mao, Juepeng Zheng, Fan Zhou, Ziqi Qiu, Baoquan Zhao |
| 2025 | MTSD: Simple Yet Effective Self-Distillation for Generalizable Deepfake Detection. Dexu Zhu, Jie Cao, Jiangnan Shao, Zhida Zhang, Junxian Duan, Ran He |
| 2025 | MVPS: Multi-View Adaptive Prompt Synergy for Zero-shot Anomaly Detection. Longzhao Huang, Wenhao Xu, Changwei Wang, Rongtao Xu, Peng Lu, Shibiao Xu |
| 2025 | Magnetic Framelet-Based Graph Contrastive Learning for Signed-Directed Graph. Yuting Chu, Yanfeng Sun, Fujiao Ju, Junbin Gao, Shaofan Wang, Baocai Yin |
| 2025 | Magnitude-Phase Dual-Path Speech Enhancement Network based on Self-Supervised Embedding and Perceptual Contrast Stretch Boosting. Alimjan Mattursun, Liejun Wang, Yinfeng Yu, Chunyang Ma |
| 2025 | Make Multi-source Task Greater Again: Adaptive Causal Diffusion Strategy. Ziyun Cai, Yawen Huang, Jie Song, Chang-Hui Hu, Tengfei Zhang |
| 2025 | Make Prototypes Perform Again: Prior-Prototypes Based Feature learning Framework for Few-Shot Hashing. Yi Lu, Shu Li, Huanglong Dong, Shuxiang Hou, Yurong Qian |
| 2025 | Making Small Language Model Excellent Symptom Inference Expert for Mental Disorders Detection. Meiling Li, Xiaotian Xu, Shicheng Li, Bin Wu |
| 2025 | MalDenoise: Enhancing Robustness of API-Based Malware Detection Against Adversarial Attacks. Xiaohui Chen, Xin Wang, Zuhui Yue, Zheng Li, Peipei Liu, Hongsong Zhu |
| 2025 | MamFusion: Multi-Mamba with Temporal Fusion for Partially Relevant Video Retrieval. Xinru Ying, Jiaqi Mo, Jingyang Lin, Canghong Jin, Fangfang Wang, Lina Wei |
| 2025 | Mamba-Based Blind Stitched Wide Field of View Light Field Image Quality Assessment via Dual-Viewport Sampling. Rui Zhou, Gangyi Jiang, Linwei Zhu, Yeyao Chen, Yueli Cui, Ting Luo, Haiyong Xu |
| 2025 | Mamba-SLAM: Enhancing Neural Implicit SLAM with Uncertainty and Mamba. Jiaming Lu, Yunrui Zhu, Ruyu Liu, Xu Cheng, Jianhua Zhang, Bo Sun, Xiufeng Liu |
| 2025 | MambaMIC: An Efficient Baseline for Microscopic Image Classification with State Space Models. Shun Zou, Zhuo Zhang, Yi Zou, Guangwei Gao |
| 2025 | MambaPose: Efficient 2D Human Pose Estimation with Pose-Prior Guided State Space Model. Yalong Xu, Mengting Jiang, Yang Gao, Junlong Mu, Di Wang, Lin Zhao |
| 2025 | Mask-Guided Transformer with Hybrid Supervision for 3D Instance Segmentation. Qi Zeng, Jianwei Guo, Haobo Qin, Yinchang Zhou, Weiliang Meng, Xiaopeng Zhang |
| 2025 | Masked Generative Extractor for Synergistic Representation and 3D Generation of Point Clouds. Hongliang Zeng, Ping Zhang, Fang Li, Jiahua Wang, Tingyu Ye, Zichen Wei |
| 2025 | Masked Self-Supervised Learning and Semantic Noise Separation for Video Anomaly Detection. Qiao Wang, Menghao Zhang, Lei Zhang, Qi Qi, Haifeng Sun, Pengfei Ren, Bo He, Jing Wang, Jingyu Wang |
| 2025 | MdCoT: Medical Diagnosis Chain-of-Thought with Self-Diagnostic Refinement for Alzheimer's Disease. Chunlin Lu, Yongheng Zhang, Peng Wang, Wenpeng Lu, Libo Qin |
| 2025 | Measuring and Controlling the Spectral Bias for Self-Supervised Image Denoising. Wang Zhang, Huaqiu Li, Xiaowan Hu, Tao Jiang, Zikang Chen, Haoqian Wang |
| 2025 | Merge Mode for Template-based Intra Mode Derivation (TIMD) in ECM. Mohsen Abdoli, Ramin G. Youvalari, Frank Plowman, Alexandre Tissier |
| 2025 | Meta-Learning Empowered Meta-Face: Personalized Speaking Style Adaptation for Audio-Driven 3D Talking Face Animation. Xukun Zhou, Fengxin Li, Ziqiao Peng, Xinyu Wang, Hongyan Liu, Zhaoxin Fan, Jun He |
| 2025 | Mimicing Real-world Knowledge to Generate 3D Adversarial Point Clouds. Tengjun Liu, Qianbin Guo, Xuanchi Gong, Huan Zhang, Xianyi Chen |
| 2025 | Missing Pieces, Complete Picture: Navigating Micro-Video Popularity with Flexible Mixture of Modality Experts. Yang Liu, Zhangtao Cheng, Bin Chen, Yan Liu, Xing He, Ting Zhong, Fan Zhou |
| 2025 | Mitigating Cache Noise in Test-Time Adaptation for Large Vision-Language Models. Haotian Zhai, Xinyu Chen, Can Zhang, Tianming Sha, Ruirui Li |
| 2025 | Mitigating Hallucination in Large Video-Language Models with Injected Semantics. Bimei Wang, Fan Wen, Jisheng Dang, Huiguo He, Xiwen Wang, Nannan Zhu, Jiasi Weng |
| 2025 | Mitigating Knowledge Forgetting by Generative Knowledge Replay and Forgetting-aware Aggregation in Semi-Supervised Federated Learning. Hongquan Liu, Yixin Ren, Jihong Guan, Shuigeng Zhou |
| 2025 | Mitigating Object Hallucination in Large Vision-Language Models via Visual Attention Direct Preference Optimization. Yixiao He, Haifeng Sun, Qi Qi, Zirui Zhuang, Pengfei Ren, Huazheng Wang, Yafeng Nan, Jing-Yu Wang |
| 2025 | MixLGN: Mixed Local-Global Network for 3D Human Pose Generation. Xinyang Liu, Sanyi Zhang, Chixuan Wei, Yinghao Yang, Long Ye |
| 2025 | Mixture-of-Modality-Experts for Unified Image Aesthetic Assessment with Multi-Level Adaptation. Fei Gao, Jiaqi Shi, Yuhao Lin, Xiaodan Zhang, Lihuo He, Nannan Wang |
| 2025 | MoE-based Mamba for Multi-scene Universal Remote Sensing Semantic Segmentation. Jie Zhang, Mingwen Shao, Xiaodong Tan, Xiangyong Cao |
| 2025 | MoMuSE: Momentum Multi-modal Target Speaker Extraction for Real-time Scenarios with Impaired Visual Cues. Junjie Li, Ke Zhang, Shuai Wang, Kong Aik Lee, Man-Wai Mak, Haizhou Li |
| 2025 | MoPE: Mixture of Policy Experts and Verification with Multimodal Information for Instance ImageGoal Navigation. Yijie Zeng, Xinyi Chen, Kexun Chen, Zhixuan Shen, Haonan Luo, Tianrui Li |
| 2025 | Mobile-StereoHPE: Real-Time Mobile 3D Hand Pose Estimation from Stereo Gray Images. Dongfang Zhao, Menghe Zhang, Yangwen Liang, Shuangquan Wang, Kee-Bong Song, Donghoon Kim |
| 2025 | Model Discrepancy Learning: Synthetic Faces Detection Based on Multi-Reconstruction. Qingchao Jiang, Zhishuo Xu, Zhiying Zhu, Ning Chen, Haoyue Wang, Zhongjie Ba |
| 2025 | Model-Guardian: Protecting against Data-Free Model Stealing Using Gradient Representations and Deceptive Predictions. Yunfei Yang, Xiaojun Chen, Yuexin Xuan, Zhendong Zhao |
| 2025 | MotionFlow: Learning Implicit Motion Flow for Complex Camera Trajectory Control in Video Generation. Guojun Lei, Chi Wang, Yikai Wang, Hong Li, Ying Song, Weiwei Xu |
| 2025 | Multi-Attention Guided Knowledge Distillation For High-Performance Object Detection. Zhihao Kong, Qifeng Lin, Qishen Shen, Jiayi Qiu, Gang Fu, Yuanlong Yu |
| 2025 | Multi-Grained Alignment for Visual Grounding. Hongbing Li, Bo Xiao, Linyi Yang, Xinran Wang, Qi Li |
| 2025 | Multi-Granularity Based Collaborative Learning for Semi-Supervised Hashing. Shuai Cheng, Lin Wang, Xiaoshuai Hao, Wanqian Zhang, Xiaohua Chen, Wei Wang |
| 2025 | Multi-Hypothesis 3D Hand Mesh Recovering from a Single Blurry Image. Yuming Chen, Rongyu Chen, Zhongqun Zhang, Yihua Cheng, Hyung Jin Chang |
| 2025 | Multi-Level Graph Pruning-Based Framework for Graph Retrieval-Augmented Generation. Hongxu Li, Xiaodi Li, Fulin Su, Qinglang Guo |
| 2025 | Multi-Level Normalizing Flow for Comprehensive Anomaly Detection and Localization. Jie Shi, Xin Wen, Shijie Guo, Robert H. Deng, Jianan Xie, Rui Cao |
| 2025 | Multi-Modal Contrastive Fusion for Consensus Learning in Sequential Group Recommendation. Yue Kou, Dong Li, Qixiang Tang, Derong Shen, Tiezheng Nie |
| 2025 | Multi-Modality Representation Learning for Antibody-Antigen Interactions Prediction. Peijin Guo, Minghui Li, Hewen Pan, Ruixiang Huang, Lulu Xue, Shengqing Hu, Zikang Guo, Wei Wan, Shengshan Hu |
| 2025 | Multi-Passage Retrieval-Augmented Multimodal Language Generation Model for Knowledge-Based Visual Question Answering. Siyu Cheng, Chao Yang, Bin Jiang |
| 2025 | Multi-Resolution Infrared-Visible Image Fusion using Multi-Scale Residual Quantization. Honglin Wu, Jun-Jie Huang, Huibin Tan, Wanrong Huang, Yuhua Tang, Xueqiong Li |
| 2025 | Multi-Scale Core-Peripheral Attention Network for Camouflaged Object Detection. Yueqian Quan, Tiancheng Pan, Chuangjie Fang, Yan Li, Jianwei Zheng |
| 2025 | Multi-Scale Diffusion: Enhancing Spatial Layout in High-Resolution Panoramic Image Generation. Xiaoyu Zhang, Teng Zhou, Xinlong Zhang, Jia Wei, Yongchuan Tang |
| 2025 | Multi-Scale Hypergraph Relational Reasoning for Weakly Supervised Recognition of Group Activities. Chongyang Xu, Runtian Zheng, Ziliang Feng, Chengfang Zhang |
| 2025 | Multi-Scale Tubularity-Aware U-Net. Yue Sun, Jie Song, Ziyun Cai, Ying Wang, Liang Xiao, Yawen Huang |
| 2025 | Multi-Task Self-Supervised Learning for Automated Measurement of Left Ventricular Ejection Fraction in Echocardiography. Zhanpeng Xu, Yu Lu, Wei Zhang, Xiaoqing Li, Shijie Shi, Xianghua Fu |
| 2025 | Multi-branch Strong Perturbation Contrastive Learning for Semi-supervised Medical Image Segmentation. Feng Xiao |
| 2025 | Multi-granularity Frequency Difference-Aware Attention for Video Question Answering. Mingyang Liu, Fan Zhou, Ruomei Wang, Baoquan Zhao |
| 2025 | Multi-modal Speech Transformer Decoders: When Do Multiple Modalities Improve Accuracy? Yiwen Guan, Viet Anh Trinh, Vivek Voleti, Jacob Whitehill |
| 2025 | Multi-mode Bidirectional Feature Fusion and Domain-consistency Refinement for Real-time Monocular 6D Object Pose Estimation. Shuo Yang, Junyi Wang, Yue Qi |
| 2025 | Multi-sentence Video Grounding for Long Video Generation. Wei Feng, Xin Wang, Hong Chen, Zeyang Zhang, Wenwu Zhu |
| 2025 | Multi-soft-label Guided Supervised Contrastive Learning for Gait Emotion Recognition. Chengju Zhou, Mengxin Xu, Xiaotong Fan, Liangyu Lu, Jiahui Pan, Lewei He |
| 2025 | Multi-view Video Coding with Decoupled Neural Representation for Multi-modal Traffic Data. Siqian Nie, Xin Ding, Jiabo Wu, Sihan Lin, Qiong Liu |
| 2025 | Multimodal Causal Reasoning-Guided Intrinsic Goals for Efficient Task Completion in Reinforcement Learning. Tong Wu, Yi Wen, Guangchun Luo, Lingfu Wang, Qiuran Li, Dayong Zhu |
| 2025 | Multimodal Conversatioal Emotion Analysis with Robustness to Incomplete Modality Details. Sidharth Anand, Chaitanya Sai Chandu Yendru, Sreyasee Das Bhattacharjee, Junsong Yuan |
| 2025 | Multimodal Emotion Recognition in Conversations via Graph Structure Learning. Feng Xiong, Geng Tu, Yice Zhang, Jun Wang, Shiwei Chen, Bin Liang, Yue Yu, Min Yang, Ruifeng Xu |
| 2025 | Multimodal Mixture of Low-Rank Experts for Sentiment Analysis and Emotion Recognition. Shuo Zhang, Jinsong Zhang, Zhejun Zhang, Lei Li |
| 2025 | Multimodal Representation Learning Techniques for Comprehensive Facial State Analysis. Kaiwen Zheng, Xuri Ge, Junchen Fu, Jun Peng, Joemon M. Jose |
| 2025 | MuseFace: Text-driven Face Editing via Diffusion-based Mask Generation Approach. Xin Zhang, Siting Huang, Xiangyang Luo, Yifan Xie, Weijiang Yu, Heng Chang, Fei Ma, Fei Yu |
| 2025 | Mutual Guidance and Residual Integration for Image Enhancement. Kun Zhou, Xinyu Lin |
| 2025 | Mutual Semantic Bridged Tri-Tower Fusion for Audio-Visual Segmentation. Jingqi Qu, Hui Yu, Dongchen Zhu, Jiamao Li |
| 2025 | Mutual Teaching: Semi-supervised Medical Image Classification with Cross Structural Consistency Learning. Chuankai Xu, Junhao Li, Ruxin Wang |
| 2025 | NLOS-R Yi Wang, Ruixu Geng, Jiarui Zhang, Xiaolong Du, Yan Chen, Yang Hu |
| 2025 | NLOSdiffuser: Generalized Steady-State Non-Line-of-sight Imaging toward Indoor Scenarios. Xian Gao, Luyang Wang, Jiacheng Ruan, Yuyang Zhang, Zongyun Zhang, Ting Liu, Yuzhuo Fu |
| 2025 | NOVA3D: Normal Aligned Video Diffusion Model for Single Image to 3D Generation. Yuxiao Yang, Peihao Li, Yuhong Zhang, Junzhe Lu, Xianglong He, Minghan Qin, Weitao Wang, Haoqian Wang |
| 2025 | NVPose: Novel View Data Augmentation for Human Pose Estimation. Yiqing Xu, Liwei Liao, Ronggang Wang |
| 2025 | Navigating the Implicit Map: Community-Aware Disentangled Experts for Multi-Modal Knowledge Graph Completion. Shichong Li, Bin Chen, Yichen Xin, Zhangtao Cheng, Qing Chen, Ting Zhong, Fan Zhou |
| 2025 | NeRFSwap: A NeRF-Based Generative Model for Face Swapping. Shuangyi Tan, Mingzhi Mao, Guanbin Li |
| 2025 | Neeko: Model Hijacking Attacks Against Generative Adversarial Networks. Junjie Chu, Yugeng Liu, Xinlei He, Michael Backes, Yang Zhang, Ahmed Salem |
| 2025 | Neural Implicit Reconstruction and Fast Rendering Based on Dual Spherical Shell. Zijian Wang, Yuqi Liu, Yan Zhao, Binghao Wang, Shen Cai, Yanting Zhang |
| 2025 | Neural Representations for Scalable Video Coding. Yiying Wei, Hadi Amirpour, Christian Timmerer |
| 2025 | Neural-MCRL: Neural Multimodal Contrastive Representation Learning for EEG-based Visual Decoding. Yueyang Li, Zijian Kang, Shengyu Gong, Wenhao Dong, Weiming Zeng, Hongjie Yan, Wai Ting Siok, Nizhuan Wang |
| 2025 | Noise Mitigation for Unsupervised Cross-Domain Image Retrieval. Jiayang Liu, Kai Wang, Zheng Wang, Xin Liu, Fumin Shen, Xing Xu |
| 2025 | Non-Parametric Media Quality Recovery from Spammer-Affected Subjectively Annotated Datasets. Lohic Fotio Tiotsop, Andrés Altieri, Giuseppe Valenzise |
| 2025 | Nucleus-SAM:Point-Supervised SAM for Nucleus Segmentation. Yu Zhou, Xing Wu, Liangshan Zhu, Chengliang Wang, Zailin Yang, Yao Liu |
| 2025 | Nutrition Prediction from Food Images Using Foundation Models. Vitalii Emelianov, Niki Martinel |
| 2025 | OD-VAE: An Omni-dimensional Video Compressor for Improving Latent Video Diffusion Model. Liuhan Chen, Zongjian Li, Bin Lin, Bin Zhu, Qian Wang, Shenghai Yuan, Xing Zhou, Xinhua Cheng, Li Yuan |
| 2025 | OFF3D:Object-Centric Feature Field for 3D Scene Segmentation. Qinwei Lin, Bing Wang, Junjie Zhao, Jun Xu, Haoqian Wang |
| 2025 | OG-Mapping: Octree-based Structured 3D Gaussians for Online Dense Mapping. Meng Wang, Junyi Wang, Changqun Xia, Chen Wang, Yue Qi |
| 2025 | OGS-Mapping: Object-Level 3D Gaussian Splatting Mapping. Xinyu Liu, Zhenghao Qi, Rong Ding |
| 2025 | OSA: Object-level Scale Alignment for Small Object Detection in Large-Scale Images. Yuxiang Wang, Yixuan Ji, Xiangqin Chen, Chuanyuan Tan, Yajin Li, Haozhong Xue, Zheng Zhao |
| 2025 | OSLLM: A Retrieve-Reason-Refine Framework for Multi-Domain Relation Extraction with Large Language Models. Jie Zhou, Yongxue Shan, Meihan Wu, Fei Hu, Li Zheng, Xiaodong Wang |
| 2025 | Object Isolated Attention for Consistent Story Visualization. Xiangyang Luo, Junhao Cheng, Yifan Xie, Xin Zhang, Tao Feng, Zhou Liu, Fei Ma, Fei Yu |
| 2025 | Object Placement for Anything. Bingjie Gao, Bo Zhang, Li Niu |
| 2025 | Object-Centric Feature Enrichment for Single-Domain Generalized Object Detection. Shukuan Yuan, Zihao Zhang, Yahong Han |
| 2025 | OcSplats: Rendering Occluded Humans with Prior Knowledge. Jie Zhang, Qiongjie Cui, XuLei Yang, Na Zhao |
| 2025 | Omni-AD: Learning to Reconstruct Global and Local Features for Multi-class Anomaly Detection. Jiajie Quan, Ao Tong, Yuxuan Cai, Xinwei He, Yulong Wang, Yang Zhou |
| 2025 | OmniRestore: Robust Universal Image Restoration from Combined and Unspecified Degradations. Anjusree Karnavar, Yang Li, Jiajun Liu, Jun Zhou, Junhu Wang |
| 2025 | OmniStyle: Attention-Optimized Global and Local Image Stylization with Diffusion Model Inversion. Jiarong Cheng, Xihang Qiu, Qing Zhou, Ming Li, Chun Li, Yao Lu, Fei Richard Yu |
| 2025 | One General Plug-In for Facial Heatmap-based Keypoint Detection. Hanyu Jiang, Jian Xue, Xing Lan, Ke Lu |
| 2025 | One-Shot Federated Learning with Classifier-Free Diffusion Models. Obaidullah Zaland, Shutong Jin, Florian T. Pokorny, Monowar Bhuyan |
| 2025 | Only One Stage: A Chemical-Aware Model for Accurate Combustion Chemical Kinetics Prediction. Zhenglun Sun, Peng Qiao, Yong Dou, Rongchun Li, Sidun Liu, Wenyu Li, Wenjie Hu |
| 2025 | Open-Scene Understanding-oriented 3D Scene Graph Generation. Yuansu Hao, Fei Yu, Yanhao Wang, Yuehua Li, Quan Deng, Yuan Yu, Chen Huang, Nan Che |
| 2025 | OpenDUN: To Discover Unknown Number of Visual Categories. Sik Chit Wu, Munan Ning, Dong Wei, Yefeng Zheng, Donghuan Lu, Li Yuan |
| 2025 | OptiDiff: Unsupervised Deep-Sea Image Enhancement via Optical Priors Guided Stable Diffusion. Wenhui Wu, Yuemiao Wang, Hua Li, Yuanhao Gong |
| 2025 | Optimization of Multimodal Inputs Based on Diffusion Models: Zero-Shot Semantic Image Generation. Leilei Wang, Renjie Lu, Fengzhao Sun, Yunxiang Zhang, Jun Yu, Qingsong Liu, Jianqing Sun, Jiaen Liang |
| 2025 | Optimizing Efficiency and Visual-Textual Alignment for LLM-Based Radiology Report Generation. Zailong Chen, Peng Gao, Yujian Lee, Johan Barthelemy, Luping Zhou, Lei Wang |
| 2025 | Optimizing and Attacking Embodied Intelligence: Instruction Decomposition and Adversarial Robustness. Minghao Li, Wenpeng Xing, Yong Liu, Wei Zhang, Meng Han |
| 2025 | Overcoming Feature Contamination by Unidirectional Information Modeling for Vision-Language Tracking. Jingchao Wang, Zhijian Wu, Wenlong Zhang, Wenhui Liu, Jianwei Zhang, Dingjiang Huang |
| 2025 | P2WNet: Homography Estimation for Part-To-Whole and Cross-Modality Scenarios. ShangXuan Xie, Haifeng Wu, Wen Li, Lixin Duan |
| 2025 | PCM-SAR: Physics-Driven Contrastive Mutual Learning for SAR Classification. Pengfei Wang, Hao Zheng, Zhigang Hu, Aikun Xu, Meiguang Zheng, Liu Yang |
| 2025 | PDFIN: Prompt-Guided Dynamic Feature Integration Network for Few-Shot Class-Incremental Remote Sensing Scene Classification. Kaili Lu, Jian Ji, Ruoxue Li, Falin Wang, Chengwei Xu |
| 2025 | PDMambaNet: Poisson Denoising-Aided Twin-Path Mamba for Brain MRI Image Segmentation. Dayong Ren, Feifei Zhang, Fei Shi, Aoxue Chen |
| 2025 | PDNet: Patch-Wise Deformation Network for Cross-Modal Point Cloud Completion. Jingwen He, Zhenjiang Du, Ning Xie, Lei Zhang |
| 2025 | PGD-N2L: A Parameter-Guided Disentanglement Approach for Normal-To-Lombard Speech Conversion. Hongyang Chen, Yuhong Yang, Xinmeng Xu, Xingyu Liu, Weiping Tu, Zhongyuan Wang, Cedar Lin, Xin Zhao |
| 2025 | PSFD: Proactive Spatial-Frequency Defense against Malicious Exemplar-Guided Image Editing. Li Zeng, Xiaojun Mo, Meng Xie, Hangtao Zhang, Yixiang Liu, Yezhuo Peng, Yanchun Li |
| 2025 | PSUMatch: Unifying Open-Set Semi-Supervised Learning with Progressive Semantic Universum. Chenyang Song, Songcan Chen |
| 2025 | Partially View-aligned Clustering with Unbiased Semantic Learning. Liang Zhao, Ziyue Wang, Yukun Yuan |
| 2025 | Patch-Wise Hypergraph Contrastive Learning with Dual Normal Distribution Weighting for Multi-Domain Stain Transfer. Haiyan Wei, Hangrui Xu, Bingxu Zhu, Yulian Geng, Aolei Liu, Wenfei Yin, Jian Liu |
| 2025 | PatchSegDet: Attack-Agnostic Detection of Physical Adversarial Patches in Face Recognition Systems. Zhiqiang Shen, Qinfeng Li, Xuhong Zhang, Yuxiang Cai, Xiaochu Chen, Ping An, Haiqin Weng, Yang Liu |
| 2025 | Pedestrian Trajectory Prediction Driven by Bidirectional Intention-Interaction. Hang Yu, Yansen Yu, Jiayan Qiu |
| 2025 | Perceiving Smoothness: Temporal Consistency Learning for Multi-Frame-Rate Video Quality Assessment. Jinliang Han, Xiongkuo Min, Wei Sun, Guangtao Zhai |
| 2025 | Perception-Oriented Latent Coding for High-Performance Compressed Domain Semantic Inference. Xu Zhang, Ming Lu, Yan Chen, Zhan Ma |
| 2025 | Perspective Makes Perfect: Prompt-tuning Vision-Language Models for Action Recognition with Diversified Multi-Modal Observation. Hailun Zhang, Qijun Zhao, Zhen Zhai, Xinrui Wang |
| 2025 | Perturbing Confounders via Causal Disentanglement for Domain Generalization. Jingliang Bian, Junhao Li, Jian Xu, Ruxin Wang |
| 2025 | PhD-GS: Real-World Underwater Scene Reconstruction Using Gaussian Splatting. Yu Du, Runfa Chen, Wenhang Ge, Fuchun Sun, Ling Wang, Xiao Lv |
| 2025 | PhysFFTFormer: A Frequency Domain-based Vision Transformer for Efficient Remote Physiological Measurement. Fangyuan Liu, Sirui Zhao, Tong Xu, Yu Sun, Hao Wang, Suojuan Zhang, Enhong Chen |
| 2025 | PhysLight: Accurate rPPG Heart Rate Measurement with Adaptive Video Relighting. Menglin Zhang, Xiaoxin Guo, Bohao Qu, Xiaofeng Cao, Shuifa Sun, Qing Guo |
| 2025 | PiCo: Jailbreaking Multimodal Large Language Models via Pictorial Code Contextualization. Aofan Liu, Lulu Tang, Ting Pan, Yuguo Yin, Bin Wang, Ao Yang |
| 2025 | Pioneer: Encrypted Video Traffic Identification for Mixed Transmission of Video-Audio Segments. Weitao Tang, Taizhong Xu, Meijie Du, Die Hu, Xu Tang, Qingyun Liu |
| 2025 | Pixel-Level Adaptive Refinement Framework with Knowledge Distillation for Weakly Supervised Semantic Segmentation. Yulian Li, Xinfang Qin, Zhengwen Shen, Shuyu Han, Jun Wang |
| 2025 | Pixel-wise Single Image Reflection Removal Method Based on Reinforcement Learning. Yucheng Wang, Xueshi Yu, Zhengzhe Zhang, Xiankai Lu, Yilong Yin, Qian Zheng, Wenjia Meng |
| 2025 | Poison in the Well: Feature Embedding Disruption in Backdoor Attacks. Zhou Feng, Jiahao Chen, Chunyi Zhou, Yuwen Pu, Qingming Li, Shouling Ji |
| 2025 | Pop-Diffuseq: Controllable Symbolic Music Multi-Instrument Infilling and Accompaniment Generation with Long-Axis Attention. Yi Zou, Haonan Cheng, Long Ye, Qin Zhang |
| 2025 | PopuDet: Autism Spectrum Disorder Detection in Population Graphs via Micro-macro Relationship Construction and Multi-feature Fusion. Manman Yuan, Ting Xu, Jiazhen Ye, Peican Zhu, Jiacheng Wang, Keke Tang |
| 2025 | Pretrain like Your Inference: Masked Tuning Improves Zero-Shot Composed Image Retrieval. Junyang Chen, Hanjiang Lai |
| 2025 | Prior-Guided Test Time Adaptation for Blind Image Quality Assessment. Shishun Tian, Fangjie Hou, Guanghui Yue, Yuanhao Gong, Wenbin Zou, Ting Su |
| 2025 | Privacy-Preserving Anti-Recompression Video Watermarking in Bitstream Domain. Zhekai Luo, Xiangyu Gao, Peijia Zheng, Jian Li, Weiqi Luo |
| 2025 | Privacy-Preserving Gait Authentication Scheme Based on Partial Euclidean Distance in Cloud Computing. Tong Ji, Yunting Tao, Fanyu Kong, Guoyan Zhang, Yuliang Shi, Jia Yu |
| 2025 | ProDehaze: Prompting Diffusion Models Toward Faithful Image Dehazing. Tianwen Zhou, Jing Wang, Songtao Wu, Kuanhong Xu |
| 2025 | Probabilistic Embeddings with Causal Constraint for Error Detection in Egocentric Procedural Videos. Tong Hou, Shenshen Li, Xun Jiang, Zheng Wang, Fumin Shen, Xing Xu |
| 2025 | Progressively Enhanced Camouflaged Object Detection via Boundary Awareness. Jinyang Wang, Wei Wu |
| 2025 | Prohibited Items Segmentation via Occlusion-aware Bilayer Modeling. Yunhan Ren, Ruihuang Li, Lingbo Liu, Changwen Chen |
| 2025 | Promoting Segment Anything Model towards Highly Accurate Dichotomous Image Segmentation. Xianjie Liu, Keren Fu, Yao Jiang, Qijun Zhao |
| 2025 | Prompt-Based Two-Stage Enhancement for Low-Light Object Detection. Bohan Xiong, Kan Chang, Mingyang Ling, Shilin Huang, Shucheng Xia, Yujian Yuan |
| 2025 | Prompt-Guided Multi-Task Decoupling for Speech Presentation Skills Assessment. Zihua Xiong, Jiachen Tan, Tingting Zhang, Bin Wu, Chunping Zheng |
| 2025 | Prompt-driven Multi-modal Unsupervised Domain Adaptation for 3D Semantic Segmentation. Mingwei Xing, Yao Wu, Yachao Zhang, Yanyun Qu |
| 2025 | Prototype Guided Multi-Scale Class Aggregation for Generalized Few-Shot Semantic Segmentation. Wenxin Jiang, Peng Qin, Guanhua Zhang, Kai Wu, Shengke Wang |
| 2025 | Prototype Optimal Transport for Box-Supervised 3D Instance Segmentation. Ye Zhou, Wenfei Yang, Tianzhu Zhang, Xiang Liu |
| 2025 | Prototype-Based Communication Topology Optimization for Decentralized Federated Learning. Xinlin Leng, Kangyu Hu, Hanlin Gu, Xiangui Kang, Wenyuan Yang |
| 2025 | Prototype-guided Vision Foundation Models fine-tuning for Domain Generalized Semantic Segmentation. Fengwen Liu, Huan Hu, Xiangbin Wu, Wenqiang Hu |
| 2025 | Puzzle-MAE: A Puzzle-Inspired Mask Autoencoder for Multi-Modal Fusion. Xin Li, Bochao Zou, Rongquan Wang, Huimin Ma |
| 2025 | Pyramid-based Mamba Multi-class Unsupervised Anomaly Detection. Nasar Iqbal, Niki Martinel |
| 2025 | QCG-SLAM: Quadtree-based Condensed Gaussian Splatting for Visual SLAM. Xun Fang, Zixuan Hua, Xiao Zhao, Lihua Zhang |
| 2025 | QEMesh: Employing A Quadric Error Metrics-Based Representation for Mesh Generation. Jiaqi Li, Ruowei Wang, Yu Liu, Qijun Zhao |
| 2025 | QTG-VQA: Question-Type-Guided Architectural for VideoQA Systems. Zhixian He, Pengcheng Zhao, Shujin Lin |
| 2025 | QoE Evaluation of Remote Physiotherapy in Volumetric Video and Video-Based Real-Time Communication. Ashutosh Singla, Irene Viola, Jack Jansen, Pablo César |
| 2025 | Quality Control For HEVC: A Deep Reinforcement Learning Approach. Yichen Guo, Rui Ding, Mai Xu, Lai Jiang, Shengxi Li, Xin Deng |
| 2025 | Quality-Guided Dynamic Memory for LLMs-based Long-Term Video Understanding. Bimei Wang, Jingmei Jiao, Jisheng Dang, Qingrun Jiang, Jiyuan Lin, Zhixuan Chen, Teng Wang, Jun Yang |
| 2025 | Quantized Memory-Efficient Full-Parameter Tuning with Sign Descent Optimization. Xuezhi Zhao, Haichen Bai, Qiang Li, Qi Wang |
| 2025 | RBDN: A Robust Background Denoising Network for Weakly Supervised Temporal Language Grounding. Yifan Lyu, Zehua Zang, Hongzhou Wu, Lixiang Liu, Jiangmeng Li |
| 2025 | RDFNet: Real-time Object Detection Framework for Foggy Scenes. Tianle Fang, Zhenbing Liu, Yutao Tang, Yingxin Huang, Haoxiang Lu, Chuangtao Zheng |
| 2025 | REAL: Retrieval-Augmented Prototype Alignment for Improved Fake News Video Detection. Yili Li, Jian Lang, Rongpei Hong, Qing Chen, Zhangtao Cheng, Jia Chen, Ting Zhong, Fan Zhou |
| 2025 | RIDE: Robust and Decentralized Federated Learning with Input Validation. Zhi Lu, Mengyuan Zou, Samir M. Umran, Yuhao Long, Songfeng Lu, Junjun Wu, Mu Wang |
| 2025 | RKU: Relevant Knowledge-aware Unlearning for Federated Continual Learning. Haodong Zhang, Liu Yang, Zihan Jiang |
| 2025 | RLCP: A Reinforcement Learning-based Copyright Protection Method for Text-to-Image Diffusion Model. Zhuan Shi, Jing Yan, Xiaoli Tang, Lingjuan Lyu, Boi Faltings |
| 2025 | RLK-Net: An Efficient Residual Large Kernel Convolution with Channel-Wise Adaptive Feature Fusion for Medical Image Segmentation. Qingxue Zhao, Zhongjie Pan, Di Wu, Ge Tang, Jun Tian |
| 2025 | ROMA: Regularization for Out-of-distribution Detection with Masked Autoencoders. Xiaochen Feng, Yuan Jiang, Hao Sha, Yongbing Zhang |
| 2025 | RWKV-UI: UI Understanding with Enhanced Perception and Reasoning. Jiaxi Yang, Haowen Hou |
| 2025 | ReCLIP: Reconstruction-Refined Zero-/Few-Shot Anomaly Classification and Segmentation. Lanning Zhang, Yali Shi, Shujie Lan, Fei Gao, Hao Qin, Nannan Wang |
| 2025 | ReDet: Effective Real-time Object Detection via Efficient Multi-scale Extraction Aggregation. Jian Li, Xin Jiang, Lu Jin, Zechao Li |
| 2025 | ReF-LLE: Personalized Low-Light Enhancement via Reference-Guided Deep Reinforcement Learning. Ming Zhao, Pingping Liu, Tongshun Zhang, Zhe Zhang |
| 2025 | ReFEdit: Rehearsal-Free Lifelong Knowledge Editing for Large Language Models. Xianjie Mo, Youcheng Pan, Yongshuai Hou, Ping Luo, Yang Xiang |
| 2025 | Real-World Retrieval Support Zero-Shot Learning: A Novel Learning Paradigm and an Efficient Balanced Generative Framework. Gang Yang, Xinyue Ju, Yipeng Xu, Yici Zhang |
| 2025 | Real-World Video Dehazing based on Optical Flow Deformable Attention Fusion and Contrastive Learning. Mengnan Zhang, Gang Zhou, Linghui Ma, Zhaoxi Liu, Li Zhang, Zhenhong Jia |
| 2025 | RealMind: Advancing Visual Decoding and Language Interaction via EEG Signals. Dongyang Li, Haoyang Qin, Mingyang Wu, Jiahua Tang, Chen Wei, Quanying Liu |
| 2025 | RealityAvatar: Comprehensive Head Avatar Generation with 360° Rendering. Houteng Yu, Hao Zhu, Xun Cao |
| 2025 | Recovering Human Mesh from Videos by 2D and 3D Deformable Attentions. Yulei Kang, Teng-Yue Chen, Xiaotong Lin, Siyu Jiang, Jian-Fang Hu |
| 2025 | Rectified Mixed-Label Learning for Semi-Supervised Medical Image Segmentation. Zeyu An, Zichong Chen |
| 2025 | Redefining Image-to-Recipe Retrieval with Nutritional and Ingredient Similarity. Satayu Parinayok, Shin'ichi Satoh, Kiyoharu Aizawa, Yoko Yamakata |
| 2025 | Redesigning Upsampling in Decoders with Aligned Feature Aggregation for Semantic Segmentation. Qinjie Hu, Fei Qi, Kaiwen Fu, Chengyuan Chang, Xiaotian Wang, Kun Liu, Guangming Shi |
| 2025 | Redundancy Optimization via Mutual Information for Unsupervised Domain Adaptation. Xing Wei, Dexuan Zhao, Fan Yang, Taizhang Hu, Chong Zhao, Yang Lu |
| 2025 | Refined Temporal Pyramidal Compression-and-Amplification Transformer for 3D Human Pose Estimation. Hanbing Liu, Zhi-Qi Cheng, Wangmeng Xiang, Jun-Yan He, Bin Luo, Yifeng Geng, Xuansong Xie |
| 2025 | Refining Interactions: Enhancing Anisotropy in Graph Neural Networks with Language Semantics. Zhaoxing Li, Haifeng Zhang, Xiaoming Zhang, Chengxiang Liu |
| 2025 | Region Confidence Refinement with Progressive Semantic Mining for Source-Free Domain Adaptive Object Detection. Zichong Chen, Zeyu An, Jian Cheng |
| 2025 | Reinforced Model Merging. Jiaqi Han, Jingwen Ye, Shunyu Liu, Haofei Zhang, Jie Song, Zunlei Feng, Mingli Song |
| 2025 | Reinforcement Learning-based Token Pruning in Vision Transformers: A Markov Game Approach. Chenglong Lu, Shen Liang, Xuewei Wang, Wei Wang |
| 2025 | Relation-Aware Graph Attention Network for Nuclei Classification. Lingbo Zhang, Ye Zhang, Linghan Cai, Xianchao Guan, Kai Zhang, Yongbing Zhang |
| 2025 | Relational Enhancement Network for Industrial Defect Detection. Haotian Linghu, Meiqin Liu, Senlin Zhang |
| 2025 | Representation Disentanglement for Semantic Coding. Jinming Liu, Junhao Geng, Lexiang Lv, Wenjun Zeng, Xin Jin |
| 2025 | Research on Audio-Visual Quality Assessment Dataset and Method for User-Generated Omnidirectional Video. Fei Zhao, Da Pan, Zelu Qi, Ping Shi |
| 2025 | Residual-based Efficient Bidirectional Diffusion Model for Image Dehazing and Haze Generation. Bing Liu, Le Wang, Hao Liu, Mingming Liu |
| 2025 | Rethinking 3D Robotic Perception: Elastic Voxel Representation with Splatting Distillation. Shaohui Pan, Yong Xu, Ruotao Xu, Zihan Zhou, Si Wu, Zhuliang Yu, Patrick Le Callet |
| 2025 | Rethinking Cross-Modality Fusion Mamba from a Frequency Domain Perspective. Zeyu Wang, Huiying Xu, Yun Liu, Chen Li, Xinzhong Zhu, Xiaolei Zhang, Hongbo Li |
| 2025 | Rethinking Cross-view Object Geo-Localization: Towards Many-to-Many Real-world Localization. YuanYuan Li, Qingwang Zhang, Yingying Zhu |
| 2025 | Rethinking DeNoising Training for DETR-based Object Detection. Bin Jiang, Yiming Fan, Chao Yang, Chenglong Lei, Ruiqi Hu, Zheng Zhou |
| 2025 | Rethinking Joint Optimization in Feature Compression: Insights from Person Re-Identification. Changsheng Gao, Zhuoyuan Li, Li Li, Dong Liu, Feng Wu, Weisi Lin |
| 2025 | Rethinking Steel Surface Defect Segmentation with Pseudo Mixup and Self Distillation. Jialin Xu, Jing Tang, Yankai Jin, Jun Liu, Zeyu Gong |
| 2025 | Retinal OCT Anomaly Detection Based on Suspicious Strategy and Relational Learning. Minghui Zhai, Xing Wu, Liangshan Zhu, Chengliang Wang, Yonggang Luo, Peng Wang |
| 2025 | RetouchDiffusion: Unsupervised Personalized Image Retouching via Diffusion Models. Yang Dong, Zhuoqi Ma, Zejun You, Yunan Li, Qiguang Miao |
| 2025 | Revisiting DETR for Small Object Detection via Noise-Resilient Query Optimization. Xiaocheng Fang, Jieyi Cai, Huanyu Liu, Wenxiu Cai, Yishu Liu, Bingzhi Chen |
| 2025 | RoGA: Towards Generalizable Deepfake Detection through Robust Gradient Alignment. Lingyu Qiu, Ke Jiang, Xiaoyang Tan |
| 2025 | Roadside Monocular 3D Detection for Small Objects: A Novel Feature Enhancement by Pyramid Depth Prediction and Regional Refinement. Jie Tang, Haoran Pan |
| 2025 | RobusTReID: Defending Vision Transformer for Robust Image ReID. Hua Zhang, Tingting Xiao, Li Sun, Qingli Li |
| 2025 | Robust Blind Spatio-Temporal Adaptive Video Watermarking Based on 3-D Symmetry. Fei Zhang, Hongxia Wang |
| 2025 | Robust Generalized Zero-Shot Learning via Dual-Stream Variational Autoencoders and Out-of-Distribution Detection. Xue Han, Zhixiang Li, Wenchuan Zhang, Hanyuan Huang, Wentao Fan |
| 2025 | RotatedMVPS: Multi-view Photometric Stereo with Rotated Natural Light. Songyun Yang, Yufei Han, Jilong Zhang, Kongming Liang, Peng Yu, Zhaowei Qu, Heng Guo |
| 2025 | S3SR: Towards Efficient Image Super-Resolution with Selective State Space Model. Pei Wang, Xiaotong Luo, Zekun Ai, Yanyun Qu |
| 2025 | SAG-KeyNet: Scale-Adaptive Keypoint Gaussian Heatmap Regression Network for Oriented SAR Ship Detection. Xu Wang, Yan Fu, Yanxia Wu, Dan Lin, Ye Yuan, Xue Zhang, Zhirou Ma |
| 2025 | SAM-FE: Segment Anything Model Guided Feature Enhancement for Semantic Change Detection of Remote Sensing Images. Junqing Huang, Tong Liu, Chan-Tong Lam, Xiaochen Yuan |
| 2025 | SAM-GA: SAM-Guided Grouped Aggregation Network for Weakly Supervised cardiac MRI Segmentation. Yang Li, Chengliang Wang, Xing Wu, Yonggang Luo, Peng Wang, Haidong Wang |
| 2025 | SAM2-Cap: Segment Anything 2 with using Parts and Object Spatial Hierarchical Relationships for Image Segmentation. Xiufeng Liu, Zhongqiu Zhao, Yi Yang, Donghui Hu, Zhao Zhang |
| 2025 | SAMDiffusion: Semantic Segmentation with Diffusion Model and Segmentation Anything Model. Yihao Wang, Xinyu Mu, Peixiang Liu, Zihao Zhang, Zhiyi Wang, Xiaoming Huang |
| 2025 | SANE: Enhancing Large-scale Scene Representation with Semantic-aware NeRF Experts. Zesheng Wang, Yufeng Wang, Shuangkang Fang, Xinrui Zhang, Dacheng Qi, Shengxi Li, Mai Xu, Wenrui Ding |
| 2025 | SASG: Semantic-Aware Salient Guidance for Day-to-Night Domain Adaptive Object Detection. Wei Yan, Xiaoman Zhao |
| 2025 | SAVE-GSL: Scalable and Expressive Graph Structure Learning for Large Graphs. Manxin Xu, Shengjie Zhao, Jin Zeng, Weichao Chen, Shilong Dong |
| 2025 | SCA-ZegCLIP: Shape- and Context-aware CLIP for Zero-shot Semantic Segmentation. Chunrui Li, Yi Zhang, Shu Hu |
| 2025 | SCGRL: Graph representation learning based on edge structure contrastive self-supervised framework. Ruishuang Sun, Ruiting Wang, Enguang Zuo, Junyu Zhu, Chen Chen, Cheng Chen, Xiaoyi Lv |
| 2025 | SCJD: Sparse Correlation and Joint Distillation for Efficient 3D Human Pose Estimation. Weihong Chen, Xuemiao Xu, Haoxin Yang, Yi Xie, Peng Xiao, Cheng Xu, Huaidong Zhang, Pheng-Ann Heng |
| 2025 | SDS-TG: Secure Diffusion Steganography in Text-Guided Generative Images. Haozhong Yang, Hongxia Wang, Jinhe Li, Fei Zhang |
| 2025 | SE(3)-Equivariant Multi-Scale Graph Transformer for Multi-Resolution 3D Aneurysm Segmentation. Xudong Ru, Xingce Wang, Peng Du, Yanghui Yan, Shaolong Liu, Yi-Cheng Zhu, Wuyang Shui, Zhongke Wu |
| 2025 | SELIC: Semantic-Enhanced Learned Image Compression via High-Level Textual Guidance. Haisheng Fu, Jie Liang, Zhenman Fang, Jingning Han |
| 2025 | SFRP: Fine-Grained Point Cloud Classification via Interaction of Spatial and Feature Representation Points. Haoxiang Sun, Xiaomeng Li, Yanhao Ding, Qian Sun, Zhenbo Li |
| 2025 | SGAD: An Unsupervised Secondary-Guided Diffusion Model for Industrial Anomaly Detection. Wenze Kang, Yuanming Zhang, Libo Weng, Zhenbo Cheng, Fei Gao |
| 2025 | SI23DCQA: Perceptual Quality Assessment of Single Image-to-3D Content. Kang Fu, Huiyu Duan, Zicheng Zhang, Xiaohong Liu, Xiongkuo Min, Jia Wang, Guangtao Zhai |
| 2025 | SIR: Multi-view Inverse Rendering with Decomposable Shadow Under Indoor Intense Lighting. Xiaokang Wei, Zhuoman Liu, Ping Li, Yan Luximon |
| 2025 | SKL-CLIP: Learning Skeleton-Based Action Representations via Language Supervision. Kun Wang, Jiuxin Cao, Jiawei Ge, Chang Liu, Bo Liu |
| 2025 | SLGN: Spatiotemporal Language-Guided Graph Network for Referring Video Segmentation. Rongrong Lian, Xiangdong Li, Zhenkai Wu, Mengting Ma, Wei Zhang |
| 2025 | SMPL Normal Map Is All You Need for Single-view Textured Human Reconstruction. Wenhao Shen, Gangjian Zhang, Jianfeng Zhang, Yu Feng, Nanjie Yao, Xuanmeng Zhang, Hao Wang |
| 2025 | SODMAMBA-DETR:A Small Object DETR Detector Based on a Mamba Encoder. Yiheng Sun, Xiaopeng Hu, Fan Wang, Xinrong Wu, Ying Zhou, Jie Zhao, Rongqi Zhu |
| 2025 | SPMamba: Leveraging Long-Sequence Modeling with State Space Models for Speech Separation. Kai Li, Guo Chen, Runxuan Yang, Xiaolin Hu |
| 2025 | SQ-Delta: Ultra-High Delta Compression for LLMs via Joint Sparsification-Quantization. Yanfeng Jiang, Zelan Yang, Bohua Chen, Shen Li, Yong Li, Tao Li |
| 2025 | SS-MPP: Semi-Supervised Shape-Aware Medical Image Segmentation Based on Multi-Scale Pixel-Wise Prototype. Kanqi Wang, Xiaowei Lu, Haoyun Wang, Yang Zhao, Gang Liu |
| 2025 | SSTD: Stripe-Like Space Target Detection Using Single-Point Weak Supervision. Zijian Zhu, Ali Zia, Xuesong Li, Bingbing Dan, Yuebo Ma, Enhai Liu, Rujin Zhao |
| 2025 | STFTCodec: High-Fidelity Audio Compression through Time-Frequency Domain Representation. Tao Feng, Zhiyuan Zhao, Yifan Xie, Yuqi Ye, Xiangyang Luo, Xun Guan, Yu Li |
| 2025 | STGGait: A Graph Transformer Network for Pose-based Gait Recognition. Wansong Qin, Zhijie Han, Yaru Li |
| 2025 | STP4D: Spatio-Temporal-Prompt Consistent Modeling for Text-To-4D Gaussian Splatting. Yunze Deng, Haijun Xiong, Bin Feng, Xinggang Wang, Wenyu Liu |
| 2025 | STPM: Spatial-Temporal Point Mamba for Activity Recognition Using mmWave Radar Point Clouds. Yingru Chen, Zhihao Guo, Haimin Zhang, Min Xu |
| 2025 | STSA: Spatial-Temporal Semantic Alignment for Visual Dubbing. Zijun Ding, Mingdie Xiong, Congcong Zhu, Jingrun Chen |
| 2025 | SU-SAM: A Simple Unified Framework for Adapting SAM in Underperformed Scene. Yiran Song, Qianyu Zhou, Xuequan Lu, Zhiwen Shao, Lizhuang Ma |
| 2025 | SUEDE: Shared Unified Experts for Physical- Digital Face Attack Detection Enhancement. Zuying Xie, Changtao Miao, Ajian Liu, Jiabao Guo, Feng Li, Dan Guo, Yunfeng Diao |
| 2025 | Safety-constrained Reinforcement Learning with Interaction-aware for Decision-making of Autonomous Driving. Di Zhang, Haonan Luo, Honglin Dong, Jianfeng Lu |
| 2025 | ScNet: Scene-Consistency Network Learning for Multi-Agent Motion Forecasting. Jianxin Shi, Xiaolong Chen, Yusen Xie, Jinhao Chen, Fali Wang, Jun Ma, Tianyu Wo |
| 2025 | Scalable Multi-Kernel Clustering with Dynamic Procrustes. Lizhu Wu, Yan Chen, Peng Zhou, Liang Du |
| 2025 | Scanpath Prediction via Utilizing Peripheral Information of the Human Visual System. Kepei Zhang, Ge Tong, Xuetao Zhang |
| 2025 | Scene Graph Generation with Large Vision-Language Model and Its Applications. Wei-Xin Chen, Yong-Yong Chen, Shi-Chao Kan |
| 2025 | Scene Text Image Super-Resolution with Visual Text Cues Transfer and Enhancement. Mingjun Li, Zeming Zhuang, Feng Su |
| 2025 | Selective Masking Adversarial Attack on Automatic Speech Recognition Systems. Zheng Fang, Shenyi Zhang, Tao Wang, Bowen Li, Lingchen Zhao, Zhangyi Wang |
| 2025 | Self-ReS: Self-Reflection in Large Vision-Language Models for Long Video Understanding. João Pereira, Vasco Lopes, David Semedo, João Neves |
| 2025 | Self-Relevance-Based Multimodal In-Context Learning for Multimodal Named Entity Recognition. Zhi Zhang, Bing Xu, Muyun Yang, Hailong Cao, Conghui Zhu, Wenpeng Lu, Tiejun Zhao |
| 2025 | Self-Supervised Learning for Transparent Object Depth Completion Using Depth from Non-Transparent Objects. Xianghui Fan, Zhaoyu Chen, Mengyang Pan, Anping Deng, Hang Yang |
| 2025 | Self-Supervised Point Cloud Completion based on Multi-View Augmentations of Single Partial Point Cloud. Jingjing Lu, Huilong Pi, Yunchuan Qin, Zhuo Tang, Ruihui Li |
| 2025 | Semantic Alignment and Hard Sample Retraining for Visible-Infrared Person Re-Identification. Jingchen Ni, Keyu Lyu, Yu Guo, Chun Yuan |
| 2025 | Semantic Communication Using Intent-guided Coarse- and Fine-grained Codec with Pre-trained Diffusion Models. Rui Tang, Dahua Gao, Minxi Yang |
| 2025 | Semantic Palette-Guided Color Propagation. Zi-Yu Zhang, Bing-Feng Seng, Ya-Feng Du, Kang Li, Zhe-Cheng Wang, Zheng-Jun Du |
| 2025 | Semantic and Temporal Integration in Latent Diffusion Space for High-Fidelity Video Super-Resolution. Yiwen Wang, Xinning Chai, Yuhong Zhang, Zhengxue Cheng, Jun Zhao, Rong Xie, Li Song |
| 2025 | Semantic-Aware Adaptation with Hierarchical Multimodal Prompts for Few-Shot Learning. Wenhao Li, Qiangchang Wang, Jing Li, Shengnan Zhao, Mindi Ruan, Yilong Yin |
| 2025 | Semantic-Guided Residual Learning for the Quality Assessment of Enhanced Images. Shishun Tian, Zhiwei Lan, Zhengyu Zhang, Ting Su, Xia Li, Lu Zhang |
| 2025 | Semantic-aware Fine-grained Point Augmentation for 3D Multi-modal Object Detection. Wei Li, Kuan Zhu, Haiyun Guo, Honghui Dong, Jinqiao Wang |
| 2025 | Semantic-guided Representation Learning for Multi-Label Recognition. Ruhui Zhang, Hezhe Qiao, Pengcheng Xu, Mingsheng Shang, Lin Chen |
| 2025 | SemanticLoom: Category-aware Dynamic Fusion for Multi-class Few-shot Image Synthesis. Jie Wang, Yan Huang, Yunfei Zhang, Tianyi Chen, Si Wu, Yong Xu, Patrick Le Callet |
| 2025 | Serial Low-rank Adaptation of Vision Transformer. Houqiang Zhong, Shaocheng Shen, Ke Cai, Zhenlong Wu, Jiangchao Yao, Yuan Cheng, Xuefei Li, Xiaoyun Zhang, Li Song, Qiang Hu |
| 2025 | Shape-Preserving and Surface-Fitting Network for 3D Lane Detection. Jianhua Li, Yongkang Liu, Gaoqi He, Wenxiang Liu, Weiliang Meng |
| 2025 | Shift-Driven Learning for Unsupervised Domain Adaptation. Wentang Chen, Yibin Wen, Juepeng Zheng |
| 2025 | SimCast: Enhancing Precipitation Nowcasting with Short-to-Long Term Knowledge Distillation. Yifang Yin, Shengkai Chen, Yiyao Li, Lu Wang, Ruibing Jin, Wei Cui, Shili Xiang |
| 2025 | SituLM: Leveraging Visual Instruction Tuning and an Augmented SWiG Dataset for Enhanced Grounded Situation Recognition. Yuran Wang, Zhi-Qi Cheng |
| 2025 | SketchRef: a Multi-Task Evaluation Benchmark for Sketch Synthesis. Xingyue Lin, Xingjian Hu, Shuai Peng, Jianhua Zhu, Liangcai Gao |
| 2025 | Slot Inversion for Asymmetric Composed Image Retrieval. Haiwen Li, Zining Chen, Ying Liu, Fei Su, Zhicheng Zhao |
| 2025 | SmartEdit: Editing-driven Engagement Prediction and Enhancement of Short-Videos. Saumya Gupta, Ishita Dasgupta, Stefano Petrangeli, Somdeb Sarkhel |
| 2025 | Social Optimum Assisted Gradient Modulation for Imbalanced Multimodal Learning. Disen Hu, Xun Jiang, Zhe Sun, Hao Yang, Chong Peng, Peng Yan, Xing Xu |
| 2025 | Soften the Mask: Adaptive Temporal Soft Mask for Efficient Dynamic Facial Expression Recognition. Mengzhu Li, Quanxing Zha, Hongjun Wu |
| 2025 | Source-Free Domain Adaptation via Transformer-based Object-centric Perception. Ziyun Cai, Weilong Gao, Yawen Huang, Jie Song, Chang-Hui Hu, Tengfei Zhang |
| 2025 | Sparse-view 3D Open-vocabulary Gaussian Splatting via Collaborative Contrastive Learning. Guibiao Liao, Anjie Wang, Mingxuan Chen, Zhijun Fang |
| 2025 | SparseDM: Toward Sparse Efficient Diffusion Models. Kafeng Wang, Jianfei Chen, He Li, Zhenpeng Mi, Jun Zhu |
| 2025 | Spatial 3D-LLM : Exploring Spatial Awareness in 3D Vision-Language Models. Xiaoyan Wang, Zeju Li, Yifan Xu, Jiaxing Qi, Zhifei Yang, Ruifei Ma, Xiangde Liu, Chao Zhang |
| 2025 | Spatial-Spectral Aware Learning with Deformable Affinity for Weakly Supervised Semantic Segmentation. Yuzhen Zhou, Pan Gao, Li Yu |
| 2025 | Spatial-Spectral Fusion Neural Operator. Wei Li, Jiawei Jiang, Ni Xu, Ying Cui, Yan Li, Jianwei Zheng |
| 2025 | Spatial-Temporal Prior Knowledge Guidance for Long-term Action Anticipation. Yiming Li, Miao Ji, Sisi You, Bing-Kun Bao |
| 2025 | SpatialMe: Stereo Video Conversion Using Depth-Warping and Blend-Inpainting. Jiale Zhang, Qianxi Jia, Yang Liu, Wei Zhang, Wei Wei, Xin Tian |
| 2025 | Spatio-Temporal Point Convolutional Network With Meta-motion Level Refinement for Point Cloud-Based Human Action Recognition. Qian Huang, Zhaoyu Chen, Ge Gao, Shihao Han, Qing Meng, Xing Li |
| 2025 | Spatio-Temporally Consistent Depth Estimation for Dynamic Scenes using 3D Scene Flows. Yu Cai, Tianjiao Jing, Chang Liu, Zhengxuan Lian, Shi-Sheng Huang, Hua Huang |
| 2025 | Spectral Enhanced Tuning: An Efficient Plug-and-Play Framework for Frequency-Aware Dehazing. Cheng Tang, Wenqi Lou, Qianyu Cheng, Jiayi Tuo, Wei Fu, Tianhao Jiang, Chao Wang, Xuehai Zhou |
| 2025 | Spectrum-Adaptive Distribution of 2D Gaussians for Image Representation and Compression. Zunian Wan, Jiancheng Zhao, Yepeng Ding, Lingfeng Zhang, Hiroyuki Sato, Takefumi Ogawa |
| 2025 | Spectrum-Assisted Mamba for Infrared Small Target Detection. Yongji Li, Luping Wang |
| 2025 | SpeechPrune: Context-Aware Token Pruning for Speech Information Retrieval. Yueqian Lin, Yuzhe Fu, Jingyang Zhang, Yudong Liu, Jianyi Zhang, Jingwei Sun, Hai Helen Li, Yiran Chen |
| 2025 | Stair-LIF: Boosting the Representation of Spiking Neural Networks with Learnable Incremental Multi-Threshold Neurons. Jilong Luo, Yinsheng Chen, Yue Liu, Jinghai Wang, Zhiyi Yu, Shanlin Xiao |
| 2025 | StegOT: Trade-offs in Steganography via Optimal Transport. Chengde Lin, Xuezhu Gong, Shuxue Ding, Mingzhe Yang, Xijun Lu, Chengjun Mo |
| 2025 | Stepwise Schema-Guided Prompting Framework with Parameter Efficient Instruction Tuning for Multimedia Event Extraction. Xiang Yuan, Xinrong Chen, Haochen Li, Hang Yang, Guanyu Wang, Weiping Li, Tong Mo |
| 2025 | Structure-Guided Camouflaged Object Detection with Progressive Enhancement Strategy. Qingzheng Wang, Jiazhi Xie, Ning Li |
| 2025 | Study of Finger Biometrics on Finger Semantic Segmentation and Finger Shape Authentication. Junduan Huang, Dacan Luo, Weili Yang, Jiahui Pan, Wenxiong Kang |
| 2025 | StyleRWKV: High-Quality and High-Efficiency Style Transfer with RWKV-like Architecture. Miaomiao Dai, Qianyu Zhou, Lizhuang Ma |
| 2025 | Subjective Quality Assessment for Point Clouds of Digital Humans with Shaded Rendering. Amar Tious, Toinon Vigier, Vincent Ricordel |
| 2025 | Supplementary Material for "NoiseActor: A Noise-Action Collaborative Framework for Privacy-Preserving Action Recognition without Privacy Labels". Xiao Li, Xiao-Ming Wu, Delong Zhang, Kun-Yu Lin, Yi-Xing Peng, Ling-An Zeng, Wei-Shi Zheng |
| 2025 | Supplementary Material for STTODE: Spatio-Temporal Transformer Ordinary Differential Equation Networks for Pedestrian Trajectory Forecasting. Yi Zou, Yingjie Liu, Jian Yang, Mingsong Chen, Xuan Tang, Xian Wei |
| 2025 | SwinCAE: Capsule Autoencoder using Shifted Windows for 3D Human Pose Estimation. Xiufeng Liu, Zhongqiu Zhao, Yi Yang, Donghui Hu, Zhao Zhang |
| 2025 | SymND: Detecting Backdoor Attacks in Self-Supervised Facial Representation Tasks. Liyue Zhu, Changchun Yin, Liming Fang, Zhen Qin |
| 2025 | Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding. Yidan Sun, Qin Chao, Yangfeng Ji, Boyang Li |
| 2025 | Synthesize Large-scale in situ Darkfield Images for Training Marine Plankton Detection Algorithms. Zhenping Li, Jianping Li |
| 2025 | T-Dreamer: Topology-Aware Text-to-3D Generation. Xiaoxuan Wu, Qiulu Li, Lin Shu, Ke Lv, Ke Chen |
| 2025 | TACOS: Open Tagging and Comparative Scoring for Instruction Fine-Tuning Data Selection. Xixiang He, Hao Yu, Qiyao Sun, Ao Cheng, Tailai Zhang, Cong Liu, Shuxuan Guo |
| 2025 | TAD-IVR: Enhancing Temporal Action Detection via Instrumental Variable Regression. Minglin Hong, Bo Sun, Jun He, Yinghui Zhang |
| 2025 | TC-GS: Tri-plane based Compression for 3D Gaussian Splatting. Taorui Wang, Zitong Yu, Yong Xu |
| 2025 | TC-NeRF:Temporal Consistent Neural Radiance Fields with Cross-View Complementation for Occluded Object Removal. Zicheng Wu, Li-Hsuan Chang, Kuan-Wen Chen |
| 2025 | TCFI: Topology-Consistent Pruning with Fisher Information for Efficient Medical Image Segmentation. Yi Wang, Renda Han, Yihao Chen |
| 2025 | TD-BFR: Truncated Diffusion Model for Efficient Blind Face Restoration. Ziying Zhang, Xiang Gao, Zhixin Wang, Qiang Hu, Xiaoyun Zhang |
| 2025 | TDE-VC: Timbre Disentanglement and Extraction Via Consistency for Zero-Shot Voice Conversion. Ying Hu, Shangkun Tu, Fan Li, Lijun He, Hai Yan, Yan Li |
| 2025 | TEVLA: Text-oriented Enhancement for Vision-Language Alignment in Relation Extraction. Junlin Chen, Qiushan Guo, Ka Chun Cheung, Mingrui Liang, Dezhi Chen |
| 2025 | TGATrack: Template-Guided Low-Rank Adaption for Robust RGB-T Tracking. Shihui Zhang, Junbin Su, Jiawei Zhang, Ziteng Xue, Zhipeng Zhang |
| 2025 | TGSR: Template-Guided Semantic Resampling against Adversarial Tracking Attacks. Xuhong Ren, Jianlang Chen, Wanli Xue, Lei Ma, Qing Guo, Jianjun Zhao, Shengyong Chen |
| 2025 | THOR: Text to Human-Object Interaction Diffusion via Relation Intervention. Qianyang Wu, Ye Shi, Xiaoshui Huang, Lan Xu, Jingyi Yu, Jingya Wang |
| 2025 | TRAMFuse: Text image Tampering Detection via Directional Residual Attention Mechanism. Xingqian Guo, Tingting Chai, Lunke Fei, JiaLing Xu, Guanglu Zhou, Xiangqian Wu, Haoxing Cao |
| 2025 | TRR-LGF: a Simple yet Efficient Classification Network. Zhen Long, Qingqing Cao, Hu Yao, Yipeng Liu, Le Zhang, Ce Zhu |
| 2025 | TSRS-Net: A Trilaterally Supervised Residual Network for Accurate Segmentation of Prostate Lesion Ablation Regions from MRI and Surgical Plan Images. Yixin Li, Haifeng Wang, Zhichao Yan, Ye Luo |
| 2025 | TSTMotion: Training-free Scene-aware Text-to-motion Generation. Ziyan Guo, Haoxuan Qu, Hossein Rahmani, De Wen Soh, Ping Hu, Qiuhong Ke, Jun Liu |
| 2025 | Tactile Information Coding for DNA Storage with Prospects for AI Applications. Rongduo Han, Cihan Ruan, Shunye Tang, Haoyu Wu, Nam Ling, Haining Zhang |
| 2025 | Take What I Need: Active Data Distillation for Federated Learning. Hongcheng Li, Yucan Zhou, Yibin Wang, Xiaoyan Gu, Bo Li, Weiping Wang |
| 2025 | TalkFashion: Intelligent Virtual Try-On Assistant Based on Multimodal Large Language Model. Yujie Hu, Xuanyu Zhang, Weiqi Li, Jian Zhang |
| 2025 | Tapping Beyond Hands: Assisting No-Handed Touch Interaction under Situational Impairments. Yawen Zheng, Jin Huang, Hao Zhang, Yang Li, Juan Liu, Yulong Bian, Chenglei Yang, Xiangxu Meng |
| 2025 | Target Distribution Agnostic Domain Adaptation for in-the-Wild Image Classification under Both Domain and Label Shifts. Aotian Zheng, Jenq-Neng Hwang, Rania Hussein, Farron Wallace, Kelsey Magrane, Lauren Shiosaka |
| 2025 | Target-oriented Multimodal Sentiment Classification with Counterfactual-enhanced Debiasing. Zhiyue Liu, Fanrong Ma, Xin Ling |
| 2025 | Task-Aware Knowledge Prompt and Distillation for Cross-Domain Few-Shot Learning. Jun Liang, Yunyu Zou, Yang Peng, Yalong Cheng, Rui Luo, Yishu Liu, Bingzhi Chen |
| 2025 | TaxAgent: How Large Language Model Designs Fiscal Policy. Jizhou Wang, Xiaodan Fang, Lei Huang, Yongfeng Huang |
| 2025 | Teaching LLMs for Step-Level Automatic Math Correction via Reinforcement Learning. Junsong Li, Jie Zhou, Yutao Yang, Bihao Zhan, Qianjun Pan, Yuyang Ding, Qin Chen, Jiang Bo, Xin Lin, Liang He |
| 2025 | Tell and Show: A Multimodal Guidance Method for Instructional Video Planning. Mingzhe Zhang, Yinghui Zhang, Fengxiang Ge |
| 2025 | Temporal Invariant Feature Combined with Arbitrary Enhancement for Missing Modality Emotion Recognition. Jiahao Fan, Weiting Chen, Zheming Fan, Ruizhi Yu |
| 2025 | Teochew-Wild: The First In-the-wild Teochew Dataset with Orthographic Annotations. Linrong Pan, Chenglong Jiang, Gaoze Hou, Ying Gao |
| 2025 | Text to Trajectory: Enhancing and Evaluating LLMs for Embodied Task Planning. Yihan Tang, Yong Xu, Ruotao Xu, Yan Huang, Si Wu, Patrick Le Callet |
| 2025 | Text-to-Image Diffusion Models are AI-Generated Image Quality Scorers. Xiangfei Sheng, Weidong Zou, Pengfei Chen, Li Cai, Chao He, Leida Li |
| 2025 | Texture-Aware Neural Radiance Fields Watermarking for Resisting Feature-Modulation Surrogate Model Attacks. Lei Tan, Yuliang Xue, Guobiao Li, Zhenxing Qian, Sheng Li, Xinpeng Zhang |
| 2025 | Texture-aware Intrinsic Image Decomposition with Model- and Learning-based Priors. Xiaodong Wang, Zijun He, Xin Yuan |
| 2025 | The Motion in the Details: Adapting CLIP for Action Recognition via Dual-prompt Guidance. Longjuan Sun, Xixia Xu, Dongchen Zhu, Jiamao Li |
| 2025 | Think Twice: Empowering Action Recognition Models with Human-Like Deep Reasoning. Xiangning Ruan, Baoxing Xie, Zhaohui Hou, Qixiang Yin, Fei Su, Zhicheng Zhao |
| 2025 | Time-Frequency Domain Fusion Transformer for Cross-Subject Motor Imagery Classification. Zijian Xia, Jianfeng Li, Jiahui Pan |
| 2025 | Time-Series Acoustic Network for Underwater Acoustic Target Recognition. Pengyuan Qi, Ye Tian, Guisheng Yin |
| 2025 | Time-Series Anomaly Detection Method Based on Frequency-Domain Decoupling and Correction. Di Niu, Enyuan Zhao, Jie Nie, Min Ye, Shusong Yu, Xinyue Liang |
| 2025 | Token-Driven Linkage Network: One-Shot Adaptation of SAM for Challenging Segmentation Scenarios. Yao Shen, Kaiyang Zeng, Guangyao Li |
| 2025 | TopoLayer: A Universal Neural Network Layer for Topological Feature Learning on Point Clouds using Persistent Homology. Zechao Guan, Shuai Du, Qingshan Liu |
| 2025 | Toward Uncontrolled Palmprint Recognition via Multi-View Block Diagonal Structure Learning. Shuping Zhao, Chongli Zhuang, Li Yang, Yanling Zhong, Yanping Li, Yonghan Chen |
| 2025 | Towards A Real-World Road Damage Detection Dataset. Menghao Hu, Zuogan Tang, Xiaoshan Yang, Zhe Wu, Zhouxin Yang, Shaocong Wu, Yaguang Song, Kui Hou, Qingfang Zheng, Yaowei Wang |
| 2025 | Towards Advanced Emotional Care: Embodied Emotional Care System for Humanoid Robots. Yang Chang, Aoxing Li, Yuxuan Lin, Jianan Wang, Lizheng Liu, Yang Liu, Jing Liu, Liang Cao, Yan Wang, Zhongxue Gan, Wenqiang Zhang |
| 2025 | Towards Aligned Data Forgetting via Twin Machine Unlearning. Haoxuan Ji, Zheng Lin, Yuyao Sun, Fei Gao, Yuhang Wang, Haichang Gao, Zhenxing Niu |
| 2025 | Towards End-to-End Neuromorphic Voxel-based 3D Object Reconstruction Without Physical Priors. Chuanzhi Xu, Langyi Chen, Haodong Chen, Vera Chung, Vincent Qu |
| 2025 | Towards Improved Deep Metric Learning via Unsupervised Object Location. Changxin Ye, Yushan Zhang, Xinyi Xu, Wei Huangfu, Cheng Deng |
| 2025 | Towards Practical Real-Time Low-Latency Music Source Separation. Junyu Wu, Jie Liu, Tianrui Pan, Jie Tang, Gangshan Wu |
| 2025 | Towards Robust Image Restoration: A Multi-Type Degradation Dataset for Outdoor Scenes. Yongheng Zhang, Danfeng Yan |
| 2025 | Towards Robust Time-Of-Flight Depth Denoising with Confidence-Aware Diffusion Model. Changyong He, Jin Zeng, Jiawei Zhang, Jiajie Guo |
| 2025 | Towards Robust Visual Question Answering via Causal Intervention and Contrastive Learning. Wei Li, Zhixin Li |
| 2025 | Towards Specialized and Generalizable Geometry Restoration of Compressed Point Clouds. Lixuan Meng, Qiang Xu, Shan Liu, Wei Gao, Ge Li |
| 2025 | Towards Trustworthy Model via Uncertainty Verification in Multimodal Sentiment Analysis. Chen Tang, Yangle Li, Tingrui Shen, Xinrong Gong, Tong Zhang |
| 2025 | Training Robust DNNs with Noisy Labels via Contrastive Re-Calibration Learning. Yongfeng Dong, Jiaji Wang, Zhen Wang, Guifang Wu, Hao Cheng |
| 2025 | Trans-Diff:Transformer-based Video Summarization with Diffusion. Cai Pan, Guowei Zhang, Rui Zhong |
| 2025 | Transferable Attack against Face Swapping in an Extended Space. Mingzhi Lyu, Yi Huang, Jun Xie, Zihao Zhao, Hong Xu, Adams Wai-Kin Kong |
| 2025 | TriModal Enhanced Fusion Network: Advancing Multimodal Representation and Fusion for Enhanced Multimodal Intent Recognition. Yixuan Wang, Kehan Wang, Huayu Zhang, Ming Fang, Shuhua Liu |
| 2025 | TrojFlow: Flow Models are Natural Targets for Trojan Attacks. Zhengyang Qi, Xiaohua Xu |
| 2025 | True Match: Leveraging 2D-Assisted Queries for Multi-view 3D Detection in Polar Space. Yefei Hou, Jie Tang |
| 2025 | Trustworthy Localized Corrections-guided Mutual Learning for Multi-View Learning. Qiuran Li, Yi Luo, Yan Sun, Tong Wu, Aiguo Chen |
| 2025 | Twin Progressive Generative Adversarial Network For High-Resolution Image Inpainting. Zhiying Li, Weibin Chen, Zhaoxin Fan, Kaichuan Kong, Xiaobo Jin, Guanggang Geng |
| 2025 | USGT: A Unified Syntax-Guided Transformer Framework for Sentiment Classification and Aspect Term Extraction. Xiaohong Xiang, Zhe Zhang, Yi Zhou, Xin Deng |
| 2025 | Uncertainty-Driven Weakly Supervised Dehazing Network: Integrating Dynamic Attention and Multi-Scale Feature Fusion. Jinbin Wang, Aiping Yang, Yumeng Liu, Qinghua Hu |
| 2025 | Uncertainty-Guided Iterative Architecture for Stereo Matching. Weiqing Xiao, Fengjun Zhong, Hao Zhao |
| 2025 | Uncertainty-guided Multi-modal Sequential Recommendation. Li Yin, Baigang Mi, Yi Fan |
| 2025 | Uncovering Personality Traits via Multimodal LLM for Personalized Image Emotion Analysis. Jianzhang Gao, Hao Pu, Yuchong Sun, Ruihua Song |
| 2025 | Uneven Event Modeling for Partially Relevant Video Retrieval. Sa Zhu, Huashan Chen, Wanqian Zhang, Jinchao Zhang, Zexian Yang, Xiaoshuai Hao, Bo Li |
| 2025 | Unfolding Framework with Complex-Valued Deformable Attention for High-Quality Computer-Generated Hologram Generation. Haomiao Zhang, Zhangyuan Li, Yanling Piao, Zhi Li, Xiaodong Wang, Miao Cao, Xiongfei Su, Qiang Song, Xin Yuan |
| 2025 | UniBind: Leveraging LLM-Augmented Knowledge Base for Scene Integration. Zhonghao Zhang, Ruonan Zhang, Libo Liu |
| 2025 | UniSep: Universal Target Audio Separation with Language Models at Scale. Yuanyuan Wang, Hangting Chen, Dongchao Yang, Weiqin Li, Dan Luo, Guangzhi Li, Shan Yang, Zhiyong Wu, Helen Meng, Xixin Wu |
| 2025 | UniSync: A Unified Framework for Audio-Visual Synchronization. Tao Feng, Yifan Xie, Xun Guan, Jiyuan Song, Zhou Liu, Fei Ma, F. Richard Yu |
| 2025 | UniTD: A Benchmark with Unified Text-Domain for Text-to-Image Person ReID. Ping Lai, Yihang Duan, Hao Ni, Liangcheng Fu, Hui Xu, Pengpeng Zeng |
| 2025 | UniVG: Towards UNIfied-modal Video Generation. Ludan Ruan, Lei Tian, Chuanwei Huang, Xu Zhang, Xinyan Xiao |
| 2025 | Unified Line Segment Detection and Description. Xinyu Lin, Yingjie Zhou, Zhen Long, Yipeng Liu, Lu Yang, Ce Zhu |
| 2025 | Unified-Modality Attention Network for Multimodal Sentiment Analysis. Zuocheng Li, Lishuang Li |
| 2025 | Unifying Spatio-Temporal Contexts for Advanced Text-Video Retrieval. Yanhao Huang, Baoyao Yang, Junxiang Chen, Wenbin Yao, Dixin Chen |
| 2025 | Unimodal-driven Distillation in Multimodal Emotion Recognition with Dynamic Fusion. Jiagen Li, Rui Yu, Huihao Huang, Huaicheng Yan |
| 2025 | Universal Scene Graph Generation via Semantic Feature Alignment. Xiangyu Zhang, Guoxi Qiu, Yong Xu, Jinghua Wang |
| 2025 | Unlocking Instance Semantic Awareness for Domain Adaptive Semantic Segmentation. Fan Li, Xuan Wang, Min Qi, Zhaoxiang Zhang, Chengming Xu, Yuelei Xu |
| 2025 | Unsupervised Domain Adaptation for Fetal R-peak Detection at Trans-Pregnancy Stages based on Multiview Mixing. Yiwei Lin, Yuying Bao, Tao Yu, Zhenqin Chen, Xu Cheng, Jinshan Xu |
| 2025 | Utilizing Additional Personalized Representations for Personalized Federated Learning. Shulan Yin, Yingxun Fu, Li Ma |
| 2025 | Utilizing Contrastive Learning for Locating Network Anomalies in Real-time Conferencing Applications. Teng Ma, Dongbiao He, Zhongxing Ming, Junhao Xu, Laizhong Cui, Yunpeng Chai |
| 2025 | VADMamba: Exploring State Space Models for Fast Video Anomaly Detection. Jiahao Lyu, Minghua Zhao, Jing Hu, Xuewen Huang, Yifei Chen, Shuangli Du |
| 2025 | VFFG-CL: Virtual Fusion Feature Generation with Curriculum Learning for Missing-Modality Emotion Recognition. Xiaolan Tang, Yan Xiang, Zhengtao Yu, Yuxin Huang |
| 2025 | VG-Net: Vision Transformer based Graph Fusion Representation for Multi-label Pattern Image Retrieval. Erwan Ye, Ying Li |
| 2025 | VGAT: A Cancer Survival Analysis Framework Transitioning from Generative Visual Question Answering to Genomic Reconstruction. Zizhi Chen, Minghao Han, Xukun Zhang, Shuwei Ma, Tao Liu, Xing Wei, Lihua Zhang |
| 2025 | VIP-PCQA: A Multi-Modal Framework for No-reference Point Cloud Quality Assessment. Kang Fu, Zicheng Zhang, Huiyu Duan, Xiaohong Liu, Xiongkuo Min, Jiarui Wang, Guangtao Zhai |
| 2025 | VL-UR: Vision-Language-guided Universal Restoration of Images Degraded by Adverse Weather Conditions. Ziyan Liu, Yuxu Lu, Hushan Yu, Dong Yang |
| 2025 | VLCO:A Dual-Optimization Framework for Precise Camouflaged Object Localization and Segmentation. Maosheng Su, Shuo Wang, Zhichuan Wang, Jun Luo |
| 2025 | VSD2M: Large-scale Vision-language Sticker Dataset for Multi-frame Animated Sticker Generation. Zhiqiang Yuan, Jiapei Zhang, Ying Deng, Yeshuang Zhu, Jie Zhou, Jinchao Zhang |
| 2025 | Variance-Reduction Guidance: Sampling Trajectory Optimization for Diffusion Models. Shifeng Xu, Yanzhu Liu, Adams Wai-Kin Kong |
| 2025 | VectorPainter: Advanced Stylized Vector Graphics Synthesis Using Stroke-Style Priors. Juncheng Hu, Ximing Xing, Jing Zhang, Qian Yu |
| 2025 | VidCtx: Context-aware Video Question Answering with Image Models. Andreas Goulas, Vasileios Mezaris, Ioannis Patras |
| 2025 | Video Flow as Time Series: Discovering Temporal Consistency and Variability for VideoQA. Zijie Song, Zhenzhen Hu, Yixiao Ma, Jia Li, Richang Hong |
| 2025 | Video Label Refinement for Temporal Localization. Jennifer Piane, Thiruvarangan Ramaraj, Jacob D. Furst, Daniela Raicu |
| 2025 | Video Quality Assessment for Resolution Cross-Over in Live Sports. Jingwen Zhu, Yixu Chen, Hai Wei, Sriram Sethuraman, Yongjun Wu |
| 2025 | Visibility-GS: Visibility-Wise Densification of 3D Gaussian Splatting. Haoru Deng, Jiaxiang Qian, Shuangli Du, Sha Li, Ruoling Qi, Zhenyu Xu |
| 2025 | Visual Feature Learning from Randomized EEG Trials for Object Recognition. Xiaoya Fan, Haixiao Xue, Yufan Feng, Qi Zhao, Zheng Zhao, Zhong Wang |
| 2025 | Visual Relationships Are Different: Appropriate Way To Predict Each Relationship. Zhenhua Lei, Xuemei Xie |
| 2025 | Visual Semantic Description Generation with MLLMs for Image-Text Matching. Junyu Chen, Yihua Gao, Mingyong Li |
| 2025 | Visual-Textual Feature Learning for Rare Human-Object Interactions Detection. Mingliang Xue, Chong Cao, Zhengyang Zhao, Xiaodong Duan, Shu Cao |
| 2025 | VividPose: Vividly 3D-driven Stable Pose Diffusion of High Facial Fidelity. Qilin Wang, Zhengkai Jiang, Chengming Xu, Jiangning Zhang, Yabiao Wang, Xinyi Zhang, Yun Cao, Weijian Cao, Chengjie Wang, Zhanxiong Wang, Yanwei Fu |
| 2025 | Vote & Mix: Plug-and-Play Token Reduction for Efficient Vision Transformer. Shuai Peng, Di Fu, Baole Wei, Liangcai Gao, Zhi Tang |
| 2025 | VoxelDet: Towards Accurate 3D Object Detection with Voxel Pruning and Fine Geometric Shape. Jia Wen, Jialin Li, Ting Zhang |
| 2025 | WCG-Net: Warping Consistency Compensation Guided Multi-Feature Fusion For Stereo Matching. Yan Hong, Chao He, Zhibo Rao, Zhen Chen, Nan Li, Congxuan Zhang |
| 2025 | WDRE-NET: Wavelet-Differential Convolution and Region-Expansion to Enhance Weakly Supervised Adjacent Nuclei Segmentation. Meng Geng, Qian Huang, Yulin Chen, Xuejie Zhang |
| 2025 | WDiff: Wavelet-based Diffusion Models for Surgical Endoscopic Image Low-Light Enhancement. Zeyu Lei, Lidan Fu, Anqi Xiao, Jie Tian, Zhenhua Hu |
| 2025 | WL-MVSNet: Frequency-Aware and Regularized Learning for Multi-View Stereo. Yan Ma, Ruijie Peng, Suping Wu |
| 2025 | WSGS: A Speech-Driven Zero-Shot System for 6D Robotic Arm Grasping. Yitong Ge, Lin Zhang, Yang Chen, Ying Shen |
| 2025 | WT-BCP: Wavelet Transform based Bidirectional Copy-Paste for Semi-Supervised Medical Image Segmentation. Mingya Zhang, Liang Wang, Limei Gu, Tingsheng Ling, XianPing Tao |
| 2025 | Wavelet Convolution and Multi-Scale Attention Network for Image Tampering Localization. Yun Song, Yaoyao Xu, Jiaxin Chen, He Yang, Dengyong Zhang, Miaohui Wang |
| 2025 | Wavelet-based Feature Representation Framework for Event Stream Recognition. Zihan Cheng, Xingyu Pan, Xi Chen, Shenghua Fan |
| 2025 | Wavelet-based Global-Local Interaction Network with Cross-Attention for Multi-View Diabetic Retinopathy Detection. Yongting Hu, Yuxin Lin, Chengliang Liu, Xiaoling Luo, Xiaoyan Dou, Qihao Xu, Yong Xu |
| 2025 | Weak Semantic-Guided Entropy Model for Image Compression. Yiming Ding, Jianguo Wei |
| 2025 | Weakly Supervised Object Detection Framework based on Classification-Localization Consistency. Yihuan Zhu, Simiao Wang, Mingyu Lu, Zhengxing Sun |
| 2025 | Weaponizing Tokens: Backdooring Text-to-Image Generation via Token Remapping. Jiaming He, Wenbo Jiang, Guanyu Hou, Qiyang Song, Ji Guo, Hongwei Li |
| 2025 | When Epipolar Transformers Meets Implicit Neural Super-Resolution in Multi-View Stereo. Boyang Song, Jin Xiao, Xiaoguang Hu, Guofeng Zhang, Jiaqi Shi, Hao Jiang |
| 2025 | Where's That Voice Coming? Continual Learning for Sound Source Localization. Yang Xiao, Rohan Kumar Das |
| 2025 | Zero-1-to-3DGS: a Single Image to 3D Gaussian by Consistent Multi-view Generation. Shenghao Yang, Hongtao Zhang, Jianxing Ren, Zhihao Tang, Mingbo Zhao, Yuping Liu |
| 2025 | Zero-Shot Speech Perception Decoding via Advancing Representation Consistency. Yi Xiao, Xuyi Qiao, Yu-Xuan Zhang, Xianchuan Yu |
| 2025 | Zero-shot Face Editing via ID-Attribute Decoupled Inversion. Yang Hou, Minggu Wang, Jianjun Zhao |
| 2025 | Zero-shot Quantization of Vision Transformers: Leveraging Multi-model Ensembles and Attention Mixup. Yao Li, Xinrui Chen, Zhuozhen Yu, Shunzhou Wang, Wei Gao |
| 2025 | ZeroPose: Leveraging Diffusion Models and Large Language Models for Advanced Multi-Hypothesis 3D Construction Workers' Pose Estimation. Gaowei Zhang, Wei Wang, Yi Wang |
| 2025 | k2SSL: A Faster and Better Framework for Self-Supervised Speech Representation Learning. Yifan Yang, Jianheng Zhuo, Zengrui Jin, Ziyang Ma, Xiaoyu Yang, Zengwei Yao, Liyong Guo, Wei Kang, Fangjun Kuang, Long Lin, Daniel Povey, Xie Chen |
| 2025 | α-SAV: Generalized Weighted Input Verification for Secure Aggregation in Federated Learning. Zhi Lu, Yuhao Long, Qirui Zhou, Mengyuan Zou, Wenjie Cai, Songfeng Lu |