| 2024 | 2D Feature Distillation for Weakly- and Semi-Supervised 3D Semantic Segmentation. Ozan Unal, Dengxin Dai, Lukas Hoyer, Yigit Baran Can, Luc Van Gool |
| 2024 | 360BEV: Panoramic Semantic Mapping for Indoor Bird's-Eye View. Zhifeng Teng, Jiaming Zhang, Kailun Yang, Kunyu Peng, Hao Shi, Simon Reiß, Ke Cao, Rainer Stiefelhagen |
| 2024 | 3D Face Style Transfer with a Hybrid Solution of NeRF and Mesh Rasterization. Jianwei Feng, Prateek Singhal |
| 2024 | 3D Human Pose Estimation with Two-step Mixed-Training Strategy. Yingfeng Wang, Zhengwei Wang, Muyu Li, Hong Yan |
| 2024 | 3D Reconstruction of Interacting Multi-Person in Clothing from a Single Image. Junuk Cha, Hansol Lee, Jaewon Kim, Nhat Nguyen Bao Truong, Jae Shin Yoon, Seungryul Baek |
| 2024 | 3D Super-Resolution Model for Vehicle Flow Field Enrichment. Thanh Luan Trinh, Fangge Chen, Takuya Nanri, Kei Akasaka |
| 2024 | 3D-Aware Talking-Head Video Motion Transfer. Haomiao Ni, Jiachen Liu, Yuan Xue, Sharon X. Huang |
| 2024 | 3SD: Self-Supervised Saliency Detection With No Labels. Rajeev Yasarla, Renliang Weng, Wongun Choi, Vishal M. Patel, Amir Sadeghian |
| 2024 | 4K-Resolution Photo Exposure Correction at 125 FPS with ~8K Parameters. Yijie Zhou, Chao Li, Jin Liang, Tianyi Xu, Xin Liu, Jun Xu |
| 2024 | A Closer Look at Robustness of Vision Transformers to Backdoor Attacks. Akshayvarun Subramanya, Soroush Abbasi Koohpayegani, Aniruddha Saha, Ajinkya Tejankar, Hamed Pirsiavash |
| 2024 | A Coarse-to-Fine Pseudo-Labeling (C2FPL) Framework for Unsupervised Video Anomaly Detection. Anas Al-lahham, Nurbek Tastan, Muhammad Zaigham Zaheer, Karthik Nandakumar |
| 2024 | A Generative Multi-Resolution Pyramid and Normal-Conditioning 3D Cloth Draping. Hunor Laczkó, Meysam Madadi, Sergio Escalera, Jordi Gonzàlez |
| 2024 | A Geometry Loss Combination for 3D Human Pose Estimation. Ai Matsune, Shichen Hu, Guangquan Li, Sihan Wen, Xiantan Zhu, Zhiming Tan |
| 2024 | A Hybrid Graph Network for Complex Activity Detection in Video. Salman Khan, Izzeddin Teeti, Andrew Bradley, Mohamed Elhoseiny, Fabio Cuzzolin |
| 2024 | A Multimodal Benchmark and Improved Architecture for Zero Shot Learning. Keval Doshi, Amanmeet Garg, Burak Uzkent, Xiaolong Wang, Mohamed Omar |
| 2024 | A Neural Height-Map Approach for the Binocular Photometric Stereo Problem. Fotios Logothetis, Ignas Budvytis, Roberto Cipolla |
| 2024 | A One-Shot Learning Approach to Document Layout Segmentation of Ancient Arabic Manuscripts. Axel De Nardin, Silvia Zottin, Claudio Piciarelli, Emanuela Colombi, Gian Luca Foresti |
| 2024 | A Robust Diffusion Modeling Framework for Radar Camera 3D Object Detection. Zizhang Wu, Yunzhe Wu, Xiaoquan Wang, Yuanzhu Gan, Jian Pu |
| 2024 | A Sequential Learning-based Approach for Monocular Human Performance Capture. Jianchun Chen, Jayakorn Vongkulbhisal, Fernando De la Torre Frade |
| 2024 | A Visual Active Search Framework for Geospatial Exploration. Anindya Sarkar, Michael Lanier, Scott Alfeld, Jiarui Feng, Roman Garnett, Nathan Jacobs, Yevgeniy Vorobeychik |
| 2024 | A generic and flexible regularization framework for NeRFs. Thibaud Ehret, Roger Marí, Gabriele Facciolo |
| 2024 | A*: Atrous Spatial Temporal Action Recognition for Real Time Applications. Myeongjun Kim, Federica Spinola, Philipp Benz, Tae-Hoon Kim |
| 2024 | AFTer-SAM: Adapting SAM with Axial Fusion Transformer for Medical Imaging Segmentation. Xiangyi Yan, Shanlin Sun, Kun Han, Thanh-Tung Le, Haoyu Ma, Chenyu You, Xiaohui Xie |
| 2024 | AMEND: Adaptive Margin and Expanded Neighborhood for Efficient Generalized Category Discovery. Anwesha Banerjee, Liyana Sahir Kallooriyakath, Soma Biswas |
| 2024 | ARNIQA: Learning Distortion Manifold for Image Quality Assessment. Lorenzo Agnolucci, Leonardo Galteri, Marco Bertini, Alberto Del Bimbo |
| 2024 | ATS: Adaptive Temperature Scaling for Enhancing Out-of-Distribution Detection Methods. Gerhard Krumpl, Henning Avenhaus, Horst Possegger, Horst Bischof |
| 2024 | AU-Aware Dynamic 3D Face Reconstruction from Videos with Transformer. Chenyi Kuang, Jeffrey O. Kephart, Qiang Ji |
| 2024 | Active Batch Sampling for Multi-label Classification with Binary User Feedback. Debanjan Goswami, Shayok Chakraborty |
| 2024 | Active Learning for Single-Stage Object Detection in UAV Images. Asma Yamani, Albandari Alyami, Hamzah Luqman, Bernard Ghanem, Silvio Giancola |
| 2024 | Active Learning with Task Consistency and Diversity in Multi-Task Networks. Aral Hekimoglu, Michael Schmidt, Alvaro Marcos-Ramiro |
| 2024 | Active Transfer Learning for Efficient Video-Specific Human Pose Estimation. Hiromu Taketsugu, Norimichi Ukita |
| 2024 | Activity-based Early Autism Diagnosis Using A Multi-Dataset Supervised Contrastive Learning Approach. Asha Rani, Yashaswi Verma |
| 2024 | Adapt Your Teacher: Improving Knowledge Distillation for Exemplar-free Continual Learning. Filip Szatkowski, Mateusz Pyla, Marcin Przewiezlikowski, Sebastian Cygert, Bartlomiej Twardowski, Tomasz Trzcinski |
| 2024 | Adaptive Deep Neural Network Inference Optimization with EENet. Fatih Ilhan, Ka-Ho Chow, Sihao Hu, Tiansheng Huang, Selim F. Tekin, Wenqi Wei, Yanzhao Wu, Myungjin Lee, Ramana Kompella, Hugo Latapie, Gaowen Liu, Ling Liu |
| 2024 | Adaptive Latent Diffusion Model for 3D Medical Image to Image Translation: Multi-modal Magnetic Resonance Imaging Study. Jonghun Kim, Hyunjin Park |
| 2024 | Adaptive manifold for imbalanced transductive few-shot learning. Michalis Lazarou, Yannis Avrithis, Tania Stathaki |
| 2024 | Adversarial Likelihood Estimation With One-Way Flows. Omri Ben-Dov, Pravir Singh Gupta, Victoria Fernández Abrevaya, Michael J. Black, Partha Ghosh |
| 2024 | Aligning Non-Causal Factors for Transformer-Based Source-Free Domain Adaptation. Sunandini Sanyal, Ashish Ramayee Asokan, Suvaansh Bhambri, Pradyumna YM, Akshay R. Kulkarni, Jogendra Nath Kundu, R. Venkatesh Babu |
| 2024 | Alleviating Foreground Sparsity for Semi-Supervised Monocular 3D Object Detection. Weijia Zhang, Dongnan Liu, Chao Ma, Tom Weidong Cai |
| 2024 | Amodal Intra-class Instance Segmentation: Synthetic Datasets and Benchmark. Jiayang Ao, Qiuhong Ke, Krista A. Ehinger |
| 2024 | An Analysis of Initial Training Strategies for Exemplar-Free Class-Incremental Learning. Grégoire Petit, Michaël Soumm, Eva Feillet, Adrian Popescu, Bertrand Delezoide, David Picard, Céline Hudelot |
| 2024 | An Empirical Investigation into Benchmarking Model Multiplicity for Trustworthy Machine Learning: A Case Study on Image Classification. Prakhar Ganesh |
| 2024 | Analyzing the Domain Shift Immunity of Deep Homography Estimation. Mingzhen Shao, Tolga Tasdizen, Sarang C. Joshi |
| 2024 | Annotation-free Audio-Visual Segmentation. Jinxiang Liu, Yu Wang, Chen Ju, Chaofan Ma, Ya Zhang, Weidi Xie |
| 2024 | AnyStar: Domain randomized universal star-convex 3D instance segmentation. Neel Dey, S. Mazdak Abulnaga, Benjamin Billot, Esra Abaci Turk, Patricia Ellen Grant, Adrian V. Dalca, Polina Golland |
| 2024 | Appearance-Based Curriculum for Semi-Supervised Learning with Multi-Angle Unlabeled Data. Yuki Tanaka, Shuhei M. Yoshida, Takashi Shibata, Makoto Terao, Takayuki Okatani, Masashi Sugiyama |
| 2024 | Approximating Intersections and Differences Between Linear Statistical Shape Models Using Markov Chain Monte Carlo. Maximilian Weiherer, Finn Klein, Bernhard Egger |
| 2024 | Arbitrary-Resolution and Arbitrary-Scale Face Super-Resolution with Implicit Representation Networks. Yi-Ting Tsai, Yu Wei Chen, Hong-Han Shuai, Ching-Chun Huang |
| 2024 | ArcAid: Analysis of Archaeological Artifacts using Drawings. Offry Hayon, Stefan Münger, Ilan Shimshoni, Ayellet Tal |
| 2024 | ArcGeo: Localizing Limited Field-of-View Images using Cross-view Matching. Maxim Shugaev, Ilya Semenov, Kyle Ashley, Michael Klaczynski, Naresh P. Cuntoor, Mun Wai Lee, Nathan Jacobs |
| 2024 | Are Natural Domain Foundation Models Useful for Medical Image Classification? Joana Palés Huix, Adithya Raju Ganeshan, Johan Fredin Haslum, Magnus Söderberg, Christos Matsoukas, Kevin Smith |
| 2024 | Army of Thieves: Enhancing Black-Box Model Extraction via Ensemble based sample selection. Akshit Jindal, Vikram Goyal, Saket Anand, Chetan Arora |
| 2024 | ArtQuest: Countering Hidden Language Biases in ArtVQA. Tibor Bleidt, Sedigheh Eslami, Gerard de Melo |
| 2024 | AssemblyNet: A Point Cloud Dataset and Benchmark for Predicting Part Directions in an Exploded Layout. Jesper Gaarsdal, Joakim Bruslund Haurum, Sune Wolff, Claus Brøndgaard Madsen |
| 2024 | Assessing Neural Network Robustness via Adversarial Pivotal Tuning. Peter Ebert Christensen, Vésteinn Snæbjarnarson, Andrea Dittadi, Serge J. Belongie, Sagie Benaim |
| 2024 | Assist Is Just as Important as the Goal: Image Resurfacing to Aid Model's Robust Prediction. Abhijith Sharma, Phil Munz, Apurva Narayan |
| 2024 | Asymmetric Image Retrieval with Cross Model Compatible Ensembles. Alon Shoshan, Ori Linial, Nadav Bhonker, Elad Hirsch, Lior Zamir, Igor Kviatkovsky, Gérard G. Medioni |
| 2024 | Attention Modules Improve Image-Level Anomaly Detection for Industrial Inspection: A DifferNet Case Study. André Luiz Buarque Vieira e Silva, Francisco Simões, Danny Kowerko, Tobias Schlosser, Felipe Battisti, Veronica Teichrieb |
| 2024 | Attention-Guided Prototype Mixing: Diversifying Minority Context on Imbalanced Whole Slide Images Classification Learning. Farchan Hakim Raswa, Chun-Shien Lu, Jia-Ching Wang |
| 2024 | Attentive Prototypes for Source-free Unsupervised Domain Adaptive 3D Object Detection. Deepti Hegde, Vishal M. Patel |
| 2024 | Augment the Pairs: Semantics-Preserving Image-Caption Pair Augmentation for Grounding-Based Vision and Language Models. Jingru Yi, Burak Uzkent, Oana Ignat, Zili Li, Amanmeet Garg, Xiang Yu, Linda Liu |
| 2024 | Auto-BPA: An Enhanced Ball-Pivoting Algorithm with Adaptive Radius using Contextual Bandits. Houda Saffi, Naima Otberdout, Youssef Hmamouche, Amal El Fallah Seghrouchni |
| 2024 | Automated Camera Calibration via Homography Estimation with GNNs. Giacomo D'Amicantonio, Egor Bondarev, Peter H. N. de With |
| 2024 | Automated Monitoring of Ear Biting in Pigs by Tracking Individuals and Events. Anicetus Odo, Niall McLaughlin, Ilias Kyriazakis |
| 2024 | Automated Sperm Assessment Framework and Neural Network Specialized for Sperm Video Recognition. Takuro Fujii, Hayato Nakagawa, Teppei Takeshima, Yasushi Yumura, Tomoki Hamagami |
| 2024 | AvatarOne: Monocular 3D Human Animation. Akash Karthikeyan, Robert Ren, Yash Kant, Igor Gilitschenski |
| 2024 | BALF: Simple and Efficient Blur Aware Local Feature Detector. Zhenjun Zhao |
| 2024 | BEVMap: Map-Aware BEV Modeling for 3D Perception. Mincheol Chang, Seokha Moon, Reza Mahjourian, Jinkyu Kim |
| 2024 | BPKD: Boundary Privileged Knowledge Distillation For Semantic Segmentation. Liyang Liu, Zihan Wang, Minh Hieu Phan, Bowen Zhang, Jinchao Ge, Yifan Liu |
| 2024 | BSRAW: Improving Blind RAW Image Super-Resolution. Marcos V. Conde, Florin-Alexandru Vasluianu, Radu Timofte |
| 2024 | Back to Optimization: Diffusion-based Zero-Shot 3D Human Pose Estimation. Zhongyu Jiang, Zhuoran Zhou, Lei Li, Wenhao Chai, Cheng-Yen Yang, Jenq-Neng Hwang |
| 2024 | Bag of Tricks for Fully Test-Time Adaptation. Saypraseuth Mounsaveng, Florent Chiaroni, Malik Boudiaf, Marco Pedersoli, Ismail Ben Ayed |
| 2024 | Benchmark Generation Framework with Customizable Distortions for Image Classifier Robustness. Soumyendu Sarkar, Ashwin Ramesh Babu, Sajad Mousavi, Zachariah Carmichael, Vineet Gundecha, Sahand Ghorbanpour, Ricardo Luna Gutierrez, Antonio Guillen, Avisek Naug |
| 2024 | Benchmarking Out-of-Distribution Detection in Visual Question Answering. Xiangxi Shi, Stefan Lee |
| 2024 | Best of Both Worlds: Learning Arbitrary-scale Blind Super-Resolution via Dual Degradation Representations and Cycle-Consistency. Shao-Yu Weng, Hsuan Yuan, Yu-Syuan Xu, Ching-Chun Huang, Wei-Chen Chiu |
| 2024 | Beyond Active Learning: Leveraging the Full Potential of Human Interaction via Auto-Labeling, Human Correction, and Human Verification. Nathan Beck, Krishnateja Killamsetty, Suraj Kothawade, Rishabh K. Iyer |
| 2024 | Beyond Classification: Definition and Density-based Estimation of Calibration in Object Detection. Teodora Popordanoska, Aleksei Tiulpin, Matthew B. Blaschko |
| 2024 | Beyond Document Page Classification: Design, Datasets, and Challenges. Jordy Van Landeghem, Sanket Biswas, Matthew B. Blaschko, Marie-Francine Moens |
| 2024 | Beyond Fusion: Modality Hallucination-based Multispectral Fusion for Pedestrian Detection. Qian Xie, Ta Ying Cheng, Jia-Xing Zhong, Kaichen Zhou, Andrew Markham, Niki Trigoni |
| 2024 | Beyond RGB: A Real World Dataset for Multispectral Imaging in Mobile Devices. Ortal Glatt, Yotam Ater, Woo-Shik Kim, Shira Werman, Oded Berby, Yael Zini, Shay Zelinger, Sangyoon Lee, Heejin Choi, Evgeny Soloveichik |
| 2024 | Beyond SOT: Tracking Multiple Generic Objects at Once. Christoph Mayer, Martin Danelljan, Ming-Hsuan Yang, Vittorio Ferrari, Luc Van Gool, Alina Kuznetsova |
| 2024 | Beyond Self-Attention: Deformable Large Kernel Attention for Medical Image Segmentation. Reza Azad, Leon Niggemeier, Michael Hüttemann, Amirhossein Kazerouni, Ehsan Khodapanah Aghdam, Yury Velichko, Ulas Bagci, Dorit Merhof |
| 2024 | Bi-directional Training for Composed Image Retrieval via Text Prompt Learning. Zheyuan Liu, Weixuan Sun, Yicong Hong, Damien Teney, Stephen Gould |
| 2024 | Bias and Diversity in Synthetic-based Face Recognition. Marco Huber, Anh Thi Luu, Fadi Boutros, Arjan Kuijper, Naser Damer |
| 2024 | BigSmall: Efficient Multi-Task Learning for Disparate Spatial and Temporal Physiological Measurements. Girish Narayanswamy, Yujia Liu, Yuzhe Yang, Chengqian Ma, Xin Liu, Daniel McDuff, Shwetak N. Patel |
| 2024 | Bipartite Graph Diffusion Model for Human Interaction Generation. Baptiste Chopin, Hao Tang, Mohamed Daoudi |
| 2024 | BirdSAT: Cross-View Contrastive Masked Autoencoders for Bird Species Classification and Mapping. Srikumar Sastry, Subash Khanal, Aayush Dhakal, Di Huang, Nathan Jacobs |
| 2024 | Blurry Video Compression A Trade-off between Visual Enhancement and Data Compression. Dawit Mureja Argaw, Junsik Kim, In So Kweon |
| 2024 | BoostRad: Enhancing Object Detection by Boosting Radar Reflections. Yuval Haitman, Oded Bialer |
| 2024 | Booster-SHOT: Boosting Stacked Homography Transformations for Multiview Pedestrian Detection with Attention. Jinwoo Hwang, Philipp Benz, Pete Kim |
| 2024 | Boosting Weakly Supervised Object Detection using Fusion and Priors from Hallucinated Depth. Cagri Gungor, Adriana Kovashka |
| 2024 | Brainomaly: Unsupervised Neurologic Disease Detection Utilizing Unannotated T1-weighted Brain MR Images. Md Mahfuzur Rahman Siddiquee, Jay Shah, Teresa Wu, Catherine D. Chong, Todd J. Schwedt, Gina Dumkrieger, Simona Nikolova, Baoxin Li |
| 2024 | Bridging Generalization Gaps in High Content Imaging Through Online Self-Supervised Domain Adaptation. Johan Fredin Haslum, Christos Matsoukas, Karl-Johan Leuchowius, Kevin Smith |
| 2024 | Bridging the Gap between Multi-focus and Multi-modal: A Focused Integration Framework for Multi-modal Image Fusion. Xilai Li, Xiaosong Li, Tao Ye, Xiaoqi Cheng, Wuyang Liu, Haishu Tan |
| 2024 | C Ashutosh Kulkarni, Shruti S. Phutke, Santosh Kumar Vipparthi, Subrahmanyam Murala |
| 2024 | C-CLIP: Contrastive Image-Text Encoders to Close the Descriptive-Commentative Gap. William Theisen, Walter J. Scheirer |
| 2024 | CAD - Contextual Multi-modal Alignment for Dynamic AVQA. Asmar Nadeem, Adrian Hilton, Robert Dawes, Graham Thomas, Armin Mustafa |
| 2024 | CAILA: Concept-Aware Intra-Layer Adapters for Compositional Zero-Shot Learning. Zhaoheng Zheng, Haidong Zhu, Ram Nevatia |
| 2024 | CAMOT: Camera Angle-aware Multi-Object Tracking. Felix Limanta, Kuniaki Uto, Koichi Shinoda |
| 2024 | CARE: Counterfactual-based Algorithmic Recourse for Explainable Pose Correction. Bhat Dittakavi, Bharathi Callepalli, Aleti Vardhan, Sai Vikas Desai, Vineeth N. Balasubramanian |
| 2024 | CATS: Combined Activation and Temporal Suppression for Efficient Network Inference. Zeqi Zhu, Arash Pourtaherian, Luc Waeijen, Ibrahim Batuhan Akkaya, Egor Bondarev, Orlando Moreira |
| 2024 | CCMR: High Resolution Optical Flow Estimation via Coarse-to-Fine Context-Guided Motion Reasoning. Azin Jahedi, Maximilian Luz, Marc Rivinius, Andrés Bruhn |
| 2024 | CGAPoseNet+GCAN: A Geometric Clifford Algebra Network for Geometry-aware Camera Pose Regression. Alberto Pepe, Joan Lasenby, Sven Buchholz |
| 2024 | CHAI: Craters in Historical Aerial Images. Marvin Burges, Sebastian Zambanini, Philipp Pirker |
| 2024 | CL-MAE: Curriculum-Learned Masked Autoencoders. Neelu Madan, Nicolae-Catalin Ristea, Kamal Nasrollahi, Thomas B. Moeslund, Radu Tudor Ionescu |
| 2024 | CLID: Controlled-Length Image Descriptions with Limited Data. Elad Hirsch, Ayellet Tal |
| 2024 | CLIP-DIY: CLIP Dense Inference Yields Open-Vocabulary Semantic Segmentation For-Free. Monika Wysoczanska, Michaël Ramamonjisoa, Tomasz Trzcinski, Oriane Siméoni |
| 2024 | CLIPAG: Towards Generator-Free Text-to-Image Generation. Roy Ganz, Michael Elad |
| 2024 | CLRerNet: Improving Confidence of Lane Detection with LaneIoU. Hiroto Honda, Yusuke Uchida |
| 2024 | CPSeg: Finer-grained Image Semantic Segmentation via Chain-of-Thought Language Prompting. Lei Li |
| 2024 | CSAM: A 2.5D Cross-Slice Attention Module for Anisotropic Volumetric Medical Image Segmentation. Alex Ling Yu Hung, Haoxin Zheng, Kai Zhao, Xiaoxi Du, Kaifeng Pang, Qi Miao, Steven S. Raman, Demetri Terzopoulos, Kyunghyun Sung |
| 2024 | CVTHead: One-shot Controllable Head Avatar with Vertex-feature Transformer. Haoyu Ma, Tong Zhang, Shanlin Sun, Xiangyi Yan, Kun Han, Xiaohui Xie |
| 2024 | CXR-IRGen: An Integrated Vision and Language Model for the Generation of Clinically Accurate Chest X-Ray Image-Report Pairs. Junjie Shentu, Noura Al Moubayed |
| 2024 | Camera-Independent Single Image Depth Estimation from Defocus Blur. Lahiru N. S. Wijayasingha, Homa Alemzadeh, John A. Stankovic |
| 2024 | CamoFocus: Enhancing Camouflage Object Detection with Split-Feature Focal Modulation and Context Refinement. Abbas Khan, Mustaqeem Khan, Wail Gueaieb, Abdulmotaleb El Saddik, Giulia De Masi, Fakhri Karray |
| 2024 | Can CLIP Help Sound Source Localization? Sooyoung Park, Arda Senocak, Joon Son Chung |
| 2024 | Can Vision-Language Models be a Good Guesser? Exploring VLMs for Times and Location Reasoning. Gengyuan Zhang, Yurui Zhang, Kerui Zhang, Volker Tresp |
| 2024 | Can you even tell left from right? Presenting a new challenge for VQA. Sai Raam Venkataraman, Rishi Sridhar Rao, S. Balasubramanian, R. Raghunatha Sarma, Chandra Sekhar Vorugunti |
| 2024 | Causal Analysis for Robust Interpretability of Neural Networks. Ola Ahmad, Nicolas Béreux, Loïc Baret, Vahid Hashemi, Freddy Lécué |
| 2024 | Causal Feature Alignment: Learning to Ignore Spurious Background Features. Rahul Venkataramani, Parag Dutta, Vikram Melapudi, Ambedkar Dukkipati |
| 2024 | Cheating Depth: Enhancing 3D Surface Anomaly Detection via Depth Simulation. Vitjan Zavrtanik, Matej Kristan, Danijel Skocaj |
| 2024 | Classifying Cable Tendency with Semantic Segmentation by Utilizing Real and Simulated RGB Data. Pei-Chun Chien, Powei Liao, Eiji Fukuzawa, Jun Ohya |
| 2024 | ClusterFix: A Cluster-Based Debiasing Approach without Protected-Group Supervision. Giacomo Capitani, Federico Bolelli, Angelo Porrello, Simone Calderara, Elisa Ficarra |
| 2024 | Co-Speech Gesture Detection through Multi-Phase Sequence Labeling. Esam Ghaleb, Ilya Burenko, Marlou Rasenberg, Wim T. J. L. Pouw, Peter Uhrig, Judith Holler, Ivan Toni, Asli Özyürek, Raquel Fernández |
| 2024 | CoD: Coherent Detection of Entities from Images with Multiple Modalities. Vinay Kumar Verma, Dween Rabius Sanny, Abhishek Singh, Deepak Gupta |
| 2024 | Collage Diffusion. Vishnu Sarukkai, Linden Li, Arden Ma, Christopher Ré, Kayvon Fatahalian |
| 2024 | Common Diffusion Noise Schedules and Sample Steps are Flawed. Shanchuan Lin, Bingchen Liu, Jiashi Li, Xiao Yang |
| 2024 | Complementary-Contradictory Feature Regularization against Multimodal Overfitting. Antonio Tejero-de-Pablos |
| 2024 | Complex Organ Mask Guided Radiology Report Generation. Tiancheng Gu, Dongnan Liu, Zhiyuan Li, Weidong Cai |
| 2024 | Composite Diffusion: whole >= Σparts. Vikram Jamwal, Ramaneswaran S. |
| 2024 | Computer Vision on the Edge: Individual Cattle Identification in Real-time with ReadMyCow System. Moniek Smink, Haotian Liu, Dörte Döpfer, Yong Jae Lee |
| 2024 | Concept-Centric Transformers: Enhancing Model Interpretability through Object-Centric Concept Learning within a Shared Global Workspace. Jinyung Hong, Keun Hee Park, Theodore P. Pavlic |
| 2024 | Concurrent Band Selection and Traversability Estimation from Long-Wave Hyperspectral Imagery in Off-Road Settings. Florence Yellin, Scott McCloskey, Cole Hill, Eric Smith, Brian Clipp |
| 2024 | Conditional Velocity Score Estimation for Image Restoration. Ziqiang Shi, Rujie Liu |
| 2024 | ConeQuest: A Benchmark for Cone Segmentation on Mars. Mirali Purohit, Jacob B. Adler, Hannah Kerner |
| 2024 | ConfTrack: Kalman Filter-based Multi-Person Tracking by Utilizing Confidence Score of Detection Box. Hyeonchul Jung, Seokjun Kang, Takgen Kim, Hyeongki Kim |
| 2024 | Consistent Multimodal Generation via A Unified GAN Framework. Zhen Zhu, Yijun Li, Weijie Lyu, Krishna Kumar Singh, Zhixin Shu, Sören Pirk, Derek Hoiem |
| 2024 | Constrained Probabilistic Mask Learning for Task-specific Undersampled MRI Reconstruction. Tobias Weber, Michael Ingrisch, Bernd Bischl, David Rügamer |
| 2024 | Content-Aware Image Color Editing with Auxiliary Color Restoration Tasks. Yixuan Ren, Jing Shi, Zhifei Zhang, Yifei Fan, Zhe Lin, Bo He, Abhinav Shrivastava |
| 2024 | Context in Human Action through Motion Complementarity. Eadom Dessalene, Michael Maynord, Cornelia Fermüller, Yiannis Aloimonos |
| 2024 | Context-based Interpretable Spatio-Temporal Graph Convolutional Network for Human Motion Forecasting. Edgar Medina, Leyong Loh, Namrata Gurung, Kyung Hun Oh, Niels Heller |
| 2024 | Contextual Affinity Distillation for Image Anomaly Detection. Jie Zhang, Masanori Suganuma, Takayuki Okatani |
| 2024 | Continual Learning of Unsupervised Monocular Depth from Videos. Hemang Chawla, Arnav Varma, Elahe Arani, Bahram Zonooz |
| 2024 | Continual Test-time Domain Adaptation via Dynamic Sample Selection. Yanshuo Wang, Jie Hong, Ali Cheraghian, Shafin Rahman, David Ahmedt-Aristizabal, Lars Petersson, Mehrtash Harandi |
| 2024 | Continual atlas-based segmentation of prostate MRI. Amin Ranem, Camila González, Daniel Pinto dos Santos, Andreas M. Bucher, Ahmed E. Othman, Anirban Mukhopadhyay |
| 2024 | Continuous Adaptation for Interactive Segmentation Using Teacher-Student Architecture. Barsegh Atanyan, Levon Khachatryan, Shant Navasardyan, Yunchao Wei, Humphrey Shi |
| 2024 | Contrastive Learning for Multi-Object Tracking with Transformers. Pierre-François De Plaen, Nicola Marinello, Marc Proesmans, Tinne Tuytelaars, Luc Van Gool |
| 2024 | Contrastive Viewpoint-aware Shape Learning for Long-term Person Re-Identification. Vuong D. Nguyen, Khadija Khaldi, Dung Nguyen, Pranav Mantini, Shishir Shah |
| 2024 | Controllable Image Synthesis of Industrial Data using Stable Diffusion. Gabriele Valvano, Antonino Agostino, Giovanni De Magistris, Antonino Graziano, Giacomo Veneri |
| 2024 | Controllable Text-to-Image Synthesis for Multi-Modality MR Images. Kyuri Kim, Yoonho Na, Sung-Joon Ye, Jimin Lee, Sungsoo Ahn, Ji Eun Park, Hwiyoung Kim |
| 2024 | Controlling Character Motions without Observable Driving Source. Weiyuan Li, Bin Dai, Ziyi Zhou, Qi Yao, Baoyuan Wang |
| 2024 | Controlling Rate, Distortion, and Realism: Towards a Single Comprehensive Neural Image Compression Model. Shoma Iwai, Tomo Miyazaki, Shinichiro Omachi |
| 2024 | Controlling Virtual Try-on Pipeline Through Rendering Policies. Kedan Li, Jeffrey Zhang, Shao-Yu Chang, David A. Forsyth |
| 2024 | Convolutional Masked Image Modeling for Dense Prediction Tasks on Pathology Images. Yan Yang, Liyuan Pan, Liu Liu, Eric A. Stone |
| 2024 | Correlation-aware active learning for surgery video segmentation. Fei Wu, Pablo Márquez-Neila, Mingyi Zheng, Hedyeh Rafii-Tari, Raphael Sznitman |
| 2024 | CrashCar101: Procedural Generation for Damage Assessment. Jens Parslov, Erik Riise, Dim P. Papadopoulos |
| 2024 | Critical Gap Between Generalization Error and Empirical Error in Active Learning. Yusuke Kanebako |
| 2024 | Cross-Attention Between Satellite and Ground Views for Enhanced Fine-Grained Robot Geo-Localization. Dong Yuan, Frédéric Maire, Feras Dayoub |
| 2024 | Cross-Domain Few-Shot Incremental Learning for Point-Cloud Recognition. Yuwen Tan, Xiang Xiang |
| 2024 | Cross-feature Contrastive Loss for Decentralized Deep Learning on Heterogeneous Data. Sai Aparna Aketi, Kaushik Roy |
| 2024 | CryoRL: Reinforcement Learning Enables Efficient Cryo-EM Data Collection. Quanfu Fan, Yilai Li, Yuguang Yao, John Cohn, Sijia Liu, Ziping Xu, Seychelle M. Vos, Michael A. Cianfrocco |
| 2024 | Customizing 360-Degree Panoramas through Text-to-Image Diffusion Models. Hai Wang, Xiaoyu Xiang, Yuchen Fan, Jing-Hao Xue |
| 2024 | CycleCL: Self-supervised Learning for Periodic Videos. Matteo Destro, Michael Gygli |
| 2024 | D Lin Zhang, Linghan Xu, Saman Motamed, Shayok Chakraborty, Fernando De la Torre |
| 2024 | D4: Detection of Adversarial Diffusion Deepfakes Using Disjoint Ensembles. Ashish Hooda, Neal Mangaokar, Ryan Feng, Kassem Fawaz, Somesh Jha, Atul Prakash |
| 2024 | DDAM-PS: Diligent Domain Adaptive Mixer for Person Search. Mohammed Khaleed Almansoori, Mustansar Fiaz, Hisham Cholakkal |
| 2024 | DECDM: Document Enhancement using Cycle-Consistent Diffusion Models. Jiaxin Zhang, Joy Rimchala, Lalla Mouatadid, Kamalika Das, Kumar Sricharan |
| 2024 | DISCO: Distributed Inference with Sparse Communications. Minghai Qin, Chao Sun, Jaco Hofmann, Dejan Vucinic |
| 2024 | DPPMask: Masked Image Modeling with Determinantal Point Processes. Junde Xu, Zikai Lin, Donghao Zhou, Yaodong Yang, Xiangyun Liao, Qiong Wang, Bian Wu, Guangyong Chen, Pheng-Ann Heng |
| 2024 | DR Chenxu Zhang, Chao Wang, Yifan Zhao, Shuo Cheng, Linjie Luo, Xiaohu Guo |
| 2024 | DR10K: Transfer Learning Using Weak Labels for Grading Diabetic Retinopathy on DR10K Dataset. Mohamed ElHabebe, Shereen Elkordi, Ahmed Gamal-Eldin, Noha Adly, Marwan Torki, Ahmed Elmasry, Islam SH Ahmed |
| 2024 | DREAM: Visual Decoding from REversing HumAn Visual SysteM. Weihao Xia, Raoul de Charette, Cengiz Öztireli, Jing-Hao Xue |
| 2024 | DTrOCR: Decoder-only Transformer for Optical Character Recognition. Masato Fujitake |
| 2024 | Data Augmentation for Object Detection via Controllable Diffusion Models. Haoyang Fang, Boran Han, Shuai Zhang, Su Zhou, Cuixiong Hu, Wenming Ye |
| 2024 | Data-Centric Debugging: mitigating model failures via targeted image retrieval. Sahil Singla, Atoosa Malemir Chegini, Mazda Moayeri, Soheil Feizi |
| 2024 | DeVOS: Flow-Guided Deformable Transformer for Video Object Segmentation. Volodymyr Fedynyak, Yaroslav Romanus, Bohdan Hlovatskyi, Bohdan Sydor, Oles Dobosevych, Igor Babin, Roman Riazantsev |
| 2024 | Debiasing, calibrating, and improving Semi-supervised Learning performance via simple Ensemble Projector. Khanh-Binh Nguyen |
| 2024 | Deblur-NSFF: Neural Scene Flow Fields for Blurry Dynamic Scenes. Achleshwar Luthra, Shiva Souhith Gantha, Xiyun Song, Heather Yu, Zongfang Lin, Liang Peng |
| 2024 | Deep Image Fingerprint: Towards Low Budget Synthetic Image Detection and Model Lineage Analysis. Sergey Sinitsa, Ohad Fried |
| 2024 | Deep Metric Learning with Chance Constraints. Yeti Ziya Gürbüz, Ogul Can, A. Aydin Alatan |
| 2024 | Deep Optics for Optomechanical Control Policy Design. Justin Fletcher |
| 2024 | Deep Plug-and-play Nighttime Non-blind Deblurring with Saturated Pixel Handling Schemes. Hung-Yu Shu, Yi-Hsien Lin, Yi-Chang Lu |
| 2024 | Deep Subdomain Alignment for Cross-domain Image Classification. Yewei Zhao, Hu Han, Shiguang Shan, Xilin Chen |
| 2024 | Deep Visual-Genetic Biometrics for Taxonomic Classification of Rare Species. Tayfun Karaderi, Tilo Burghardt, Raphael Morard, Daniela N. Schmidt |
| 2024 | Defending Object Detection Models against Image Distortions. Mark Ofori-Oduro, Maria A. Amer |
| 2024 | Defense against Adversarial Cloud Attack on Remote Sensing Salient Object Detection. Huiming Sun, Lan Fu, Jinlong Li, Qing Guo, Zibo Meng, Tianyun Zhang, Yuewei Lin, Hongkai Yu |
| 2024 | Denoising and Selecting Pseudo-Heatmaps for Semi-Supervised Human Pose Estimation. Zhuoran Yu, Manchen Wang, Yanbei Chen, Paolo Favaro, Davide Modolo |
| 2024 | Density-Based Flow Mask Integration via Deformable Convolution for Video People Flux Estimation. Chang-Lin Wan, Feng-Kai Huang, Hong-Han Shuai |
| 2024 | Depth from Asymmetric Frame-Event Stereo: A Divide-and-Conquer Approach. Xihao Chen, Wenming Weng, Yueyi Zhang, Zhiwei Xiong |
| 2024 | Describe Images in a Boring Way: Towards Cross-Modal Sarcasm Generation. Jie Ruan, Yue Wu, Xiaojun Wan, Yuesheng Zhu |
| 2024 | Design Choices for Enhancing Noisy Student Self-Training. Aswathnarayan Radhakrishnan, Jim Davis, Zachary Rabin, Benjamin Lewis, Matthew Scherreik, Roman Ilin |
| 2024 | Designing a Hybrid Neural System to Learn Real-world Crack Segmentation from Fractal-based Simulation. Achref Jaziri, Martin Mundt, Andres Fernandez Rodriguez, Visvanathan Ramesh |
| 2024 | Detecting Content Segments from Online Sports Streaming Events: Challenges and Solutions. Zongyi Liu, Yarong Feng, Shunyan Luo, Yuan Ling, Shujing Dong, Shuyi Wang |
| 2024 | Detection Defenses: An Empty Promise against Adversarial Patch Attacks on Optical Flow. Erik Scheurer, Jenny Schmalfuss, Alexander Lis, Andrés Bruhn |
| 2024 | Diff2Lip: Audio Conditioned Diffusion Models for Lip-Synchronization. Soumik Mukhopadhyay, Saksham Suri, Ravi Teja Gadde, Abhinav Shrivastava |
| 2024 | DiffBody: Diffusion-based Pose and Shape Editing of Human Images. Yuta Okuyama, Yuki Endo, Yoshihiro Kanamori |
| 2024 | DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D Classification. Sitian Shen, Zilin Zhu, Linqian Fan, Harry Zhang, Xinxiao Wu |
| 2024 | Differentiable JPEG: The Devil is in the Details. Christoph Reich, Biplob Debnath, Deep Patel, Srimat Chakradhar |
| 2024 | Differentially Private Video Activity Recognition. Zelun Luo, Yuliang Zou, Yijin Yang, Zane Durante, De-An Huang, Zhiding Yu, Chaowei Xiao, Li Fei-Fei, Animashree Anandkumar |
| 2024 | Diffuse and Restore: A Region-Adaptive Diffusion Model for Identity-Preserving Blind Face Restoration. Maitreya Suin, Nithin Gopalakrishnan Nair, Chun Pong Lau, Vishal M. Patel, Rama Chellappa |
| 2024 | Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation. Michal Stypulkowski, Konstantinos Vougioukas, Sen He, Maciej Zieba, Stavros Petridis, Maja Pantic |
| 2024 | Diffusion in the Dark: A Diffusion Model for Low-Light Text Recognition. Cindy M. Nguyen, Eric R. Chan, Alexander W. Bergman, Gordon Wetzstein |
| 2024 | Diffusion models meet image counter-forensics. Matías Tailanián, Marina Gardella, Álvaro Pardo, Pablo Musé |
| 2024 | Diffusion-based generation of Histopathological Whole Slide Images at a Gigapixel scale. Robert Harb, Thomas Pock, Heimo Müller |
| 2024 | Discovering and Mitigating Biases in CLIP-based Image Editing. Md. Mehrab Tanjim, Krishna Kumar Singh, Kushal Kafle, Ritwik Sinha, Garrison W. Cottrell |
| 2024 | Discriminator-free Unsupervised Domain Adaptation for Multi-label Image Classification. Inder Pal Singh, Enjie Ghorbel, Anis Kacem, Arunkumar Rathinam, Djamila Aouada |
| 2024 | Disentangled Pre-training for Image Matting. Yanda Li, Zilong Huang, Gang Yu, Ling Chen, Yunchao Wei, Jianbo Jiao |
| 2024 | Distortion-Disentangled Contrastive Learning. Jinfeng Wang, Sifan Song, Jionglong Su, S. Kevin Zhou |
| 2024 | Diverse Imagenet Models Transfer Better. Niv Nayman, Avram Golbert, Asaf Noy, Lihi Zelnik-Manor |
| 2024 | Do VSR Models Generalize Beyond LRS3? Yasser Abdelaziz Dahou Djilali, Sanath Narayan, Eustache Le Bihan, Haithem Boussaid, Ebtesam Almazrouei, Mérouane Debbah |
| 2024 | Do We Still Need Non-Maximum Suppression? Accurate Confidence Estimates and Implicit Duplication Modeling with IoU-Aware Calibration. Johannes Gilg, Torben Teepe, Fabian Herzog, Philipp Wolters, Gerhard Rigoll |
| 2024 | DocReal: Robust Document Dewarping of Real-Life Images via Attention-Enhanced Control Point Prediction. Fangchen Yu, Yina Xie, Lei Wu, Yafei Wen, Guozhi Wang, Shuai Ren, Xiaoxin Chen, Jianfeng Mao, Wenye Li |
| 2024 | Domain Adaptive 3D Shape Retrieval from Monocular Images. Harsh Pal, Ritwik Khandelwal, Shivam Pande, Biplab Banerjee, Srikrishna Karanam |
| 2024 | Domain Aligned CLIP for Few-shot Classification. Muhammad Waleed Gondal, Jochen Gast, Iñigo Alonso Ruiz, Richard Droste, Tommaso Macrì, Suren Kumar, Luitpold Staudigl |
| 2024 | Domain Generalisation via Risk Distribution Matching. Toan Nguyen, Kien Do, Bao Duong, Thin Nguyen |
| 2024 | Domain Generalization by Rejecting Extreme Augmentations. Masih Aminbeidokhti, Fidel A. Guerrero-Peña, Heitor Rapela Medeiros, Thomas Dubail, Eric Granger, Marco Pedersoli |
| 2024 | Domain Generalization with Correlated Style Uncertainty. Zheyuan Zhang, Bin Wang, Debesh Jha, Ugur Demir, Ulas Bagci |
| 2024 | Domain-Aware Knowledge Distillation for Continual Model Generalization. Nikhil Reddy, Mahsa Baktashmotlagh, Chetan Arora |
| 2024 | Driving through the Concept Gridlock: Unraveling Explainability Bottlenecks in Automated Driving. Jessica Maria Echterhoff, An Yan, Kyungtae Han, Amr Abdelraouf, Rohit Gupta, Julian J. McAuley |
| 2024 | Dual Domain Diffusion Guidance for 3D CBCT Metal Artifact Reduction. Yongjin Choi, Doeyoung Kwon, Seung Jun Baek |
| 2024 | Dynamic Multimodal Information Bottleneck for Multimodality Classification. Yingying Fang, Shuang Wu, Sheng Zhang, Chaoyan Huang, Tieyong Zeng, Xiaodan Xing, Simon Walsh, Guang Yang |
| 2024 | Dynamic Token-Pass Transformers for Semantic Segmentation. Yuang Liu, Qiang Zhou, Jing Wang, Zhibin Wang, Fan Wang, Jun Wang, Wei Zhang |
| 2024 | EASUM: Enhancing Affective State Understanding through Joint Sentiment and Emotion Modeling for Multimodal Tasks. Yewon Hwang, Jong-Hwan Kim |
| 2024 | ECSIC: Epipolar Cross Attention for Stereo Image Compression. Matthias Wödlinger, Jan Kotera, Manuel Keglevic, Jan Xu, Robert Sablatnig |
| 2024 | ENIGMA-51: Towards a Fine-Grained Understanding of Human Behavior in Industrial Scenarios. Francesco Ragusa, Rosario Leonardi, Michele Mazzamuto, Claudia Bonanno, Rosario Scavo, Antonino Furnari, Giovanni Maria Farinella |
| 2024 | ENTED: Enhanced Neural Texture Extraction and Distribution for Reference-based Blind Face Restoration. Yuen-Fui Lau, Tianjia Zhang, Zhefan Rao, Qifeng Chen |
| 2024 | EResFD: Rediscovery of the Effectiveness of Standard Convolution for Lightweight Face Detection. Joonhyun Jeong, Beomyoung Kim, Joonsang Yu, Youngjoon Yoo |
| 2024 | Edge Inference with Fully Differentiable Quantized Mixed Precision Neural Networks. Clemens J. S. Schaefer, Siddharth Joshi, Shan Li, Raúl Blázquez |
| 2024 | Effective Restoration of Source Knowledge in Continual Test Time Adaptation. Fahim Faisal Niloy, Sk Miraj Ahmed, Dripta S. Raychaudhuri, Samet Oymak, Amit K. Roy-Chowdhury |
| 2024 | Effects of Markers in Training Datasets on the Accuracy of 6D Pose Estimation. Janis Rosskamp, René Weller, Gabriel Zachmann |
| 2024 | Efficient Expansion and Gradient Based Task Inference for Replay Free Incremental Learning. Soumya Roy, Vinay Kumar Verma, Deepak Gupta |
| 2024 | Efficient Explainable Face Verification based on Similarity Score Argument Backpropagation. Marco Huber, Anh Thi Luu, Philipp Terhörst, Naser Damer |
| 2024 | Efficient Feature Distillation for Zero-shot Annotation Object Detection. Zhuoming Liu, Xuefeng Hu, Ram Nevatia |
| 2024 | Efficient Layout-Guided Image Inpainting for Mobile Use. Wenbo Li, Yi Wei, Yilin Shen, Hongxia Jin |
| 2024 | Efficient MAE towards Large-Scale Vision Transformers. Han Qiu, Gongjie Zhang, Jiaxing Huang, Peng Gao, Zhang Wei, Shijian Lu |
| 2024 | Efficient Semantic Matching with Hypercolumn Correlation. Seungwook Kim, Juhong Min, Minsu Cho |
| 2024 | Efficient Transferability Assessment for Selection of Pre-trained Detectors. Zhao Wang, Aoxue Li, Zhenguo Li, Qi Dou |
| 2024 | EfficientAD: Accurate Visual Anomaly Detection at Millisecond-Level Latencies. Kilian Batzner, Lars Heckler, Rebecca König |
| 2024 | Ego2HandsPose: A Dataset for Egocentric Two-hand 3D Global Pose Estimation. Fanqing Lin, Tony R. Martinez |
| 2024 | Egocentric Action Recognition by Capturing Hand-Object Contact and Object State. Tsukasa Shiota, Motohiro Takagi, Kaori Kumagai, Hitoshi Seshimo, Yushi Aono |
| 2024 | Elusive Images: Beyond Coarse Analysis for Fine-Grained Recognition. Connor Anderson, Matthew Gwilliam, Evelyn Gaskin, Ryan Farrell |
| 2024 | Embedding Task Structure for Action Detection. Michael Peven, Gregory D. Hager |
| 2024 | Embodied Human Activity Recognition. Sha Hu, Yu Gong, Greg Mori |
| 2024 | EmoStyle: One-Shot Facial Expression Editing Using Continuous Emotion Parameters. Bita Azari, Angelica Lim |
| 2024 | Empowering Unsupervised Domain Adaptation with Large-scale Pre-trained Vision-Language Models. Zhengfeng Lai, Haoping Bai, Haotian Zhang, Xianzhi Du, Jiulong Shan, Yinfei Yang, Chen-Nee Chuah, Meng Cao |
| 2024 | Enforcing Sparsity on Latent Space for Robust and Explainable Representations. Hanao Li, Tian Han |
| 2024 | Enhancing Diverse Intra-identity Representation for Visible-Infrared Person Re-Identification. Sejun Kim, Soonyong Gwon, Kisung Seo |
| 2024 | Enhancing Multi-view Pedestrian Detection Through Generalized 3D Feature Pulling. Sithu Aung, Haesol Park, Hyungjoo Jung, Junghyun Cho |
| 2024 | Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining. Ugur Sahin, Hang Li, Qadeer Khan, Daniel Cremers, Volker Tresp |
| 2024 | Estimating Blood Alcohol Level Through Facial Features for Driver Impairment Assessment. Ensiyeh Keshtkaran, Brodie von Berg, Grant Regan, David Suter, Syed Zulqarnain Gilani |
| 2024 | Estimating Fog Parameters from an Image Sequence using Non-linear Optimisation. Yining Ding, Andrew M. Wallace, Sen Wang |
| 2024 | EvDNeRF: Reconstructing Event Data with Dynamic Neural Radiance Fields. Anish Bhattacharya, Ratnesh Madaan, Fernando Cladera Ojeda, Sai Vemprala, Rogerio Bonatti, Kostas Daniilidis, Ashish Kapoor, Vijay Kumar, Nikolai Matni, Jayesh K. Gupta |
| 2024 | Evaluation of Video Masked Autoencoders' Performance and Uncertainty Estimations for Driver Action and Intention Recognition. Koen Vellenga, H. Joe Steinhauer, Göran Falkman, Tomas Björklund |
| 2024 | Evidential Uncertainty Quantification: A Variance-Based Perspective. Ruxiao Duan, Brian Caffo, Harrison X. Bai, Haris I. Sair, Craig K. Jones |
| 2024 | Evolve: Enhancing Unsupervised Continual Learning with Multiple Experts. Xiaofan Yu, Tajana Rosing, Yunhui Guo |
| 2024 | Expanding Expressiveness of Diffusion Models with Limited Data via Self-Distillation based Fine-Tuning. Jiwan Hur, Jaehyun Choi, Gyojin Han, Dong-Jae Lee, Junmo Kim |
| 2024 | Expanding Hyperspherical Space for Few-Shot Class-Incremental Learning. Yao Deng, Xiang Xiang |
| 2024 | Exploiting CLIP for Zero-shot HOI Detection Requires Knowledge Distillation at Multiple Levels. Bo Wan, Tinne Tuytelaars |
| 2024 | Exploiting the Signal-Leak Bias in Diffusion Models. Martin Nicolas Everaert, Athanasios Fitsios, Marco Bocchio, Sami Arpa, Sabine Süsstrunk, Radhakrishna Achanta |
| 2024 | Exploring Adversarial Robustness of Vision Transformers in the Spectral Perspective. Gihyun Kim, Juyeop Kim, Jong-Seok Lee |
| 2024 | Exploring the Impact of Rendering Method and Motion Quality on Model Performance when Using Multi-view Synthetic Data for Action Recognition. Stanislav Panev, Emily Kim, Sai Abhishek Si Namburu, Desislava Nikolova, Celso De Melo, Fernando De la Torre, Jessica K. Hodgins |
| 2024 | FAKD: Feature Augmented Knowledge Distillation for Semantic Segmentation. Jianlong Yuan, Minh Hieu Phan, Liyang Liu, Yifan Liu |
| 2024 | FATE: Feature-Agnostic Transformer-based Encoder for learning generalized embedding spaces in flow cytometry data. Lisa Weijler, Florian Kowarsch, Michael Reiter, Pedro Hermosilla, Margarita Maurer-Granofszky, Michael N. Dworzak |
| 2024 | FELGA: Unsupervised Fragment Embedding for Fine-Grained Cross-Modal Association. Yaoxin Zhuo, Baoxin Li |
| 2024 | FG-Net: Facial Action Unit Detection with Generalizable Pyramidal Features. Yufeng Yin, Di Chang, Guoxian Song, Shen Sang, Tiancheng Zhi, Jing Liu, Linjie Luo, Mohammad Soleymani |
| 2024 | FIRE: Food Image to REcipe generation. Prateek Chhikara, Dhiraj Chaurasia, Yifan Jiang, Omkar Masur, Filip Ilievski |
| 2024 | FIRe: Fast Inverse Rendering using Directional and Signed Distance Functions. Tarun Yenamandra, Ayush Tewari, Nan Yang, Florian Bernard, Christian Theobalt, Daniel Cremers |
| 2024 | FLORA: Fine-grained Low-Rank Architecture Search for Vision Transformer. Chi-Chih Chang, Yuan-Yao Sung, Shixing Yu, Ning-Chi Huang, Diana Marculescu, Kai-Chiang Wu |
| 2024 | FOSSIL: Free Open-Vocabulary Semantic Segmentation through Synthetic References Retrieval. Luca Barsellotti, Roberto Amoroso, Lorenzo Baraldi, Rita Cucchiara |
| 2024 | FOUND: Foot Optimization with Uncertain Normals for Surface Deformation Using Synthetic Data. Oliver Boyne, Gwangbin Bae, James Charles, Roberto Cipolla |
| 2024 | FPGAN-Control: A Controllable Fingerprint Generator for Training with Synthetic Data. Alon Shoshan, Nadav Bhonker, Emanuel Ben Baruch, Ori Nizan, Igor Kviatkovsky, Joshua J. Engelsma, Manoj Aggarwal, Gérard G. Medioni |
| 2024 | FRoG-MOT: Fast and Robust Generic Multiple-Object Tracking by IoU and Motion-State Associations. Takuya Ogawa, Takashi Shibata, Toshinori Hosoi |
| 2024 | FacadeNet: Conditional Facade Synthesis via Selective Editing. Yiangos Georgiou, Marios Loizou, Tom Kelly, Melinos Averkiou |
| 2024 | Face Identity-Aware Disentanglement in StyleGAN. Adrian Suwala, Bartosz Wójcik, Magdalena Proszewska, Jacek Tabor, Przemyslaw Spurek, Marek Smieja |
| 2024 | Face Presentation Attack Detection by Excavating Causal Clues and Adapting Embedding Statistics. Meiling Fang, Naser Damer |
| 2024 | FarSight: A Physics-Driven Whole-Body Biometric System at Large Distance and Altitude. Feng Liu, Ryan Ashbaugh, Nicholas Chimitt, Najmul Hassan, Ali Hassani, Ajay Jaiswal, Minchul Kim, Zhiyuan Mao, Christopher Perry, Zhiyuan Ren, Yiyang Su, Pegah Varghaei, Kai Wang, Stanley H. Chan, Arun Ross, Humphrey Shi, Zhangyang Wang, Anil Jain, Xiaoming Liu |
| 2024 | Fast Diffusion EM: a diffusion model for blind inverse problems with application to deconvolution. Charles Laroche, Andrés Almansa, Eva Coupeté |
| 2024 | Fast Sun-aligned Outdoor Scene Relighting based on TensoRF. Yeonjin Chang, Yearim Kim, Seunghyeon Seo, Jung Yi, Nojun Kwak |
| 2024 | Fast and Interpretable Face Identification for Out-Of-Distribution Data Using Vision Transformers. Hai Phan, Cindy X. Le, Vu Le, Yihui He, Anh Totti Nguyen |
| 2024 | FastCLIPstyler: Optimisation-free Text-based Image Style Transfer Using Style Representations. Ananda Padhmanabhan Suresh, Sanjana Jain, Pavit Noinongyao, Ankush Ganguly, Ukrit Watchareeruetai, Aubin Samacoïts |
| 2024 | FastSR-NeRF: Improving NeRF Efficiency on Consumer Devices with A Simple Super-Resolution Pipeline. Chien-Yu Lin, Qichen Fu, Thomas Merth, Karren D. Yang, Anurag Ranjan |
| 2024 | Favoring One Among Equals - Not a Good Idea: Many-to-one Matching for Robust Transformer based Pedestrian Detection. K. N. Ajay Shastry, K. Ravi Sri Teja, Aditya Nigam, Chetan Arora |
| 2024 | Feed-Forward Latent Domain Adaptation. Ondrej Bohdal, Da Li, Shell Xu Hu, Timothy M. Hospedales |
| 2024 | Few-Shot Event Classification in Images using Knowledge Graphs for Prompting. Golsa Tahmasebzadeh, Matthias Springstein, Ralph Ewerth, Eric Müller-Budack |
| 2024 | Few-shot Shape Recognition by Learning Deep Shape-aware Features. Wenlong Shi, Changsheng Lu, Ming Shao, Yinjie Zhang, Siyu Xia, Piotr Koniusz |
| 2024 | Few-shot generative model for skeleton-based human action synthesis using cross-domain adversarial learning. Kenichiro Fukushi, Yoshitaka Nozaki, Kosuke Nishihara, Kentaro Nakahara |
| 2024 | FinderNet: A Data Augmentation Free Canonicalization aided Loop Detection and Closure technique for Point clouds in 6-DOF separation. Sudarshan S. Harithas, Gurkirat Singh, Aneesh Chavan, Sarthak Sharma, Suraj Patni, Chetan Arora, Madhava Krishna |
| 2024 | Fine-Grained Alignment for Cross-Modal Recipe Retrieval. Muntasir Wahed, Xiaona Zhou, Tianjiao Yu, Ismini Lourentzou |
| 2024 | Fingervein Verification using Convolutional Multi-Head Attention Network. Raghavendra Ramachandra, Sushma Venkatesh |
| 2024 | FishTrack23: An Ensemble Underwater Dataset for Multi-Object Tracking. Matthew Dawkins, Jack Prior, Bryon Lewis, Robin Faillettaz, Thompson Banez, Mary Salvi, Audrey K. Rollo, Julien Simon, Matthew D. Campbell, Matthew Lucero, Aashish Chaudhary, Benjamin L. Richards, Anthony Hoogs |
| 2024 | Fixed Pattern Noise Removal For Multi-View Single-Sensor Infrared Camera. Arnaud Barral, Pablo Arias, Axel Davy |
| 2024 | Fixing Overconfidence in Dynamic Neural Networks. Lassi Meronen, Martin Trapp, Andrea Pilzer, Le Yang, Arno Solin |
| 2024 | FocusTune: Tuning Visual Localization through Focus-Guided Sampling. Son Tung Nguyen, Alejandro Fontán, Michael Milford, Tobias Fischer |
| 2024 | Foundation Model Assisted Weakly Supervised Semantic Segmentation. Xiaobo Yang, Xiaojin Gong |
| 2024 | Framework-agnostic Semantically-aware Global Reasoning for Segmentation. Mir Rayat Imtiaz Hossain, Leonid Sigal, James J. Little |
| 2024 | FreMIM: Fourier Transform Meets Masked Image Modeling for Medical Image Segmentation. Wenxuan Wang, Jing Wang, Chen Chen, Jianbo Jiao, Yuanxiu Cai, Shanshan Song, Jiangyun Li |
| 2024 | Frequency Attention for Knowledge Distillation. Cuong Pham, Van-Anh Nguyen, Trung Le, Dinh Q. Phung, Gustavo Carneiro, Thanh-Toan Do |
| 2024 | From Chaos to Calibration: A Geometric Mutual Information Approach to Target-Free Camera LiDAR Extrinsic Calibration. Jack Borer, Jeremy Tschirner, Florian Ölsner, Stefan Milz |
| 2024 | From Denoising Training to Test-Time Adaptation: Enhancing Domain Generalization for Medical Image Segmentation. Ruxue Wen, Hangjie Yuan, Dong Ni, Wenbo Xiao, Yaoyao Wu |
| 2024 | Fully-Automatic Reflection Removal for 360-Degree Images. Jonghyuk Park, HyeonA Kim, Eunpil Park, Jae-Young Sim |
| 2024 | FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions. Noam Rotstein, David Bensaïd, Shaked Brody, Roy Ganz, Ron Kimmel |
| 2024 | G-CASCADE: Efficient Cascaded Graph Convolutional Decoding for 2D Medical Image Segmentation. Md Mostafijur Rahman, Radu Marculescu |
| 2024 | GC-MVSNet: Multi-View, Multi-Scale, Geometrically-Consistent Multi-View Stereo. Vibhas K. Vats, Sripad Joshi, David J. Crandall, Md. Alimoor Reza, Soon-Heung Jung |
| 2024 | GC-VTON: Predicting Globally Consistent and Occlusion Aware Local Flows with Neighborhood Integrity Preservation for Virtual Try-on. Hamza Rawal, Muhammad Junaid Ahmad, Farooq Zaman |
| 2024 | GIPCOL: Graph-Injected Soft Prompting for Compositional Zero-Shot Learning. Guangyue Xu, Joyce Chai, Parisa Kordjamshidi |
| 2024 | GLAD: Global-Local View Alignment and Background Debiasing for Unsupervised Video Domain Adaptation with Large Domain Gap. Hyogun Lee, Kyungho Bae, Seong Jong Ha, Yumin Ko, Gyeong-Moon Park, Jinwoo Choi |
| 2024 | GRIT: GAN Residuals for Paired Image-to-Image Translation. Saksham Suri, Moustafa Meshry, Larry S. Davis, Abhinav Shrivastava |
| 2024 | GTP-ViT: Efficient Vision Transformers via Graph-based Token Propagation. Xuwei Xu, Sen Wang, Yudong Chen, Yanping Zheng, Zhewei Wei, Jiajun Liu |
| 2024 | GazeGNN: A Gaze-Guided Graph Neural Network for Chest X-ray Classification. Bin Wang, Hongyi Pan, Armstrong Aboah, Zheyuan Zhang, Elif Keles, Drew A. Torigian, Baris Turkbey, Elizabeth A. Krupinski, Jayaram K. Udupa, Ulas Bagci |
| 2024 | Generalization by Adaptation: Diffusion-Based Domain Extension for Domain-Generalized Semantic Segmentation. Joshua Niemeijer, Manuel Schwonberg, Jan-Aike Termöhlen, Nico M. Schmidt, Tim Fingscheidt |
| 2024 | Generalizing to Unseen Domains in Diabetic Retinopathy Classification. Chamuditha Jayanga Galappaththige, Gayal Kuruppu, Muhammad Haris Khan |
| 2024 | Generated Distributions Are All You Need for Membership Inference Attacks Against Generative Models. Minxing Zhang, Ning Yu, Rui Wen, Michael Backes, Yang Zhang |
| 2024 | Generation of Upright Panoramic Image from Non-upright Panoramic Image. Jingguo Liu, Heyu Chen, Shigang Li, Jianfeng Li |
| 2024 | Glance to Count: Learning to Rank with Anchors for Weakly-supervised Crowd Counting. Zheng Xiong, Liangyu Chai, Wenxi Liu, Yongtuo Liu, Sucheng Ren, Shengfeng He |
| 2024 | Global Occlusion-Aware Transformer for Robust Stereo Matching. Zihua Liu, Yizhou Li, Masatoshi Okutomi |
| 2024 | Gradient Coreset for Federated Learning. Durga Sivasubramanian, Lokesh Nagalapatti, Rishabh K. Iyer, Ganesh Ramakrishnan |
| 2024 | Gradient-Guided Knowledge Distillation for Object Detectors. Qizhen Lan, Qing Tian |
| 2024 | Gradual Source Domain Expansion for Unsupervised Domain Adaptation. Thomas Westfechtel, Hao-Wei Yeh, Dexuan Zhang, Tatsuya Harada |
| 2024 | Grafting Vision Transformers. Jongwoo Park, Kumara Kahatapitiya, Donghyun Kim, Shivchander Sudalairaj, Quanfu Fan, Michael S. Ryoo |
| 2024 | Graph Neural Networks for End-to-End Information Extraction from Handwritten Documents. Yessine Khanfir, Marwa Dhiaf, Emna Ghodhbani, Ahmed Cheikh Rouhou, Yousri Kessentini |
| 2024 | Graph(Graph): A Nested Graph-Based Framework for Early Accident Anticipation. Nupur Thakur, PrasanthSai Gouripeddi, Baoxin Li |
| 2024 | GraphFill: Deep Image Inpainting using Graphs. Shashikant Verma, Aman Sharma, Roopa Sheshadri, Shanmuganathan Raman |
| 2024 | Group-wise Contrastive Bottleneck for Weakly-Supervised Visual Representation Learning. Boon Peng Yap, Beng Koon Ng |
| 2024 | Guided Cluster Aggregation: A Hierarchical Approach to Generalized Category Discovery. Jona Otholt, Christoph Meinel, Haojin Yang |
| 2024 | Guided Distillation for Semi-Supervised Instance Segmentation. Tariq Berrada, Camille Couprie, Karteek Alahari, Jakob Verbeek |
| 2024 | HALSIE: Hybrid Approach to Learning Segmentation by Simultaneously Exploiting Image and Event Modalities. Shristi Das Biswas, Adarsh Kosta, Chamika M. Liyanagedera, Marco Paul E. Apolinario, Kaushik Roy |
| 2024 | HAMMER: Learning Entropy Maps to Create Accurate 3D Models in Multi-View Stereo. Rafael Weilharter, Friedrich Fraundorfer |
| 2024 | HD-Fusion: Detailed Text-to-3D Generation Leveraging Multiple Noise Estimation. Jinbo Wu, Xiaobo Gao, Xing Liu, Zhengyang Shen, Chen Zhao, Haocheng Feng, Jingtuo Liu, Errui Ding |
| 2024 | HDMNet: A Hierarchical Matching Network with Double Attention for Large-scale Outdoor LiDAR Point Cloud Registration. Weiyi Xue, Fan Lu, Guang Chen |
| 2024 | HELA-VFA: A Hellinger Distance-Attention-based Feature Aggregation Network for Few-Shot Classification. Gao Yu Lee, Tanmoy Dam, Daniel Puiu Poenar, Vu N. Duong, Md Meftahul Ferdaus |
| 2024 | HMP: Hand Motion Priors for Pose and Shape Estimation from Video. Enes Duran, Muhammed Kocabas, Vasileios Choutas, Zicong Fan, Michael J. Black |
| 2024 | HaGRID - HAnd Gesture Recognition Image Dataset. Alexander Kapitanov, Karina Kvanchiani, Alexander Nagaev, Roman Kraynov, Andrei Makhliarchuk |
| 2024 | HalluciDet: Hallucinating RGB Modality for Person Detection Through Privileged Information. Heitor Rapela Medeiros, Fidel A. Guerrero-Peña, Masih Aminbeidokhti, Thomas Dubail, Eric Granger, Marco Pedersoli |
| 2024 | Handformer2T: A Lightweight Regression-based Model for Interacting Hands Pose Estimation from A Single RGB Image. Pengfei Zhang, Deying Kong |
| 2024 | Hard Sample-aware Consistency for Low-resolution Facial Expression Recognition. Bokyeung Lee, Kyungdeuk Ko, Jonghwan Hong, Hanseok Ko |
| 2024 | Hard-label based Small Query Black-box Adversarial Attack. Jeonghwan Park, Paul Miller, Niall McLaughlin |
| 2024 | Hardware Aware Evolutionary Neural Architecture Search using Representation Similarity Metric. Nilotpal Sinha, Abd El Rahman Shabayek, Anis Kacem, Peyman Rostami, Carl Shneider, Djamila Aouada |
| 2024 | Harnessing the Power of Multi-Lingual Datasets for Pre-training: Towards Enhancing Text Spotting Performance. Alloy Das, Sanket Biswas, Ayan Banerjee, Josep Lladós, Umapada Pal, Saumik Bhattacharya |
| 2024 | HashReID: Dynamic Network with Binary Codes for Efficient Person Re-identification. Kshitij Nikhal, Yujunrong Ma, Shuvra S. Bhattacharyya, Benjamin S. Riggan |
| 2024 | Have We Ever Encountered This Before? Retrieving Out-of-Distribution Road Obstacles from Driving Scenes. Youssef Shoeb, Robin Chan, Gesina Schwalbe, Azarm Nowzad, Fatma Güney, Hanno Gottschalk |
| 2024 | Hierarchical Diffusion Autoencoders and Disentangled Image Manipulation. Zeyu Lu, Chengyue Wu, Xinyuan Chen, Yaohui Wang, Lei Bai, Yu Qiao, Xihui Liu |
| 2024 | Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis. Shangbang Long, Siyang Qin, Yasuhisa Fujii, Alessandro Bissacco, Michalis Raptis |
| 2024 | High-Fidelity Zero-Shot Texture Anomaly Localization Using Feature Correspondence Analysis. Andrei-Timotei Ardelean, Tim Weyrich |
| 2024 | High-fidelity Pseudo-labels for Boosting Weakly-Supervised Segmentation. Arvi Jonnarth, Yushan Zhang, Michael Felsberg |
| 2024 | Holistic Representation Learning for Multitask Trajectory Anomaly Detection. Alexandros Stergiou, Brent De Weerdt, Nikos Deligiannis |
| 2024 | How Do Deepfakes Move? Motion Magnification for Deepfake Source Detection. Ilke Demir, Umur Aybars Ciftci |
| 2024 | Human Motion Aware Text-to-Video Generation with Explicit Camera Control. Taehoon Kim, Chanhee Kang, Jaehyuk Park, Daun Jeong, ChangHee Yang, Suk-Ju Kang, Kyeongbo Kong |
| 2024 | Hyb-NeRF: A Multiresolution Hybrid Encoding for Neural Radiance Fields. Yifan Wang, Yi Gong, Yuan Zeng |
| 2024 | Hybrid Neural Diffeomorphic Flow for Shape Representation and Generation via Triplane. Kun Han, Shanlin Sun, Thanh-Tung Le, Xiangyi Yan, Haoyu Ma, Chenyu You, Xiaohui Xie |
| 2024 | Hybrid Sample Synthesis-based Debiasing of Classifier in Limited Data Setting. Piyush Arora, Pratik Mazumder |
| 2024 | HyperMix: Out-of-Distribution Detection and Classification in Few-Shot Settings. Nikhil Mehta, Kevin J. Liang, Jing Huang, Fu-Jen Chu, Li Yin, Tal Hassner |
| 2024 | Hyperbolic vs Euclidean Embeddings in Few-Shot Learning: Two Sides of the Same Coin. Gabriel Moreira, Manuel Marques, João Paulo Costeira, Alexander G. Hauptmann |
| 2024 | I-AI: A Controllable & Interpretable AI System for Decoding Radiologists' Intense Focus for Accurate CXR Diagnoses. Trong-Thang Pham, Jacob Brecheisen, Anh Nguyen, Hien Nguyen, Ngan Le |
| 2024 | ICF-SRSR: Invertible scale-Conditional Function for Self-Supervised Real-world Single Image Super-Resolution. Reyhaneh Neshatavar, Mohsen Yavartanoo, Sanghyun Son, Kyoung Mu Lee |
| 2024 | IDD-AW: A Benchmark for Safe and Robust Segmentation of Drive Scenes in Unstructured Traffic and Adverse Weather. Furqan Ahmed Shaik, Abhishek Reddy Malreddy, Nikhil Reddy Billa, Kunal Chaudhary, Sunny Manchanda, Girish Varma |
| 2024 | IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2024, Waikoloa, HI, USA, January 3-8, 2024 |
| 2024 | IKEA Ego 3D Dataset: Understanding furniture assembly actions from ego-view 3D Point Clouds. Yizhak Ben-Shabat, Jonathan Paul, Eviatar Segev, Oren Shrout, Stephen Gould |
| 2024 | INCODE: Implicit Neural Conditioning with Prior Knowledge Embeddings. Amirhossein Kazerouni, Reza Azad, Alireza Hosseini, Dorit Merhof, Ulas Bagci |
| 2024 | IR-FRestormer: Iterative Refinement with Fourier-Based Restormer for Accelerated MRI Reconstruction. Mohammad Zalbagi Darestani, Vishwesh Nath, Wenqi Li, Yufan He, Holger R. Roth, Ziyue Xu, Daguang Xu, Reinhard Heckel, Can Zhao |
| 2024 | ISAR: A Benchmark for Single- and Few-Shot Object Instance Segmentation and Re-Identification. Nicolas Gorlo, Kenneth Blomqvist, Francesco Milano, Roland Siegwart |
| 2024 | Identifying Label Errors in Object Detection Datasets by Loss Inspection. Marius Schubert, Tobias Riedlinger, Karsten Kahl, Daniel Kröll, Sebastian Schoenen, Sinisa Segvic, Matthias Rottmann |
| 2024 | Image Denoising and the Generative Accumulation of Photons. Alexander Krull, Hector Basevi, Benjamin Salmon, Andre Zeug, Franziska Müller, Samuel Tonks, Leela Muppala, Ales Leonardis |
| 2024 | Image Labels Are All You Need for Coarse Seagrass Segmentation. Scarlett Raine, Ross Marchant, Brano Kusy, Frédéric Maire, Tobias Fischer |
| 2024 | Implicit Neural Image Stitching With Enhanced and Blended Feature Reconstruction. Minsu Kim, Jaewon Lee, Byeonghun Lee, Sunghoon Im, Kyong Hwan Jin |
| 2024 | Implicit neural representation for change detection. Peter Naylor, Diego Di Carlo, Arianna Traviglia, Makoto Yamada, Marco Fiorucci |
| 2024 | Improved Techniques for Quantizing Deep Networks with Adaptive Bit-Widths. Ximeng Sun, Rameswar Panda, Chun-Fu Richard Chen, Naigang Wang, Bowen Pan, Aude Oliva, Rogério Feris, Kate Saenko |
| 2024 | Improved Topological Preservation in 3D Axon Segmentation and Centerline Detection using Geometric Assessment-driven Topological Smoothing (GATS). Nina I. Shamsi, Alec S. Xu, Lars A. Gjesteby, Laura J. Brattain |
| 2024 | Improving Fairness in Deepfake Detection. Yan Ju, Shu Hu, Shan Jia, George H. Chen, Siwei Lyu |
| 2024 | Improving Fairness using Vision-Language Driven Image Augmentation. Moreno D'Incà, Christos Tzelepis, Ioannis Patras, Nicu Sebe |
| 2024 | Improving Graph Networks through Selection-based Convolution. David Hart, Bryan S. Morse |
| 2024 | Improving Normalization with the James-Stein Estimator. Seyedalireza Khoshsirat, Chandra Kambhamettu |
| 2024 | Improving Open-Set Semi-Supervised Learning with Self-Supervision. Erik Wallin, Lennart Svensson, Fredrik Kahl, Lars Hammarstrand |
| 2024 | Improving Vision-and-Language Reasoning via Spatial Relations Modeling. Cheng Yang, Rui Xu, Ye Guo, Peixiang Huang, Yiru Chen, Wenkui Ding, Zhongyuan Wang, Hong Zhou |
| 2024 | Improving the Effectiveness of Deep Generative Data. Ruyu Wang, Sabrina Schmedding, Marco F. Huber |
| 2024 | Improving the Fairness of the Min-Max Game in GANs Training. Zhaoyu Zhang, Yang Hua, Hui Wang, Seán F. McLoone |
| 2024 | Improving the Leaking of Augmentations in Data-Efficient GANs via Adaptive Negative Data Augmentation. Zhaoyu Zhang, Yang Hua, Guanxiong Sun, Hui Wang, Seán F. McLoone |
| 2024 | Incorporating Physics Principles for Precise Human Motion Prediction. Yufei Zhang, Jeffrey O. Kephart, Qiang Ji |
| 2024 | Increasing biases can be more efficient than increasing weights. Carlo Metta, Marco Fantozzi, Andrea Papini, Gianluca Amato, Matteo Bergamaschi, Silvia Giulia Galfrè, Alessandro Marchetti, Michelangelo Vegliò, Maurizio Parton, Francesco Morandin |
| 2024 | Indoor Visual Localization using Point and Line Correspondences in dense colored point cloud. Yuya Matsumoto, Gaku Nakano, Kazumine Ogura |
| 2024 | IndustReal: A Dataset for Procedure Step Recognition Handling Execution Errors in Egocentric Videos in an Industrial-Like Setting. Tim J. Schoonbeek, Tim Houben, Hans Onvlee, Peter H. N. de With, Fons van der Sommen |
| 2024 | InfraParis: A multi-modal and multi-task autonomous driving dataset. Gianni Franchi, Marwane Hariat, Xuanlong Yu, Nacim Belkhir, Antoine Manzanera, David Filliat |
| 2024 | Instruct Me More! Random Prompting for Visual In-Context Learning. Jiahao Zhang, Bowen Wang, Liangzhi Li, Yuta Nakashima, Hajime Nagahara |
| 2024 | Interaction Region Visual Transformer for Egocentric Action Anticipation. Debaditya Roy, Ramanathan Rajendiran, Basura Fernando |
| 2024 | Interactive Network Perturbation between Teacher and Students for Semi-Supervised Semantic Segmentation. Hyuna Cho, Injun Choi, Suha Kwak, Won Hwa Kim |
| 2024 | Interactive Segmentation for Diverse Gesture Types Without Context. Josh Myers-Dean, Yifei Fan, Brian L. Price, Wilson Chan, Danna Gurari |
| 2024 | Interpretable Object Recognition by Semantic Prototype Analysis. Qiyang Wan, Ruiping Wang, Xilin Chen |
| 2024 | Intrinsic Hand Avatar: Illumination-aware Hand Appearance and Shape Reconstruction from Monocular RGB Video. Pratik Kalshetti, Parag Chaudhuri |
| 2024 | Investigating the Role of Attribute Context in Vision-Language Models for Object Recognition and Detection. Kyle Buettner, Adriana Kovashka |
| 2024 | Iterative Multi-granular Image Editing using Diffusion Models. K. J. Joseph, Prateksha Udhayanan, Tripti Shukla, Aishwarya Agarwal, Srikrishna Karanam, Koustava Goswami, Balaji Vasan Srinivasan |
| 2024 | JOADAA: joint online action detection and action anticipation. Mohammed Guermal, Abid Ali, Rui Dai, François Brémond |
| 2024 | Joint 3D Shape and Motion Estimation from Rolling Shutter Light-Field Images. Hermes McGriff, Renato Martins, Nicolas Andreff, Cédric Demonceaux |
| 2024 | Joint Depth Prediction and Semantic Segmentation with Multi-View SAM. Mykhailo Shvets, Dongxu Zhao, Marc Niethammer, Roni Sengupta, Alexander C. Berg |
| 2024 | Kaizen: Practical self-supervised continual learning with continual fine-tuning. Chi Ian Tang, Lorena Qendro, Dimitris Spathis, Fahim Kawsar, Cecilia Mascolo, Akhil Mathur |
| 2024 | LAVSS: Location-Guided Audio-Visual Spatial Audio Separation. Yuxin Ye, Wenming Yang, Yapeng Tian |
| 2024 | LIVENet: A novel network for real-world low-light image denoising and enhancement. Dhruv Makwana, Gayatri Deshmukh, Onkar Susladkar, Sparsh Mittal, R. Sai Chandra Teja |
| 2024 | LInKs "Lifting Independent Keypoints" - Partial Pose Lifting for Occlusion Handling with Improved Accuracy in 2D-3D Human Pose Estimation. Peter Hardy, Hansung Kim |
| 2024 | LP-OVOD: Open-Vocabulary Object Detection by Linear Probing. Chau Pham, Truong Vu, Khoi Nguyen |
| 2024 | Label Augmentation as Inter-class Data Augmentation for Conditional Image Synthesis with Imbalanced Data. Kai Katsumata, Duc Minh Vo, Hideki Nakayama |
| 2024 | Label Shift Estimation for Class-Imbalance Problem: A Bayesian Approach. Changkun Ye, Russell Tsuchida, Lars Petersson, Nick Barnes |
| 2024 | Label-Free Synthetic Pretraining of Object Detectors. Hei Law, Jia Deng |
| 2024 | Late to the party? On-demand unlabeled personalized federated learning. Ohad Amosy, Gal Eyal, Gal Chechik |
| 2024 | Latent Feature-Guided Diffusion Models for Shadow Removal. Kangfu Mei, Luis Figueroa, Zhe Lin, Zhihong Ding, Scott Cohen, Vishal M. Patel |
| 2024 | Latent-Guided Exemplar-Based Image Re-Colorization. Wenjie Yang, Ning Xu, Yifei Fan |
| 2024 | LatentDR: Improving Model Generalization Through Sample-Aware Latent Degradation and Restoration. Ran Liu, Sahil Khose, Jingyun Xiao, Lakshmi Sathidevi, Keerthan Ramnath, Zsolt Kira, Eva L. Dyer |
| 2024 | LatentPaint: Image Inpainting in Latent Space with Diffusion Models. Ciprian A. Corneanu, Raghudeep Gadde, Aleix M. Martínez |
| 2024 | LaughTalk: Expressive 3D Talking Head Generation with Laughter. Sung-Bin Kim, Lee Hyun, Da Hye Hong, Suekyeong Nam, Janghoon Ju, Tae-Hyun Oh |
| 2024 | Layer-wise Auto-Weighting for Non-Stationary Test-Time Adaptation. Junyoung Park, Jin Kim, Hyeongjun Kwon, Ilhoon Yoon, Kwanghoon Sohn |
| 2024 | Learn to Unlearn for Deep Neural Networks: Minimizing Unlearning Interference with Gradient Projection. Tuan Hoang, Santu Rana, Sunil Gupta, Svetha Venkatesh |
| 2024 | Learnable Cube-based Video Encryption for Privacy-Preserving Action Recognition. Yuchi Ishikawa, Masayoshi Kondo, Hirokatsu Kataoka |
| 2024 | Learning Better Keypoints for Multi-Object 6DoF Pose Estimation. Yangzheng Wu, Michael A. Greenspan |
| 2024 | Learning Class and Domain Augmentations for Single-Source Open-Domain Generalization. Prathmesh Bele, Valay Bundele, Avigyan Bhattacharya, Ankit Jha, Gemma Roig, Biplab Banerjee |
| 2024 | Learning Generalizable Perceptual Representations for Data-Efficient No-Reference Image Quality Assessment. Suhas Srinath, Shankhanil Mitra, Shika Rao, Rajiv Soundararajan |
| 2024 | Learning Intra-class Multimodal Distributions with Orthonormal Matrices. Jumpei Goto, Yohei Nakata, Kiyofumi Abe, Yasunori Ishii, Takayoshi Yamashita |
| 2024 | Learning Low-Rank Latent Spaces with Simple Deterministic Autoencoder: Theoretical and Empirical Insights. Alokendu Mazumder, Tirthajit Baruah, Bhartendu Kumar, Rishab Sharma, Vishwajeet Pattanaik, Punit Rathore |
| 2024 | Learning Quality Labels for Robust Image Classification. Xiaosong Wang, Ziyue Xu, Dong Yang, Leo K. Tam, Holger Roth, Daguang Xu |
| 2024 | Learning Residual Elastic Warps for Image Stitching under Dirichlet Boundary Condition. Minsu Kim, Yongjun Lee, Woo Kyoung Han, Kyong Hwan Jin |
| 2024 | Learning Robust Deep Visual Representations from EEG Brain Recordings. Prajwal Singh, Dwip Dalal, Gautam Vashishtha, Krishna P. Miyapuram, Shanmuganathan Raman |
| 2024 | Learning Saliency From Fixations. Yasser Abdelaziz Dahou Djilali, Kevin McGuinness, Noel E. O'Connor |
| 2024 | Learning Transferable Representations for Image Anomaly Localization Using Dense Pretraining. Haitian He, Sarah M. Erfani, Mingming Gong, Qiuhong Ke |
| 2024 | Learning Visual Body-shape-Aware Embeddings for Fashion Compatibility. Kaicheng Pang, Xingxing Zou, Waikeung Wong |
| 2024 | Learning the What and How of Annotation in Video Object Segmentation. Thanos Delatolas, Vicky Kalogeiton, Dim P. Papadopoulos |
| 2024 | Learning to Adapt CLIP for Few-Shot Monocular Depth Estimation. Xueting Hu, Ce Zhang, Yi Zhang, Bowen Hai, Ke Yu, Zhihai He |
| 2024 | Learning to Compose SuperWeights for Neural Parameter Allocation Search. Piotr Teterwak, Soren Nelson, Nikoli Dryden, Dina Bashkirova, Kate Saenko, Bryan A. Plummer |
| 2024 | Learning to Detour: Shortcut Mitigating Augmentation for Weakly Supervised Semantic Segmentation. JuneHyoung Kwon, Eunju Lee, Yunsung Cho, YoungBin Kim |
| 2024 | Learning to Read Analog Gauges from Synthetic Data. Juan Leon Alcazar, Yazeed Alnumay, Cheng Zheng, Hassane Trigui, Sahejad Patel, Bernard Ghanem |
| 2024 | Learning to Recognize Occluded and Small Objects with Partial Inputs. Hasib Zunair, A. Ben Hamza |
| 2024 | Learning to generate training datasets for robust semantic segmentation. Marwane Hariat, Olivier Laurent, Rémi Kazmierczak, Shihao Zhang, Andrei Bursuc, Angela Yao, Gianni Franchi |
| 2024 | Learning-based Spotlight Position Optimization for Non-Line-of-Sight Human Localization and Posture Classification. Sreenithy Chandran, Tatsuya Yatagawa, Hiroyuki Kubo, Suren Jayasuriya |
| 2024 | LensNeRF: Rethinking Volume Rendering based on Thin-Lens Camera Model. Min-Jung Kim, Gyojung Gu, Jaegul Choo |
| 2024 | Let the Beat Follow You - Creating Interactive Drum Sounds From Body Rhythm. Xiulong Liu, Kun Su, Eli Shlizerman |
| 2024 | Let's Observe Them Over Time: An Improved Pedestrian Attribute Recognition Approach. Kamalakar Vijay Thakare, Debi Prosad Dogra, Heeseung Choi, Haksub Kim, Ig-Jae Kim |
| 2024 | Letting 3D Guide the Way: 3D Guided 2D Few-Shot Image Classification. Jiajing Chen, Minmin Yang, Senem Velipasalar |
| 2024 | Leveraging Bitstream Metadata for Fast, Accurate, Generalized Compressed Video Quality Enhancement. Max Ehrlich, Jon Barker, Namitha Padmanabhan, Larry Davis, Andrew Tao, Bryan Catanzaro, Abhinav Shrivastava |
| 2024 | Leveraging Next-Active Objects for Context-Aware Anticipation in Egocentric Videos. Sanket Kumar Thakur, Cigdem Beyan, Pietro Morerio, Vittorio Murino, Alessio Del Bue |
| 2024 | Leveraging Synthetic Data to Learn Video Stabilization Under Adverse Conditions. Abdulrahman Kerim, Washington L. S. Ramos, Leandro Soriano Marcolino, Erickson R. Nascimento, Richard Jiang |
| 2024 | Leveraging Task-Specific Pre-Training to Reason across Images and Videos. Arka Sadhu, Ram Nevatia |
| 2024 | Leveraging the Power of Data Augmentation for Transformer-based Tracking. Jie Zhao, Johan Edstedt, Michael Felsberg, Dong Wang, Huchuan Lu |
| 2024 | LibreFace: An Open-Source Toolkit for Deep Facial Expression Analysis. Di Chang, Yufeng Yin, Zongjian Li, Minh Tran, Mohammad Soleymani |
| 2024 | LidarCLIP or: How I Learned to Talk to Point Clouds. Georg Hess, Adam Tonderski, Christoffer Petersson, Kalle Åström, Lennart Svensson |
| 2024 | Lightweight Delivery Detection on Doorbell Cameras. Pirazh Khorramshahi, Zhe Wu, Tianchen Wang, Luke Deluccia, Hongcheng Wang |
| 2024 | Lightweight Portrait Matting via Regional Attention and Refinement. Yatao Zhong, Ilya Zharkov |
| 2024 | Lightweight Thermal Super-Resolution and Object Detection for Robust Perception in Adverse Weather Conditions. Pranjay Shyam, Hyunjin Yoo |
| 2024 | Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders. Srijan Das, Tanmay Jain, Dominick Reilly, Pranav Balaji, Soumyajit Karmakar, Shyam Marjit, Xiang Li, Abhijit Das, Michael S. Ryoo |
| 2024 | Link Prediction for Flow-Driven Spatial Networks. Bastian Wittmann, Johannes C. Paetzold, Chinmay Prabhakar, Daniel Rueckert, Bjoern H. Menze |
| 2024 | Linking convolutional kernel size to generalization bias in face analysis CNNs. Hao Liang, Josue Ortega Caro, Vikram Maheshri, Ankit B. Patel, Guha Balakrishnan |
| 2024 | LipAT: Beyond Style Transfer for Controllable Neural Simulation of Lipstick using Cosmetic Attributes. Amila Silva, Olga Moskvyak, Alexander Long, Ravi Garg, Stephen Gould, Gil Avraham, Anton van den Hengel |
| 2024 | Localization and Manipulation of Immoral Visual Cues for Safe Text-to-Image Generation. Seongbeom Park, Suhong Moon, Seunghyun Park, Jinkyu Kim |
| 2024 | Location-Aware Self-Supervised Transformers for Semantic Segmentation. Mathilde Caron, Neil Houlsby, Cordelia Schmid |
| 2024 | LongFormer: Longitudinal Transformer for Alzheimer's Disease Classification with Structural MRIs. Qiuhui Chen, Qiang Fu, Hao Bai, Yi Hong |
| 2024 | Lost Your Style? Navigating with Semantic-Level Approach for Text-to-Outfit Retrieval. JunKyu Jang, Eugene Hwang, Sung-Hyuk Park |
| 2024 | M Muhammad Abdullah Jamal, Omid Mohareri |
| 2024 | MACP: Efficient Model Adaptation for Cooperative Perception. Yunsheng Ma, Juanwu Lu, Can Cui, Sicheng Zhao, Xu Cao, Wenqian Ye, Ziran Wang |
| 2024 | MAELi: Masked Autoencoder for Large-Scale LiDAR Point Clouds. Georg Krispel, David Schinagl, Christian Fruhwirth-Reisinger, Horst Possegger, Horst Bischof |
| 2024 | MAdVerse: A Hierarchical Dataset of Multi-Lingual Ads from Diverse Sources and Categories. Amruth Sagar, Rishabh Srivastava, Rakshitha R. T, Venkata Kesav Venna, Ravi Kiran Sarvadevabhatla |
| 2024 | MEGANet: Multi-Scale Edge-Guided Attention Network for Weak Boundary Polyp Segmentation. Nhat-Tan Bui, Dinh-Hieu Hoang, Quang-Thuc Nguyen, Minh-Triet Tran, Ngan Le |
| 2024 | MFT: Long-Term Tracking of Every Pixel. Michal Neoral, Jonás Serých, Jirí Matas |
| 2024 | MGM-AE: Self-Supervised Learning on 3D Shape Using Mesh Graph Masked Autoencoders. Zhangsihao Yang, Kaize Ding, Huan Liu, Yalin Wang |
| 2024 | MICS: Midpoint Interpolation to Learn Compact and Separated Representations for Few-Shot Class-Incremental Learning. Solang Kim, Yuho Jeong, Joon Sung Park, Sung Whan Yoon |
| 2024 | MIDAS: Mixing Ambiguous Data with Soft Labels for Dynamic Facial Expression Recognition. Ryosuke Kawamura, Hideaki Hayashi, Noriko Takemura, Hajime Nagahara |
| 2024 | MIST: Medical Image Segmentation Transformer with Convolutional Attention Mixing (CAM) Decoder. Md Motiur Rahman, Shiva Shokouhmand, Smriti Bhatt, Miad Faezipour |
| 2024 | MITFAS: Mutual Information based Temporal Feature Alignment and Sampling for Aerial Video Action Recognition. Ruiqi Xian, Xijun Wang, Dinesh Manocha |
| 2024 | MIVC: Multiple Instance Visual Component for Visual-Language Models. Wenyi Wu, Qi Li, Wenliang Zhong, Junzhou Huang |
| 2024 | MOPA: Modular Object Navigation with PointGoal Agents. Sonia Raychaudhuri, Tommaso Campari, Unnat Jain, Manolis Savva, Angel X. Chang |
| 2024 | MPT: Mesh Pre-Training with Transformers for Human Pose and Mesh Reconstruction. Kevin Lin, Chung-Ching Lin, Lin Liang, Zicheng Liu, Lijuan Wang |
| 2024 | MS-EVS: Multispectral event-based vision for deep learning based face detection. Saad Himmi, Vincent Parret, Ajad Chhatkuli, Luc Van Gool |
| 2024 | MSCC: Multi-Scale Transformers for Camera Calibration. Xu Song, Hao Kang, Atsunori Moteki, Genta Suzuki, Yoshie Kobayashi, Zhiming Tan |
| 2024 | MagneticPillars: Efficient Point Cloud Registration through Hierarchized Birds-Eye-View Cell Correspondence Refinement. Kai Fischer, Martin Simon, Stefan Milz, Patrick Mäder |
| 2024 | MarsLS-Net: Martian Landslides Segmentation Network and Benchmark Dataset. Sidike Paheding, Abel A. Reyes, A. Rajaneesh, K. S. Sajinkumar, Thomas Oommen |
| 2024 | MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation. Abdullah Rashwan, Jiageng Zhang, Ali Taalimi, Fan Yang, Xingyi Zhou, Chaochao Yan, Liang-Chieh Chen, Yeqing Li |
| 2024 | Masked Collaborative Contrast for Weakly Supervised Semantic Segmentation. Fangwen Wu, Jingxuan He, Yufei Yin, Yanbin Hao, Gang Huang, Lechao Cheng |
| 2024 | Masked Event Modeling: Self-Supervised Pretraining for Event Cameras. Simon Klenk, David Bonello, Lukas Koestler, Nikita Araslanov, Daniel Cremers |
| 2024 | Masking Improves Contrastive Self-Supervised Learning for ConvNets, and Saliency Tells You Where. Zhi-Yi Chin, Chieh-Ming Jiang, Ching-Chun Huang, Pin-Yu Chen, Wei-Chen Chiu |
| 2024 | Maximum Knowledge Orthogonality Reconstruction with Gradients in Federated Learning. Feng Wang, Senem Velipasalar, Mustafa Cenk Gursoy |
| 2024 | Med-DANet V2: A Flexible Dynamic Architecture for Efficient Medical Volumetric Segmentation. Haoran Shen, Yifu Zhang, Wenxuan Wang, Chen Chen, Jing Liu, Shanshan Song, Jiangyun Li |
| 2024 | Membership Inference Attack Using Self Influence Functions. Gilad Cohen, Raja Giryes |
| 2024 | Meta-Learned Attribute Self-Interaction Network for Continual and Generalized Zero-Shot Learning. Vinay Kumar Verma, Nikhil Mehta, Kevin J. Liang, Aakansha Mishra, Lawrence Carin |
| 2024 | Meta-Learned Kernel For Blind Super-Resolution Kernel Estimation. Royson Lee, Rui Li, Stylianos I. Venieris, Timothy M. Hospedales, Ferenc Huszár, Nicholas D. Lane |
| 2024 | MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient Semantic Segmentation. Beoungwoo Kang, Seunghun Moon, Yubin Cho, Hyunwoo Yu, Suk-Ju Kang |
| 2024 | MetaVers: Meta-Learned Versatile Representations for Personalized Federated Learning. Jin Hyuk Lim, SeungBum Ha, Sung Whan Yoon |
| 2024 | Mini but Mighty: Finetuning ViTs with Mini Adapters. Imad Eddine Marouf, Enzo Tartaglione, Stéphane Lathuilière |
| 2024 | Minimizing Layerwise Activation Norm Improves Generalization in Federated Learning. M. Yashwanth, Gaurav Kumar Nayak, Harsh Rangwani, Arya Singh, R. Venkatesh Babu, Anirban Chakraborty |
| 2024 | Mining and Unifying Heterogeneous Contrastive Relations for Weakly-Supervised Actor-Action Segmentation. Bin Duan, Hao Tang, Changchang Sun, Ye Zhu, Yan Yan |
| 2024 | Missing Modality Robustness in Semi-Supervised Multi-Modal Semantic Segmentation. Harsh Maheshwari, Yen-Cheng Liu, Zsolt Kira |
| 2024 | Mitigate Domain Shift by Primary-Auxiliary Objectives Association for Generalizing Person ReID. Qilei Li, Shaogang Gong |
| 2024 | Mixing Gradients in Neural Networks as a Strategy to Enhance Privacy in Federated Learning. Shaltiel Eloul, Fran Silavong, Sanket Kamthe, Antonios Georgiadis, Sean J. Moran |
| 2024 | MixtureGrowth: Growing Neural Networks by Recombining Learned Parameters. Chau Pham, Piotr Teterwak, Soren Nelson, Bryan A. Plummer |
| 2024 | MoP-CLIP: A Mixture of Prompt-Tuned CLIP Models for Domain Incremental Learning. Julien Nicolas, Florent Chiaroni, Imtiaz Masud Ziko, Ola Ahmad, Christian Desrosiers, Jose Dolz |
| 2024 | MoRF: Mobile Realistic Fullbody Avatars from a Monocular Video. Renat Bashirov, Alexey Larionov, Evgeniya Ustinova, Mikhail Sidorenko, David Svitov, Ilya Zakharkin, Victor Lempitsky |
| 2024 | MobileNVC: Real-time 1080p Neural Video Compression on a Mobile Device. Ties van Rozendaal, Tushar Singhal, Hoang Le, Guillaume Sautière, Amir Said, Krishna Buska, Anjuman Raha, Dimitris Kalatzis, Hitarth Mehta, Frank Mayer, Liang Zhang, Markus Nagel, Auke J. Wiggers |
| 2024 | Modality-Aware Representation Learning for Zero-shot Sketch-based Image Retrieval. Eunyi Lyou, Doyeon Lee, Jooeun Kim, Joonseok Lee |
| 2024 | MonoProb: Self-Supervised Monocular Depth Estimation with Interpretable Uncertainty. Rémi Marsal, Florian Chabot, Angelique Loesch, William Grolleau, Hichem Sahbi |
| 2024 | Monocular 3D Object Detection with LiDAR Guided Semi Supervised Active Learning. Aral Hekimoglu, Michael Schmidt, Alvaro Marcos-Ramiro |
| 2024 | Motion Matters: Neural Motion Transfer for Better Camera Physiological Measurement. Akshay Paruchuri, Xin Liu, Yulu Pan, Shwetak N. Patel, Daniel McDuff, Soumyadip Sengupta |
| 2024 | MotionAGFormer: Enhancing 3D Human Pose Estimation with a Transformer-GCNFormer Network. Soroush Mehraban, Vida Adeli, Babak Taati |
| 2024 | MotionGPT: Human Motion Synthesis with Improved Diversity and Realism via GPT-3 Prompting. José Ribeiro-Gomes, Tianhui Cai, Zoltán Ádám Milacski, Chen Wu, Aayush Prakash, Shingo Takagi, Amaury Aubel, Daeil Kim, Alexandre Bernardino, Fernando De la Torre |
| 2024 | Movie Genre Classification by Language Augmentation and Shot Sampling. Zhongping Zhang, Yiwen Gu, Bryan A. Plummer, Xin Miao, Jiayi Liu, Huayan Wang |
| 2024 | MuSHRoom: Multi-Sensor Hybrid Room Dataset for Joint 3D Reconstruction and Novel View Synthesis. Xuqian Ren, Wenjia Wang, Dingding Cai, Tuuli Tuominen, Juho Kannala, Esa Rahtu |
| 2024 | Multi-Class Segmentation from Aerial Views using Recursive Noise Diffusion. Benedikt Kolbeinsson, Krystian Mikolajczyk |
| 2024 | Multi-Modal Gaze Following in Conversational Scenarios. Yuqi Hou, Zhongqun Zhang, Nora Horanyi, Jaewon Moon, Yihua Cheng, Hyung Jin Chang |
| 2024 | Multi-Source Domain Adaptation for Object Detection with Prototype-based Mean Teacher. Atif Belal, Akhil Meethal, Francisco Perdigon Romero, Marco Pedersoli, Eric Granger |
| 2024 | Multi-level Attention Aggregation for Aesthetic Face Relighting. Hemanth Pidaparthy, Abhay Chauhan, Pavan Sudheendra |
| 2024 | Multi-view 3D Object Reconstruction and Uncertainty Modelling with Neural Shape Prior. Ziwei Liao, Steven L. Waslander |
| 2024 | Multi-view Classification Using Hybrid Fusion and Mutual Distillation. Samuel Black, Richard Souvenir |
| 2024 | Multimodal Channel-Mixing: Channel and Spatial Masked AutoEncoder on Facial Action Unit Detection. Xiang Zhang, Huiyuan Yang, Taoyue Wang, Xiaotian Li, Lijun Yin |
| 2024 | Multimodal Deep Learning for Remote Stress Estimation Using CCT-LSTM. Sayyedjavad Ziaratnia, Tipporn Laohakangvalvit, Midori Sugaya, Peeraya Sripian |
| 2024 | Multimodality-guided Image Style Transfer using Cross-modal GAN Inversion. Hanyu Wang, Pengxiang Wu, Kevin Dela Rosa, Chen Wang, Abhinav Shrivastava |
| 2024 | Multispectral Imaging for Differential Face Morphing Attack Detection: A Preliminary Study. Raghavendra Ramachandra, Sushma Venkatesh, Naser Damer, Narayan Vetrekar, Rajendra S. Gad |
| 2024 | Multitask Vision-Language Prompt Tuning. Sheng Shen, Shijia Yang, Tianjun Zhang, Bohan Zhai, Joseph E. Gonzalez, Kurt Keutzer, Trevor Darrell |
| 2024 | NCIS: Neural Contextual Iterative Smoothing for Purifying Adversarial Perturbations. Sungmin Cha, Naeun Ko, Heewoong Choi, Youngjoon Yoo, Taesup Moon |
| 2024 | NITEC: Versatile Hand-Annotated Eye Contact Dataset for Ego-Vision Interaction. Thorsten Hempel, Magnus Jung, Ahmed A. Abdelrahman, Ayoub Al-Hamadi |
| 2024 | NOMAD: A Natural, Occluded, Multi-scale Aerial Dataset, for Emergency Response Scenarios. Arturo Miguel Russell Bernal, Walter J. Scheirer, Jane Cleland-Huang |
| 2024 | NVAutoNet: Fast and Accurate 360° 3D Visual Perception For Self Driving. Trung Pham, Mehran Maghoumi, Wanli Jiang, Bala Siva Sashank Jujjavarapu, Mehdi Sajjadi, Xin Liu, Hsuan-Chu Lin, Bor-Jeng Chen, Giang Truong, Chao Fang, Junghyun Kwon, Minwoo Park |
| 2024 | Natural Light Can Also be Dangerous: Traffic Sign Misinterpretation Under Adversarial Natural Light Attacks. Teng-Fang Hsiao, Bo-Lun Huang, Zi-Xiang Ni, Yan-Ting Lin, Hong-Han Shuai, Yung-Hui Li, Wen-Huang Cheng |
| 2024 | NeRFEditor: Differentiable Style Decomposition for 3D Scene Editing. Chunyi Sun, Yanbin Liu, Junlin Han, Stephen Gould |
| 2024 | Nested Diffusion Processes for Anytime Image Generation. Noam Elata, Bahjat Kawar, Tomer Michaeli, Michael Elad |
| 2024 | Neural Echos: Depthwise Convolutional Filters Replicate Biological Receptive Fields. Zahra Babaiee, Peyman M. Kiasari, Daniela Rus, Radu Grosu |
| 2024 | Neural Image Compression Using Masked Sparse Visual Representation. Wei Jiang, Wei Wang, Yue Chen |
| 2024 | Neural Style Protection: Counteracting Unauthorized Neural Style Transfer. Yaxin Li, Jie Ren, Han Xu, Hui Liu |
| 2024 | Neural Textured Deformable Meshes for Robust Analysis-by-Synthesis. Angtian Wang, Wufei Ma, Alan L. Yuille, Adam Kortylewski |
| 2024 | OE-CTST: Outlier-Embedded Cross Temporal Scale Transformer for Weakly-supervised Video Anomaly Detection. Snehashis Majhi, Rui Dai, Quan Kong, Lorenzo Garattoni, Gianpiero Francesca, François Brémond |
| 2024 | OOD Aware Supervised Contrastive Learning. Soroush Seifi, Daniel Olmeda Reino, Nikolay Chumerin, Rahaf Aljundi |
| 2024 | OTAS: Unsupervised Boundary Detection for Object-Centric Temporal Action Segmentation. Yuerong Li, Zhengrong Xue, Huazhe Xu |
| 2024 | OVeNet: Offset Vector Network for Semantic Segmentation. Stamatis Alexandropoulos, Christos Sakaridis, Petros Maragos |
| 2024 | Object Aware Contrastive Prior for Interactive Image Segmentation. Praful Mathur, Shashi Kumar Parwani, Mrinmoy Sen, Roopa Sheshadri, Aman Sharma |
| 2024 | Object Re-Identification from Point Clouds. Benjamin Thérien, Chengjie Huang, Adrian Chow, Krzysztof Czarnecki |
| 2024 | Object-centric Video Representation for Long-term Action Anticipation. Ce Zhang, Changcheng Fu, Shijie Wang, Nakul Agarwal, Kwonjoon Lee, Chiho Choi, Chen Sun |
| 2024 | Occlusion Sensitivity Analysis with Augmentation Subspace Perturbation in Deep Feature Space. Pedro H. V. Valois, Koichiro Niinuma, Kazuhiro Fukui |
| 2024 | Offline-to-Online Knowledge Distillation for Video Instance Segmentation. Hojin Kim, Seunghun Lee, Hyeon Kang, Sunghoon Im |
| 2024 | OmniVec: Learning robust representations with cross modal sharing. Siddharth Srivastava, Gaurav Sharma |
| 2024 | On Manipulating Scene Text in the Wild with Diffusion Models. Joshua Santoso, Christian Simon, Williem |
| 2024 | On the Fly Neural Style Smoothing for Risk-Averse Domain Generalization. Akshay Mehra, Yunbei Zhang, Bhavya Kailkhura, Jihun Hamm |
| 2024 | On the Importance of Large Objects in CNN Based Object Detection Algorithms. Ahmed Ben Saad, Gabriele Facciolo, Axel Davy |
| 2024 | On the Quantification of Image Reconstruction Uncertainty without Training Data. Jiaxin Zhang, Sirui Bi, Victor Fung |
| 2024 | One Style is All You Need to Generate a Video. Sandeep Manandhar, Auguste Genovesio |
| 2024 | Online Class-Incremental Learning For Real-World Food Image Classification. Siddeshwar Raghavan, Jiangpeng He, Fengqing Zhu |
| 2024 | Open-NeRF: Towards Open Vocabulary NeRF Decomposition. Hao Zhang, Fang Li, Narendra Ahuja |
| 2024 | Open-Set Object Detection By Aligning Known Class Representations. Hiran Sarkar, Vishal M. Chudasama, Naoyuki Onoe, Pankaj Wasnik, Vineeth N. Balasubramanian |
| 2024 | Opinion Unaware Image Quality Assessment via Adversarial Convolutional Variational Autoencoder. Ankit Shukla, Avinash Upadhyay, Swati Bhugra, Manoj Sharma |
| 2024 | OptFlow: Fast Optimization-based Scene Flow Estimation without Supervision. Rahul Ahuja, Chris L. Baker, Wilko Schwarting |
| 2024 | Optical Flow Domain Adaptation via Target Style Transfer. Jeongbeen Yoon, Sanghyun Kim, Suha Kwak, Minsu Cho |
| 2024 | Optimizing Long-Term Robot Tracking with Multi-Platform Sensor Fusion. Giuliano Albanese, Arka Mitra, Jan-Nico Zaech, Yupeng Zhao, Ajad Chhatkuli, Luc Van Gool |
| 2024 | Ordinal Classification with Distance Regularization for Robust Brain Age Prediction. Jay Shah, Md Mahfuzur Rahman Siddiquee, Yi Su, Teresa Wu, Baoxin Li |
| 2024 | Out-of-Distribution Detection with Logical Reasoning. Konstantin Kirchheim, Tim Gonschorek, Frank Ortmeier |
| 2024 | Overcoming Catastrophic Forgetting for Multi-Label Class-Incremental Learning. Xiang Song, Kuang Shu, Songlin Dong, Jie Cheng, Xing Wei, Yihong Gong |
| 2024 | P-Age: Pexels Dataset for Robust Spatio-Temporal Apparent Age Classification. Abid Ali, Ashish Marisetty, François Brémond |
| 2024 | P2D: Plug and Play Discriminator for accelerating GAN frameworks. Min Jin Chong, Krishna Kumar Singh, Yijun Li, Jingwan Lu, David A. Forsyth |
| 2024 | PAIR : Perception Aided Image Restoration for Natural Driving Conditions. Pranjay Shyam, Hyunjin Yoo |
| 2024 | PATROL: Privacy-Oriented Pruning for Collaborative Inference Against Model Inversion Attacks. Shiwei Ding, Lan Zhang, Miao Pan, Xiaoyong Yuan |
| 2024 | PDA-RWSR: Pixel-Wise Degradation Adaptive Real-World Super-Resolution. Andreas Aakerberg, Majed El Helou, Kamal Nasrollahi, Thomas B. Moeslund |
| 2024 | PECoP: Parameter Efficient Continual Pretraining for Action Quality Assessment. Amirhossein Dadashzadeh, Shuchao Duan, Alan L. Whone, Majid Mirmehdi |
| 2024 | PETIT-GAN: Physically Enhanced Thermal Image-Translating Generative Adversarial Network. Omri Berman, Navot Oz, David Mendlovic, Nir A. Sochen, Yafit Cohen, Iftach Klapp |
| 2024 | PGVT: Pose-Guided Video Transformer for Fine-Grained Action Recognition. Haosong Zhang, Mei Chee Leong, Liyuan Li, Weisi Lin |
| 2024 | PHG-Net: Persistent Homology Guided Medical Image Classification Yaopeng Peng, Hongxiao Wang, Milan Sonka, Danny Z. Chen |
| 2024 | PIDiffu: Pixel-aligned Diffusion Model for High-Fidelity Clothed Human Reconstruction. Jungeun Lee, Sanghun Kim, Hansol Lee, Tserendorj Adiya, Hwasup Lim |
| 2024 | PMI Sampler: Patch Similarity Guided Frame Selection For Aerial Action Recognition. Ruiqi Xian, Xijun Wang, Divya Kothandaraman, Dinesh Manocha |
| 2024 | PMVC: Promoting Multi-View Consistency for 3D Scene Reconstruction. Chushan Zhang, Jinguang Tong, Tao Jun Lin, Chuong Nguyen, Hongdong Li |
| 2024 | POISE: Pose Guided Human Silhouette Extraction under Occlusions. Arindam Dutta, Rohit Lal, Dripta S. Raychaudhuri, Calvin-Khang Ta, Amit K. Roy-Chowdhury |
| 2024 | POP-VQA - Privacy preserving, On-device, Personalized Visual Question Answering. Pragya Paramita Sahu, Abhishek Raut, Jagdish Singh Samant, Mahesh Gorijala, Vignesh Lakshminarayanan, Pinaki Bhaskar |
| 2024 | Painterly Image Harmonization via Adversarial Residual Learning. Xudong Wang, Li Niu, Junyan Cao, Yan Hong, Liqing Zhang |
| 2024 | Panelformer: Sewing Pattern Reconstruction from 2D Garment Images. Cheng-Hsiu Chen, Jheng-Wei Su, Min-Chun Hu, Chih-Yuan Yao, Hung-Kuo Chu |
| 2024 | Partial Binarization of Neural Networks for Budget-Aware Efficient Learning. Udbhav Bamba, Neeraj Anand, Saksham Aggarwal, Dilip K. Prasad, Deepak K. Gupta |
| 2024 | ParticleNeRF: A Particle-Based Encoding for Online Neural Radiance Fields. Jad Abou-Chakra, Feras Dayoub, Niko Sünderhauf |
| 2024 | Patch-based Selection and Refinement for Early Object Detection. Tianyi Zhang, Kishore Kasichainula, Yaoxin Zhuo, Baoxin Li, Jae-sun Seo, Yu Cao |
| 2024 | PatchRefineNet: Improving Binary Segmentation by Incorporating Signals from Optimal Patch-wise Binarization. Savinay Nagendra, Daniel Kifer |
| 2024 | PathLDM: Text conditioned Latent Diffusion Model for Histopathology. Srikar Yellapragada, Alexandros Graikos, Prateek Prasanna, Tahsin M. Kurç, Joel H. Saltz, Dimitris Samaras |
| 2024 | Permutation-Aware Activity Segmentation via Unsupervised Frame-to-Segment Alignment. Quoc-Huy Tran, Ahmed Mehmood, Muhammad Ahmed, Muhammad Naufil, Anas Zafar, Andrey Konin, M. Zeeshan Zia |
| 2024 | Personalized Face Inpainting with Diffusion Models by Parallel Visual Attention. Jianjin Xu, Saman Motamed, Praneetha Vaddamanu, Chen Henry Wu, Christian Häne, Jean-Charles Bazin, Fernando De la Torre |
| 2024 | PhISH-Net: Physics Inspired System for High Resolution Underwater Image Enhancement. Aditya Chandrasekar, Manogna Sreenivas, Soma Biswas |
| 2024 | Physical-space Multi-body Mesh Detection Achieved by Local Alignment and Global Dense Learning. Haoye Dong, Tiange Xiang, Sravan Chittupalli, Jun Liu, Dong Huang |
| 2024 | Pixel Matching Network for Cross-Domain Few-Shot Segmentation. Hao Chen, Yonghan Dong, Zheming Lu, Yunlong Yu, Jungong Han |
| 2024 | Pixel-Grounded Prototypical Part Networks. Zachariah Carmichael, Suhas Lohit, Anoop Cherian, Michael J. Jones, Walter J. Scheirer |
| 2024 | PlantPlotGAN: A Physics-Informed Generative Adversarial Network for Plant Disease Prediction. Felipe A. Lopes, Vasit Sagan, Flavio Esposito |
| 2024 | Plasticity-Optimized Complementary Networks for Unsupervised Continual Learning. Alex Gomez-Villa, Bartlomiej Twardowski, Kai Wang, Joost van de Weijer |
| 2024 | Point-DynRF: Point-based Dynamic Radiance Fields from a Monocular Video. Byeongjun Park, Changick Kim |
| 2024 | PointCT: Point Central Transformer Network for Weakly-supervised Point Cloud Semantic Segmentation. Anh-Thuan Tran, Hoanh-Su Le, Suk-Hwan Lee, Ki-Ryong Kwon |
| 2024 | Polarimetric PatchMatch Multi-View Stereo. Jinyu Zhao, Jumpei Oishi, Yusuke Monno, Masatoshi Okutomi |
| 2024 | PolyMaX: General Dense Prediction with Mask Transformer. Xuan Yang, Liangzhe Yuan, Kimberly Wilber, Astuti Sharma, Xiuye Gu, Siyuan Qiao, Stephanie Debats, Huisheng Wang, Hartwig Adam, Mikhail Sirotenko, Liang-Chieh Chen |
| 2024 | PoseDiff: Pose-conditioned Multimodal Diffusion Model for Unbounded Scene Synthesis from Sparse Inputs. Seoyoung Lee, Joonseok Lee |
| 2024 | PreciseDebias: An Automatic Prompt Engineering Approach for Generative AI to Mitigate Image Demographic Biases. Colton Clemmer, Junhua Ding, Yunhe Feng |
| 2024 | Preserving Image Properties Through Initializations in Diffusion Models. Jeffrey Zhang, Shao-Yu Chang, Kedan Li, David A. Forsyth |
| 2024 | PressureVision++: Estimating Fingertip Pressure from Diverse RGB Images. Patrick Grady, Jeremy A. Collins, Chengcheng Tang, Christopher D. Twigg, Kunal Aneja, James Hays, Charles C. Kemp |
| 2024 | PrivObfNet: A Weakly Supervised Semantic Segmentation Model for Data Protection. Chiat-Pin Tay, Vigneshwaran Subbaraju, Thivya Kandappu |
| 2024 | Privacy-Enhancing Person Re-identification Framework - A Dual-Stage Approach. Kajal Kansal, Yongkang Wong, Mohan S. Kankanhalli |
| 2024 | ProS: Facial Omni-Representation Learning via Prototype-based Self-Distillation. Xing Di, Yiyu Zheng, Xiaoming Liu, Yu Cheng |
| 2024 | ProcSim: Proxy-based Confidence for Robust Similarity Learning. Oriol Barbany, Xiaofan Lin, Muhammet Bastan, Arnab Dhua |
| 2024 | Progressive Hypothesis Transformer for 3D Human Mesh Recovery. Huang-Ru Liao, Jen-Chun Lin, Chun-Yi Lee |
| 2024 | PromptAD: Zero-shot Anomaly Detection using Text Prompts. Yiting Li, Adam David Goodge, Fayao Liu, Chuan-Sheng Foo |
| 2024 | Prompting classes: Exploring the Power of Prompt Class Learning in Weakly Supervised Semantic Segmentation. Balamurali Murugesan, Rukhshanda Hussain, Rajarshi Bhattacharya, Ismail Ben Ayed, Jose Dolz |
| 2024 | PromptonomyViT: Multi-Task Prompt Learning Improves Video Transformers using Synthetic Scene Data. Roei Herzig, Ofir Abramovich, Elad Ben-Avraham, Assaf Arbelle, Leonid Karlinsky, Ariel Shamir, Trevor Darrell, Amir Globerson |
| 2024 | Prototype Learning for Explainable Brain Age Prediction. Linde S. Hesse, Nicola K. Dinsdale, Ana I. L. Namburete |
| 2024 | Prototypical Contrastive Network for Imbalanced Aerial Image Segmentation. Keiller Nogueira, Mayara Maezano Faita Pinheiro, Ana Paula Marques Ramos, Wesley Nunes Gonçalves, José Marcato Junior, Jefersson A. dos Santos |
| 2024 | ProxEdit: Improving Tuning-Free Real Image Editing with Proximal Guidance. Ligong Han, Song Wen, Qi Chen, Zhixing Zhang, Kunpeng Song, Mengwei Ren, Ruijiang Gao, Anastasis Stathopoulos, Xiaoxiao He, Yuxiao Chen, Di Liu, Qilong Zhangli, Jindong Jiang, Zhaoyang Xia, Akash Srivastava, Dimitris N. Metaxas |
| 2024 | Pruning from Scratch via Shared Pruning Module and Nuclear norm-based Regularization. Donghyeon Lee, Eunho Lee, Youngbae Hwang |
| 2024 | PsyMo: A Dataset for Estimating Self-Reported Psychological Traits from Gait. Adrian Cosma, Ion Emilian Radoi |
| 2024 | Query-guided Attention in Vision Transformers for Localizing Objects Using a Single Sketch. Aditay Tripathi, Anand Mishra, Anirban Chakraborty |
| 2024 | RADIO: Reference-Agnostic Dubbing Video Synthesis. Dongyeun Lee, Chaewon Kim, Sangjoon Yu, Jaejun Yoo, Gyeong-Moon Park |
| 2024 | REALM: Robust Entropy Adaptive Loss Minimization for Improved Single-Sample Test-Time Adaptation. Skyler Seto, Barry-John Theobald, Federico Danieli, Navdeep Jaitly, Dan Busbridge |
| 2024 | RGB-D Mapping and Tracking in a Plenoxel Radiance Field. Andreas Langeland Teigen, Yeonsoo Park, Annette Stahl, Rudolf Mester |
| 2024 | RGB-X Object Detection via Scene-Specific Fusion Modules. Sri Aditya Deevi, Connor Lee, Lu Gan, Sushruth Nagesh, Gaurav Pandey, Soon-Jo Chung |
| 2024 | RGBT-Dog: A Parametric Model and Pose Prior For Canine Body Analysis Data Creation. Jake Deane, Sinead Kearney, Kwang In Kim, Darren Cosker |
| 2024 | RIMeshGNN: A Rotation-Invariant Graph Neural Network for Mesh Classification. Bahareh Shakibajahromi, Edward Kim, David E. Breen |
| 2024 | RMFER: Semi-supervised Contrastive Learning for Facial Expression Recognition with Reaction Mashup Video. Yunseong Cho, Chanwoo Kim, Hoseong Cho, Yunhoe Ku, Eunseo Kim, Muhammadjon Boboev, Joonseok Lee, Seungryul Baek |
| 2024 | RPCANet: Deep Unfolding RPCA Based Infrared Small Target Detection. Fengyi Wu, Tianfang Zhang, Lei Li, Yian Huang, Zhenming Peng |
| 2024 | RS2G: Data-Driven Scene-Graph Extraction and Embedding for Robust Autonomous Perception and Scenario Understanding. Junyao Wang, Arnav Vaibhav Malawade, Junhong Zhou, Shih-Yuan Yu, Mohammad Abdullah Al Faruque |
| 2024 | RSMPNet: Relationship Guided Semantic Map Prediction. Jingwen Sun, Jing Wu, Ze Ji, Yu-Kun Lai |
| 2024 | Random Walks for Temporal Action Segmentation with Timestamp Supervision. Roy Hirsch, Regev Cohen, Tomer Golany, Daniel Freedman, Ehud Rivlin |
| 2024 | Randomized Adversarial Style Perturbations for Domain Generalization. Taehoon Kim, Bohyung Han |
| 2024 | Rank2Tell: A Multimodal Driving Dataset for Joint Importance Ranking and Reasoning. Enna Sachdeva, Nakul Agarwal, Suhas Chundi, Sean Roelofs, Jiachen Li, Mykel J. Kochenderfer, Chiho Choi, Behzad Dariush |
| 2024 | RankDVQA: Deep VQA based on Ranking-inspired Hybrid Training. Chen Feng, Duolikun Danier, Fan Zhang, David Bull |
| 2024 | Ray Deformation Networks for Novel View Synthesis of Refractive Objects. Weijian Deng, Dylan Campbell, Chunyi Sun, Shubham Kanitkar, Matthew E. Shaffer, Stephen Gould |
| 2024 | Re-Evaluating LiDAR Scene Flow. Nathaniel Chodosh, Deva Ramanan, Simon Lucey |
| 2024 | Re-VoxelDet: Rethinking Neck and Head Architectures for High-Performance Voxel-based 3D Detection. Jae-Keun Lee, Jin-Hee Lee, Joohyun Lee, Soon Kwon, Heechul Jung |
| 2024 | ReCLIP: Refine Contrastive Language Image Pre-Training with Source Free Domain Adaptation. Xuefeng Hu, Ke Zhang, Lu Xia, Albert Chen, Jiajia Luo, Yuyin Sun, Ken Wang, Nan Qiao, Xiao Zeng, Min Sun, Cheng-Hao Kuo, Ram Nevatia |
| 2024 | ReConPatch : Contrastive Patch Representation Learning for Industrial Anomaly Detection. Jeeho Hyun, Sangyun Kim, Giyoung Jeon, Seung Hwan Kim, Kyunghoon Bae, Byung Jun Kang |
| 2024 | Real Time GAZED: Online Shot Selection and Editing of Virtual Cameras from Wide-Angle Monocular Video Recordings. Sudheer Achary, Rohit Girmaji, Adhiraj Anil Deshmukh, Vineet Gandhi |
| 2024 | Real-Time Polyp Detection in Colonoscopy using Lightweight Transformer. Youngbeom Yoo, Jae Young Lee, Dong-Jae Lee, Jiwoon Jeon, Junmo Kim |
| 2024 | Real-Time User-guided Adaptive Colorization with Vision Transformer. Gwanghan Lee, Saebyeol Shin, Taeyoung Na, Simon S. Woo |
| 2024 | Real-Time Weakly Supervised Video Anomaly Detection. Hamza Karim, Keval Doshi, Yasin Yilmaz |
| 2024 | Real-time 6-DoF Pose Estimation by an Event-based Camera using Active LED Markers. Gerald Ebmer, Adam Loch, Minh Nhat Vu, Roberto Mecca, Germain Haessig, Christian Hartl-Nesic, Markus Vincze, Andreas Kugi |
| 2024 | Recognition of Unseen Bird Species by Learning from Field Guides. Andrés C. Rodríguez, Stefano D'Aronco, Rodrigo Caye Daudt, Jan D. Wegner, Konrad Schindler |
| 2024 | RecycleNet: Latent Feature Recycling Leads to Iterative Decision Refinement. Gregor Köhler, Tassilo Wald, Constantin Ulrich, David Zimmerer, Paul F. Jaeger, Jörg K. H. Franke, Simon Kohl, Fabian Isensee, Klaus H. Maier-Hein |
| 2024 | Reducing the Side-Effects of Oscillations in Training of Quantized YOLO Networks. Kartik Gupta, Akshay Asthana |
| 2024 | Reference-based Restoration of Digitized Analog Videotapes. Lorenzo Agnolucci, Leonardo Galteri, Marco Bertini, Alberto Del Bimbo |
| 2024 | Refine and Redistribute: Multi-Domain Fusion and Dynamic Label Assignment for Unbiased Scene Graph Generation. Yujie Zang, Yaochen Li, Yuan Gao, Yimou Guo, Wenneng Tang, Yanxue Li, Meklit Atlaw |
| 2024 | Registered and Segmented Deformable Object Reconstruction from a Single View Point Cloud. Pit Henrich, Balázs Gyenes, Paul Maria Scheikl, Gerhard Neumann, Franziska Mathis-Ullrich |
| 2024 | Removing the Quality Tax in Controllable Face Generation. Yiwen Huang, Zhiqiu Yu, Xinjie Yi, Yue Wang, James Tompkin |
| 2024 | Repetitive Action Counting with Motion Feature Learning. Xinjie Li, Huijuan Xu |
| 2024 | Residual Graph Convolutional Network for Bird's-Eye-View Semantic Segmentation. Qiuxiao Chen, Xiaojun Qi |
| 2024 | Restoring Degraded Old Films with Recursive Recurrent Transformer Networks. Shan Lin, Edgar Simo-Serra |
| 2024 | Rethink Cross-Modal Fusion in Weakly-Supervised Audio-Visual Video Parsing. Yating Xu, Conghui Hu, Gim Hee Lee |
| 2024 | Rethinking Knowledge Distillation with Raw Features for Semantic Segmentation. Tao Liu, Chenshu Chen, Xi Yang, Wenming Tan |
| 2024 | Rethinking Multimodal Content Moderation from an Asymmetric Angle with Mixed-modality. Jialin Yuan, Ye Yu, Gaurav Mittal, Matthew Hall, Sandra Sajeev, Mei Chen |
| 2024 | Rethinking Visibility in Human Pose Estimation: Occluded Pose Reasoning via Transformers. Pengzhan Sun, Kerui Gu, Yunsong Wang, Linlin Yang, Angela Yao |
| 2024 | Reverse Knowledge Distillation: Training a Large Model using a Small One for Retinal Image Matching on Limited Data. Sahar Almahfouz Nasser, Nihar Gupte, Amit Sethi |
| 2024 | Revisiting Latent Space of GAN Inversion for Robust Real Image Editing. Kai Katsumata, Duc Minh Vo, Bei Liu, Hideki Nakayama |
| 2024 | Revisiting Pixel-Level Contrastive Pre-Training on Scene Images. Zongshang Pang, Yuta Nakashima, Mayu Otani, Hajime Nagahara |
| 2024 | Revisiting Token Pruning for Object Detection and Instance Segmentation. Yifei Liu, Mathias Gehrig, Nico Messikommer, Marco Cannici, Davide Scaramuzza |
| 2024 | Revolutionize the Oceanic Drone RGB Imagery with Pioneering Sun Glint Detection and Removal Techniques. Jiangying Qin, Ming Li, Jie Zhao, Jiageng Zhong, Hanqi Zhang |
| 2024 | Robust Category-Level 3D Pose Estimation from Diffusion-Enhanced Synthetic Data. Jiahao Yang, Wufei Ma, Angtian Wang, Xiaoding Yuan, Alan L. Yuille, Adam Kortylewski |
| 2024 | Robust Eye Blink Detection Using Dual Embedding Video Vision Transformer. Jeongmin Hong, Joseph Shin, Juhee Choi, Minsam Ko |
| 2024 | Robust Feature Learning and Global Variance-Driven Classifier Alignment for Long-Tail Class Incremental Learning. Jayateja Kalla, Soma Biswas |
| 2024 | Robust Learning via Conditional Prevalence Adjustment. Minh Nguyen, Alan Q. Wang, Heejong Kim, Mert R. Sabuncu |
| 2024 | Robust Object Detection in Challenging Weather Conditions. Himanshu Gupta, Oleksandr Kotlyar, Henrik Andreasson, Achim J. Lilienthal |
| 2024 | Robust Source-Free Domain Adaptation for Fundus Image Segmentation. Lingrui Li, Yanfeng Zhou, Ge Yang |
| 2024 | Robust TRISO-fueled Pebble Identification by Digit Recognition. Roshan Kenia, Jihane Mendil, Ahmed Jasim, Muthanna Al-Dahhan, Zhaozheng Yin |
| 2024 | Robust Unsupervised Domain Adaptation through Negative-View Regularization. Joonhyeok Jang, Sunhyeok Lee, Seonghak Kim, Jung-Un Kim, Seonghyun Kim, Daeshik Kim |
| 2024 | RobustCLEVR: A Benchmark and Framework for Evaluating Robustness in Object-centric Learning. Nathan Drenkow, Mathias Unberath |
| 2024 | Rotation-Constrained Cross-View Feature Fusion for Multi-View Appearance-based Gaze Estimation. Yoichiro Hisadome, Tianyi Wu, Jiawei Qin, Yusuke Sugano |
| 2024 | S Robert Johanson, Christian Wilms, Ole Johannsen, Simone Frintrop |
| 2024 | SAM Fewshot Finetuning for Anatomical Segmentation in Medical Images. Weiyi Xie, Nathalie Willems, Shubham Patil, Yang Li, Mayank Kumar |
| 2024 | SBCFormer: Lightweight Network Capable of Full-size ImageNet Classification at 1 FPS on Single Board Computers. Xiangyong Lu, Masanori Suganuma, Takayuki Okatani |
| 2024 | SC-MIL: Supervised Contrastive Multiple Instance Learning for Imbalanced Classification in Pathology. Dinkar Juyal, Siddhant Shingi, Syed Ashar Javed, Harshith Padigela, Chintan Shah, Anand Sampat, Archit Khosla, John Abel, Amaro Taylor-Weiner |
| 2024 | SCUNet++: Swin-UNet and CNN Bottleneck Hybrid Architecture with Multi-Fusion Dense Skip Connection for Pulmonary Embolism CT Image Segmentation. Yifei Chen, Binfeng Zou, Zhaoxin Guo, Yiyu Huang, Yifan Huang, Feiwei Qin, Qinhai Li, Changmiao Wang |
| 2024 | SCoRD: Subject-Conditional Relation Detection with Text-Augmented Data. Ziyan Yang, Kushal Kafle, Zhe Lin, Scott Cohen, Zhihong Ding, Vicente Ordonez |
| 2024 | SDNet: An Extremely Efficient Portrait Matting Model via Self-Distillation. Ziwen Li, Bo Xu, Jiake Xie, Yong Tang, Cheng Lu |
| 2024 | SEMA: Semantic Attention for Capturing Long-Range Dependencies in Egocentric Lifelogs. Pravin Nagar, K. N. Ajay Shastry, Jayesh Chaudhari, Chetan Arora |
| 2024 | SGRec3D: Self-Supervised 3D Scene Graph Learning via Object-Level Scene Reconstruction. Sebastian Koch, Pedro Hermosilla, Narunas Vaskevicius, Mirco Colosi, Timo Ropinski |
| 2024 | SICKLE: A Multi-Sensor Satellite Imagery Dataset Annotated with Multiple Key Cropping Parameters. Depanshu Sani, Sandeep Mahato, Sourabh Saini, Harsh Kumar Agarwal, Charu Chandra Devshali, Saket Anand, Gaurav Arora, Thiagarajan Jayaraman |
| 2024 | SLoSH: Set Locality Sensitive Hashing via Sliced-Wasserstein Embeddings. Yuzhe Lu, Xinran Liu, Andrea Soltoggio, Soheil Kolouri |
| 2024 | SOAP: Cross-sensor Domain Adaptation for 3D Object Detection Using Stationary Object Aggregation Pseudo-labelling. Chengjie Huang, Vahdat Abdelzad, Sean Sedwards, Krzysztof Czarnecki |
| 2024 | SSP: Semi-signed prioritized neural fitting for surface reconstruction from unoriented point clouds. Runsong Zhu, Di Kang, Ka-Hei Hui, Yue Qian, Shi Qiu, Zhen Dong, Linchao Bao, Pheng-Ann Heng, Chi-Wing Fu |
| 2024 | SSVOD: Semi-Supervised Video Object Detection with Sparse Annotations. Tanvir Mahmud, Chun-Hao Liu, Burhaneddin Yaman, Diana Marculescu |
| 2024 | STEP - Towards Structured Scene-Text Spotting. Sergi Garcia-Bordils, Dimosthenis Karatzas, Marçal Rusiñol |
| 2024 | Salient Object Detection for Images Taken by People With Vision Impairments. Jarek Reynolds, Chandra Kanth Nagesh, Danna Gurari |
| 2024 | Scale-Adaptive Feature Aggregation for Efficient Space-Time Video Super-Resolution. Zhewei Huang, Ailin Huang, Xiaotao Hu, Chen Hu, Jun Xu, Shuchang Zhou |
| 2024 | ScanEnts3D: Exploiting Phrase-to-3D-Object Correspondences for Improved Visio-Linguistic Models in 3D Scenes. Ahmed Abdelreheem, Kyle Olszewski, Hsin-Ying Lee, Peter Wonka, Panos Achlioptas |
| 2024 | Scene Text Image Super-resolution based on Text-conditional Diffusion Models. Chihiro Noguchi, Shun Fukuda, Masao Yamanaka |
| 2024 | SciOL and MuLMS-Img: Introducing A Large-Scale Multimodal Scientific Dataset and Models for Image-Text Tasks in the Scientific Domain. Tim Tarsi, Heike Adel, Jan Hendrik Metzen, Dan Zhang, Matteo Finco, Annemarie Friedrich |
| 2024 | SeaTurtleID2022: A long-span dataset for reliable sea turtle re-identification. Lukás Adam, Vojtech Cermák, Kostas Papafitsoros, Lukás Picek |
| 2024 | Second-Order Graph ODEs for Multi-Agent Trajectory Forecasting. Song Wen, Hao Wang, Di Liu, Qilong Zhangli, Dimitris N. Metaxas |
| 2024 | Seeing Stars: Learned Star Localization for Narrow-Field Astrometry. Violet Felt, Justin Fletcher |
| 2024 | Segment anything, from space? Simiao Ren, Francesco Luzi, Saad Lahrichi, Kaleb Kassaw, Leslie M. Collins, Kyle Bradbury, Jordan M. Malof |
| 2024 | Self-Annotated 3D Geometric Learning for Smeared Points Removal. Miaowei Wang, Daniel D. Morris |
| 2024 | Self-Sampling Meta SAM: Enhancing Few-shot Medical Image Segmentation with Meta-Learning. Tianang Leng, Yiming Zhang, Kun Han, Xiaohui Xie |
| 2024 | Self-Supervised Denoising Transformer with Gaussian Process. Rajeev Yasarla, Jeya Maria Jose Valanarasu, Vishwanath S, Vishal M. Patel |
| 2024 | Self-Supervised Edge Detection Reconstruction for Topology-Informed 3D Axon Segmentation and Centerline Detection. Alec S. Xu, Nina I. Shamsi, Lars A. Gjesteby, Laura J. Brattain |
| 2024 | Self-Supervised Learning for Place Representation Generalization across Appearance Changes. Mohamed Adel Musallam, Vincent Gaudillière, Djamila Aouada |
| 2024 | Self-Supervised Learning for Visual Relationship Detection through Masked Bounding Box Reconstruction. Zacharias Anastasakis, Dimitrios Mallis, Markos Diomataris, George Alexandridis, Stefanos Kollias, Vassilis Pitsikalis |
| 2024 | Self-Supervised Learning with Masked Autoencoders for Teeth Segmentation from Intra-oral 3D Scans. Amani Almalki, Longin Jan Latecki |
| 2024 | Self-Supervised Relation Alignment for Scene Graph Generation. Bicheng Xu, Renjie Liao, Leonid Sigal |
| 2024 | Self-Supervised Representation Learning with Cross-Context Learning between Global and Hypercolumn Features. Zheng Gao, Chen Feng, Ioannis Patras |
| 2024 | Self-supervised Learning of Semantic Correspondence Using Web Videos. Donghyeon Kwon, Minsu Cho, Suha Kwak |
| 2024 | SemST: Semantically Consistent Multi-Scale Image Translation via Structure-Texture Alignment. Ganning Zhao, Wenhui Cui, Suya You, C.-C. Jay Kuo |
| 2024 | Semantic Fusion Augmentation and Semantic Boundary Detection: A Novel Approach to Multi-Target Video Moment Retrieval. Cheng Huang, Yi-Lun Wu, Hong-Han Shuai, Ching-Chun Huang |
| 2024 | Semantic Generative Augmentations for Few-Shot Counting. Perla Doubinsky, Nicolas Audebert, Michel Crucianu, Hervé Le Borgne |
| 2024 | Semantic Labels-Aware Transformer Model for Searching over a Large Collection of Lecture-Slides. K. V. Jobin, Anand Mishra, C. V. Jawahar |
| 2024 | Semantic Transfer from Head to Tail: Enlarging Tail Margin for Long-Tailed Visual Recognition. Shan Zhang, Yao Ni, Jinhao Du, Yanxia Liu, Piotr Koniusz |
| 2024 | Semantic-aware Video Representation for Few-shot Action Recognition. Yutao Tang, Benjamín Béjar, René Vidal |
| 2024 | Semi-Supervised Scene Change Detection by Distillation from Feature-metric Alignment. Seonhoon Lee, Jong-Hwan Kim |
| 2024 | Semi-Supervised Semantic Depth Estimation using Symbiotic Transformer and NearFarMix Augmentation. Md Awsafur Rahman, Shaikh Anowarul Fattah |
| 2024 | Separable Self and Mixed Attention Transformers for Efficient Object Tracking. Goutam Yelluru Gopal, Maria A. Amer |
| 2024 | SequenceMatch Revisiting the design of weak-strong augmentations for Semi-supervised learning. Khanh-Binh Nguyen |
| 2024 | Sequential Transformer for End-to-End Video Text Detection. Junbo Zhang, Mengbiao Zhao, Fei Yin, Cheng-Lin Liu |
| 2024 | ShARc: Shape and Appearance Recognition for Person Identification In-the-wild. Haidong Zhu, Wanrong Zheng, Zhaoheng Zheng, Ram Nevatia |
| 2024 | ShadowSense: Unsupervised Domain Adaptation and Feature Fusion for Shadow-Agnostic Tree Crown Detection from RGB-Thermal Drone Imagery. Rudraksh Kapil, Seyed Mojtaba Marvasti-Zadeh, Nadir Erbilgin, Nilanjan Ray |
| 2024 | Shape from Shading for Robotic Manipulation. Arkadeep Narayan Chaudhury, Leonid Keselman, Christopher G. Atkeson |
| 2024 | Shape-Guided Diffusion with Inside-Outside Attention. Dong Huk Park, Grace Luo, Clayton Toste, Samaneh Azadi, Xihui Liu, Maka Karalashvili, Anna Rohrbach, Trevor Darrell |
| 2024 | Shape-biased CNNs are Not Always Superior in Out-of-Distribution Robustness. Xinkuan Qiu, Meina Kan, Yongbin Zhou, Yanchao Bi, Shiguang Shan |
| 2024 | Sharp-NeRF: Grid-based Fast Deblurring Neural Radiance Fields using Sharpness Prior. Byeonghyeon Lee, Howoong Lee, Usman Ali, Eunbyung Park |
| 2024 | Show Your Face: Restoring Complete Facial Images from Partial Observations for VR Meeting. Zheng Chen, Zhiqi Zhang, Junsong Yuan, Yi Xu, Lantao Liu |
| 2024 | SigmML: Metric meta-learning for Writer Independent Offline Signature Verification in the Space of SPD Matrices. Alexios Giazitzis, Elias N. Zois |
| 2024 | Sign Language Production with Latent Motion Transformer. Pan Xie, Taiying Peng, Yao Du, Qipeng Zhang |
| 2024 | SimA: Simple Softmax-free Attention for Vision Transformers. Soroush Abbasi Koohpayegani, Hamed Pirsiavash |
| 2024 | Simple Post-Training Robustness using Test Time Augmentations and Random Forest. Gilad Cohen, Raja Giryes |
| 2024 | Simple Token-Level Confidence Improves Caption Correctness. Suzanne Petryk, Spencer Whitehead, Joseph E. Gonzalez, Trevor Darrell, Anna Rohrbach, Marcus Rohrbach |
| 2024 | SimpliMix: A Simplified Manifold Mixup for Few-shot Point Cloud Classification. Minmin Yang, Weiheng Chai, Jiyang Wang, Senem Velipasalar |
| 2024 | Single Domain Generalization via Normalised Cross-correlation Based Convolutions. Weiqin Chuah, Ruwan B. Tennakoon, Reza Hoseinnezhad, David Suter, Alireza Bab-Hadiashar |
| 2024 | Single Frame Semantic Segmentation Using Multi-Modal Spherical Images. Suresh Guttikonda, Jason R. Rambach |
| 2024 | Single-Image Deblurring, Trajectory and Shape Recovery of Fast Moving Objects with Denoising Diffusion Probabilistic Models. Radim Spetlík, Denys Rozumnyi, Jirí Matas |
| 2024 | Sketch-based Video Object Localization. Sangmin Woo, So-Yeong Jeon, Jinyoung Park, Minji Son, Sumin Lee, Changick Kim |
| 2024 | Slice and Conquer: A Planar-to-3D Framework for Efficient Interactive Segmentation of Volumetric Images. Wonwoo Cho, Dongmin Choi, Hyesu Lim, Jinho Choi, Saemee Choi, Hyunseok Min, Sungbin Lim, Jaegul Choo |
| 2024 | Small Objects Matters in Weakly-supervised Semantic Segmentation. Cheolhyun Mun, Sanghuk Lee, Youngjung Uh, Junsuk Choe, Hyeran Byun |
| 2024 | So you think you can track? Derek Gloudemans, Gergely Zachár, Yanbing Wang, Junyi Ji, Matthew Nice, Matt Bunting, William Barbour, Jonathan Sprinkle, Benedetto Piccoli, Maria Laura Delle Monache, Alexandre M. Bayen, Benjamin Seibold, Daniel B. Work |
| 2024 | Soft Curriculum for Learning Conditional GANs with Noisy-Labeled and Uncurated Unlabeled Data. Kai Katsumata, Duc Minh Vo, Tatsuya Harada, Hideki Nakayama |
| 2024 | Solving the Plane-Sphere Ambiguity in Top-Down Structure-from-Motion. Lars Haalck, Benjamin Risse |
| 2024 | Sound3DVDet: 3D Sound Source Detection using Multiview Microphone Array and RGB Images. Yuhang He, Sangyun Shin, Anoop Cherian, Niki Trigoni, Andrew Markham |
| 2024 | Source-Guided Similarity Preservation for Online Person Re-Identification. Hamza Rami, Jhony H. Giraldo, Nicolas Winckler, Stéphane Lathuilière |
| 2024 | Sparse Convolutional Networks for Surface Reconstruction from Noisy Point Clouds. Tao Wang, Jing Wu, Ze Ji, Yu-Kun Lai |
| 2024 | Spatio-temporal Filter Analysis Improves 3D-CNN For Action Classification. Takumi Kobayashi, Jiaxing Ye |
| 2024 | SpectralCLIP: Preventing Artifacts in Text-Guided Style Transfer from a Spectral Perspective. Zipeng Xu, Songlong Xing, Enver Sangineto, Nicu Sebe |
| 2024 | Spectroformer: Multi-Domain Query Cascaded Transformer Network For Underwater Image Enhancement. Md Raqib Khan, Priyanka Mishra, Nancy Mehta, Shruti S. Phutke, Santosh Kumar Vipparthi, Sukumar Nandi, Subrahmanyam Murala |
| 2024 | Specular Object Reconstruction Behind Frosted Glass by Differentiable Rendering. Takafumi Iwaguchi, Hiroyuki Kubo, Hiroshi Kawasaki |
| 2024 | SphereCraft: A Dataset for Spherical Keypoint Detection, Matching and Camera Pose Estimation. Christiano Couto Gava, Yunmin Cho, Federico Raue, Sebastian Palacio, Alain Pagani, Andreas Dengel |
| 2024 | Spiking Denoising Diffusion Probabilistic Models. Jiahang Cao, Ziqing Wang, Hanzhong Guo, Hao Cheng, Qiang Zhang, Renjing Xu |
| 2024 | Spiking Neural Networks for Active Time-Resolved SPAD Imaging. Yang Lin, Edoardo Charbon |
| 2024 | Steering Prototypes with Prompt-tuning for Rehearsal-free Continual Learning. Zhuowei Li, Long Zhao, Zizhao Zhang, Han Zhang, Di Liu, Ting Liu, Dimitris N. Metaxas |
| 2024 | Stereo Conversion with Disparity-Aware Warping, Compositing and Inpainting. Lukas Mehl, Andrés Bruhn, Markus Gross, Christopher Schroers |
| 2024 | Stereo Matching in Time: 100+ FPS Video Stereo Matching for Extended Reality. Ziang Cheng, Jiayu Yang, Hongdong Li |
| 2024 | Stochastic Binary Network for Universal Domain Adaptation. Saurabh Kumar Jain, Sukhendu Das |
| 2024 | StreamMapNet: Streaming Mapping Network for Vectorized Online HD Map Construction. Tianyuan Yuan, Yicheng Liu, Yue Wang, Yilun Wang, Hang Zhao |
| 2024 | StyLIP: Multi-Scale Style-Conditioned Prompt Learning for CLIP-based Domain Generalization. Shirsha Bose, Ankit Jha, Enrico Fini, Mainak Singha, Elisa Ricci, Biplab Banerjee |
| 2024 | StyleAvatar: Stylizing Animatable Head Avatars. Juan C. Pérez, Thu Nguyen-Phuoc, Chen Cao, Artsiom Sanakoyeu, Tomas Simon, Pablo Arbeláez, Bernard Ghanem, Ali K. Thabet, Albert Pumarola |
| 2024 | StyleGAN-Fusion: Diffusion Guided Domain Adaptation of Image Generators. Kunpeng Song, Ligong Han, Bingchen Liu, Dimitris N. Metaxas, Ahmed Elgammal |
| 2024 | StyleGenes: Discrete and Efficient Latent Distributions for GANs. Evangelos Ntavelis, Mohamad Shahbazi, Iason Kastanis, Martin Danelljan, Luc Van Gool |
| 2024 | SupeRVol: Super-Resolution Shape and Reflectance Estimation in Inverse Volume Rendering. Mohammed Brahimi, Bjoern Haefner, Tarun Yenamandra, Bastian Goldluecke, Daniel Cremers |
| 2024 | Synergizing Contrastive Learning and Optimal Transport for 3D Point Cloud Domain Adaptation. Siddharth Katageri, Arkadipta De, Chaitanya Devaguptapu, V. S. S. V. Prasad, Charu Sharma, Manohar Kaul |
| 2024 | SynergyNet: Bridging the Gap between Discrete and Continuous Representations for Precise Medical Image Segmentation. Vandan Gorade, Sparsh Mittal, Debesh Jha, Ulas Bagci |
| 2024 | SynthProv: Interpretable Framework for Profiling Identity Leakage. Jaisidh Singh, Harshil Bhatia, Mayank Vatsa, Richa Singh, Aparna Bharati |
| 2024 | SyntheWorld: A Large-Scale Synthetic Dataset for Land Cover Mapping and Building Change Detection. Jian Song, Hongruixuan Chen, Naoto Yokoya |
| 2024 | Synthesizing Anyone, Anywhere, in Any Pose. Håkon Hukkelås, Frank Lindseth |
| 2024 | Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models. Xichen Pan, Pengda Qin, Yuhong Li, Hui Xue, Wenhu Chen |
| 2024 | TAMPAR: Visual Tampering Detection for Parcel Logistics in Postal Supply Chains. Alexander Naumann, Felix Hertlein, Laura Dörr, Kai Furmans |
| 2024 | TCP: Triplet Contrastive-relationship Preserving for Class-Incremental Learning. Shiyao Li, Xuefei Ning, Shanghang Zhang, Lidong Guo, Tianchen Zhao, Huazhong Yang, Yu Wang |
| 2024 | TEGLO: High Fidelity Canonical Texture Mapping from Single-View Images. Vishal Vinod, Tanmay Shah, Dmitry Lagun |
| 2024 | THInImg: Cross-modal Steganography for Presenting Talking Heads in Images. Lin Zhao, Hongxuan Li, Xuefei Ning, Xinru Jiang |
| 2024 | TIAM - A Metric for Evaluating Alignment in Text-to-Image Generation. Paul Grimal, Hervé Le Borgne, Olivier Ferret, Julien Tourille |
| 2024 | TPSeNCE: Towards Artifact-Free Realistic Rain Generation for Deraining and Object Detection in Rain. Shen Zheng, Changjie Lu, Srinivasa G. Narasimhan |
| 2024 | TSA Zeyu Xiao, Yurui Zhu, Xueyang Fu, Zhiwei Xiong |
| 2024 | TSP-Transformer: Task-Specific Prompts Boosted Transformer for Holistic Scene Understanding. Shuo Wang, Jing Li, Zibo Zhao, Dongze Lian, Binbin Huang, Xiaomei Wang, Zhengxin Li, Shenghua Gao |
| 2024 | Tackling Data Bias in MUSIC-AVQA: Crafting a Balanced Dataset for Unbiased Question-Answering. Xiulong Liu, Zhikang Dong, Peng Zhang |
| 2024 | Taming Normalizing Flows. Shimon Malnick, Shai Avidan, Ohad Fried |
| 2024 | Task-Oriented Human-Object Interactions Generation with Implicit Neural Representations. Quanzhou Li, Jingbo Wang, Chen Change Loy, Bo Dai |
| 2024 | Temporal Context Enhanced Referring Video Object Segmentation. Xiao Hu, Basavaraj Hampiholi, Heiko Neumann, Jochen Lang |
| 2024 | Temporally-Consistent Video Semantic Segmentation with Bidirectional Occlusion-guided Feature Propagation. Razieh Kaviani Baghbaderani, Yuanxin Li, Shuangquan Wang, Hairong Qi |
| 2024 | Text-Guided Face Recognition using Multi-Granularity Cross-Modal Contrastive Learning. Md Mahedi Hasan, Shoaib Meraj Sami, Nasser M. Nasrabadi |
| 2024 | Text-to-Image Models for Counterfactual Explanations: a Black-Box Approach. Guillaume Jeanneret, Loïc Simon, Frédéric Jurie |
| 2024 | Text-to-image Editing by Image Information Removal. Zhongping Zhang, Jian Zheng, Jacob Zhiyuan Fang, Bryan A. Plummer |
| 2024 | Textron: Weakly Supervised Multilingual Text Detection through Data Programming. Dhruv Kudale, Badri Vishal Kasuba, Venkatapathy Subramanian, Parag Chaudhuri, Ganesh Ramakrishnan |
| 2024 | Textual Alchemy: CoFormer for Scene Text Understanding. Gayatri Deshmukh, Onkar Susladkar, Dhruv Makwana, Sparsh Mittal, R. Sai Chandra Teja |
| 2024 | The Background Also Matters: Background-Aware Motion-Guided Objects Discovery. Sandra Kara, Hejer Ammar, Florian Chabot, Quoc-Cuong Pham |
| 2024 | The Growing Strawberries Dataset: Tracking Multiple Objects with Biological Development over an Extended Period. Junhan Wen, Camiel R. Verschoor, Chengming Feng, Irina-Mona Epure, Thomas Abeel, Mathijs de Weerdt |
| 2024 | The Paleographer's Eye ex machina: Using Computer Vision to Assist Humanists in Scribal Hand Identification. Samuel Grieggs, C. E. M. Henderson, Sebastian Sobecki, Alexandra Gillespie, Walter J. Scheirer |
| 2024 | Think before You Simulate: Symbolic Reasoning to Orchestrate Neural Computation for Counterfactual Question Answering. Adam Ishay, Zhun Yang, Joohyung Lee, Ilgu Kang, Dongjae Lim |
| 2024 | Time to Shine: Fine-Tuning Object Detection Models with Synthetic Adverse Weather Images. Thomas Rothmeier, Werner Huber, Alois C. Knoll |
| 2024 | Token Fusion: Bridging the Gap between Token Pruning and Token Merging. Minchul Kim, Shangqian Gao, Yen-Chang Hsu, Yilin Shen, Hongxia Jin |
| 2024 | Top-Down Beats Bottom-Up in 3D Instance Segmentation. Maksim Kolodiazhnyi, Anna Vorontsova, Anton Konushin, Danila Rukhovich |
| 2024 | Torque based Structured Pruning for Deep Neural Network. Arshita Gupta, Tien Bau, Joonsoo Kim, Zhe Zhu, Sumit Jha, Hrishikesh Garud |
| 2024 | Toward Planet-Wide Traffic Camera Calibration. Khiem Vuong, Robert Tamburo, Srinivasa G. Narasimhan |
| 2024 | Towards Accurate Disease Segmentation in Plant Images: A Comprehensive Dataset Creation and Network Evaluation. Komuravelli Prashanth, Jaladi Sri Harsha, Sivapuram Arun Kumar, Jaladi Srilekha |
| 2024 | Towards Addressing the Misalignment of Object Proposal Evaluation for Vision-Language Tasks via Semantic Grounding. Joshua Feinglass, Yezhou Yang |
| 2024 | Towards Better Structured Pruning Saliency by Reorganizing Convolution. Xinglong Sun, Humphrey Shi |
| 2024 | Towards Diverse and Consistent Typography Generation. Wataru Shimoda, Daichi Haraguchi, Seiichi Uchida, Kota Yamaguchi |
| 2024 | Towards More Realistic Membership Inference Attacks on Large Diffusion Models. Jan Dubinski, Antoni Kowalczuk, Stanislaw Pawlak, Przemyslaw Rokita, Tomasz Trzcinski, Pawel Morawiecki |
| 2024 | Towards Realistic Generative 3D Face Models. Aashish Rai, Hiresh Gupta, Ayush Pandey, Francisco Vicente Carrasco, Shingo Jason Takagi, Amaury Aubel, Daeil Kim, Aayush Prakash, Fernando De la Torre |
| 2024 | Towards Visual Saliency Explanations of Face Verification. Yuhang Lu, Zewei Xu, Touradj Ebrahimi |
| 2024 | Towards a Dynamic Vision Sensor-based Insect Camera Trap. Eike Gebauer, Sebastian Thiele, Pierre Ouvrard, Adrien Sicard, Benjamin Risse |
| 2024 | Tracking Skiers from the Top to the Bottom. Matteo Dunnhofer, Luca Sordi, Niki Martinel, Christian Micheloni |
| 2024 | Tracking Tiny Insects in Cluttered Natural Environments using Refinable Recurrent Neural Networks. Lars Haalck, Sebastian Thiele, Benjamin Risse |
| 2024 | Training Ensembles with Inliers and Outliers for Semi-supervised Active Learning. Vladan Stojnic, Zakaria Laskar, Giorgos Tolias |
| 2024 | Training-Based Model Refinement and Representation Disagreement for Semi-Supervised Object Detection. Seyed Mojtaba Marvasti-Zadeh, Nilanjan Ray, Nadir Erbilgin |
| 2024 | Training-Free Layout Control with Cross-Attention Guidance. Minghao Chen, Iro Laina, Andrea Vedaldi |
| 2024 | Training-free Content Injection using h-space in Diffusion Models. Jaeseok Jeong, Mingi Kwon, Youngjung Uh |
| 2024 | Training-free Object Counting with Prompts. Zenglin Shi, Ying Sun, Mengmi Zhang |
| 2024 | TransFed: A way to epitomize Focal Modulation using Transformer-based Federated Learning. Tajamul Ashraf, Fuzayil Bin Afzal Mir, Iqra Altaf Gillani |
| 2024 | TransRadar: Adaptive-Directional Transformer for Real-Time Multi-View Radar Semantic Segmentation. Yahia Dalbah, Jean Lahoud, Hisham Cholakkal |
| 2024 | TriCoLo: Trimodal Contrastive Loss for Text to Shape Retrieval. Yue Ruan, Han-Hung Lee, Yiming Zhang, Ke Zhang, Angel X. Chang |
| 2024 | TriPlaneNet: An Encoder for EG3D Inversion. Ananta R. Bhattarai, Matthias Nießner, Artem Sevastopolsky |
| 2024 | Triplet Attention Transformer for Spatiotemporal Predictive Learning. Xuesong Nie, Xi Chen, Haoyuan Jin, Zhihang Zhu, Yunfeng Yan, Donglian Qi |
| 2024 | Tunable Hybrid Proposal Networks for the Open World. Matthew Inkawhich, Nathan Inkawhich, Hai Li, Yiran Chen |
| 2024 | U3DS Jiaxu Liu, Zhengdi Yu, Toby P. Breckon, Hubert P. H. Shum |
| 2024 | UGPNet: Universal Generative Prior for Image Restoration. Hwayoon Lee, Kyoungkook Kang, Hyeongmin Lee, Seung-Hwan Baek, Sunghyun Cho |
| 2024 | UNSPAT: Uncertainty-Guided SpatioTemporal Transformer for 3D Human Pose and Shape Estimation on Videos. Minsoo Lee, Hyunmin Lee, Bumsoo Kim, Seunghwan Kim |
| 2024 | UOW-Vessel: A Benchmark Dataset of High-Resolution Optical Satellite Images for Vessel Detection and Segmentation. Ly Bui, Son Lam Phung, Yang Di, Hoang Thanh Le, Tran Thanh Phong Nguyen, Sandy Burden, Abdesselam Bouzerdoum |
| 2024 | USDN: A Unified Sample-wise Dynamic Network with Mixed-Precision and Early-Exit. Ji-Ye Jeon, Xuan Truong Nguyen, Soojung Ryu, Hyuk-Jae Lee |
| 2024 | Uncertainty Estimation in Instance Segmentation with Star-convex Shapes. Qasim M. K. Siddiqui, Sebastian Starke, Peter Steinbach |
| 2024 | Uncertainty-weighted Loss Functions for Improved Adversarial Attacks on Semantic Segmentation. Kira Maag, Asja Fischer |
| 2024 | Understanding Dark Scenes by Contrasting Multi-Modal Observations. Xiaoyu Dong, Naoto Yokoya |
| 2024 | Understanding Hyperbolic Metric Learning through Hard Negative Sampling. Yun Yue, Fangzhou Lin, Guanyi Mou, Ziming Zhang |
| 2024 | Unified Concept Editing in Diffusion Models. Rohit Gandikota, Hadas Orgad, Yonatan Belinkov, Joanna Materzynska, David Bau |
| 2024 | United We Stand, Divided We Fall: UnityGraph for Unsupervised Procedure Learning from Videos. Siddhant Bansal, Chetan Arora, C. V. Jawahar |
| 2024 | Universal Semi-supervised Model Adaptation via Collaborative Consistency Training. Zizheng Yan, Yushuang Wu, Yipeng Qin, Xiaoguang Han, Shuguang Cui, Guanbin Li |
| 2024 | Universal Test-time Adaptation through Weight Ensembling, Diversity Weighting, and Prior Correction. Robert A. Marsden, Mario Döbler, Bin Yang |
| 2024 | Unsupervised 3D Pose Estimation with Non-Rigid Structure-from-Motion Modeling. Haorui Ji, Hui Deng, Yuchao Dai, Hongdong Li |
| 2024 | Unsupervised Co-generation of Foreground-Background Segmentation from Text-to-Image Synthesis. Yeruru Asrar Ahmed, Anurag Mittal |
| 2024 | Unsupervised Domain Adaptation for Semantic Segmentation with Pseudo Label Self-Refinement. Xingchen Zhao, Niluthpol Chowdhury Mithun, Abhinav Rajvanshi, Han-Pang Chiu, Supun Samarasekera |
| 2024 | Unsupervised Domain Adaptation of MRI Skull-stripping Trained on Adult Data to Newborns. Abbas Omidi, Aida Mohammadshahi, Neha Gianchandani, Regan King, Lara Leijser, Roberto Souza |
| 2024 | Unsupervised Event-Based Video Reconstruction. Gereon Fox, Xingang Pan, Ayush Tewari, Mohamed A. Elgharib, Christian Theobalt |
| 2024 | Unsupervised Exemplar-Based Image-to-Image Translation and Cascaded Vision Transformers for Tagged and Untagged Cardiac Cine MRI Registration. Meng Ye, Mikael Kanski, Dong Yang, Leon Axel, Dimitris N. Metaxas |
| 2024 | Unsupervised Graphic Layout Grouping with Transformers. Jialiang Zhu, Danqing Huang, Chunyu Wang, Mingxi Cheng, Ji Li, Han Hu, Xin Geng, Baining Guo |
| 2024 | Unsupervised Model-based Learning for Simultaneous Video Deflickering and Deblotching. Anuj Fulari, Satish Mulleti, Ajit Rajwade |
| 2024 | Unsupervised and semi-supervised co-salient object detection via segmentation frequency statistics. Souradeep Chakraborty, Shujon Naha, Muhammet Bastan, Amit Kumar K. C, Dimitris Samaras |
| 2024 | Using Early Readouts to Mediate Featural Bias in Distillation. Rishabh Tiwari, Durga Sivasubramanian, Anmol Reddy Mekala, Ganesh Ramakrishnan, Pradeep Shenoy |
| 2024 | VCISR: Blind Single Image Super-Resolution with Video Compression Synthetic Data. Boyang Wang, Bowen Liu, Shiyu Liu, Fengyu Yang |
| 2024 | VD-GR: Boosting Visual Dialog with Cascaded Spatial-Temporal Multi-Modal GRaphs. Adnen Abdessaied, Lei Shi, Andreas Bulling |
| 2024 | VEATIC: Video-based Emotion and Affect Tracking in Context Dataset. Zhihang Ren, Jefferson Ortega, Yifan Wang, Zhimin Chen, Yunhui Guo, Stella X. Yu, David Whitney |
| 2024 | VMFormer: End-to-End Video Matting with Transformer. Jiachen Li, Vidit Goel, Marianna Ohanyan, Shant Navasardyan, Yunchao Wei, Humphrey Shi |
| 2024 | Video Instance Matting. Jiachen Li, Roberto Henschel, Vidit Goel, Marianna Ohanyan, Shant Navasardyan, Humphrey Shi |
| 2024 | Video-kMaX: A Simple Unified Approach for Online and Near-Online Video Panoptic Segmentation. Inkyu Shin, Dahun Kim, Qihang Yu, Jun Xie, Hong-Seok Kim, Bradley Green, In So Kweon, Kuk-Jin Yoon, Liang-Chieh Chen |
| 2024 | VideoFACT: Detecting Video Forgeries Using Attention, Scene Context, and Forensic Traces. Tai D. Nguyen, Shengbang Fang, Matthew C. Stamm |
| 2024 | Vikriti-ID: A Novel Approach For Real Looking Fingerprint Data-set Generation. Rishabh Shukla, Aditya Sinha, Vansh Singh, Harkeerat Kaur |
| 2024 | Vision Transformer for Multispectral Satellite Imagery: Advancing Landcover Classification Ryan Rad |
| 2024 | Visual Narratives: Large-scale Hierarchical Classification of Art-historical Images. Matthias Springstein, Stefanie Schneider, Javad Rahnama, Julian Stalter, Maximilian Kristen, Eric Müller-Budack, Ralph Ewerth |
| 2024 | Visually Guided Audio Source Separation with Meta Consistency Learning. Md. Amirul Islam, Seyed Shahabeddin Nabavi, Irina Kezele, Yang Wang, Yuanhao Yu, Jin Tang |
| 2024 | Volumetric Disentanglement for 3D Scene Manipulation. Sagie Benaim, Frederik Warburg, Peter Ebert Christensen, Serge J. Belongie |
| 2024 | WATCH: Wide-Area Terrestrial Change Hypercube. Connor Greenwell, Jon Crall, Matthew Purri, Kristin J. Dana, Nathan Jacobs, Armin Hadzic, Scott Workman, Matthew J. Leotta |
| 2024 | Wakening Past Concepts without Past Data: Class-Incremental Learning from Online Placebos. Yaoyao Liu, Yingying Li, Bernt Schiele, Qianru Sun |
| 2024 | WalkFormer: Point Cloud Completion via Guided Walks. Mohang Zhang, Yushi Li, Rong Chen, Yushan Pan, Jia Wang, Yunzhe Wang, Rong Xiang |
| 2024 | Watch Where You Head: A View-biased Domain Gap in Gait Recognition and Unsupervised Adaptation. Gavriel Habib, Noa Barzilay, Or Shimshi, Rami Ben-Ari, Nir Darshan |
| 2024 | WaveMixSR: Resource-efficient Neural Network for Image Super-resolution. Pranav Jeevan, Akella Srinidhi, Pasunuri Prathiba, Amit Sethi |
| 2024 | Weakly-Supervised Representation Learning for Video Alignment and Analysis. Guy Bar-Shalom, George Leifman, Michael Elad |
| 2024 | Weakly-supervised deepfake localization in diffusion-generated images. Dragos-Constantin Tântaru, Elisabeta Oneata, Dan Oneata |
| 2024 | What Decreases Editing Capability? Domain-Specific Hybrid Refinement for Improved GAN Inversion. Pu Cao, Lu Yang, Dongxv Liu, Xiaoya Yang, Tianrui Huang, Qing Song |
| 2024 | What's Outside the Intersection? Fine-grained Error Analysis for Semantic Segmentation Beyond IoU. Maximilian Bernhard, Roberto Amoroso, Yannic Kindermann, Lorenzo Baraldi, Rita Cucchiara, Volker Tresp, Matthias Schubert |
| 2024 | What's in the Flow? Exploiting Temporal Motion Cues for Unsupervised Generic Event Boundary Detection. Sourabh Vasant Gothe, Vibhav Agarwal, Sourav Ghosh, Jayesh Rajkumar Vachhani, Pranay Kashyap, Barath Raj Kandur Raja |
| 2024 | When 3D Bounding-Box Meets SAM: Point Cloud Instance Segmentation with Weak-and-Noisy Supervision. Qingtao Yu, Heming Du, Chen Liu, Xin Yu |
| 2024 | WildlifeDatasets: An open-source toolkit for animal re-identification. Vojtech Cermák, Lukás Picek, Lukás Adam, Kostas Papafitsoros |
| 2024 | Wino Vidi Vici: Conquering Numerical Instability of 8-bit Winograd Convolution for Accurate Inference Acceleration on Edge. Pierpaolo Morì, Lukas Frickenstein, Shambhavi Balamuthu Sampath, Moritz Thoma, Nael Fasfous, Manoj Rohit Vemparala, Alexander Frickenstein, Christian Unger, Walter Stechele, Daniel Mueller-Gritschneder, Claudio Passerone |
| 2024 | You Can Run but not Hide: Improving Gait Recognition with Intrinsic Occlusion Type Awareness. Ayush Gupta, Rama Chellappa |
| 2024 | ZEETAD: Adapting Pretrained Vision-Language Model for Zero-Shot End-to-End Temporal Action Detection. Thinh Phan, Khoa Vo, Duy Le, Gianfranco Doretto, Donald A. Adjeroh, Ngan Le |
| 2024 | ZIGNeRF: Zero-shot 3D Scene Representation with Invertible Generative Neural Radiance Fields. Kanghyeok Ko, Minhyeok Lee |
| 2024 | ZRG: A Dataset for Multimodal 3D Residential Rooftop Understanding. Isaac Corley, Jonathan Lwowski, Peyman Najafirad |
| 2024 | Zero-Shot Video Moment Retrieval from Frozen Vision-Language Models. Dezhao Luo, Jiabo Huang, Shaogang Gong, Hailin Jin, Yang Liu |
| 2024 | Zero-shot Building Attribute Extraction from Large-Scale Vision and Language Models. Fei Pan, Sangryul Jeon, Brian Wang, Frank McKenna, Stella X. Yu |
| 2024 | dacl10k: Benchmark for Semantic Bridge Damage Segmentation. Johannes Flotzinger, Philipp Jonas Rösch, Thomas Braml |
| 2024 | iBARLE: imBalance-Aware Room Layout Estimation. Taotao Jing, Lichen Wang, Naji Khosravan, Zhiqiang Wan, Zachary Bessinger, Zhengming Ding, Sing Bing Kang |
| 2024 | pSTarC: Pseudo Source Guided Target Clustering for Fully Test-Time Adaptation. Manogna Sreenivas, Goirik Chakrabarty, Soma Biswas |