BMVC A

264 papers

YearTitle / Authors
202435th British Machine Vision Conference, BMVC 2024, Glasgow, UK, November 25-28, 2024
20243D Blur Kernel on Gaussian Splatting.
Yongchao Lin, Xiangdong Su, Yuhan Yang
20243D Point Cloud Network Pruning: When Some Weights Do not Matter.
Amrijit Biswas, Md. Ismail Hossain, Mirza M. Lutfe Elahi, Ali Cheraghian, Fuad Rahman, Nabeel Mohammed, Shafin Rahman
2024A Deep Belief Network Approach to Scalable Compression of Light Field Data for Auto-Stereoscopic Displays.
Sally Khaidem, Mansi Sharma
2024A Learnable Color Correction Matrix for RAW Reconstruction.
Anqi Liu, Shiyi Mu, Shugong Xu
2024A Multimodal Network on Handwritten Chinese Character Error Correction.
Haizhao Sun, Yu Ning, Xu Ji, Chuang Zhang, Ming Wu
2024A Novel Divide and Merge Approach for Improved Classification of Functional Data.
Wei Zhao, Xiao-Jun Zeng, Chengdong Shi, Ching-Hsun Tseng, Yue Chang
2024A Prototype Unit for Image De-raining using Time-Lapse Data.
Jaehoon Cho, Minjung Yoo, Jini Yang, Sunok Kim
2024A Revisit to the Decoder for Camouflaged Object Detection.
Seung Woo Ko, Joopyo Hong, Suyoung Kim, Seungjai Bang, Sungzoon Cho, Nojun Kwak, Hyung-Sin Kim, Joonseok Lee
2024A Super-pixel-based Approach to the Stable Interpretation of Neural Networks.
Shizhan Gong, Jingwei Zhang, Qi Dou, Farzan Farnia
2024A self-supervised and adversarial approach to hyperspectral demosaicking and RGB reconstruction in surgical imaging.
Peichao Li, Oscar MacCormac, Jonathan Shapey, Tom Vercauteren
2024A self-supervised cyclic neural-analytic approach for novel view synthesis and 3D reconstruction.
Dragos Costea, Alina Marcu, Marius Leordeanu
2024ACIL: Active Class Incremental Learning for Image Classification.
Aditya R. Bhattacharya, Debanjan Goswami, Shayok Chakraborty
2024AISE: Adaptive Input Sampling for Explanation of Black-box Models.
Evgeny Tsykunov, Wonju Lee, Minje Park
2024APTPose: Anatomy-aware Pre-Training for 3D Human Pose Estimation.
Qing-Wen Yang, Kai-Wen Duan, Ting-Yi Lu, Kevin Lin, Cheng-Yen Yang, Lijuan Wang, Jenq-Neng Hwang, Shang-Hong Lai
2024AR-TTA: A Simple Method for Real-World Continual Test-Time Adaptation.
Damian Sójka, Bartlomiej Twardowski, Tomasz Trzcinski, Sebastian Cygert
2024ATLANTIS: A Framework for Automated Targeted Language-guided Augmentation Training for Robust Image Search.
Inderjeet Singh, Roman Vainshtein, Alon Zolfi, Asaf Shabtai, Tu Bui, Jonathan Brokman, Omer Hofman, Fumiyoshi Kasahara, Kentaro Tsuji, Hisashi Kojima
2024AUPIMO: Redefining Anomaly Localization Benchmarks with High Speed and Low Tolerance.
João P. C. Bertoldo, Dick Ameln, Ashwin Vaidya, Samet Akcay
2024Acoustic-based 3D Human Pose Estimation Robust to Human Position.
Yusuke Oumi, Yuto Shibata, Go Irie, Akisato Kimura, Yoshimitsu Aoki, Mariko Isogawa
2024Adapting MIMO video restoration networks to low latency constraints.
Valéry Dewil, Zhe Zheng, Arnaud Barral, Lara Raad, Nao Nicolas, Ioannis Cassagne, Jean-Michel Morel, Gabriele Facciolo, Bruno Galerne, Pablo Arias
2024Adaptive Weighted Co-Learning for Cross-Domain Few-Shot Learning.
Abdullah Alchihabi, Marzi Heidari, Yuhong Guo
2024Advancing Anomaly Detection: The IDW dataset and MC algorithm.
Alexander D. J. Taylor, Jonathan James Morrison, Phillip Tregidgo, Neill D. F. Campbell
2024Advancing Medical Image Segmentation: Morphology-Driven Learning with Diffusion Transformer.
Sungmin Kang, Jaeha Song, Jihie Kim
2024AggSS: An Aggregated Self-Supervised Approach for Class Incremental Learning.
Jayateja Kalla, Soma Biswas
2024Align-DETR: Enhancing End-to-end Object Detection with Aligned Loss.
Zhi Cai, Songtao Liu, Guodong Wang, Zeming Li, Zheng Ge, Xiangyu Zhang, Di Huang
2024Alignment-aware Patch-level Routing for Dynamic Video Frame Interpolation.
Ban Chen, Xin Jin, Longhai Wu, Jie Chen, Ilhyun Cho, Cheul-Hee Hahm
2024Anchor-Based Masked Generative Distillation for Pixel-Level Prediction Tasks.
Xie Yu, Wentao Zhang
2024Annotation by Clicks: A Point-Supervised Contrastive Variance Method for Medical Semantic Segmentation.
Qing En, Yuhong Guo
2024Anomaly Detection Based on Semi-Formula Driven Pre-training Dataset to Represent Subtle Difference and Anomaly Score.
Hiroki Kobayashi, Naoki Murakami, Naoto Hiramatsu, Takahiro Suzuki, Manabu Hashimoto
2024Are Sparse Neural Networks Better Hard Sample Learners?
Qiao Xiao, Boqian Wu, Lu Yin, Christopher Neil Gadzinski, Tianjin Huang, Mykola Pechenizkiy, Decebal Constantin Mocanu
2024As Firm As Their Foundations: Creating Transferable Adversarial Examples Across Downstream Tasks with CLIP.
Anjun Hu, Jindong Gu, Francesco Pinto, Konstantinos Kamnitsas, Philip Torr
2024AtomGS: Atomizing Gaussian Splatting for High-Fidelity Radiance Field.
Rong Liu, Rui Xu, Yue Hu, Meida Chen, Andrew Feng
2024AttEntropy: On the Generalization Ability of Supervised Semantic Segmentation Transformers to New Objects in New Domains.
Krzysztof Baron-Lis, Matthias Rottmann, Annika Mütze, Sina Honari, Pascal Fua, Mathieu Salzmann
2024AutoDOM: Automated Dimension Overlay for Enhanced Measurement-Guidance.
Pushpendu Ghosh, Aniket Joshi, Soumyajit Chowdhury, Promod Yenigalla
2024Backdoor Defense through Self-Supervised and Generative Learning.
Ivan Sabolic, Ivan Grubisic, Sinisa Segvic
2024Balancing Calibration and Performance: Stochastic Depth in Segmentation BNNs.
Linghong Yao, Denis Hadjivelichkov, Andromachi Maria Delfaki, Yuanchang Liu, Brooks Paige, Dimitrios Kanoulas
2024BaseBoostDepth: Exploiting Larger Baselines For Self-supervised Monocular Depth Estimation.
Kieran Saunders, Luis J. Manso, George Vogiatzis
2024Benchmarking and Optimizing Federated Learning with Hardware-related Metrics.
Kai Pan, Yapeng Tian, Yinhe Han, Yiming Gan
2024Beyond Face Matching: A Facial Traits based Privacy Score for Synthetic Face Datasets.
Robero Leyva, Praveen Selvaraj, Andrew Elliott, Gregory Epiphaniou, Carsten Maple
2024Beyond Static and Dynamic Quantization - Hybrid Quantization of Vision Transformers.
Piotr Kluska, Florian Scheidegger, A. Cristiano I. Malossi, Enrique S. Quintana-Ortí
2024Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models.
Bin Fu, Qiyang Wan, Jialin Li, Ruiping Wang, Xilin Chen
2024Boundary Contrastive Learning for Label-Efficient Medical Image Segmentation.
Satoshi Kamiya, Kota Yamashita, Kazuhiro Hotta
2024Box for Mask and Mask for Box: weak losses for multi-task partially supervised learning.
Hoàng-Ân Lê, Paul Berg, Minh-Tan Pham
2024Budget-aware Dynamic Spatially Adaptive Inference.
Georgios Zampokas, Christos-Savvas Bouganis, Dimitrios Tzovaras
2024CLIP Adaptation by Intra-Modal Overlap Reduction.
Alexey Kravets, Vinay P. Namboodiri
2024CLIP with Generative Latent Replay: a Strong Baseline for Incremental Learning.
Emanuele Frascaroli, Aniello Panariello, Pietro Buzzega, Lorenzo Bonicelli, Angelo Porrello, Simone Calderara
2024COSMo: CLIP Talks on Open-Set Multi-Target Domain Adaptation.
Munish Monga, Sachin Kumar Giroh, Ankit Jha, Mainak Singha, Biplab Banerjee, Jocelyn Chanussot
2024CPDR: Towards Highly-Efficient Salient Object Detection via Crossed Post-decoder Refinement.
Yijie Li, Hewei Wang, Aggelos K. Katsaggelos
2024CSAD: Unsupervised Component Segmentation for Logical Anomaly Detection.
Yu Hsuan Hsieh, Shang-Hong Lai
2024CVAM-Pose: Conditional Variational Autoencoder for Multi-Object Monocular Pose Estimation.
Jianyu Zhao, Wei Quan, Bogdan J. Matuszewski
2024Calibration of 2D LiDAR sensors using cylindrical target.
Tamás Tófalvi, Bandó Kovács, Levente Hajder
2024Channel-Partitioned Windowed Attention And Frequency Learning for Single Image Super-Resolution.
Dinh Phu Tran, Dao Duy Hung, Daeyoung Kim
2024Complete the Feature Space: Diffusion-Based Fictional ID Generation for Face Recognition.
Myeong-Yeon Yi, Dongjae Lee, Naeun Ko, Yonghyun Jeong, Sang-goo Lee, Seunggyu Chang
2024Content and Style Aware Audio-Driven Facial Animation.
Qingju Liu, Hyeongwoo Kim, Gaurav Bharaj
2024ControlDreamer: Blending Geometry and Style in Text-to-3D.
Yeongtak Oh, Jooyoung Choi, Yongsung Kim, Minjun Park, Chaehun Shin, Sungroh Yoon
2024ControlEdit: A MultiModal Local Clothing Image Editing Method.
Di Cheng, Yingjie Shi, ShiXin Sun, JiaFu Zhang, WeiJing Wang, Yu Liu
2024CosFairNet: A Parameter-Space based Approach for Bias Free Learning.
Rajeev Ranjan Dwivedi, Priyadarshini Kumari, Vinod K. Kurmi
2024Cost-Sensitive Learning for Long-Tailed Temporal Action Segmentation.
Zhanzhong Pang, Fadime Sener, Shrinivas Ramasubramanian, Angela Yao
2024DAVINCI: A Single-Stage Architecture for Constrained CAD Sketch Inference.
Ahmet Serdar Karadeniz, Dimitrios Mallis, Nesryne Mejri, Kseniya Cherenkova, Anis Kacem, Djamila Aouada
2024DRAFT: Direct Radiance Fields Editing with Composable Operations.
Zhihan Cai, Kailu Wu, Dapeng Cao, Feng Chen, Kaisheng Ma
2024Decoupling Forgery Semantics for Generalizable Deepfake Detection.
Wei Ye, Xinan He, Feng Ding
2024Deep Learning for GPS-Denied SAR Image Focusing and Vehicle Trajectory Estimation.
Christopher Beam, Andrew R. Willis, Kevin M. Brink
2024Deep Unfolding Network with Spatial-spectral Perception Enhanced for Pan-sharpening.
Mengjiao Zhao, Mengting Ma, Xiangdong Li, Ao Gao, Siyang Song, Wei Zhang
2024Depth-Guided Privacy-Preserving Visual Localization Using 3D Sphere Clouds.
HeeJoon Moon, Jongwoo Lee, Jeong-Gon Kim, Je Hyeong Hong
2024Detecting Audio-Visual Deepfakes with Fine-Grained Inconsistencies.
Marcella Astrid, Enjie Ghorbel, Djamila Aouada
2024Difflare: Removing Image Lens Flare with Latent Diffusion Models.
Tianwen Zhou, Qihao Duan, Zitong Yu
2024DiffusedWrinkles: A Diffusion-Based Model for Data-Driven Garment Animation.
Raquel Vidaurre, Elena Garces, Dan Casas
2024Direct-Sum Approach to Integrate Losses Via Classifier Subspace.
Takumi Kobayashi
2024DisCoM-KD: Cross-Modal Knowledge Distillation via Disentanglement Representation and Adversarial Learning.
Dino Ienco, Cássio Fraga Dantas
2024Discovering an Image-Adaptive Coordinate System for Photography Processing.
Ziteng Cui, Lin Gu, Tatsuya Harada
2024Disparity Estimation Using a Quad-Pixel Sensor.
Zhuofeng Wu, Doehyung Lee, Zihua Liu, Kazunori Yoshizaki, Yusuke Monno, Masatoshi Okutomi
2024Distribution-Aware Calibration for Object Detection with Noisy Bounding Boxes.
Donghao Zhou, Jialin Li, Jinpeng Li, Jiancheng Huang, Qiang Nie, Yong Liu, Bin-Bin Gao, Qiong Wang, Pheng-Ann Heng, Guangyong Chen
2024Drawing Insights: Sequential Representation Learning in Comics.
Sam Titarsolej, Neil Cohn, Nanne van Noord
2024Drone-assisted Road Gaussian Splatting with Cross-view Uncertainty.
Saining Zhang, Baijun Ye, Xiaoxue Chen, Yuantao Chen, Zongzheng Zhang, Cheng Peng, Yongliang Shi, Hao Zhao
2024D³Nav: Data-Driven Driving Agents for Autonomous Vehicles in Unstructured Traffic.
Aditya Nalgunda Ganesh, Gowri Srinivasa
2024EIANet: A Novel Domain Adaptation Approach to Maximize Class Distinction with Neural Collapse Principles.
Zicheng Pan, Xiaohan Yu, Yongsheng Gao
2024Effective Message Hiding with Order-Preserving Mechanisms.
Yu Gao, Xuchong Qiu, Zihan Ye
2024Efficiency-preserving Scene-adaptive Object Detection.
Zekun Zhang, Vu Quang Truong, Minh Hoai
2024Efficient Data Source Relevance Quantification for Multi-Source Neural Networks.
Jakob Gawlikowski, Nina Maria Gottschling
2024Enabling Local Editing in Diffusion Models by Joint and Individual Component Analysis.
Theodoros Kouzelis, Emmanouil Plitsis, Mihalis Nicolaou, Yannis Panagakis
2024Enhancing 3D Hand Pose Estimation via Dense Ordinal Regression Network.
Yamin Mao, Zhihua Liu, Weiming Li, SoonYong Cho, Qiang Wang, Xiaoshuai Hao
2024Enhancing Cardiovascular Disease Prediction through Multi-Modal Self-Supervised Learning.
Francesco Girlanda, Olga V. Demler, Bjoern H. Menze, Neda Davoudi
2024Enhancing Radiology Report Generation: The Impact of Locally Grounded Vision and Language Training.
Sergio Sánchez Santiesteban, Muhammad Awais, Yi-Zhe Song, Josef Kittler
2024Erasing Concepts from Text-to-Image Diffusion Models with Few-shot Unlearning.
Masane Fuchi, Tomohiro Takagi
2024Examining the Threat Landscape: Foundation Models and Model Stealing.
Ankita Raj, Deepankar Varma, Chetan Arora
2024Explaining Multi-modal Large Language Models by Analyzing their Vision Perception.
Loris Giulivi, Giacomo Boracchi
2024Extract More from Less: Efficient Fine-Grained Visual Recognition in Low-Data Regimes.
Dmitry Demidov, Abduragim Shtanchaev, Mihail Mihaylov, Mohammad Almansoori
2024FADE: Few-shot/zero-shot Anomaly Detection Engine using Large Vision-Language Model.
Yuanwei Li, Elizaveta Ivanova, Martins Bruveris
2024FFR-UNet: Feature Filter-Refinement UNet for Medical Image Segmentation.
Weixin Xu
2024FILS: Self-Supervised Video Feature Prediction In Semantic Language Space.
Mona Ahmadian, Frank Guerin, Andrew Gilbert
2024FLARE up your data: Diffusion-based Augmentation Method in Astronomical Imaging.
Mohammed Talha Alam, Raza Imam, Mohsen Guizani, Fakhri Karray
2024FastForensics: Efficient Two-Stream Design for Real-Time Image Manipulation Detection.
Yangxiang Zhang, Yuezun Li, Ao Luo, Jiaran Zhou, Junyu Dong
2024Feature Splatting for Better Novel View Synthesis with Low Overlap.
Tomás Berriel Martins, Javier Civera
2024Federated Learning for Face Recognition via Intra-subject Self-supervised Learning.
Hansol Kim, Hoyeol Choi, Youngjun Kwak
2024Few-Shot Classification of Interactive Activities of Daily Living (InteractADL).
Zane Durante, Robathan Harries, Edward Vendrow, Zelun Luo, Yuta Kyuragi, Kazuki Kozuka, Li Fei-Fei, Ehsan Adeli
2024Few-shot Multispectral Segmentation with Representations Generated by Reinforcement Learning.
Dilith Jayakody, Thanuja D. Ambegoda
2024Flexible Graph Convolutional Network for 3D Human Pose Estimation.
Abu Taib Mohammed Shahjahan, Abdessamad Ben Hamza
2024Frequency Decomposition to Tap the Potential of Single Domain for Generalization.
Hongjing Niu, Qingyue Yang, Pengfei Xia, Wei Zhang, Bin Li, Feng Zhao
2024From Black-box to Label-only: a Plug-and-Play Attack Network for Model Inversion.
Huan Bao, Kaimin Wei, Yao Chen, Hanting Hou, Jinpeng Chen, Yongdong Wu
2024Future Does Matter: Boosting 3D Object Detection with Temporal Motion Estimation in Point Cloud Sequences.
Rui Yu, Runkai Zhao, Cong Nie, Heng Wang, Siyu Li, Songhao Zhu
2024G3FA: Geometry-guided GAN for Face Animation.
Alireza Javanmardi, Alain Pagani, Didier Stricker
2024GLCM-Adapter: Global-Local Content Matching for Few-shot CLIP Adaptation.
Shuo Wang, Xieenlong, Jinda Lu, Jinghan Li, Yanbin Hao
2024GLPI: A Global Layered Prompt Integration approach for Explicit Visual Prompt.
Yufei Gao, Bin Fu, Lei Shi, Chengming Liu, Yucheng Shi
2024GN-FR: Generalizable Neural Radinace Fields for Flare Removal.
Gopi Raju Matta, Rahul Siddartha, Rongali Simhachala Venkata Girish, Sumit Sharma, Kaushik Mitra
2024Gaussian Splatting in Mirrors: Reflection-aware Rendering via Virtual Camera Optimization.
Zihan Wang, Shuzhe Wang, Matias Turkulainen, Junyuan Fang, Juho Kannala
2024GazeHELL: Gaze Estimation with Hybrid Encoders and Localised Losses with weighing.
Shubham Dokania, Vasudev Singh, Shuaib Ahmed
2024Generalizing Teacher Networks for Effective Knowledge Distillation Across Student Architectures.
Kuluhan Binici, Weiming Wu, Tulika Mitra
2024GeoFormer: A Multi-Polygon Segmentation Transformer.
Maxim Khomiakov, Michael Riis Andersen, Jes Frellsen
2024Group Activity Recognition via Spatio-Temporal Reasoning of Key Instances.
Haoting He, Yaochen Li, Yutong Wang, Gaojie Li, Wei Guo, Runlin Zou
2024Guidance-base Diffusion Models for Improving Photoacoustic Image Quality.
Tatsuhiro Eguchi, Shumpei Takezaki, Mihoko Shimano, Takayuki Yagi, Ryoma Bise
2024Guided Attention for Interpretable Motion Captioning.
Karim Radouane, Julien Lagarde, Sylvie Ranwez, Andon Tchechmedjiev
2024HDRSplat: Gaussian Splatting for High Dynmaic Range 3D Scene Reconstruction from Raw Images.
Shreyas Singh, Aryan Garg, Kaushik Mitra
2024HFGS: 4D Gaussian Splatting with Emphasis on Spatial and Temporal High-Frequency Components for Endoscopic Scene Reconstruction.
Haoyu Zhao, Xingyue Zhao, Lingting Zhu, Weixi Zheng, Yongchao Xu
2024Hierarchical Prompt Learning for Scene Graph Generation.
Xuhan Zhu, Yifei Xing, Ruiping Wang, Yaowei Wang, Xiangyuan Lan
2024Horospherical Learning with Smart Prototypes.
Paul Berg, Björn Michele, Minh-Tan Pham, Laetitia Chapel, Nicolas Courty
2024Hybrid-CSR: Coupling Explicit and Implicit Reconstruction of Cortical Surface.
Shanlin Sun, Tung Le, Pooya Khosravi, Chenyu You, Kun Han, Haoyu Ma, Deying Kong, Xiangyi Yan, Xiaohui Xie
2024ICAF-4: An Integrated Framework of Category-level Articulated Object Perception and Manipulation for Embodied Intelligence.
Wenbo Xu, Li Zhang, Qiankun Li, Qi Wu, Lin Yuanbo Wu, Liu Liu
2024Improving Depth Gradient Continuity in Transformers: A Comparative Study on Monocular Depth Estimation with CNN.
Jiawei Yao, Tong Wu, Xiaofeng Zhang
2024Improving Multimodal Learning with Multi-Loss Gradient Modulation.
Konstantinos Kontras, Christos Chatzichristos, Matthew B. Blaschko, Maarten De Vos
2024Improving Object Detection via Local-global Contrastive Learning.
Danai Triantafyllidou, Sarah Parisot, Ales Leonardis, Steven McDonagh
2024InSpaceType: Dataset and Benchmark for Reconsidering Cross-Space Type Performance in Indoor Monocular Depth.
Cho-Ying Wu, Quankai Gao, Chin-Cheng Hsu, Te-Lin Wu, Jing-Wen Chen, Ulrich Neumann
2024IncreLM: Incremental 3D Line Mapping.
Xulong Bai, Hainan Cui, Shuhan Shen
2024Infrared and Visible Image Fusion Using Multi-level Adaptive Fractional Differential.
Kang Zhang, Xinnian Guo
2024Interactive Image Segmentation with Temporal Information Augmented.
Qiaoqiao Wei, Hui Zhang, Jun-Hai Yong
2024Interpretable Long-term Action Quality Assessment.
Xu Dong, Xinran Liu, Wanqing Li, Anthony Adeyemi-Ejeye, Andrew Gilbert
2024Interpretable Representation Learning from Videos using Nonlinear Priors.
Marian Longa, João F. Henriques
2024InterroGate: Learning to Share, Specialize, and Prune Representations for Multi-task Learning.
Babak Ehteshami Bejnordi, Gaurav Kumar, Amelie Royer, Christos Louizos, Tijmen Blankevoort, Mohsen Ghafoorian
2024Into the Fog: Evaluating Robustness of Multiple Object Tracking.
Nadezda Kirillova, Muhammad Jehanzeb Mirza, Horst Bischof, Horst Possegger
2024JEAN: Joint Expression and Audio-guided NeRF-based Talking Face Generation.
Sai Tanmay Reddy Chakkera, Aggelina Chatziagapi, Dimitris Samaras
2024Kernel Representation for Dynamic Networks.
Yichen Zhou, Teck Khim Ng
2024Key-point Guided Deformable Image Manipulation Using Diffusion Model.
SeokHwan Oh, Guil Jung, Myeong-Gee Kim, Sang-Yun Kim, Young-Min Kim, Hyeon-Jik Lee, Hyuksool Kwon, Hyeon-Min Bae
2024Knowledge Distillation with Global Filters for Efficient Human Pose Estimation.
Kaushik Bhargav Sivangi, Fani Deligianni
2024LLM-guided Instance-level Image Manipulation with Diffusion U-Net Cross-Attention Maps.
Andrey Palaev, Adil Khan, Syed M. Ahsan Kazmi
2024Label Smoothing++: Enhanced Label Regularization for Training Neural Networks.
Sachin Chhabra, Hemanth Venkateswara, Baoxin Li
2024Layer-wise Learning of CNNs by Self-tuning Learning Rate and Early Stopping at Each Layer.
Melika Sadeghi Tabrizi, Ali Karimi, Ahmad Kalhor, Babak Nadjar Araabi, Mona Ahmadian
2024Layout Free Scene Graph to Image Generation.
Rameshwar Mishra, A. Venkata Subramanyam
2024Learning Object Placement via Convolution Scoring Attention.
Yibin Wang, Yuchao Feng, Jianwei Zheng
2024Learning Scene-Goal-Aware Motion Representation for Trajectory Prediction.
Ziyang Ren, Ping Wei, Haowen Tang, Huan Li, Jin Yang
2024Learning conditionally untangled latent spaces using Fixed Point Iteration.
Victor Enescu, Hichem Sahbi
2024Learning to Project for Cross-Task Knowledge Distillation.
Dylan Auty, Roy Miles, Benedikt Kolbeinsson, Krystian Mikolajczyk
2024Learning to Segment Publicly Accessible Green Spaces with Visual and Semantic Data.
Jian Gao, Niall McLaughlin, Joanna Sara Valson, Neil Anderson, Ruth F. Hunter
2024Leveraging Inductive Bias in ViT for Medical Image Diagnosis.
Jungmin Ha, Euihyun Yoon, Sungsik Kim, Jinkyu Kim, Jaekoo Lee
2024Lightweight Human Pose Estimation with Enhanced Knowledge Review.
Hao Xu, Shengye Yan, Wei Zheng
2024Linear Calibration Approach to Knowledge-free Group Robust Classification.
Ryota Ishizaki, Shunya Yamagami, Yuta Goto, Go Irie
2024Local Implicit Wavelet Transformer for Arbitrary-Scale Super-Resolution.
Minghong Duan, Linhao Qu, Shaolei Liu, Manning Wang
2024MCDS-VSS: Moving Camera Dynamic Scene Video Semantic Segmentation by Filtering with Self-Supervised Geometry and Motion.
Angel Villar-Corrales, Moritz Austermann, Sven Behnke
2024ML-2SN: A Hybrid Two-Stream System for Sitting Posture Detection.
Kehang Jia, Gaorui Zhang, Yixuan Yang, Guangwei Huang, Penghuan Wang, Cheng Cheng
2024MMPrune4U: Regularizing Multimodal Feature Distortion in Weight Pruning for Deep Neural Network Compression.
Sudip Das, Kaixin Xu, Nushrat Hussain, Ziyuan Zhao, Arindam Das, Weisi Lin, Ujjwal Bhattacharya
2024MSA2Net: Multi-scale Adaptive Attention-guided Network for Medical Image Segmentation.
Sina Ghorbani Kolahi, Seyed Kamal Chaharsooghi, Toktam Khatibi, Afshin Bozorgpour, Reza Azad, Moein Heidari, Ilker Hacihaliloglu, Dorit Merhof
2024MV-Match: Multi-View Matching for Domain-Adaptive Identification of Plant Nutrient Deficiencies.
Jinhui Yi, Yanan Luo, Marion Deichmann, Gabriel Schaaf, Juergen Gall
2024May the Forgetting Be with You: Alternate Replay for Learning with Noisy Labels.
Monica Millunzi, Lorenzo Bonicelli, Angelo Porrello, Jacopo Credi, Petter N. Kolm, Simone Calderara
2024MeTTA: Single-View to 3D Textured Mesh Reconstruction with Test-Time Adaptation.
Kim Yu-Ji, Hyunwoo Ha, Kim Youwang, Jaeheung Surh, Hyowon Ha, Tae-Hyun Oh
2024Measuring Physical Plausibility of 3D Human Poses Using Physics Simulation.
Nathan Louis, Mahzad Khoshlessan, Jason J. Corso
2024MixMask: Revisiting Masking Strategy for Siamese ConvNets.
Kirill Vishniakov, Eric P. Xing, Zhiqiang Shen
2024Mixstyle-Entropy: Whole Process Domain Generalization with Causal Intervention and Perturbation.
Luyao Tang, Yuxuan Yuan, Chaoqi Chen, Xinghao Ding, Yue Huang
2024MoManifold: Learning to Measure 3D Human Motion via Decoupled Joint Acceleration Manifolds.
Ziqiang Dang, Tianxing Fan, Boming Zhao, Xujie Shen, Lei Wang, Guofeng Zhang, Zhaopeng Cui
2024MonoGS++: Fast and Accurate Monocular RGB Gaussian SLAM.
Renwu Li, Wenjing Ke, Dong Li, Lu Tian, Emad Barsoum
2024Motion Avatar: Generate Human and Animal Avatars with Arbitrary Motion.
Zeyu Zhang, Yiran Wang, Biao Wu, Shuo Chen, Zhiyuan Zhang, Shiya Huang, Wenbo Zhang, Meng Fang, Ling Chen, Yang Zhao
2024Motion Tracking with Rotated Bounding Boxes on Overhead Fisheye Imagery.
Jordan Lam
2024MotionMAE: Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders.
Haosen Yang, Deng Huang, Bin Wen, Jiannan Wu, Hongxun Yao, Yi Jiang, Xiatian Zhu, Zehuan Yuan
2024Multi-Modal Information Bottleneck Attribution with Cross-Attention Guidance.
Pauline Bourigault, Emmanuelle Bourigault, Danilo P. Mandic
2024Multi-Scale Semantic Enrichment and Dual Angular Margin Contrast for Few-Shot Class Incremental Learning.
Riya Verma, Sukhendu Das
2024Multi-Scope Representation Learning for Causal Relation Discovery with new Challenging Datasets.
Jiageng Zhu, Hanchen Xie, Jianhua Wu, Mohamed E. Hussein, Mahyar Khayatkhoei, Jiazhi Li, Wael AbdAlmageed
2024Multi-modal Crowd Counting via Modal Emulation.
Chenhao Wang, Xiaopeng Hong, Zhiheng Ma, Yupeng Wei, Yabin Wang, Xiaopeng Fan
2024Multimodal base distributions in conditional flow matching generative models.
Shane Josias, Willie Brink
2024Mumpy: Multilateral Temporal-view Pyramid Transformer for Video Inpainting Detection.
Ying Zhang, Yuezun Li, Bo Peng, Jiaran Zhou, Huiyu Zhou, Junyu Dong
2024MxT: Mamba x Transformer for Image Inpainting.
Shuang Chen, Amir Atapour-Abarghouei, Haozheng Zhang, Hubert P. H. Shum
2024NCA-Morph: Medical Image Registration with Neural Cellular Automata.
Amin Ranem, John Kalkhof, Anirban Mukhopadhyay
2024NSSR-DIL: Null-Shot Image Super-Resolution Using Deep Identity Learning.
Sree Rama Vamsidhar S., Gorthi Rama Krishna Sai Subrahmanyam
2024Neural Collapse Inspired Contrastive Continual Learning.
Antoine Montmaur, Nicolas Larue, Ngoc-Son Vu
2024No Captions, No Problem: Captionless 3D-CLIP Alignment with Hard Negatives via CLIP Knowledge and LLMs.
Cristian Sbrolli, Matteo Matteucci
2024Noise-Tolerant Few-Shot Unsupervised Adapter for Vision-Language Models.
Eman Ali, Muhammad Haris Khan
2024On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models.
Hashmat Shadab Malik, Numan Saeed, Asif Hanif, Muzammal Naseer, Mohammad Yaqub, Salman Khan, Fahad Shahbaz Khan
2024On Partial Prototype Collapse in the DINO Family of Self-Supervised Methods.
Hariprasath Govindarajan, Per Sidén, Jacob Roll, Fredrik Lindsten
2024Open-Vocabulary Temporal Action Localization using Multimodal Guidance.
Akshita Gupta, Aditya Arora, Sanath Narayan, Salman Khan, Fahad Shahbaz Khan, Graham W. Taylor
2024Open-World Semi-Supervised Learning under Compound Distribution Shifts.
Shijia Xu, Lin Zhao, Jialiang Tang, Guangyu Li, Chen Gong
2024Optimising Diffusion Models for Histopathology Image Synthesis.
Victoria Porter, Richard Gault, Stephanie G. Craig, Jacqueline A. James
2024Out-Of-Distribution Detection for Audio-visual Generalized Zero-Shot Learning: A General Framework.
Liuyuan Wen
2024Outlier detection by ensembling uncertainty with negative objectness.
Anja Delic, Matej Grcic, Sinisa Segvic
2024PEEKABOO: Hiding Parts of an Image for Unsupervised Object Localization.
Hasib Zunair, Abdessamad Ben Hamza
2024PT43D: A Probabilistic Transformer for Generating 3D Shapes from Single Highly-Ambiguous RGB Images.
Yiheng Xiong, Angela Dai
2024PV-SLAM: Panoptic Visual SLAM with Loop Closure and Online Bundle Adjustment.
Ashok Bandyopadhyay, Pranjal Baranwal, Arijit Sur, U. P. Rajeev
2024Painterly Image Harmonization via Bi-Transformation with Dynamic Kernels.
Zhangliang Sun, Hui Zhang
2024PatchRot: Self-Supervised Training of Vision Transformers by Rotation Prediction.
Sachin Chhabra, Hemanth Venkateswara, Baoxin Li
2024PawFACS: Leveraging Semi-Supervised Learning for Pet Facial Action Recognition.
Anandavardhan Hegde, Sudha Velusamy, Narayan Kothari, Aman Bahuguna, Apnesh Rawat, Hema Sathiamurthy, Ankit Raja
2024PhysFlow: Skin tone transfer for remote heart rate estimation through conditional normalizing flows.
Joaquim Comas, Antònia Alomar, Adria Ruiz, Federico Sukno
2024PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition.
Chenhongyi Yang, Zehui Chen, Miguel Espinosa, Linus Ericsson, Zhenyu Wang, Jiaming Liu, Elliot J. Crowley
2024Privacy-preserving datasets by capturing feature distributions with Conditional VAEs.
Francesco Di Salvo, David Tafler, Sebastian Doerrich, Christian Ledig
2024Projected Stochastic Gradient Descent with Quantum Annealed Binary Gradients.
Maximilian Krahn, Michele Sasdelli, Frances Fengyi Yang, Vladislav Golyanik, Juho Kannala, Tat-Jun Chin, Tolga Birdal
2024Prompt Generation Networks for Input-Space Adaptation of Frozen Vision Transformers.
Jochem Loedeman, Maarten C. Stol, Tengda Han, Yuki M. Asano
2024Prompt-guided Multi-modal contrastive learning for Cross-compression-rate Deepfake Detection.
Ching-Yi Lai, Chiou-Ting Hsu, Chih-Chung Hsu, Chia-Wen Lin
2024Prompting Diffusion Representations for Cross-Domain Semantic Segmentation.
Rui Gong, Martin Danelljan, Han Sun, Julio Delgado Mangas, Nikolay Marin, Luc Van Gool
2024Pseudo Labelling for Enhanced Masked Auto Encoders.
Srinivasa Rao Nandam, Sara Atito, Zhenhua Feng, Josef Kittler, Muhammad Awais
2024Putting the Segment Anything Model to the Test with 3D Knee MRI - A Comparison with State-of-the-Art Performance.
Oliver Mills, Nishant Ravikumar, Philip G. Conaghan, Samuel D. Relton
2024QUD: Unsupervised Knowledge Distillation for Deep Face Recognition.
Jan Niklas Kolf, Naser Damer, Fadi Boutros
2024RETRO: Reusing teacher projection head for efficient embedding distillation on Lightweight Models via Self-supervised Learning.
Khanh-Binh Nguyen, Chae Jung Park
2024RISSOLE: Parameter-efficient Diffusion Models via Block-wise Generation and Retrieval-Guidance.
Avideep Mukherjee, Soumya Banerjee, Piyush Rai, Vinay P. Namboodiri
2024RT-GS2: Real-Time Generalizable Semantic Segmentation for 3D Gaussian Representations of Radiance Fields.
Mihnea Bogdan Jurca, Remco Royen, Ion Giosan, Adrian Munteanu
2024Reclaiming Residual Knowledge: A Novel Paradigm to Low-Bit Quantization.
Róisín Luo, Alexandru Drimbarean, James McDermott, Colm O'Riordan
2024Reconstructing Spheres by Fitting Planes.
Erol Ozgur, Mohammad Alkhatib, Youcef Mezouar, Adrien Bartoli
2024Recovering Global Data Distribution Locally in Federated Learning.
Ziyu Yao
2024Recovering SLAM Tracking Lost by Trifocal Pose Estimation using GPU-HC++.
Chiang-Heng Chien, Ahmad Abdelfattah, Benjamin B. Kimia
2024Rectifying Shortcut Learning through Cellular Differentiation in Deep Learning Neurons.
Hongjing Niu, Hanting Li, Guoping Wu, Bin Li, Feng Zhao
2024Region-based Entropy Separation for One-shot Test-Time Adaptation.
Kodai Kawamura, Shunya Yamagami, Go Irie
2024Rethinking Domain Adaptive Optic Disc and Cup Segmentation in Fundus Image through Dynamic Diffusion Flow.
Canran Li, Dongnan Liu, Weidong Cai
2024Retinex-Inspired Cooperative Game Through Multi-Level Feature Fusion for Robust, Universal Image Enhancement.
Ruiqi Mao, Rongxin Cui
2024Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization.
Nicholas Moratelli, Davide Caffagni, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara
2024Revitalizing Legacy Video Content: Deinterlacing with Bidirectional Information Propagation.
Zhaowei Gao, Mingyang Song, Christopher Schroers, Yang Zhang
2024SAE: Single Architecture Ensemble Neural Networks.
Martin Ferianc, Hongxiang Fan, Miguel R. D. Rodrigues
2024SAM Helps SSL: Mask-guided Attention Bias for Self-supervised Learning.
Kensuke Taguchi, Takehiko Kawai, Wataru Imaeda, Hironobu Fujiyoshi
2024SAM-EG: Segment Anything Model with Egde Guidance framework for efficient Polyp Segmentation.
Quoc-Huy Trinh, Hai-Dang Nguyen, Bao-Tram Nguyen Ngoc, Debesh Jha, Ulas Bagci, Minh-Triet Tran
2024SOFI: Multi-Scale Deformable Transformer for Camera Calibration with Enhanced Line Queries.
Sebastian Janampa, Marios Pattichis
2024SR+Codec: a Benchmark of Super-Resolution for Video Compression Bitrate Reduction.
Evgeney Bogatyrev, Ivan Molodetskikh, Dmitriy S. Vatolin
2024STPose: 6D object pose estimation network based on sparse attention and cross-layer connection.
Shihao Chen, Xiaobing Li, Keduo Yan, Yong Li, Dongxu Gao
2024SagaGAN: Style Applied using Gram matrix Attribution based on StarGAN v2.
Yongseon Yoo, Seonggyu Kim, Jong-Min Lee
2024Scalable Frame Sampling for Video Classification: A Semi-Optimal Policy Approach with Reduced Search Space.
Junho Lee, Jeongwoo Shin, Seung Woo Ko, Seongsu Ha, Joonseok Lee
2024SceneSAM: Integrating 2D Labels for Weakly Supervised 3D Scene Understanding.
Julius Körner, Dogu Tamgac, Dávid Rozenberszki
2024SciPostLayout: A Dataset for Layout Analysis and Layout Generation of Scientific Posters.
Shohei Tanaka, Hao Wang, Yoshitaka Ushiku
2024Seg-HGNN: Unsupervised and Light-Weight Image Segmentation with Hyperbolic Graph Neural Networks.
Debjyoti Mondal, Rahul Mishra, Chandan Kumar Pandey
2024Self-Evolving Depth-Supervised 3D Gaussian Splatting from Rendered Stereo Pairs.
Sadra Safadoust, Fabio Tosi, Fatma Güney, Matteo Poggi
2024Self-Supervised Real-World Denoising by Jointly Learning Visible and Invisible Noise.
Shaoyu Wang, Changze Zhou, Bolin Song, Yiyang Wang
2024Semantic Image Synthesis of Anime Characters Based on Conditional Generative Adversarial Networks.
Xuhui Zhu, Feng Jiang, Jing Wen, Yi Wang, Qiang Gao
2024Separated and Independent Contrastive Learning on Labeled and Unlabeled Samples: Boosting Performance on Long-tail Semi-supervised Learning.
Dongyoung Kim, Jeong-Gun Lee, Wonsook Lee
2024Sequential Amodal Segmentation via Cumulative Occlusion Learning.
Jiayang Ao, Qiuhong Ke, Krista A. Ehinger
2024Sign Stitching: A Novel Approach to Sign Language Production.
Harry Walsh, Ben Saunders, Richard Bowden
2024SignVTCL: Multi-Modal Continuous Sign Language Recognition Enhanced by Visual-Textual Contrastive Learning.
Hao Chen, Jiaze Wang, Ziyu Guo, Jinpeng Li, Donghao Zhou, Bian Wu, Chenyong Guan, Guangyong Chen, Pheng-Ann Heng
2024Spatial-Temporal NAS for Fast Surgical Segmentation.
Matthew Lee, Felix John Samuel Bragman, Ricardo Sanchez-Matilla, Imanol Luengo, Danail Stoyanov
2024Spatio-Temporal Transformer with Rotary Position Embedding and Bone Priors for 3D Human Pose Estimation.
Cheng Chen, Jiang Liu, Liaoyuan Zeng, Fang Duan, Sean McGrath, Tian Dan
2024Spatiotemporal Vision Transformer for Weakly Supervised Dense Prediction of Dynamic Brain Maps.
Behnam Kazemivash, Armin Iraji, Sergey M. Plis, Vince D. Calhoun
2024Spike-SLR: An Energy-efficient Parallel Spiking Transformer for Event-based Sign Language Recognition.
Xinxu Lin, Mingxuan Liu, Kezhuo Liu, Hong Chen
2024SuperLoRA: Parameter-Efficient Unified Adaptation of Large Foundation Models.
Xiangyu Chen, Jing Liu, Ye Wang, Pu Perry Wang, Matthew Brand, Guanghui Wang, Toshiaki Koike-Akino
2024Syn-to-Real Unsupervised Domain Adaptation for Indoor 3D Object Detection.
Yunsong Wang, Na Zhao, Gim Hee Lee
2024Synthetic-to-Real Domain Generalized Semantic Segmentation for 3D Indoor Point Clouds.
Yuyang Zhao, Na Zhao, Gim Hee Lee
2024S³-Match: Common-View Aligned Image Matching via Self-Supervised Keypoint Selection.
Shizhen Li, Jingcheng Liu, Jianwu Fang, Dezheng Gao, Jianru Xue
2024TalkLoRA: Low-Rank Adaptation for Speech-Driven Animation.
Jack R. Saunders, Vinay P. Namboodiri
2024Taming the Tail: Leveraging Asymmetric Loss and Padé Approximation to Overcome Long-Tailed Class Imbalance.
Pankhi Kashyap, Pavni Tandon, Sunny Gupta, Abhishek Tiwari, Ritwik Kulkarni, Kshitij Sharad Jadhav
2024Task-Related Feature Enhancement Network for Neuronal Morphology Classification.
Chunli Sun, Feng Zhao
2024Text Removal In E-Commerce Images: A Comparison Of Inpainting Methods.
Hiya Roy, Björn Stenger
2024Text-Guided Mixup Towards Long-Tailed Image Categorization.
Richard Franklin, Jiawei Yao, Deyang Zhong, Qi Qian, Juhua Hu
2024Textual Attention RPN for Open-Vocabulary Object Detection.
Tae-Min Choi, Inug Yoon, Jong-Hwan Kim, Juyoun Park
2024The Attempt on Combining Three Talents by KD with Enhanced Boundary in Co-Salient Object Detection.
Ziyi Cao, Shengye Yan, Wei Zheng
2024Time-conditioned Illumination for Inverse Rendering of Outdoor Scenes.
Xiaoxue Chen, Hao Zhao, Guyue Zhou, Ya-Qin Zhang
2024Topology-preserving Adversarial Training for Alleviating Natural Accuracy Degradation.
Xiaoyue Mi, Fan Tang, Yepeng Weng, Danding Wang, Juan Cao, Sheng Tang, Peng Li, Yang Liu
2024Toward Highly Efficient Semantic-Guided Machine Vision for Low-Light Object Detection.
Xin Feng, Junxian Zeng, Siping Wang, Zhenwei He
2024Towards Better Zero-Shot Anomaly Detection under Distribution Shift with CLIP.
Jiyao Gao, Chengxin He, Lei Duan, Jie Zuo
2024Towards Generative Class Prompt Learning for Fine-grained Visual Recognition.
Soumitri Chattopadhyay, Sanket Biswas, Emanuele Vivoli, Josep Lladós
2024TraIL-Det: Transformation-Invariant Local Feature Networks for 3D LiDAR Object Detection with Unsupervised Pre-Training.
Li Li, Tanqiu Qiao, Hubert P. H. Shum, Toby P. Breckon
2024Training-Free Zero-Shot Semantic Segmentation with LLM Refinement.
Yuantian Huang, Satoshi Iizuka, Kazuhiro Fukui
2024TrakAthlete4D: Multi-View On-Field Player Position Tracking in Sports.
Nitish Agarwal, Steven Cadavid
2024TransHuPR: Cross-View Fusion Transformer for Human Pose Estimation Using mmWave Radar.
Niraj Prakash Kini, Ruey-Horng Shiue, Ryan Chandra, Wen-Hsiao Peng, Ching-Wen Ma, Jenq-Neng Hwang
2024Transferable Learned Image Compression-Resistant Adversarial Perturbations.
Yang Sui, Zhuohang Li, Ding Ding, Xiang Pan, Xiaozhong Xu, Shan Liu, Zhenzhong Chen
2024Trimming the Fat: Efficient Compression of 3D Gaussian Splats through Pruning.
Muhammad Salman Ali, Maryam Qamar, Sung-Ho Bae, Enzo Tartaglione
2024UnSeGArmaNet: Unsupervised Image Segmentation using Graph Neural Networks with Convolutional ARMA Filters.
Kovvuri Sai Gopal Reddy, Bodduluri Saran, A. Mudit Adityaja, Saurabh J. Shigwan, Nitin Kumar, Snehasis Mukherjee
2024Uni-Mlip: Unified Self-Supervision for Medical Vision Language Pre-training.
Ameera Ali Bawazir, Kebin Wu, Wenbin Li
2024Unified Compositional Query Machine with Multimodal Consistency for Video-based Human Activity Recognition.
Tuyen Tran, Thao Minh Le, Duy Hung Tran, Truyen Tran
2024Unsupervised Domain Adaptation for Tubular Structure Segmentation Across Different Anatomical Sources.
Yuxiang An, Dongnan Liu, Weidong Cai
2024Unsupervised Hashing Network with Hyper Quantization Tree.
Sungeun Kim, Jongbin Ryu
2024Unsupervised Point Cloud Registration with Self-Distillation.
Christian Löwens, Thorben Funke, André Wagner, Alexandru Paul Condurache
2024VEMIC: View-aware Entropy model for Multi-view Image Compression.
Susmija Jabbireddy, Davit Soselia, Max Ehrlich, Christopher A. Metzler, Amitabh Varshney
2024VLAVAD: Vision-Language Models Assisted Unsupervised Video Anomaly Detection.
Changkang Li, Yalong Jiang
2024Vision-Language Guidance for LiDAR-based Unsupervised 3D Object Detection.
Christian Fruhwirth-Reisinger, Wei Lin, Dusan Malic, Horst Bischof, Horst Possegger
2024When Text and Images Don't Mix: Bias-Correcting Language-Image Similarity Scores for Anomaly Detection.
Adam Goodge, Bryan Hooi, Wee Siong Ng
2024iHAST: Integrating Hybrid Attention for Super-Resolution in Spatial Transcriptomics.
Xi Li, Jing Zhang, Ziheng Duan, Yi Dai, Siwei Xu
2024topK dice loss for medical image segmentation.
Seyed Mohsen Hosseini