ICIP B

594 papers

YearTitle / Authors
20243D Clothed Human Reconstruction From One In-the-Wild RGB Image.
Liangjing Shao, Benshuang Chen, Xinrong Chen
20243D Semantic Scene Completion From A Depth Map With Unsupervised Learning For Semantics Prioritisation.
Mona Alawadh, Mahesan Niranjan, Hansung Kim
20243D-COCO: Extension of MS-COCO Dataset for Scene Understanding and 3D Reconstruction.
Bideaux Maxence, Phe Alice, Mohamed Chaouch, Luvison Bertrand, Quoc-Cuong Pham
20243Dlaneformer: Rethinking Learning Views for 3D Lane Detection.
Kun Dong, Jian Xue, Xing Lan, Ke Lu
20243F-PNP: Compressive Sensing Using Nonlocal Self-Similarity and Deep Learning Priors.
Karen O. Egiazarian, Vladimir Katkovnik
2024A 1D Plug-and-Play Synthetic Data Deep Learning For Undersampled Magnetic Resonance Image Reconstruction.
Min Xiao, Zi Wang, Jiefeng Guo, Di Guo, Xiaobo Qu
2024A Benchmark of Variance of Opinion Scores in Image Quality Assessment.
Jianxun Lou, Xinbo Wu, Yingying Wu, Padraig Corcoran, Gualtiero Colombo, Roger M. Whitaker, Hantao Liu
2024A Cnn-Transformer Network Based Snr Guided High Frequency Reconstruction for Low Light Image Enhancement.
Jin Zhang, Haiyan Jin, Haonan Su, Yuanlin Zhang, Zhaolin Xiao, Bin Wang
2024A Comparative Study of Perceptual Quality Metrics For Audio-Driven Talking Head Videos.
Weixia Zhang, Chengguang Zhu, Jingnan Gao, Yichao Yan, Guangtao Zhai, Xiaokang Yang
2024A Confidence-Aware Matching Strategy For Generalized Multi-Object Tracking.
Kyujin Shim, Jubi Hwang, Kangwook Ko, Changick Kim
2024A Context-Oriented Multi-Scale Neural Network for Fire Segmentation.
Tony Zhang, Robert P. Dick
2024A Cross Domain Generative Network for Accelerated MRI.
Vazim Ibrahim, Joseph Suresh Paul
2024A Dataset for Understanding Open UGC Video Datasets.
Pierre R. Lebreton, Patrick Le Callet, Neil Birkbeck, Yilin Wang, Balu Adsumilli
2024A Decoding Scheme With Successive Aggregation of Multi-Level Features For Light-Weight Semantic Segmentation.
Jiwon Yoo, Jangwon Lee, Gyeonghwan Kim
2024A Dictionary Based Approach for Removing Out-of-Focus Blur.
Uditangshu Aurangabadkar, Anil C. Kokaram
2024A Dual-Domain Collaboration Network for VCS Reconstruction.
Jiahui Liu, Chunling Yang
2024A Fusion-Based Approach for Blind Contrast-Enhanced Image Ranking.
Wael Suliman, Mohamed Deriche, Naoufel Werghi, Azeddine Beghdadi
2024A Hard Convex-Shape Constraint In Dnns For Object Segmentation.
Jimut B. Pal, Suyash P. Awate
2024A Hue-Preserving Contrast Enhancement Method Using Histogram Specification for Each RGB Component.
Ryushiro Matsumoto, Mashiho Mukaida, Takanori Koga, Noriaki Suetake
2024A Large-Capacity Data Hiding Scheme in Encrypted VVC Video.
Chen Chen, Xingjun Wang
2024A Learnable Radar Imaging Paradigm Driven by Deep Generative Model.
Shuang Li, Ganggang Dong
2024A Modular and Robust Physics-Based Approach for Lensless Image Reconstruction.
Yohann Perron, Eric Bezzam, Martin Vetterli
2024A Multi-Modality Feature Enhancement Method Based On Feature Disentanglement For Sar Image Target Detection.
Jiayue He, Nan Su, Yanping Liao, Yiming Yan, Shou Feng, Chunhui Zhao
2024A Multi-Scale Feature Fusion Network for Chip Surface Defect Detection.
Haoang Ren, Mengke Tian, Guanwen Zhang, Wei Zhou
2024A Needle In A (Medical) Haystack: Detecting A Biopsy Needle In Ultrasound Images Using Vision Transformers.
Agata M. Wijata, Bartlomiej Pycinski, Jakub Nalepa
2024A Neuroimaging Yolov8-Based Cad Framework for Anosmia Grading in Covid-19.
Hossam Magdy Balaha, Mayada Elgendy, Ahmed Alksas, Mohamed Shehata, Norah Saleh Alghamdi, Fatma Taher, Mohammed Ghazal, Mahitab Ghoneim, Eslam Hamed, Fatma Sherif, Ahmed Elgarayhi, Mohammed Sallah, Mohamed Abdelbadie Salem, Elsharawy Kamal, Harpal Sandhu, Ayman El-Baz
2024A New Efficient Split & Merge Algorithm for Embedded Systems.
Nathan Maurice, Julien Sopena, Lionel Lacassagne
2024A New Fingerprinting Technique for Engraved Binary Matrix Authentication.
Léo Nicollier, Marc Michel Pic, Enric Meinhardt-Llopis, Gabriele Facciolo
2024A New People-Object Interaction Dataset and NVS Benchmarks.
Shuai Guo, Houqiang Zhong, Qiuwen Wang, Ziyu Chen, Yijie Gao, Jiajing Yuan, Chenyu Zhang, Rong Xie, Li Song
2024A Novel Approach for 3D Renal Segmentation Using a Modified GAN Model and Texture Analysis.
Israa Sharaby, Ahmed Alksas, Hossam Magdy Balaha, Ali Mahmoud, Mohammed Ali Badawy, Mohamed Abou El-Ghar, Ashraf Khalil, Mohammed Ghazal, Sohail Contractor, Ayman El-Baz
2024A Novel Architecture for Image Vectorization with Increasing Granularity.
Junhao Huang, Fang Zhang, Meiliang Liu, Zhengye Si, Zhiwen Zhao
2024A Practical Calibration Method for Cameras and Multiple Line-Lasers in Light Sectioning Systems for Underwater Environments.
Takaki Ikeda, Takafumi Iwaguchi, Diego Thomas, Hiroshi Kawasaki
2024A Preconditioning Approach To Optimizing Sensing Matrix For Improved Compressed Sensing CT Reconstruction.
Prasad Theeda, Chee-Ming Ting, Arghya Pal, Hernando Ombao
2024A Real-World Satellite Video Subjective QOE Database.
Bowen Chen, Zaixi Shang, Alan C. Bovik, Jae Won Chung, David Lerner
2024A Self-Supervised Diffusion Framework For Facial Emotion Recognition.
Saif Hassan, Mohib Ullah, Ali Shariq Imran, Ghulam Mujtaba, Muhammad Mudassar Yamin, Ehtesham Hashmi, Faouzi Alaya Cheikh, Azeddine Beghdadi
2024A Single Graph Convolution is All You Need: Efficient Grayscale Image Classification.
Jacob Fein-Ashley, Sachini Wickramasinghe, Bingyi Zhang, Rajgopal Kannan, Viktor K. Prasanna
2024A Sparse Graph Formulation for Efficient Spectral Image Segmentation.
Rahul Palnitkar, Jeová Farias Sales Rocha Neto
2024A Spatio-Temporal Aligned SUNet Model For Low-Light Video Enhancement.
Ruirui Lin, Nantheera Anantrasirichai, Alexandra Malyugina, David Bull
2024A Statistical Image Realism Score For Deepfake Detection.
Yunzhuo Chen, Naveed Akhtar, Nur Al Hasan Haldar, Jordan Vice, Ajmal Mian
2024A Study on the Effect of Color Spaces in Learned Image Compression.
Srivatsa Prativadibhayankaram, Mahadev Prasad Panda, Jürgen Seiler, Thomas Richter, Heiko Sparenberg, Siegfried Foessel, André Kaup
2024A Subjective Quality Evaluation of 3D Mesh With Dynamic Level of Detail in Virtual Reality.
Duc V. Nguyen, Tran Thuy Hien, Truong Thu Huong
2024A Text Detector Based on the Specific Text Prompt.
Xingtao Lin, Chuanyang Gong, Lanxiao Wang, Heqian Qiu, Shengyu Tong, Hongliang Li
2024A Toolkit to Benchmark Point Cloud Quality Metrics with Multi-Track Evaluation Criteria.
Ali Ak, Emin Zerman, Maurice Quach, Aladine Chetouani, Giuseppe Valenzise, Patrick Le Callet
2024A Trustworthy Authentication Against Visual Master Face Dictionary Attacks (Trauma).
Muhammad Mohzary, Baek-Young Choi, Sejun Song
2024AAGF: An Efficient Transformer With Mix-Features For Visual Place Recognition.
Kuan Zhou, Zhenyu Xu, Qieshi Zhang, Jun Cheng, Ziliang Ren, Xiangyang Gao
2024ACML: Attention-Based Cross-Modality Learning For Cloth-Changing and Occluded Person Re-Identification.
Vuong D. Nguyen, Pranav Mantini, Shishir K. Shah
2024AI-Generated Image Detection With Wasserstein Distance Compression and Dynamic Aggregation.
Zihang Lyu, Jun Xiao, Cong Zhang, Kin-Man Lam
2024AIGCOIQA2024: Perceptual Quality Assessment of AI Generated Omnidirectional Images.
Liu Yang, Huiyu Duan, Long Teng, Yucheng Zhu, Xiaohong Liu, Menghan Hu, Xiongkuo Min, Guangtao Zhai, Patrick Le Callet
2024ATAC-NET: Zoomed View Works Better for Anomaly Detection.
Shaurya Gupta, Neil Gautam, Anurag Malyala
2024ATU-NET: An Adaptive Transformation-Based U-NET for Medical Image Segmentation.
Qianyu Du, Baojiang Zhong, Kai-Kuang Ma
2024Accelerating Cascade Classifier Training with Genetic Algorithms for Edge ML Applications.
Abhishek Saini, Sajjad Moazeni
2024Accurate Colon Segmentation Using 2D Convolutional Neural Networks With 3D Contextual Information.
Samir Harb, A. Elsayed, Mohamed Yousuf, Islam Alkabbany, Asem M. Ali, Salwa Elshazley, Aly A. Farag
2024AdaViPro: Region-Based Adaptive Visual Prompt For Large-Scale Models Adapting.
Mengyu Yang, Ye Tian, Lanshan Zhang, Xiao Liang, Xuming Ran, Wendong Wang
2024Adaprompt: Prompt Tuning with Adaptive Neighbours for Generalized Category Discovery.
Liyana Sahir, Anwesha Banerjee, Soma Biswas
2024Adaptative Context Normalization: A Boost for Deep Learning in Image Processing.
Bilal Faye, Hanane Azzag, Mustapha Lebbah, Djamel Bouchaffra
2024Adapting Learned Image Codecs To Screen Content Via Adjustable Transformations.
H. Burak Dogaroglu, Ahmet Burakhan Koyuncu, Atanas Boev, Elena Alshina, Eckehard G. Steinbach
2024Adaptive Adversarial Cross-Entropy Loss for Sharpness-Aware Minimization.
Tanapat Ratchatorn, Masayuki Tanaka
2024Adaptive Downsampling and Spatial Upconversion for Point Cloud Compression.
Yichen Zhou, Xinfeng Zhang, Yingzhan Xu, Kai Zhang, Li Zhang
2024Adaptive Sampling Method for Whole-Body Low-Dose Pet Reconstruction Based on Reconstruction Difficulty.
Yanyi Li, Jianping Yin
2024Adaptive Spatial-Temporal Modelling For Human Motion Prediction.
Jianhua Zhang, Huiyu Zhou, Na Lv
2024Adaptive Tilt-Series Alignment With Feature Resampling in Cryo-Electron Tomography.
Ranhao Zhang, Mingtao Huang, Xueming Li, Yuan Shen
2024Adaptively Hierarchical Quantization Variational Autoencoder Based on Feature Decoupling and Semantic Consistency for Image Generation.
Ying Zhang, Hyunhee Park, Hanchao Jia, Fan Wang, Jianxing Zhang, Xiangyu Kong
2024Adaptrack: Adaptive Thresholding-Based Matching for Multi-Object Tracking.
Kyujin Shim, Kangwook Ko, Jubi Hwang, Changick Kim
2024Adaptxray: Vision Transformer And Adapter In X-Ray Images For Prohibited Items Detection.
Yaobin Huang, Hongxia Gao, Xiaomeng Li
2024AdvART: Adversarial Art for Camouflaged Object Detection Attacks.
Amira Guesmi, Ioan Marius Bilasco, Muhammad Shafique, Ihsen Alouani
2024Advanced Object Detection in Multibeam Forward-Looking Sonar Images Using Linear Cross-Attention Techniques.
Gangqi Chen, Zhaoyong Mao, Junge Shen
2024Advancing Colorectal Polyp Segmentation With Watershed Algorithm-Enhanced Parallel Self-Supervised Learning.
Khalil Chikhaoui, Motaz Alfarraj
2024Adversarial Detection Transformer For Kuzushiji Recognition.
Pengfeng Lu, Sei-ichiro Kamata, Mengyunqiu Zhang, Weilian Zhou
2024Adversarial EM For Partially-Supervised Image-Quality Enhancement: Application To Low-Dose Pet Imaging.
Vatsala Sharma, Suyash P. Awate
2024Adversarial Robustness for Deep Metric Learning.
Ezgi Paket, Inci M. Baytas
2024Adversarially Robust Continual Learning with Anti-Forgetting Loss.
Koki Mukai, Soichiro Kumano, Nicolas Michel, Ling Xiao, Toshihiko Yamasaki
2024Aerial View River Landform Video Segmentation: A Weakly Supervised Context-Aware Temporal Consistency Distillation Approach.
Chi-Han Chen, Chieh-Ming Chen, Wen-Huang Cheng, Ching-Chun Huang
2024Agent-Guided Gaze Estimation Network by Two-Eye Asymmetry Exploration.
Yichen Shi, Feifei Zhang, Wenming Yang, Guijin Wang, Nan Su
2024Alignface: Enhancing Face Verification Models Through Adaptive Alignment Of Pose, Expression, and Illumination.
Sahar Husseini, Jean-Luc Dugelay
2024All Skeletons are Created Equal! A Domain Adaptation Transformer to Handle Multiple Topologies.
Giulia Martinelli, Nicola Garau, Niccolò Bisagno, Nicola Conci
2024An Anchor-Free Contour-Based Method For Instance Segmentation.
Tzu-Han Huang, Wen-Jiin Tsai
2024An Explainable Spectral Analysis For Light Field Image Quality Assessment.
Shengyang Zhao, Xin Jin
2024An Image Decomposition-Guided Network for Image Interpolation.
Jiahuan Ji, Baojiang Zhong, Kai-Kuang Ma, Fuhui Zhou, Qihui Wu
2024An Indoor Scene Localization Method Using Graphical Summary of Multi-View RGB-D Images.
Preeti Meena, Himanshu Kumar, Sandeep Kumar Yadav
2024An International Standard For Assessing Trustworthiness In Media.
Deepayan Bhowmik, Sabrina B. Caldwell, Jaime Delgado, Touradj Ebrahimi, Nikolaos Fotos, Xiaojun Gu, Ziyuan Hu, Xin Kang, Fernando Pereira, Leonard Rosenthol, Frederik Temmermans, Haibo Zhou
2024An Interpretable Deep Graph Neural Network Based On Attentional Multi-Scale Feature Fusion for FMRI Analysis.
Likai Wang, Tao Zhu, Yipu Zhang
2024An Optimal Transport-Based Method For Medical Image Generation.
Bohan Lei, Yueting Zhuang, Xiaoyin Xu, Min Zhang
2024An α-Divergence Approach To Robust Canonical Correlation Analysis.
Wenjing Yang, Abd-Krim Seghouane, Pavel Krupskiy
2024Analyzing Visible Articulatory Movements in Speech Production For Speech-Driven 3D Facial Animation.
Hyung Kyu Kim, Sangmin Lee, Hak Gu Kim
2024Anomaly Detection for the Identification of Volcanic Unrest in Satellite Imagery.
Robert Gabriel Popescu, Nantheera Anantrasirichai, Juliet Biggs
2024Anomaly Unveiled: Securing Image Classification against Adversarial Patch Attacks.
Nandish Chattopadhyay, Amira Guesmi, Muhammad Shafique
2024Apnet: Generating Precise Anomaly Prior Information for Mixed-Supervised Defect Detection.
Guanji Li, Hongxia Gao
2024Are Objective Explanatory Evaluation Metrics Trustworthy? An Adversarial Analysis.
Prithwijit Chowdhury, Mohit Prabhushankar, Ghassan AlRegib, Mohamed Deriche
2024Assessing Video Shakiness: A Novel Data And Protocols Framework.
Borhen-Eddine Dakkar, Azeddine Beghdadi, Stefania Colonnese, Naveed Iqbal, Azzedine Zerguine
2024Attention Down-Sampling Transformer, Relative Ranking and Self-Consistency For Blind Image Quality Assessment.
Mohammed Alsaafin, Musab Alsheikh, Saeed Anwar, Muhammad Usman
2024Attention Enhancement With Parallel Groups for Remote Sensing Object Detection.
Zhigang Yang, Yiming Liu, Zehao Gao, Jiayue He, Tao Chen, Wei Emma Zhang
2024Attention-Based Few-Shot Diagnosis of Chest X-Rays Using Semantic Signatures.
Devi Prasad Maharathy, Prabhala Sandhya Gayatri, Angshuman Paul
2024Automated Segmentation of Lung Regions in 3D CT Scans Using Hybrid Unsupervised-Supervised Models.
Ahmed Sharafeldeen, Adel Khelifi, Mohammed Ghazal, Maha Yaghi, Sohail Contractor, Ayman El-Baz
2024B-Walk: Bernoulli Principle Guided Biased Random Walk for Curve Connection.
Zhuang Sun, Li Chen, Zhida Feng, Xiaoming Liu
2024Bayesian Blind Image Deconvolution using an Hyperbolic-Secant prior.
Francisco M. Castro-Macías, Fernando Pérez-Bueno, Miguel Vega, Javier Mateos, Rafael Molina, Aggelos K. Katsaggelos
2024Bi-Directional Tracklet Embedding for Multi-Object Tracking.
H. Çagriota Bilgi, A. Aydiotan Alatan
2024Bi-Predictive Intra Block Copy for Enhanced Video Coding Beyond VVC.
Yoshitaka Kidani, Haruhisa Kato, Kei Kawamura
2024Bidfuse: Harnessing Bi-Directional Attention with Modality-Specific Encoders for Infrared-Visible Image Fusion.
Wangzhi Xing, Diqi Chen, Mohammad Aminul Islam, Jun Zhou
2024Binary-Decomposed Vision Transformer: Compressing and Accelerating Vision Transformer by Binary Decomposition.
Ryota Kondo, Hiroaki Minoura, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi
2024Blend & Predict: Domain-Adaptable Few-Shot Learning for Microscopy Imaging.
Ayush Somani, Anshul Gupta, Arif Ahmed Sekh, Krishna Agarwal, Dilip K. Prasad
2024Bmt-Bench: A Benchmark Sports Dataset For Video Generation.
Ziang Shi, Yang Xiao, Da Yan, Min-Te Sun, Wei-Shinn Ku, Bo Hui
2024Box-Level Class-Balanced Sampling For Active Object Detection.
Jingyi Liao, Xun Xu, Chuan-Sheng Foo, Lile Cai
2024Bri3L: A Brightness Illusion Image Dataset for Identification and Localization of Regions of Illusory Perception.
Aniket Roy, Anirban Roy, Soma Mitra, Kuntal Ghosh
2024Burnsnet: Burn Region Segmentation Network From Color Images With Two-Way CNN.
Joohi Chauhan, Paul L. Rosin, Puneet Goyal
2024CAPTIV8: A Comprehensive Large Scale Capsule Endoscopy Dataset For Integrated Diagnosis.
Anuja Vats, Bilal Ahmad, Pål Anders Floor, Ahmed Kedir Mohammed, Marius Pedersen, Øistein Hovde
2024CLIFS: Clip-Driven Few-Shot Learning for Baggage Threat Classification.
Abdelfatah Hassan Ahmed, Divya Velayudhan, Mahmoud Elmezain, Muaz Al Radi, Abderrahmene Boudiaf, Taimur Hassan, Mohamed Deriche, Mohammed Bennamoun, Naoufel Werghi
2024CM
Ruoyu Wang, Chen Cai, Wenqian Wang, Jianjun Gao, Dan Lin, Wenyang Liu, Kim-Hui Yap
2024CST-Yolo: A Novel Method For Blood Cell Detection Based On Improved Yolov7 And CNN-Swin Transformer.
Ming Kang, Chee-Ming Ting, Fung Fung Ting, Raphaël C.-W. Phan
2024Cafct-Net: A Cnn-Transformer Hybrid Network With Contextual And Attentional Feature Fusion For Liver Tumor Segmentation.
Ming Kang, Chee-Ming Ting, Fung Fung Ting, Raphaël C.-W. Phan
2024Camera Calibration Through Geometric Constraints from Rotation and Projection Matrices.
Muhammad Waleed, Abdul Rauf, Murtaza Taj
2024Camouflaged Object Detection Via Style Transfer-Based Data Augmentation.
Dongni Lu, Jiaxuan Chen, Haiyan Chen, Ziyi Peng, Rong Quan, Jie Qin
2024Cascading Unknown Detection With Known Classification For Open Set Recognition.
Daniel Brignac, Abhijit Mahalanobis
2024Caseg: Clip-Based Action Segmentation With Learnable Text Prompt.
Suyuan Huang, Haoxin Zhang, Yanyu Xu, Yan Gao, Yao Hu, Zengchang Qin
2024Category-Agnostic Pose Estimation for Point Clouds.
Bowen Liu, Wei Liu, Siang Chen, Pengwei Xie, Guijin Wang
2024Cell Cycle State Prediction Using Graph Neural Networks.
Sayan Acharya, Aditya Ganguly, Ram Sarkar, Abin Jose
2024CenterRadarNet: Joint 3D Object Detection and Tracking Framework Using 4D FMCW Radar.
Jen-Hao Cheng, Sheng-Yao Kuan, Hou-I Liu, Hugo Latapie, Gaowen Liu, Jenq-Neng Hwang
2024Characterization Of Dim Light Response In DVS Pixel: Discontinuity of Event Triggering Time.
Xiao Jiang, Fei Zhou
2024Chatgpt and Biometrics: an Assessment of Face Recognition, Gender Detection, and Age Estimation Capabilities.
Ahmad Hassanpour, Yasamin Kowsari, Hatef Otroshi-Shahreza, Bian Yang, Sébastien Marcel
2024Class-Specific Channel Attention For Few Shot Learning.
Yi-Kuan Hsieh, Jun-Wei Hsieh, Ying-Yu Chen
2024ClearDepth: Addressing Depth Distortions Caused By Eyelashes For Accurate Geometric Gaze Estimation On Mobile Devices.
Jamie Koerner, Vivienne Sze
2024Clip-Based Composition-Aware Image Cropping.
Shuo Zhang, Xinyu Yang, Xiwen Bai, Yu Li
2024Clip-Medfake: Synthetic Data Augmentation With AI-Generated Content for Improved Medical Image Classification.
Honghui Chen, Baoquan Zhao, Guanghui Yue, Weide Liu, Chenlei Lv, Ruomei Wang, Fan Zhou
2024Clouds and Haze Co-Removal Based on Weight-Tuned Overlap Refinement Diffusion Model for Remote Sensing Images.
Jingxuan Zhang, Libao Zhang
2024Co2Wounds-V2: Extended Chronic Wounds Dataset from Leprosy Patients.
Karen Sanchez, Carlos Hinojosa, Olinto Mieles, Chen Zhao, Bernard Ghanem, Henry Arguello
2024Coarse-Fine Spectral-Aware Deformable Convolution for Hyperspectral Image Reconstruction.
Jincheng Yang, Lishun Wang, Miao Cao, Huan Wang, Yinping Zhao, Xin Yuan
2024Coarse-To-Fine Spatio-Temporal Luminance-Aware Reconstruction For High-Speed Motion Scene.
Zhangke Wang, Na Qi, Xiyuan Zhao, Wei Xu, Jingzhong Qi, Qing Zhu
2024Codamal: Contrastive Domain Adaptation for Malaria Detection in Low-Cost Microscopes.
Ishan Rajendrakumar Dave, Tristan de Blegiers, Chen Chen, Mubarak Shah
2024Collaborative Intelligence For Vision Transformers: A Token Sparsity-Driven Edge-Cloud Framework.
Monikka Roslianna Busto, Shohei Enomoto, Takeharu Eda
2024Combining Raft-Based Stereo Disparity and Optical Flow Models For Scene Flow Estimation.
Huizhu Pan, Ling Li, Senjian An, Hui Xie
2024Comparison of Crowdsourcing And Laboratory Settings for Subjective Assessment of Video Quality and Acceptability & Annoyance.
Ali Ak, Abhishek Gera, Denise Noyes, Hassene Tmar, Ioannis Katsavounidis, Patrick Le Callet
2024Competitive Learning For Achieving Content-Specific Filters In Video Coding For Machines.
Honglei Zhang, Jukka I. Ahonen, Nam Le, Ruiying Yang, Francesco Cricri
2024Compression-Aware Tuning for Compressing Volumetric Radiance Fields.
Luyang Tang, Yongqi Zhai, Ronggang Wang
2024Computationally Efficient Kalman Filter Framework for Intra-Frame Image Reconstruction with a Rolling Shutter Camera.
Sabeethan Kanagasingham, Andrew R. Mills, Visakan Kadirkamanathan
2024Conditional Optimal Filter Selection For Multispectral Object Classification.
Katja Kossira, David Schön, Jürgen Seiler, André Kaup
2024Conditional Past Experience Generation for Dark Continual Learning.
Cheng Feng, Chaoliang Zhong, Jie Wang, Jun Sun, Yasuto Yokota
2024Confidence Aware Stereo Matching for Realistic Cluttered Scenario.
Junhong Min, Youngpil Jeon
2024Constructing an Interpretable Deep Denoiser by Unrolling Graph Laplacian Regularizer.
Seyed Alireza Hosseini, Tam Thuc Do, Gene Cheung, Yuichi Tanaka
2024Content-Aware Supervision For Diffusion-Based Restoration of Extremely Compressed Background For VCM.
Le Thi Hue Dao, An Gia Vien, Jooyoung Lee, Seyoon Jeong, Naeun Yang, Chul Lee
2024Context-Adaptive Entropy Model With Adapters For Lossless Point Cloud Geometry Compression.
Yutong Zhang, Wenbo Zhao, Daxin Li, Junjun Jiang, Xianming Liu
2024Contextuality Helps Representation Learning for Generalized Category Discovery.
Tingzhang Luo, Mingxuan Du, Jiatao Shi, Xinxiang Chen, Bingchen Zhao, Shaoguang Huang
2024Continual Road-Scene Semantic Segmentation Via Feature-Aligned Symmetric Multi-Modal Network.
Francesco Barbato, Elena Camuffo, Simone Milani, Pietro Zanuttigh
2024Contour-Weighted Loss For Class-Imbalanced Image Segmentation.
Zhengyong Huang, Yao Sui
2024Contrast-Guided Wireframe Parsing.
Xueyuan Chen, Baojiang Zhong
2024Controllable Unsupervised Event-Based Video Generation.
Yaping Zhao, Pei Zhang, Chutian Wang, Edmund Y. Lam
2024Convex-Hull Estimation using Xpsnr for Versatile Video Coding.
Vignesh V. Menon, Christian R. Helmrich, Adam Wieckowski, Benjamin Bross, Detlev Marpe
2024Convolutional Neural Network With Learnable Masks For EIT Based Tactile Sensing.
Ibrar Amin, Ruiyuan Kang, Hasan Al-Marzouqi, Zeyar Aung, Panos Liatsis
2024Correlation-Aware Joint Pruning-Quantization using Graph Neural Networks.
Muhammad Nor Azzafri Nor-Azman, Usman Ullah Sheikh, Mohammed Sultan Mohammed, Jeevan Sirkunan, Muhammad Nadzir Marsono
2024Counting Repetitive Actions in Event Stream.
Yuelong Zhuo, Weiling Li, Beibei Yang, Yan Fang, Huaqiang Yuan
2024Crocos-V1: Enhancing Mask Leakage and Bounding Box Localization for Real-Time Crop/Weed Instance Segmentation.
Jesus Franco-Robles, Jorge E. Avilés-Mejia, Ouiddad Labbani-Igbida
2024Cross-Action Cross-Subject Skeleton Action Recognition Via Simultaneous Action-Subject Learning With Two-Step Feature Removal.
Yu Mitsuzumi, Akisato Kimura, Go Irie, Atsushi Nakazawa
2024Cross-Domain Few-Shot In-Context Learning For Enhancing Traffic Sign Recognition.
Yaozong Gan, Guang Li, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama
2024Cross-Fusion of Band-Specific Spectral Features For Multi-Band NIR Colorization.
Gyeong-Eun Youm, Tae-Sung Park, Jong-Ok Kim
2024Cross-Modal Alignment of Local and Global Features for Zero-Shot Chinese Character Recognition.
Hongyi Cai, Anna Zhu
2024Crowdassign: A Label Assignment Scheme for Pedestrian Detection in Crowded Scenes.
Zihao Li, Ning Luo, Xiwen Zhang, Ziliang Guo, Xingqi Fang, Yu Qiao
2024DALSM: A Direction-Aware Line Segment Matching Method.
Zhiyu Liu, Baojiang Zhong
2024DAPlankton: Benchmark Dataset For Multi-Instrument Plankton Recognition Via Fine-Grained Domain Adaptation.
Daniel Batrakhanov, Tuomas Eerola, Kaisa Kraft, Lumi Haraguchi, Lasse Lensu, Sanna Suikkanen, María Teresa Camarena-Gómez, Jukka Seppälä, Heikki Kälviäinen
2024DCCM: Dual Data Consistency Guided Consistency Model for Inverse Problems.
Jiahao Tian, Ziyang Zheng, Xinyu Peng, Yong Li, Wenrui Dai, Hongkai Xiong
2024DTSN: No-Reference Image Quality Assessment via Deformable Transformer and Semantic Network.
Long Tang, Liang Yuan, Guoquan Zheng, Zesheng Wang, Guangtao Zhai
2024Dcctnet: Kidney Tumors Segmentation Based On Dual-Level Combination Of Cnn And Transformer.
Bingzhen Hou, Guimei Zhang, Huiqun Liu, Yipeng Qin, Ying Chen
2024Declouding of Satellite Images for Crop Growth Monitoring Via Unrolling of Gradient Graph Laplacian Regularizer.
Parham Eftekhar, Gene Cheung, Tim Eadie
2024Decompl: Decompositional Learning with Attention Pooling for Group Activity Recognition from a Single Volleyball Image.
Berker Demirel, Huseyin Ozkan
2024Decoupling Domain Invariance and Variance With Tailored Prompts for Open-Set Domain Adaptation.
Shihao Zeng, Xinghong Liu, Yi Zhou
2024Deep Convolutional Neural Network Prediction For Glaucoma Detection Using OCT and OCT-Angiography Disc-and Macula-Centered Images and Their Combined Power.
Gouverneur François, Pourjavan Sayeh, Macq Benoit
2024Deep Fusion of Visible and Near Infrared Images for Registration and Defogging Using Cross Modal Transformer.
Mengyao Ji, Cheolkon Jung
2024Deep Learning Approach for Renal Cell Carcinoma Detection, Subtyping, And Grading.
Maroof Abdul Aziz, Fatemeh Javadian, Sherin Susheel Mathew, Avinash Gopal, Johannes Stegmaier, Sonit Singh, Abin Jose
2024Deep Learning-Based Leaf Image Analysis for Tomato Plant Disease Detection and Classification.
Ammar Chouchane, Abdelmalik Ouamane, El Ouanas Belabbaci, Yassine Himeur, Abbes Amira
2024Deep Multi-Graph Embedded Clustering for Community Detection in FMRI Functional Brain Networks Across Individuals.
Kai-Jun See, Chee-Ming Ting, Fuad Noman, Junn Yong Loo, Yee-Fan Tan, Hernando Ombao, Raphaël C.-W. Phan
2024Deep Optical Flow Learning With Deformable Large-Kernel Cross-Attention.
Xuezhi Xiang, Yiming Chen, Denis Ombati, Lei Zhang, Xiantong Zhen
2024Deep Regularization For Scale-Agnostic Superresolution of MR Images.
K. Pavan Kumar Reddy, Kunal N. Chaudhury
2024Deep Spectral Siamese Network For Heterogeneous Object Verification In Amazon Robotic Warehouse.
Maryam Rahnemoonfar
2024Deep-Learning-Based Magnetic Resonance Simultaneous Multislice Imaging Using Holographic Image Decoding.
Satoshi Ito, Yuki Sato, Naoya Endo, Shohei Ouchi
2024Deepfake Detection Via Separable Self-Consistency Learning.
Lin Lu, Yunhong Wang, Wenqi Zhuo, Liang Zhang, Guangshuai Gao, Yuanfang Guo
2024Deepfake Detection With Combined Unsupervised-Supervised Contrastive Learning.
Junshuai Zheng, Yichao Zhou, Xiyuan Hu, Zhenmin Tang
2024Deepskinformer: Skin Lesion Segmentation Using Hierarchical Transformers And Edge Enhancement.
Ufaq Khan, Umair Nawaz, Mustaqeem Khan, Wail Gueaieb, Abdulmotaleb El Saddik
2024Defending Against Physical Adversarial Patch attacks On Infrared Human Detection.
Lukas Strack, Futa Waseda, Huy H. Nguyen, Yinqiang Zheng, Isao Echizen
2024Delving into the Explainability of Prototype-Based CNN for Biological Cell Analysis.
Martin Blanchard, Olivier Delézay, Christophe Ducottet, Damien Muselet
2024Density-Guided Dense Pseudo Label Selection for Semi-Supervised Oriented Object Detection.
Tong Zhao, Qiang Fang, Shuohao Shi, Xin Xu
2024Detectability of Defects in the Presence of Linear Nuisance Parameters and Images Signal-Dependent Noise.
Rémi Cogranne
2024Detecting Biomedical Copy-Move Forgery by Attention-Based Multiscale Deep Descriptors.
Hao-Chiang Shao, Tse-Yu Tseng, Yuan-Rong Liao, Chi-Chun Chen, Chung-Yang Hung, Ming-hsin Liang
2024Directional And Topological Transformer With Topology Priors For 4D Cellular Image Segmentation.
Zelin Li, Zhaoke Huang, Zhen Zhu, Sicheng You, Zhongying Zhao, Hong Yan
2024Directional Antenna Systems for Long-Range Through-Wall Human Activity Recognition.
Julian Strohmayer, Martin Kampel
2024Disentangled Knowledge Distillation for Unified Multi-Class Anomaly Detection.
Jiyong Jang, Hayeon Lee, Younkwan Lee
2024Distinctive Image Captioning: Leveraging Ground Truth Captions in Clip Guided Reinforcement Learning.
Antoine Chaffin, Ewa Kijak, Vincent Claveau
2024Diversified Task Augmentation with Redundancy Reduction for Cross-Domain Few-Shot Learning.
Ling Yue, Lin Feng, Qiuping Shuai, Lingxiao Xu, Zihao Li
2024Diversifying Deep Ensembles: A Saliency Map Approach for Enhanced OOD Detection, Calibration, and Accuracy.
Stanislav Dereka, Ivan Karpukhin, Maksim Zhdanov, Sergey Kolesnikov
2024Domain Dilation for Single Domain Generalization.
Yuehui Fan, Baoyao Yang, Meng Shen, Fei Lyu
2024Draft - Distilled Recurrent All-Pairs Field Transforms For Optical Flow.
Yanick Christian Tchenko, Hicham Hadj-Abdelkader, Hedi Tabia
2024Driving Through Graphs: a Bipartite Graph for Traffic Scene Analysis.
Aditya Humnabadkar, Arindam Sikdar, Huaizhong Zhang, Tanveer Hussain, Ardhendu Behera
2024Dtpose: Learning Disentangled Token Representation For Effective Human Pose Estimation.
Shiyang Ye, Yuan Fang, Hong Liu, Hu Chen, Wenchao Du, Hongyu Yang
2024Dual Attention Enhanced Transformer for Image Defocus Deblurring.
Yuhang He, Senmao Tian, Jian Zhang, Shunli Zhang
2024Dual Multi-Modal Feature Fusion Network for the Evaluation of Osteosarcoma.
Zequn Song, Lingfeng Wang
2024Dual-Path Coupled Image Deraining Network Via Spatial-Frequency Interaction.
Yuhong He, Aiwen Jiang, Lingfang Jiang, Long Peng, Zhifeng Wang, Lu Wang
2024Dynamic Activation Function Based on the Branching Process and its Application in Image Classification.
Wanting Zhang, Libao Zhang
2024Dynamic MRI Reconstruction Using Low-Rank Plus Sparse Decomposition With Smoothness Regularization.
Chee-Ming Ting, Fuad Noman, Raphaël C.-W. Phan, Hernando Ombao
2024E2GS: Event Enhanced Gaussian Splatting.
Hiroyuki Deguchi, Mana Masuda, Takuya Nakabayashi, Hideo Saito
2024E2SIFT: Neuromorphic SIFT via Direct Feature Pyramid Recovery from Events.
Chris Henry, Paras Maharjan, Zhu Li, George York
2024ECAP: Extensive Cut-and-Paste Augmentation for Unsupervised Domain Adaptive Semantic Segmentation.
Erik Brorsson, Knut Åkesson, Lennart Svensson, Kristofer Bengtsson
2024ET: Explain to Train: Leveraging Explanations to Enhance the Training of A Multimodal Transformer.
Meghna P. Ayyar, Jenny Benois-Pineau, Akka Zemmari
2024Early Prediction Of The Transferability Of Bovine Embryos From Videomicroscopy.
Yasmine Hachani, Patrick Bouthemy, Elisa Fromont, Sylvie Ruffini, Ludivine Laffont, Alline de Paula Reis
2024EarthquakeNet: A High-Resolution UAV-Based Dataset for Earthquake Damage Assessment.
Shenlu Jiang, Yuxin Bian, Yiran Wang, Xufeng Li, Zhankeng Liu, Yi Ren, Yunxuan Zhao
2024Edge-Guided Pixel Level Connected Component Assisted Camouflaged Object Detection.
Qingwang Wang, Xin Qu, Liyao Zhou, Pengcheng Jin, Chengbiao Fu, Tao Shen
2024Edge-Reserved Knowledge Distillation for Image Matting.
Jiasheng Wang, Zhenhua Wang, Jifeng Ning
2024Efficient Black-Box Adversarial Attack on Deep Clustering Models.
Nan Yang, Zihan Li, Zhen Long, Xiaolin Huang, Ce Zhu, Yipeng Liu
2024Efficient Circular and Confocal Non-Line-Of-Sight Imaging With Transient Sinogram Super Resolution.
Dixin Yang, Mariko Isogawa
2024Efficient Learned Wavelet Image and Video Coding.
Anna Meyer, Srivatsa Prativadibhayankaram, André Kaup
2024Efficient Semantic Segmentation For Aerial Imagery Using Query Points and Superpixel Supervision.
Santiago Rivier, Carlos Hinojosa, Silvio Giancola, Bernard Ghanem
2024Efficient Visual Question Answering on Embedded Devices: Cross-Modality Attention With Evolutionary Quantization.
Aakansha Mishra, Aditya Agarwala, Utsav Tiwari, Vikram Nelvoy Rajendiran, Srinivas Soumitri Miriyala
2024Embedding Attention Blocks For Answer Grounding.
Seyedalireza Khoshsirat, Chandra Kambhamettu
2024Empirical Research On Quantization For 3D Multi-Modal Vit Models.
Zicong Hu, Jian Cao, Weichen Xu, Ruilong Ren, Tianhao Fu, Xinxin Xu, Xing Zhang
2024End-to-End Learned Lossy Dynamic Point Cloud Attribute Compression.
Dat Thanh Nguyen, Daniel Zieger, Marc Stamminger, André Kaup
2024End-to-End Learned Scalable Multilayer Feature Compression For Machine Vision Tasks.
Qiaoxi Chen, Changsheng Gao, Dong Liu
2024Energy Reduction Opportunities in HDR Video Encoding.
Christian Herglotz, Steven Le Moan, Alexandre Mercat
2024Enhanced Detection of Small Objects in Aerial Imagery: A High-Resolution Neural Network Approach With Amplified Feature Pyramid and Sigmoid Re-Weighting.
Chanyeong Park, Junbo Jang, Heegwang Kim, Joonki Paik
2024Enhanced Facial Restoration with Misinformation-Filtered Guide-Denoising Diffusion Probabilistic Models.
Wendi Liang, Yihan Wen, Zewei Wang, Jianuo Jiang, Tat-Ming Lok, Guanchong Niu
2024Enhanced Prototypical Part Network (EPPNet) For Explainable Image Classification Via Prototypes.
Bhushan Atote, Victor Sanchez
2024Enhancing Intubation Accuracy: Advanced Tracheal Segmentation Techniques In Video Endoscopy.
Adel Oulefki, Abbes Amira, Fatih Kurugollu, Thaweesak Trongtirakul, Sos S. Agaian, Menen Kassim Mohammed, Mohammad Alshoweky
2024Enhancing Perceptual Quality Assessment for 360-Degree Images Based on Adaptive Patch Labeling and Multi-Label Learning.
Abderrezzaq Sendjasni, Mohamed-Chaker Larabi
2024Enhancing TMIV Performance Through Proximity-Aware Grouping and Preservation of Small Clusters.
Mahshad MahdaviMoghadam, Stéphane Coulombe, Carlos Vázquez, Mohammadreza Jamali, Ahmad Vakili
2024Ensemble of Deep Variational Mixture Models for Unsupervised Clustering.
Xu Tan, Junqi Chen, Jiawei Yang, Sylwan Rahardja, Mou Wang, Susanto Rahardja
2024Estate: Expert-Guided State Text Enhancement for Zero-Shot Industrial Anomaly Detection.
Bingke Zhu, Hao Li, Changlin Chen, Liujie Hua, Jinqiao Wang
2024Estimating Indoor Scene Depth Maps From Ultrasonic Echoes.
Junpei Honma, Akisato Kimura, Go Irie
2024Evaluating 3D Human Pose Estimation in Occluded Multi-Sensor Scenarios: Dataset and Annotation Approach.
Kévin Riou, Kaiwen Dong, Yujie Huang, Kévin Subrin, Patrick Le Callet, Yanjing Sun
2024Event-Specific EEG-FNIRS Feature Fusion FOR Alzheimer's Disease Classification.
Sung-Hyeon Kim, Tae-Min Choi, Sun-Kyung Lee, Minhee Kim, Jae Gwan Kim, Jong-Hwan Kim
2024Explaining 3D Object Detection Through Shapley Value-Based Attribution Map.
Michihiro Kuroki, Toshihiko Yamasaki
2024Explaining Representation Learning With Perceptual Components.
Yavuz Yarici, Kiran Kokilepersaud, Mohit Prabhushankar, Ghassan AlRegib
2024Exploiting Change Blindness to Reduce Bitrate and Display Luminance in Video Streaming.
Steven Le Moan, Mitra Amiri, Christian Herglotz
2024Exploring Attention Mechanisms in Integration of Multi-Modal Information for Sign Language Recognition and Translation.
Zaber Ibn Abdul Hakim, Rasman Mubtasim Swargo, Muhammad Abdullah Adnan
2024Exploring Saliency Bias in Manipulation Detection.
Joshua Krinsky, Alan Bettis, Qiuyu Tang, Daniel Moreira, Aparna Bharati
2024Exploring the Impact of Moire Pattern on Deepfake Detectors.
Razaib Tariq, Shahroz Tariq, Simon S. Woo
2024Exploring the Potential of Recurrence Quantification Analysis for Video Analysis and Motion Detection.
Theodora Kyprianidi, Effrosyni Doutsi, George Tzagkarakis, Panagiotis Tsakalides
2024Exploring the Potential of Synthetic Data to Replace Real Data.
Hyungtae Lee, Yan Zhang, Heesung Kwon, Shuvra S. Bhattacharyya
2024Exposing the Limits of Deepfake Detection using novel Facial mole attack: A Perceptual Black- Box Adversarial Attack Study.
Qurat Ul Ain, Ali Javed, Khalid Mahmood Malik, Aun Irtaza
2024Extended Multiple Cross-Component Linear Models With Adaptive Thresholding and Overlapped Averaging Beyond VVC.
Haruhisa Kato, Yoshitaka Kidani, Kei Kawamura
2024Extending Segment Anything Model into Auditory and Temporal Dimensions for Audio-Visual Segmentation.
Juhyeong Seon, Woobin Im, Sebin Lee, Jumin Lee, Sung-Eui Yoon
2024FAWN: Floor-and-Walls Normal Regularization for Direct Neural TSDF Reconstruction.
Anna Sokolova, Anna Vorontsova, Bulat Gabdullin, Alexander Limonov
2024FC3DNET: A Fully Connected Encoder-Decoder for Efficient Demoiréing.
Zhibo Du, Long Peng, Yang Wang, Yang Cao, Zheng-Jun Zha
2024FEDMI: A Federated Learning Framewoek for Secure Sharing of Medical Images.
Zhongyuan Jing, Hongyan Xiang, Ruyan Wang
2024FREQ-MIP-AA: Frequency Mip Representation for Anti-Aliasing Neural Radiance Fields.
Youngin Park, Seungtae Nam, Cheul-Hee Hahm, Eunbyung Park
2024Face Drawing GAN by Channel Attention and Matrix Product Attention.
Hideyuki Ogura, Shinya Ezumi, Masaaki Ikehara
2024Face Morphing Detection in Social Media Content.
Akshay Agarwal, Nalini K. Ratha
2024Factorized Embedding Graph Matching Network For Learning Lawler's Quadratic Assignment Problem.
Yirui Yang, Xubin Lin, Li He, Yisheng Guan, Hong Zhang
2024Fanet: Feature Amplification Network for Semantic Segmentation in Cluttered Background.
Muhammad Ali, Mamoona Javaid, Mubashir Noman, Mustansar Fiaz, Salman Khan
2024Fantom: Federated Adversarial Network for Training Multi-Sequence Magnetic Resonance Imaging in Semantic Segmentation.
Anupam Borthakur, Apoorva Srivastava, Avik Kar, Dipayan Dewan, Debdoot Sheet
2024Fast Coding Mode Prediction for Intra Prediction in VVC SCC.
Dayong Wang, Junyi Yu, Xin Lu, Frédéric Dufaux, Hongwei Guo, Hui Guo, Ce Zhu
2024Fast Constant-Quality Video Encoding Using VVENC With Rate Capping Based On Pre-Analysis Statistics.
Christian R. Helmrich, Valeri George, Vignesh V. Menon, Adam Wieckowski, Benjamin Bross, Detlev Marpe
2024Fast Edge-Aware Occlusion Detection In The Context of Multispectral Camera Arrays.
Frank Sippel, Jürgen Seiler, André Kaup
2024Fast Inter Mode Decision with Resolution Sampling For VVC 360-Degree Video Coding.
Yifan Qiang, Naian Liu
2024Fast Template Matching-Based Reference Picture Padding for Video Coding.
Nicolas Neumann, Priyanka Das, Tim Classen, Mathias Wien
2024Fast Unsupervised Tensor Restoration via Low-Rank Deconvolution.
David Reixach, Josep Ramon Morros
2024Feature Decomposition Transformers for Infrared and Visible Image Fusion.
Gahyeon Kim, An Gia Vien, Duong Hai Nguyen, Chul Lee
2024Feature Enhanced Learning Image Compression With Recurrent Criss-Cross Attention.
Xue Wu, Tong Tang, Zhiyuan Zhu, Hong Zou
2024Features Disentanglement For Explainable Convolutional Neural Networks.
Pasquale Coscia, Angelo Genovese, Fabio Scotti, Vincenzo Piuri
2024FedAWA: Aggregation Weight Adjustment in Federated Domain Generalization.
Yiming Chen, Nan He, Lifeng Sun
2024Fine-Detailed Neural Indoor Scene Reconstruction Using Multi-Level Importance Sampling And Multi-View Consistency.
Xinghui Li, Yuchen Ji, Xiansong Lai, Wanting Zhang, Long Zeng
2024Fine-Tuning Text-To-Image Diffusion Models for Class-Wise Spurious Feature Generation.
AprilPyone MaungMaung, Huy H. Nguyen, Hitoshi Kiya, Isao Echizen
2024Fisheye Stereo Camera Using Fisheye Vertical Stereo Method.
Hikaru Chikugo, Kento Arai, Sarthak Pathak, Kazunori Umeda
2024FlexAE: A Self-Conditioned Detector To Prevent Model Overfitting For Unsupervised Video Anomaly Detection.
Junqi Chen, Xu Tan, Jiawei Yang, Sylwan Rahardja, Susanto Rahardja
2024Food: Facial Authentication And Out-Of-Distribution Detection With Short-Range FMCW Radar.
Sabri Mustafa Kahya, Boran Hamdi Sivrikaya, Muhammet Sami Yavuz, Eckehard G. Steinbach
2024Footbots: A Transformer-Based Architecture for Motion Prediction in Soccer.
Guillem Capellera, Luis Ferraz, Antonio Rubio, Antonio Agudo, Francesc Moreno-Noguer
2024Fourier Ptychography Microscopy With Integrated Positional Misalignment Correction.
Juliana Do Nascimento Damurie Da Silva, Patrick Horain
2024Fourier Ptychography With Information Entropy Based No-Reference Image Quality Assessment Learning.
Qijun Yang, Hujun Yin
2024Frequency-Spatial Domain Information Fusion Network for Pan-Sharpening.
Mengjiao Zhao, Mengting Ma, Ao Gao, Wei Zhang
2024Full-Reference Point Cloud Quality Assessment Using Spectral Graph Wavelets.
Ryosuke Watanabe, Keisuke Nonaka, Eduardo Pavez, Tatsuya Kobayashi, Antonio Ortega
2024Fusion of Independent and Interactive Features for Human-Object Interaction Detection.
Zehai Wu, Lijie Sheng, Songnian Zhang, Qiguang Miao
2024GEEG-YOLOv8: Gaussian Enhanced Euclidean Norm Ghost Attention for Real-Time Polyp Detection.
Phuong Thao Nguyen, Hiroshi Watanabe
2024Gabic: Graph-Based Attention Block for Image Compression.
Gabriele Spadaro, Alberto Presta, Enzo Tartaglione, Jhony H. Giraldo, Marco Grangetto, Attilio Fiandrotti
2024Gabor Feature Network for Transformer-Based Building Change Detection Model in Remote Sensing.
Priscilla Indira Osa, Josiane Zerubia, Zoltan Kato
2024Gaitgs: Temporal Feature Learning in Granularity And Span Dimension for Gait Recognition.
Haijun Xiong, Yunze Deng, Bin Feng, Xinggang Wang, Wenyu Liu
2024Generalized Nested Latent Variable Models For Lossy Coding Applied To Wind Turbine Scenarios.
Raül Pérez-Gonzalo, Andreas Espersen, Antonio Agudo
2024Generate DSLR-Like Image With Global Information and Prior Guided ISP.
Lu Xu, Chao Zhang, Yasi Wang, Qiang Wang
2024Generative Visual Compression: A Review.
Bolin Chen, Shanzhi Yin, Peilin Chen, Shiqi Wang, Yan Ye
2024Gengmm: Generalized Gaussian-Mixture-Based Domain Adaptation Model for Semantic Segmentation.
Nazanin Moradinasab, Hassan Jafarzadeh, Donald E. Brown
2024Giraffe: A Genetic Programming Algorithm To Build Deep Learning Ensembles For Ecg Arrhythmia Classification.
Damian Kucharski, Agata M. Wijata, Lu Fu, Weidong Lin, Yumei Xue, Jacek Kawa, Yalin Zheng, Gregory Yoke Hong Lip, Jakub Nalepa
2024Gradtrans: Transformer-Based Gradient Guidance for Image Generation.
Yiwei Chen, Jiaqian Yu, Siyang Pan, Sangil Jung, Wu Bi, Seung-In Park, Qiang Wang, ByungIn Yoo
2024Graph Convolutional Networks With Minimal Appearance Information For Action Recognition.
Hiroaki Tani
2024Graphic - Graph-Based Representation for Analyzing People's High-Level Interactions in Crowds.
Francesco Longobardi, Daniel Riccio
2024Guided Context Gating: Learning To Leverage Salient Lesions in Retinal Fundus Images.
Teja Krishna Cherukuri, Nagur Shareef Shaik, Dong Hye Ye
2024Gumbel-NeRF: Representing Unseen Objects as Part-Compositional Neural Radiance Fields.
Yusuke Sekikawa, Chingwei Hsu, Satoshi Ikehata, Rei Kawakami, Ikuro Sato
2024Hand-Object Reconstruction Via Interaction-Aware Graph Attention Mechanism.
Taeyun Woo, Tae-Kyun Kim, Jinah Park
2024Hdplifter: Hierarchical Dynamics Perception For 2D-to-3D Human Pose Lifting.
Ye Lu, Jianjun Gao, Chen Cai, Ruoyu Wang, Duc Tri Phan, Kim-Hui Yap
2024Hierarchical Vertex-Wise Intensification Graph Convolution for Skeleton-Based Activity Recognition.
Yun Li, Hao Xie, Jun Xiao, Cong Zhang, Tianshan Liu, Kin-Man Lam
2024Highly Constrained Coded Aperture Imaging Systems Design Via a Knowledge Distillation Approach.
Leon Suarez-Rodriguez, Roman Jacome, Henry Arguello
2024Histohdr-Net: Histogram Equalization for Single LDR to HDR Image Translation.
Hrishav Bakul Barua, Ganesh Krishnasamy, KokSheik Wong, Abhinav Dhall, Kalin Stefanov
2024HoloGesture: A Multimodal Dataset For Hand Gesture Recognition Robust To Hand Textures On Head-Mounted Mixed-Reality Devices.
Jeongwoo Park, Je Hyeong Hong
2024How to Train Your VAE.
Mariano Rivera
2024Hybrid Single Input and Multiple Output Method For Compressing Features Towards Machine Vision Tasks.
Zifu Zhang, Shengxi Li, Tie Liu, Mai Xu, Tao Xu, Zhenyu Guan, Zhuoyi Lv
2024Hyperspectral Image Classification With Fuzzy Spatial-Spectral Class Discriminate Information.
Muhammad Ahmad, Muhammad Usama, Salvatore Distefano, Manuel Mazzara
2024IEEE International Conference on Image Processing, ICIP 2024, Abu Dhabi, United Arab Emirates, october 27-30, 2024
2024IMU-Assisted Target-Free Extrinsic Calibration of Heterogeneous Lidars Based on Continuous-Time Optimization.
Zehao Yan, Lin Zhang, Zhong Wang, Shenjie Zhao
2024IN-Loop Filter for Object Mask Coding in Versatile Video Coding.
Sebastian Schwarz, Miska M. Hannuksela, Döne Bugdayci Sansli
2024Illumination-Enhanced Infrared and Low-Light Visible Image Fusion.
Guohua Lv, Xinyue Fu, Chaoqun Sima, Yanlong Xu, Baodong Zhang, Hanju Bao
2024Image Coding For Machine Via Analytics-Driven Appearance Redundancy Reduction.
Xuelin Shen, Haoqiao Ou, Wenhan Yang
2024Image Coding For Machines With Edge Information Learning Using Segment Anything.
Takahiro Shindo, Kein Yamada, Taiju Watanabe, Hiroshi Watanabe
2024Imbalanced Data Robust Online Continual Learning Based on Evolving Class Aware Memory Selection and Built-In Contrastive Representation Learning.
Rui Yang, Emmanuel Dellandréa, Matthieu Grard, Liming Chen
2024Improvement of Image Reconstruction for MRI Using Phase-Scrambling Fourier Transform and Dual-Domain Strategy.
Kazuki Yamato, Satoshi Ito
2024Improving Automatic Target Recognition With Infrared Imagery Using Vision Transformers and Focused Data Augmentation.
Nada Baili, Hichem Frigui
2024Improving Image Coding for Machines Through Optimizing Encoder Via Auxiliary Loss.
Kei Iino, Shunsuke Akamatsu, Hiroshi Watanabe, Shohei Enomoto, Akira Sakamoto, Takeharu Eda
2024Improving Image De-Raining Using Reference-Guided Transformers.
Zihao Ye, Jaehoon Cho, Changjae Oh
2024Improving Real-Time Near-Infrared Face Alignment With a Paired VIS-NIR Dataset and Data Augmentation Through Image-to-Image Translation.
Langning Miao, Ryo Kakimoto, Kaoru Ohishi, Yoshihiro Watanabe
2024Improving Self-Supervised Vision Transformers for Visual Control.
Wonil Song, Kwanghoon Sohn, Dongbo Min
2024Increasing Trust in Image Analysis by Detecting Trellis Quantization in JPEG Images.
Nora Hofer
2024Instance-Aware Uncertainty for Active Learning in Object Detection.
Zhipeng Zhang, Wenting Ma, Xiaohang Yuan, Yuan Hao, Meng Guo, Hongyi Tang, Zhiheng Zhou, Zhenjie Yao
2024Integrating Vision-Language Supervision for Uniform Appearance Tracking.
Mohamad Alansari, Ahmed Abughali, Obadah Habash, Khaled AlNuaimi, Sajid Javed, Naoufel Werghi
2024Intelligent Multi-View Test Time Augmentation.
Efe Ozturk, Mohit Prabhushankar, Ghassan AlRegib
2024Interactive Teaching For Fine-Granular Few-Shot Object Recognition Using Vision Transformers.
Philip Keller, Daniel Jost, Arne Roennau, Rüdiger Dillmann
2024Interpreting the Fraudulence Level of Different Finger Photo Presentation Attack Instruments.
Anudeep Vurity, Emanuela Marasco, Raghavendra Ramachandra, Duoduo Liao
2024Intrinsic Image Decomposition Based on Quantized Prior Codebook.
Fangzheng Yuan, Xiaoyue Jiang, Xiaoyi Feng, Moncef Gabbouj
2024Investigating Self-Supervised Methods for Label-Efficient Learning.
Srinivasa Rao Nandam, Sara Atito, Zhenhua Feng, Josef Kittler, Muhammad Awais
2024Investigating and Reducing the Impairment of Point Spread Effect For Spatiotemporal Fusion Of Remote Sensing Imagery.
Yunfei Li, Jun Li
2024JOINTRF: End-To-End Joint Optimization for Dynamic Neural Radiance Field Representation and Compression.
Zihan Zheng, Houqiang Zhong, Qiang Hu, Xiaoyun Zhang, Li Song, Ya Zhang, Yanfeng Wang
2024JPEG Image Ciphering Based on Chaotic Encryption.
Meha Hachani, Azza Ouled Zaid
2024Joint End-to-End Image Compression and Denoising: Leveraging Contrastive Learning and Multi-Scale Self-Onns.
Yuxin Xie, Li Yu, Farhad Pakdaman, Moncef Gabbouj
2024Joint Image Restoration For Domain Adaptive Object Detection In Foggy Weather Condition.
Jing Ma, Meng Lin, Gang Zhou, Zhenhong Jia
2024Knowledge-Infused Learning for Fine-Grained Plant Disease Recognition.
Jamil Ahmad, Wail Gueaieb, Abdulmotaleb El Saddik, Giulia De Masi, Fakhri Karray
2024Koopcon: A new approach towards smarter and less complex learning.
Vahid Jebraeeli, Bo Jiang, Derya Cansever, Hamid Krim
2024LFGN: Low-Level Feature-Guided Network For Adversarial Defense.
Chih-Chung Hsu, Ming-Hsuan Wu, En-Chao Liu
2024LMBF-Net: A Lightweight Multipath Bidirectional Focal Attention Network For Multifeatures Segmentation.
Tariq M. Khan, Shahzaib Iqbal, Syed Saud Naqvi, Imran Razzak, Erik Meijering
2024LSDM-PCB: A Lightweight Small Defect Detection Model for Printed Circuit Board.
Qi Zeng, Chongren Zhao, Pengfei He, Hongchao Gao
2024LWIRPOSE: A Novel Long Wave Infrared Thermal Image Pose Dataset and Benchmark.
Avinash Upadhyay, Bhipanshu Dhupar, Manoj Sharma, Ankit Shukla, Ajith Abraham
2024Land Use Classification Via Multi-Modal Complementary Feature Fusion and Context Information Enhancement For Optical and Sar Images.
Xinyue Fan, Libao Zhang
2024Latent Enhancing Autoencoder for Occluded Image Classification.
Ketan Kotwal, Tanay Deshmukh, Preeti Gopal
2024Learn By An Example Transformer For Domain Generalization In Video Object Segmentation.
Islam I. Osman, Mohamed S. Shehata
2024Learned Compression of Encoding Distributions.
Mateen Ulhaq, Ivan V. Bajic
2024Learned Image Compression Using A Long and Short Attention Module.
Zenghui Duan, Cheolkon Jung, Yang Liu, Ming Li
2024Learned Image Compression With Text Quality Enhancement.
Chih-Yu Lai, Dung N. Tran, Kazuhito Koishida
2024Learned Image Compression for Both Humans and Machines via Dynamic Adaptation.
Lingyu Zhu, Binzhe Li, Riyu Lu, Peilin Chen, Qi Mao, Zhao Wang, Wenhan Yang, Shiqi Wang
2024Learned Nonlinear Predictor for Critically Sampled 3D Point Cloud Attribute Compression.
Tam Thuc Do, Philip A. Chou, Gene Cheung
2024Learning A Rain-Invariant Network For Instance Segmentation In The Rain.
Zhiwen Chen, Wei Wu, Zhengfeng Chen
2024Learning Orthonormal Features in Self-Supervised Learning using Functional Maximal Correlation.
Bo Hu, Yuheng Bu, José C. Príncipe
2024Learning Temporal Cues for Fine-Grained Action Recognition.
Zhihao Liu, Yi Zhang, Wenhui Huang, Yan Liu, Mengyang Pu, Chao Deng, Junlan Feng
2024Learning With Instance-Dependent Noisy Labels By Anchor Hallucination And Hard Sample Label Correction.
Po-Hsuan Huang, Chia-Ching Lin, Chih-Fan Hsu, Ming-Ching Chang, Wei-Chao Chen
2024Learning-Based Point Cloud Decoding with Independent and Scalable Reduced Complexity.
Mohammadreza Ghafari, André F. R. Guarda, Nuno M. M. Rodrigues, Fernando Pereira
2024Learning-Based Video Compression with Continuously Variable Bitrate Coding.
Mingyi Yang, Xionghui Mao, Yujie Yin, Zhiwei Zhu, Defa Wang, Shuai Wan, Fuzheng Yang
2024Legit: Text Legibility For User-Generated Media.
Maniratnam Mandal, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik
2024Lensless Phase Retrieval With Regularization By Blind Noise Map Estimation and Denoising.
Igor Shevkunov, Mykola Ponomarenko, Jere Heimo, Karen O. Egiazarian
2024Lercpose: Learned Ranking and Contrastive Loss for Robust Head Pose Estimation.
Aratrik Chattopadhyay, Harshita Soni, Shuaib Ahmed
2024Leveraging Generated Image Captions for Visual Commonsense Reasoning.
Subham Das, C. Chandra Sekhar
2024LiSD: An Efficient Multi-Task Learning Framework For Lidar Segmentation and Detection.
Jiahua Xu, Si Zuo, Chenfeng Wei, Wei Zhou
2024Licaf: Lidar-Camera Asymmetric Fusion For Gait Recognition.
Yunze Deng, Haijun Xiong, Bin Feng
2024Lidar Depth Map Guided Image Compression Model.
Alessandro Gnutti, Stefano Della Fiore, Mattia Savardi, Yi-Hsin Chen, Riccardo Leonardi, Wen-Hsiao Peng
2024Light-Weight Self-Supervised Contrastive Learning Network For Small Sample Hyperspectral Image Classification.
Gan Yang, Zhaohui Wang
2024Lightweight Recurrent Neural Network for Image Super-Resolution.
Mir Sazzat Hossain, AKM Mahbubur Rahman, Md. Ashraful Amin, Amin Ahsan Ali
2024Lightweight Underwater Image Enhancement via Impulse Response of Low-Pass Filter Based Attention Network.
May Thet Tun, Yosuke Sugiura, Tetsuya Shimamura
2024Lipface: Lipschitz-Conditioned For Resolution Robust Face Recognition.
Yu Wei Chen, Huu-Phu Do, Chia-Wei Kuo, Hsuan-Tung Liu, Ching-Chun Huang
2024Localization of Image Splicing Under Segment Anything Model With Integrated Compression and Edge Artifacts.
Ruhao Zhao, Xian Zhong, Liang Liao, Wenxuan Liu, Wenxin Huang, Zheng Wang
2024Localizing Moments of Actions in Untrimmed Videos of Infants with Autism Spectrum Disorder.
Halil Ismail Helvaci, Chen-Nee Chuah, Sally Ozonoff, Sen-ching Samson Cheung
2024Long-Term Geo-Positioned Re-Identification Dataset of Urban Elements.
Paula Moral, Álvaro García-Martín, José M. Martínez
2024Low-Rank Matrix and Tensor Decomposition Using Randomized Two-Sided Subspace Iteration With Application to Video Reconstruction.
Maboud F. Kaloorazi, Salman Ahmadi-Asl, Susanto Rahardja
2024Lrdif: Diffusion Models For Under-Display Camera Emotion Recognition.
Zhifeng Wang, Kaihao Zhang, Ramesh S. Sankaranarayana
2024Luminate: Linguistic Understanding and Multi-Granularity Interaction for Video Object Segmentation.
Rahul Tekchandani, Ritik Maheshwari, Praful Hambarde, Satya Narayan Tazi, Santosh Kumar Vipparthi, Subrahmanyam Murala
2024M3T: Multi-Modal Medical Transformer To Bridge Clinical Context With Visual Insights For Retinal Image Medical Description Generation.
Nagur Shareef Shaik, Teja Krishna Cherukuri, Dong Hye Ye
2024MAVAD: Audio-Visual Dataset and Method for Anomaly Detection in Traffic Videos.
Blazej Leporowski, Arian Bakhtiarnia, Nicole Bonnici, Adrian Muscat, Luca Zanella, Yiming Wang, Alexandros Iosifidis
2024MCT-Net: a Lightweight Multiscale Convolutional Transformer Network for Polyp Segmentation.
Niladri Chakraborti, Deepak Ranjan Nayak
2024MFLFC: Multi-Frame Fusion Based Low-Resolution Feature Compression For Object Tracking.
Yi Peng, Zixiang Zhang, Li Yu
2024MGRQ: Post-Training Quantization For Vision Transformer With Mixed Granularity Reconstruction.
Lianwei Yang, Zhikai Li, Junrui Xiao, Haisong Gong, Qingyi Gu
2024MMAQ: A Multi-Modal Self-Supervised Approach For Estimating Air Quality From Remote Sensing Data.
Georgios-Fotios Angelis, Alexandros Emvoliadis, Anastasios Drosou, Dimitrios Tzovaras
2024MODIPHY: Multimodal Obscured Detection for IoT using PHantom Convolution-Enabled Faster YOLO.
Shubhabrata Mukherjee, Cory C. Beard, Zhu Li
2024MSD-CRFS: Multi-Scale Dual Aggregation Conditional Random Fields for Monocular Depth Estimation.
Xidan Zhang, Jianing Wei, Atsunori Moteki, Yoshie Kobayashi, Genta Suzuki, Zhiming Tan
2024MSGAT: Multi-Stage Graph Attention Network For Human Motion Prediction.
Ziyang Zheng, Ziliang Ren, Zhanhao Liang, Gulin Wang, Qieshi Zhang
2024MSSPG-AL: Few-Shot Hyperspectral Image Classification with Active Learning Updated Multi-Scale Superpixel Graph Fusion.
Long Yu, Jun Li, Li Zhuo
2024MTA-PS: Towards Practical Person Search in Videos.
Tiancheng Ying, Rong Quan, Peng Zheng, Yichao Yan, Jie Qin
2024MVAFormer: RGB-Based Multi-View Spatio-Temporal Action Recognition with Transformer.
Taiga Yamane, Satoshi Suzuki, Ryo Masumura, Shotaro Tora
2024MVCrackViT: Robust Multi-View Crack Detection For Point Cloud Segmentation Using View Attention.
Christian Benz, Volker Rodehorst
2024MWIRSTD: A MWIR Small Target Detection Dataset.
Nikhil Kumar, Avinash Upadhyay, Shreya Sharma, Manoj Sharma, Pravendra Singh
2024Mamba-PCGC: Mamba-Based Point Cloud Geometry Compression.
Monyneath Yim, Jui-Chiu Chiang
2024Mask-Based Invisible Backdoor Attacks on Object Detection.
Jeongjin Shin
2024Masked Momentum Contrastive Learning for Semantic Understanding by Observation.
Jiantao Wu, Shentong Mo, Sara Atito, Zhenhua Feng, Josef Kittler, Syed Sameed Husain, Muhammad Awais
2024Masked Signal Modeling for Plastic Waste Resin Classification.
S. Ebrahimkhani, J. Zheng, A. C. Y. Ngo, Ngai-Man Cheung
2024Mdbfusion: A Visible And Infrared Image Fusion Framework Capable For Motion Deblurring.
Jun Chen, Wei Yu, Xin Tian, Jun Huang, Jiayi Ma
2024Medea: Multi-View Efficient Depth Adjustment.
Mikhail Artemyev, Anna Vorontsova, Anna Sokolova, Alexander Limonov
2024Medical Knowledge-Guided Semi-Supervised Bi-Ventricular Segmentation.
Behnam Rahmati, Shahram Shirani, Zahra Keshavarz-Motamed
2024Memsvd: Long-Range Temporal Structure Capturing Using Incremental SVD.
Ioanna Ntinou, Enrique Sanchez, Georgios Tzimiropoulos
2024Meta-DM: Applications of Diffusion Models on Few-Shot Learning.
Wentao Hu, Jiarun Liu, Jiawei Wang, Hui Tian
2024Metaheuristic Camera Calibration for Optical Tomographic Imaging in Industrial Environments.
Andreas Unterberger, Cheau Tyan Foo, Zachary Adrian Emuang, Fabio J. W. A. Martins, Khadijeh Mohri
2024Micro-Expression Recognition Based On 3DCNN Combined With GRU and New Attention Mechanism.
Chun-Ting Fang, Tsung-Jung Liu, Kuan-Hsien Liu
2024Minimization of Submesh Boundary Errors In Dynamic Mesh Coding.
Koki Kishimoto, Kei Kawamura, Haruhisa Kato
2024Mix-Domain Contrastive Learning For Unpaired H&E-to-IHC Stain Translation.
Song Wang, Zhong Zhang, Huan Yan, Ming Xu, Guanghui Wang
2024Motion-Adaptive Inference for Flexible Learned B-Frame Compression.
Mustafa Akin Yilmaz, O. Ugur Ulas, Ahmet Bilican, A. Murat Tekalp
2024Motion-Lie Transformer: Geometric Attention For 3D Human Pose Motion Prediction.
Mayssa Zaier, Hazem Wannous, Hassen Drira, Jacques Boonaert
2024Multi-Attribute Vision Transformers are Efficient and Robust Learners.
Hanan Gani, Nada Saadi, Noor Hussein, Karthik Nandakumar
2024Multi-Modal Medical Image Fusion for Non-Small Cell Lung Cancer Classification.
Salma Hassan, Hamad Al Hammadi, Ibrahim Mohammed, Muhammad Haris Khan
2024Multi-Path Interference Mitigation For Indirect Time-of-Flight Camera By the Distortion of Coding Curve.
Wenbin Luo, Takafumi Iwaguchi, Ryusuke Sagawa, Hiroshi Kawasaki
2024Multi-Reference Flow-Guided Cross-Domain Reconstruction For General Object 6D Pose Estimation.
Jaewoo Park, Jaeguk Kim, Nam Ik Cho
2024Multi-Task Affinity Propagation Based Natural Image Matting.
Renkai Zhang, Nong Sang
2024Multi-View Multi-Focus Image Fusion: A Novel Benchmark Dataset and Method.
Zhilong Li, Kejun Wu, Junhao Liu, Qiong Liu, You Yang
2024Multi-View Network for Colorectal Polyps Detection in CT Colonography.
Mohamed Yousuf, Samir Harb, Islam Alkabbany, Asem M. Ali, Salwa Elshazley, Aly A. Farag
2024Multiclassification Of Vocal Folds Disorders From Videos By Spatio-Temporal Deep Features.
Dhouha Attia, Amel Benazza-Benyahia
2024Multimodal Transformer Using Cross-Channel Attention For Object Detection In Remote Sensing Images.
Bissmella Bahaduri, Zuheng Ming, Fangchen Feng, Anissa Mokraoui
2024Multimodal-Enhanced Objectness Learner For Corner Case Detection In Autonomous Driving.
Lixing Xiao, Ruixiao Shi, Xiaoyang Tang, Yi Zhou
2024NN-Based In-Loop Filtering With Inputs Transformed.
Du Liu, Jacob Ström, Mitra Damghanian, Per Wennersten
2024Navigating Limitations With Precision: A Fine-Grained Ensemble Approach To Wrist Pathology Recognition On A Limited X-Ray Dataset.
Ammar Ahmed, Ali Shariq Imran, Mohib Ullah, Zenun Kastrati, Sher Muhammad Daudpota
2024Neural Mesh Fusion: Unsupervised 3D Planar Surface Understanding.
Farhad G. Zanjani, Hong Cai, Yinhao Zhu, Leyla Mirvakhabova, Fatih Porikli
2024Neural Radiance Field-Assisted Static-Scene Video Coding.
Runyu Yang, Dong Liu, Feng Wu, Wen Gao
2024Non-Separablewavelet Transform Using Learnable Convolutional Lifting Steps.
Joao O. Parracho, Eduardo A. B. da Silva, Lucas A. Thomaz, Luis M. N. Tavora, Sérgio M. M. Faria
2024Norm-Integrated Softmax Loss For Deep Face Recognition.
Jun Chen, Yiwei Wang, Haiyan Zhang
2024Novel Meta Attention Guided Framework for Breast Abnormality Classification With Combination of FSL and DA.
Anindita Mohanta, Sourav Dey Roy, Niharika Nath, Mrinal Kanti Bhowmik
2024Nyctale: Neuro-Evidence Transformer for Adaptive and Personalized Lung Nodule Invasiveness Prediction.
Sadaf Khademi, Anastasia Oikonomou, Konstantinos N. Plataniotis, Arash Mohammadi
2024ODVISTA: An Omnidirectional Video Dataset for Super-Resolution and Quality Enhancement Tasks.
Ahmed Telili, Ibrahim Farhat, Wassim Hamidouche, Hadi Amirpour
2024ON Annotation-Free Optimization of Video Coding for Machines.
Marc Windsheimer, Fabian Brand, André Kaup
2024Object Detection Framework Using Multiple Tone Mappings on High-Dynamic-Range Images.
Takumi Watanabe, Rei Kawakami, Masayuki Tanaka, Masatoshi Okutomi
2024Object-Aware Adaptive Image Retargeting Via Importance Map Fusion.
Ziyad Alswaidan, M. Hashem Shullar, Khalil Chikhaoui, Motaz Alfarraj
2024Omra: Online Motion Resolution Adaptation To Remedy Domain Shift in Learned Hierarchical B-Frame Coding.
Zong-Lin Gao, Sang NguyenQuang, Wen-Hsiao Peng, Xiem HoangVan
2024On Efficient Neural Network Architectures for Image Compression.
Yichi Zhang, Zhihao Duan, Fengqing Zhu
2024On The Detection Of Images Generated From Text.
Yuqing Yang, Charuka Moremada, Nikos Deligiannis
2024On the Cloud Detection from Backscattered Images Generated from a Lidar-Based Ceilometer: Current State and Opportunities.
Alessio Barbaro Chisari, Alessandro Ortis, Luca Guarnera, Wladimiro Carlo Patatu, Rosaria Ausilia Giandolfo, Emanuele Spampinato, Sebastiano Battiato, Mario Valerio Giuffrida
2024On the Exploitation of DCT-Traces in the Generative-AI Domain.
Orazio Pontorno, Luca Guarnera, Sebastiano Battiato
2024One-Hot Logistic Regression for Radiomics-Based Classification.
Baptiste Schall, Rodolphe Anty, Lionel Fillatre
2024One-Shot Multi-Rate Pruning Of Graph Convolutional Networks For Skeleton-Based Recognition.
Hichem Sahbi
2024Online Anchor-Based Training For Image Classification Tasks.
Maria Tzelepi, Vasileios Mezaris
2024Open World Object Detection Via Cooperative Foundation Models for Driving Scenes.
Sheng Luo, Yi Zhou
2024Open-Vocabulary Panoptic Segmentation Using Bert Pre-Training of Vision-Language Multiway Transformer Model.
Yi-Chia Chen, Wei-Hua Li, Chu-Song Chen
2024OpenAnimalTracks: A Dataset for Animal Track Recognition.
Risa Shinoda, Kaede Shiohara
2024Optimized Decoupled Structure with Non-Local Attention for Deep Image Compression.
Xuanye Zhang, Zhaobin Zhang, Yaojun Wu, Semih Esenlik, Xiaoyan Sun, Kai Zhang, Li Zhang
2024Optimizing Learned Image Compression On Scalar and Entropy-Constraint Quantization.
Florian Borzechowski, Michael Schäfer, Heiko Schwarz, Jonathan Pfaff, Detlev Marpe, Thomas Wiegand
2024PCA-UNET for Object Segmentation.
Cheng Long, Sayantika Nag, Adrian Barbu
2024PUAD: Frustratingly Simple Method for Robust Anomaly Detection.
Shota Sugawara, Ryuji Imamura
2024PVDN-Urban - A Dataset for Provident Vehicle Detection at Night in Urban Scenarios.
Lukas Ewecker, Florian Schiffel, Robin Schwager, Tim Brühl, Tin Stribor Sohn, Thomas Villmann
2024PWISeg: Weakly-Supervised Surgical Instrument Instance Segmentation.
Zhen Sun, Huan Xu, Jinlin Wu, Zhen Chen, Hongbin Liu, Zhen Lei
2024Paon: A New Neuron Model Using Padé Approximants.
Onur Keles, A. Murat Tekalp
2024Parallel Task-Prompts ICM: A Versatile Feature Codec for Machine Vision.
Tianma Shen, Ying Liu
2024Partial Inter-Frame Coding for Dynamic Meshes.
Xudong Jin, Jianfeng Xu, Kei Kawamura
2024Perceptual Learned Image Compression via End-to-End JND-Based Optimization.
Farhad Pakdaman, Sanaz Nami, Moncef Gabbouj
2024Personatalk: Preserving Personalized Dynamic Speech Style In Talking Face Generation.
Qianxi Lu, Yi He, Shilin Wang
2024Physiological Modeling With Multispectral Imaging for Heart Rate Estimation.
Kosuke Kurihara, Yoshihiro Maeda, Daisuke Sugimura, Takayuki Hamamoto
2024Picture Partitioning Design of Neural Network-Based Intra Coding For Video Coding For Machines.
Keiichi Chono, Naoya Niwa, Hiroe Iwasaki
2024Pilot-Free Semantic Communication Over Multi-User Mimo Fading Channels.
Weixuan Chen, Qianqian Yang, Zhaohui Yang, Yiping Duan, Zhaoyang Zhang
2024Pixel-Wise Color Constancy Via Smoothness Techniques In Multi-Illuminant Scenes.
Umut Cem Entok, Firas Laakom, Farhad Pakdaman, Moncef Gabbouj
2024Point Cloud Geometry Scalable Coding with a Quality-Conditioned Latents Probability Estimator.
Daniele Mari, André F. R. Guarda, Nuno M. M. Rodrigues, Simone Milani, Fernando Pereira
2024Pose-Invariant Learning for Efficient Person Identification from Hyperspectral Hand Images.
Keigo Kunikata, Amane Kashino, Yota Yamamoto, Yukinobu Taniguchi, Yoko Sogabe, Ayumi Matsumoto, Masaki Kitahara, Go Irie
2024Power-Llava: Large Language and Vision Assistant for Power Transmission Line Inspection.
Jiahao Wang, Mingxuan Li, Haichen Luo, Jinguo Zhu, Aijun Yang, Mingzhe Rong, Xiaohua Wang
2024Priorformer: A UGC-VQA Method With Content and Distortion Priors.
Yajing Pei, Shiyu Huang, Yiting Lu, Xin Li, Zhibo Chen
2024Privacy-Preserving Visual Cues Communication for Hearing-Impaired People Using Deep Learning.
Fatima Zaidi, Hira Hameed, Muhammad Farooq, Aisha Fatima, Kamran Arshad, Khaled Assaleh, Qammer H. Abbasi
2024Progressive Learning with Visual Prompt Tuning for Variable-Rate Image Compression.
Shiyu Qin, Yi-Min Zhou, Jin-Peng Wang, Bin Chen, Baoyi An, Tao Dai, Shu-Tao Xia
2024Project, Skate, and Refresh: Improved Schrödinger Bridge Sampler for Image Restoration.
Ziqiang Shi, Rujie Liu
2024Prompt Performance Prediction For Image Generation.
Nicolas Bizzozzero, Ihab Bendidi, Olivier Risser-Maroix
2024Prune Channel And Distill: Discriminative Knowledge Distillation For Semantic Segmentation.
Bokyeung Lee, Kyungdeuk Ko, Jonghwan Hong, Hanseok Ko
2024Pyramid Coder: Hierarchical Code Generator for Compositional Visual Question Answering.
Ruoyue Shen, Nakamasa Inoue, Koichi Shinoda
2024Quadruple-Consistency Vision Transformer for Medical Image Segmentation with Limited Number of Sparse Annotations.
Yufan Liu, Ziyang Wang, Tianxiang Chen, Zi Ye
2024Quality of Experience of Viewport Adaptive Omnidirectional Video Streaming.
Xuelin Liu, Haoyun Zhang, Jiebin Yan, Hao Zhang, Yuming Fang, Shiqi Wang
2024Quantization After Inter Prediction in Displacement Coding of Dynamic Meshes.
Hitoshi Nishimura, Haruhisa Kato, Kei Kawamura
2024RESSCAL3D++: Joint Acquisition and Semantic Segmentation of 3D Point Clouds.
Remco Royen, Kostas Pataridis, Ward van der Tempel, Adrian Munteanu
2024RFG-HDR: Representative Feature-Guided Transformer For Multi-Exposure High Dynamic Range Imaging.
Keuntek Lee, Jaehyun Park, Gu Yong Park, Nam Ik Cho
2024RFNET: Refined Fusion Three-Branch RGB-D Salient Object Detection Network.
Kexuan Wang, Chenhua Liu, Huiguang Wei, Li Jing, Rongfu Zhang
2024ROI-DVC: A Region-of-Interest Based Deep Video Coding Framework.
Xiaojie Wu, Ping Wang, Xinhong Wang
2024Rafmnet: Reinforced Attention Fusion and Multiscale Network For Noisy Infrared and Visible Image Fusion.
Guohua Lv, Xiyan Wang, Yongbiao Gao, Yi Zhai, Guixin Zhao, Guangxiao Ma
2024Rage for the Machine: Image Compression with Low-Cost Random Access for Embedded Applications.
Christian D. Rask, Daniel E. Lucani
2024Rate-Complexity Optimization in Lossless Neural-Based Image Compression.
Lucas S. Lopes, Ricardo L. de Queiroz, Philip A. Chou
2024Rate-Quality or Energy-Quality Pareto Fronts for Adaptive Video Streaming?
Angeliki Katsenou, Xinyi Wang, Daniel Schien, David Bull
2024Rdssd: 3D Single Stage Object Detector For Roadside Lidar Sensors.
Conghao Lv, Ping Jiang, Meng Wang, Lixin Lin, Xuechen Chen, Xiaoheng Deng
2024Reading is Believing: Revisiting Language Bottleneck Models for Image Classification.
Honori Udo, Takafumi Koshinaka
2024Real-Time Monocular Depth Estimation on Embedded Systems.
Cheng Feng, Congxuan Zhang, Zhen Chen, Weiming Hu, Liyue Ge
2024Real-Time Semantic Video Communication of General Scenes.
Cem Eteke, Alexander Griessel, Wolfgang Kellerer, Eckehard G. Steinbach
2024Real-Time Video Prediction With Fast Video Interpolation Model and Prediction Training.
Shota Hirose, Kazuki Kotoyori, Kasidis Arunruangsirilert, Fangzheng Lin, Heming Sun, Jiro Katto
2024Real-Time and Resource-Efficient Multi-Scale Adaptive Robotics Vision for Underwater Object Detection and Domain Generalization.
Lyes Saad Saoud, Zhenwei Niu, Lakmal D. Seneviratne, Irfan Hussain
2024Real-World Atmospheric Turbulence Correction Via Domain Adaptation.
Xijun Wang, Santiago López-Tapia, Aggelos K. Katsaggelos
2024Reconstruct Dynamic Scene for Spike Camera Based on 3D Space Time Similarity.
Yuanlin Wang, Ruiqin Xiong, Jing Zhao, Tiejun Huang
2024Recurrent 3-D Multi-Level Visual Transformer For Joint Classification of Heterogeneous 2-d AND 3-D Radiographic Data.
Muhammad Owais, Muhammad Zubair, Taimur Hassan, Divya Velayudhan, Irfan Hussain, Naoufel Werghi
2024Redefining Cystoscopy With AI: Bladder Cancer Diagnosis Using an Efficient Hybrid CNN-Transformer Model.
Meryem Amaouche, Ouassim Karrakchou, Mounir Ghogho, Anouar El Ghazzaly, Mohamed Alami, Ahmed Ameur
2024Redefining Visual Quality: The Impact of Loss Functions on INR-Based Image Compression.
Lorenzo Catania, Dario Allegra
2024Reducing Motion Artifacts in Brain MRI Using Vision Transformers and Self-Supervised Learning.
Lei Zhang, Xiaoke Wang, Edward H. Herskovits, Elias R. Melhem, Linda Chang, Ze Wang, Thomas Ernst
2024Referring Image Segmentation with Two-Stage Multi-Modal Interaction.
Zhenhua Wang, Linwei Ye
2024Refining Myocardial Infarction Detection: A Novel Multi-Modal Composite Kernel Strategy in One-Class Classification.
Muhammad Uzair Zahid, Aysen Degerli, Fahad Sohrab, Serkan Kiranyaz, Tahir Hamid, Rashid Mazhar, Moncef Gabbouj
2024Reinforcement Learning-Based Secure Video Transmission For IOV Systems.
Lixin Liu, Zhibo Liu, Xiaozhen Lu, Yanling Bu, Bin Han, Liang Xiao
2024Reinforcing Pre-Trained Models Using Counterfactual Images.
Xiang Li, Ren Togo, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama
2024Remote Sensing Image Uneven Haze Removal Based On Haze Density Estimation and Saliency-Driven Dual Channel Fusion.
Yanmeng Liu, Libao Zhang
2024Removing Reflective Flare in Real-World Conditions.
Fengbo Lan, Chang Wen Chen
2024Res-NeRV: Residual Blocks For A Practical Implicit Neural Video Decoder.
Marwa Tarchouli, Thomas Guionnet, Marc Rivière, Wassim Hamidouche, Meriem Outtas, Olivier Déforges
2024ResNeRF-PCAC: Super Resolving Residual Learning NeRF for High Efficiency Point Cloud Attributes Coding.
Sajid Umair, Birendra Kathariya, Zhu Li, Anique Akhtar, Geert Van der Auwera
2024Reset: A Residual Set-Transformer Approach to Tackle the Ugly-Duckling Sign in Melanoma Detection.
Jules Collenne, Rabah Iguernaissi, Séverine Dubuisson, Djamal Merad
2024Rethinking Domain Adaptation and Generalization in the ERA Of Clip.
Ruoyu Feng, Tao Yu, Xin Jin, Xiaoyuan Yu, Lei Xiao, Zhibo Chen
2024Rethinking Temporal Self-Similarity For Repetitive Action Counting.
Yanan Luo, Jinhui Yi, Yazan Abu Farha, Moritz Wolter, Juergen Gall
2024Robust 3D Semantic Segmentation With Incomplete Point Clouds Based on Sequential Frame Sampling.
Masahiro Yamaguchi, Kyota Higa, Toshinori Hosoi, Takashi Shibata
2024Robust Representation Learning With Self-Distillation For Domain Generalization.
Ankur Singh, Senthilnath Jayavelu
2024Robust Skin Color Driven Privacy-Preserving Face Recognition Via Function Secret Sharing.
Dong Han, Yufan Jiang, Yong Li, Ricardo Mendes, Joachim Denzler
2024Robustness of Tensor Decomposition-Based Neural Network Compression.
Théo Rudkiewicz, Mohamed Ouerfelli, Riccardo Finotello, Zakariya Chaouai, Mohamed Tamaazousti
2024Rotated R-CNN: A Two-Stage Object Detection Method Adapted To Oriented Bounding Boxes.
Chengdao Pu, Jun Yu, Wen Su, Tianyu Liu
2024Rsud20K: a Dataset for Road Scene Understanding in Autonomous Driving.
Hasib Zunair, Shakib Khan, A. Ben Hamza
2024S
Yuxi Lu, Zhuming Zhang, Shiming Lin, Dengpan Zhang, Haibin Ma, Zengchang Qin
2024SANERV: Scene-Adaptive Neural Representation for Videos.
Hochang Rhee, Haesoo Chung, Junho Jo, Eunji Lee, Nam Ik Cho
2024SE3D: A Framework for Saliency Method Evaluation in 3D Imaging.
Mariusz Wisniewski, Loris Giulivi, Giacomo Boracchi
2024SFD: Similar Frame Dataset for Content-Based Video Retrieval.
Chaowei Han, Gaofeng Meng, Chunlei Huo
2024SFNet - A Spatial-Frequency Domain Neural Network For Image Lens Flare Removal.
Florin-Alexandru Vasluianu, Zongwei Wu, Radu Timofte
2024SG-JND: Semantic-Guided Just Noticeable Distortion Predictor for Image Compression.
Linhan Cao, Wei Sun, Xiongkuo Min, Jun Jia, Zicheng Zhang, Zijian Chen, Yucheng Zhu, Lizhou Liu, Qiubo Chen, Jing Chen, Guangtao Zhai
2024SINO-CT-Fusion-Net: A Lightweight Deep Learning Framework for Detection and Classification of Intracranial Hemorrhages.
Chitimireddy Sindhura, Phaneendra K. Yalavarthy, Subrahmanyam Gorthi
2024SKETCH2MANGA: Shaded Manga Screening from Sketch with Diffusion Models.
Jian Lin, Xueting Liu, Chengze li, Minshan Xie, Tien-Tsin Wong
2024SLNL: Soft Label Regularization For Semi-Supervised Facial Expression Recognition With Negative Label Learning.
Youwei Zhang, Jing Jiang, Yuying Zhao, Kongming Liang
2024SMO-CLIP: Enhancing Anomalous Smoke Density Assessment Using A Hybrid LLM-VLM Approach.
Pengfei Li, Muaz Al Radi, Mahmoud Said Elmezain, Abdelfatah Hassan Ahmed, Abderrahmene Boudiaf, Said Boumaraf, Jorge Dias, Hamad Karki, Sajid Javed, Khalid Yousef Al Awadhi, Naoufel Werghi
2024SN-NET: Semismooth Newton Driven Lightweight Network for Real-World Image Denoising.
Chenxiao Zhang, Xin Deng, Hongpeng Sun, Jingyi Xu, Mai Xu
2024SODA: A Dataset for Small Object Detection in UAV Captured Imagery.
Daniel Pisani, Dylan Seychell, Carl James Debono, Michael Schembri
2024SS-CXR: Self-Supervised Pretraining Using Chest X-Rays Towards A Domain Specific Foundation Model.
Syed Muhammad Anwar, Abhijeet Parida, Sara Atito, Muhammad Awais, Gustavo Nino, Josef Kittler, Marius George Linguraru
2024Saliency As A Schedule: Intuitive Image Attribution.
Aniket Singh, Anoop M. Namboodiri
2024Saliency-Aware End-to-End Learned Variable-Bitrate 360-Degree Image Compression.
Oguzhan Güngördü, A. Murat Tekalp
2024Salient Guided Text Detection in E-Commerce Images.
Boon Yin Yin, Nurul Japar
2024Sample Domain Prediction and Transform Skip for Region Adaptive Hierarchical Transform in Geometric Point Cloud Compression.
Bharath Vishwanath, Wenyi Wang, Yingzhan Xu, Kai Zhang, Li Zhang
2024Scalable Hypersphere Embedding For Semantic Metric Learning.
Lovre Antonio Budimir, Marko Subasic, Zoran Kalafatic, Sven Loncaric
2024Scene Generalized Multi-View Pedestrian Detection with Rotation-Based Augmentation and Regularization.
Satoshi Suzuki, Shotaro Tora, Ryo Masumura
2024Scene Text Recognition Using Progressive Rectification Network And Spelling Error Correction Language Model.
Ming-Zheng Peng, Hao-Chung Cheng, Phuong-Thi Le, Cheng-Chun Wang, Chien-Yao Wang, Jia-Ching Wang
2024SegGuard: Defending Scene Segmentation Against Adversarial Patch Attack.
Thomas Gittings, Steve Schneider, John P. Collomosse
2024Segment Any Object Model (SAOM): Real-To-Simulation Fine-Tuning Strategy For Multi-Class Multi-Instance Segmentation.
Mariia Khan, Yue Qiu, Yuren Cong, Bodo Rosenhahn, Jumana Abu-Khalaf, David Suter
2024Segmentation of Hard Exudates And Hemorrhages from Diabetic Retinopathy Images Using Residual U-Net with Squeeze and Excite Blocks.
Avinash Gaikwad, Anjali Gautam
2024Self-Supervised Anomaly Detection and a New Benchmark for X-Ray Cargo Images.
Bipin Gaikwad, Abani Patra, Carl R. Crawford, Eric L. Miller
2024Self-Supervised Multi-View Stereo with Adaptive Depth Priors.
Lintao Xiang, Hujun Yin
2024Semantic Enhanced Few-Shot Object Detection.
Zheng Wang, Yingjie Gao, Qingjie Liu, Yunhong Wang
2024Semantic-Enhanced Point-Box Joint Prompting for Video Object Segmentation.
Quan Zhao, Siying Wu, Yueyi Zhang, Xiaoyan Sun
2024Semantic-Region Specific Lookup Tables for Image Enhancement Via Unpaired Learning.
Zheng-Hui Huang, Tse-Yan Lee, Li-Jen Chang, Yong-Wei Chen, Ping-Jui Chiang, Jo-Fan Wu, Yung-Yu Chuang
2024Semi-Supervised 3D Object Detection With Channel Augmentation Using Transformation Equivariance.
Minju Kang, Taehun Kong, Tae-Kyun Kim
2024Semi-Supervised Action Recognition From Newborn Resuscitation Videos.
Syed Tahir Hussain Rizvi, Øyvind Meinich-Bache, Vilde Kolstad, Siren Rettedal, Sara Brunner, Kjersti Engan
2024Semi-Supervised Graphical Deep Dictionary Learning for Hyperspectral Image Classification From Limited Samples.
Anurag Goel, Angshul Majumdar
2024Set-Nas: Sample-Efficient Training For Neural Architecture Search With Strong Predictor And Stratified Sampling.
Yu-Ming Zhang, Jun-Wei Hsieh, Yu-Hsiu Chang, Xin Li, Ming-Ching Chang, Chun-Chieh Lee, Kuo-Chin Fan
2024Shadow-Aware Makeup Transfer with Lighting Adaptation.
Hao-Yun Chang, Wen-Jiin Tsai
2024Similarity-Weighted IoU (sIOU): A Comprehensive Metric for Evaluating Model Performance Through Similarity-Weighted Class Overlaps.
Umamaheswaran Raman Kumar, Patrick Vandewalle
2024Simple Image Signal Processing using Global Context Guidance.
Omar Elezabi, Marcos V. Conde, Radu Timofte
2024Simsam: Simple Siamese Representations Based Semantic Affinity Matrix for Unsupervised Image Segmentation.
Chanda Grover Kamra, Indra Deep Mastan, Nitin Kumar, Debayan Gupta
2024Single-Panorama Classification of 3D Objects Using Horizontally Stacked Dilated Convolutions.
Rômulo Marconato Stringhini, Thiago S. Lermen, Thiago L. T. da Silveira, Cláudio R. Jung
2024Some Can Be Better than All: Multimodal Star Transformer for Visual Dialog.
Qiangqiang He, Jie Zhang, Shuwei Qian, Chongjun Wang
2024Source-Free Continual Adaptive Learning With Limited Labels on Evolving Data Drifts.
Amrutha Machireddy, Ranganath Krishnan, Athmanarayanan Lakshmi Narayanan, Omesh Tickoo
2024SovaSeg-Net: Scale Invariant Ovarian Tumors Segmentation from Ultrasound Images.
Huu-Phong Luong, Hoang-Son Bui, Nam-Khanh Nguyen, Thi-Loan Pham, Gia-Minh Pham, Sy-Hoang Tran, Thanh-Hai Tran, Thi-Lan Le
2024Sparse Transformer Refinement Similarity Map for Aerial Tracking.
Xi Tao, Ke Qi, Peijia Chen, Wenhao Xu, Yutao Qi
2024Spatial Plaid Attention Decoder for Semantic Segmentation.
Abolfazl Meyarian, Xiaohui Yuan, Zhinan Qiao
2024Spatial-Channel Collaborated Attention for Cross-Scale Crowd Counting.
Yongpeng Chang, Guangchun Gao
2024Spatiality-Aware Prompt Tuning for Few-Shot Small Object Detection.
Takumi Karasawa, Nakamasa Inoue, Rei Kawakami
2024Spatio-Temporal Adaptation With Dilated Neighbourhood Attention For Accident Anticipation.
Patrik Patera, Yie-Tarng Chen, Wen-Hsien Fang
2024Standard Compliant Video Coding Using Low Complexity, Switchable Neural Wrappers.
Yueyu Hu, Chenhao Zhang, Onur G. Guleryuz, Debargha Mukherjee, Yao Wang
2024Start-Tv: A Closed-Form Initialization For Total Variation Models.
Yuanhao Gong, Guanghui Yue
2024Statistics-Aware Audio-Visual Deepfake Detector.
Marcella Astrid, Enjie Ghorbel, Djamila Aouada
2024Stay Focus on Object: Cross-Domain Detection Using Domain-Invariant Object Representation.
Taehoon Kim, Jaemin Na, Joong-Won Hwang, Wonjun Hwang
2024Streaming Neural Images.
Marcos V. Conde, Andy Bigos, Radu Timofte
2024Streamlined Hybrid Annotation Framework Using Scalable Codestream for Bandwidth-Restricted UAV Object Detection.
Karim El Khoury, Tiffanie Godelaine, Simon Delvaux, Sébastien Lugan, Benoît Macq
2024Structured Pruning and Quantization for Learned Image Compression.
Md Adnan Faisal Hossain, Fengqing Zhu
2024Subblock-Based Combined Inter and Intra Prediction Beyond VVC.
Lei Zhao, Kai Zhang, Li Zhang
2024Subgroups For Detection Transformer.
Tharsan Senthivel, Ngoc-Son Vu
2024Subjective Portrait Region Cropping On Landscape Video Study.
Cheng-Han Lee, Maniratnam Mandal, Neil Birkbeck, Yilin Wang, Balu Adsumilli, Alan C. Bovik
2024Subjective Quality Assessment of Thermal Infrared Images.
Guanghui Yue, Lixin Zhang, Jinxia Zhang, Zhaofei Xu, Shuigen Wang, Tianwei Zhou, Yuanhao Gong, Wei Zhou
2024Super-Resolution for Near-Eye Light Field Display in Fourier Space.
Yu-Hsiang Huang, Wei Wang, Homer H. Chen
2024Super: Selfie Undistortion and Head Pose Editing with Identity Preservation.
Polina Karpikova, Andrei Spiridonov, Anna Vorontsova, Anastasia Yaschenko, Ekaterina Radionova, Igor Medvedev, Alexander Limonov
2024Superpixel Mixing: A Data Augmentation Technique For Robust Deep Visual Recognition Models.
Danyang Sun, Fadi Dornaika, Vinh Truong Hoang, Nagore Barrena
2024Surface Anomaly Detection With Anomalous Feature Restriction And Difference-Aware Enhancement.
Jinhui Zhao, Hongxia Gao, Tongtong Liu
2024Synthmanticlidar: A Synthetic Dataset For Semantic Segmentation On Lidar Imaging.
Javier Montalvo, Pablo Carballeira, Álvaro García-Martín
2024TCA-NET: Triplet Concatenated-Attentional Network for Multimodal Engagement Estimation.
Hongyuan He, Daming Wang, Md. Rakibul Hasan, Tom Gedeon, Md. Zakir Hossain
2024TDAD: Trident Distillations for Anomaly Detection.
Wenrui Hu, Yuan Xie, Wei Yu
2024TSF-NET3D: TSF-NET for 3D Point Cloud Attribute Compression Artifacts Removal.
Birendra Kathariya, Zhu Li, Geert Van der Auwera
2024Talking-Head Video Compression With Motion Semantic Enhancement Model.
Haobo Lei, Zhisong Bie, Zhao Jing, Hongxia Bie
2024Taxes are All You Need: Integration Of Taxonomical Hierarchy Relationships Into the Contrastive Loss.
Kiran Kokilepersaud, Yavuz Yarici, Mohit Prabhushankar, Ghassan AlRegib
2024Temporal Clustering and Temporal Reference Based Specular Detection For 1-MS Visual Feedback System.
Tingting Hu, Ryuji Fuchikami, Shigekiyo Nosaka
2024Temporal Regularization for Robust Motion Compensation in Reduced Dose Cardiac-Gated Spect Images.
Xirang Zhang, Yongyi Yang, Jovan G. Brankov, P. Hendrik Pretorius, Michael A. King
2024Temporal Scalable Coding For Dynamic Meshes.
Jianfeng Xu, Haruhisa Kato, Kei Kawamura
2024Temporal Transformer Encoder for Video Class Incremental Learning.
Nattapong Kurpukdee, Adrian G. Bors
2024Temporal-Spatial SPDAGG Network For Skeleton-Based Human Action Recognition From Aerial Perspectives.
Mohamed Sanim Akremi, Najett Neji, Hedi Tabia
2024Thermal Videodiff (TVD): A Diffusion Architecture For Thermal Video Synthesis.
Tayeba Qazi, Brejesh Lall
2024Thqa: A Perceptual Quality Assessment Database for Talking Heads.
Yingjie Zhou, Zicheng Zhang, Wei Sun, Xiaohong Liu, Xiongkuo Min, Zhihua Wang, Xiao-Ping Zhang, Guangtao Zhai
2024Through-Wall Imaging Based On WiFi Channel State Information.
Julian Strohmayer, Rafael Sterzinger, Christian Stippel, Martin Kampel
2024Toward Efficient Deep Blind Raw Image Restoration.
Marcos V. Conde, Florin-Alexandru Vasluianu, Radu Timofte
2024Toward Low Artifact Virtual Try-On Via Pre-Warping Partitioned Clothing Alignment.
Wei-Chian Liang, Chieh-Yun Chen, Hong-Han Shuai
2024Towards Better Control Of Latent Spaces For Face Editing.
Savas Özkan, Mete Özay
2024Towards Generalizable Referring Image Segmentation Via Target Prompt And Visual Coherence.
Yajie Liu, Pu Ge, Haoxiang Ma, Shichao Fan, Qingjie Liu, Di Huang, Yunhong Wang
2024Towards Privacy-Enhancing Provenance Annotations for Images.
Nikolaos Fotos, Jaime Delgado
2024Towards Robust Person Re-Identification Via Efficient and Generalized Adversarial Training.
Huiwang Liu, Yan Huang, Linlin Zeng, Ya Li
2024Towards Robust Visual Localization Using Multi-View Images and HD Vector Map.
Lili Zhao, Zhili Liu, Qian Yin, Lei Yang, Meng Guo
2024Towards Unifying Anatomy Segmentation: Automated Generation of a Full-Body CT Dataset.
Alexander Jaus, Constantin Seibold, Kelsey Hermann, Negar Shahamiri, Alexandra Walter, Kristina Giske, Johannes Haubold, Jens Kleesiek, Rainer Stiefelhagen
2024Towards the Detection of AI-Synthesized Human Face Images.
Yuhang Lu, Touradj Ebrahimi
2024Transformer-Based Clipped Contrastive Quantization Learning For Unsupervised Image Retrieval.
Ayush Dubey, Shiv Ram Dubey, Satish Kumar Singh, Wei-Ta Chu
2024Trustworthy Sr: Resolving Ambiguity In Image Super-Resolution Via Diffusion Models And Human Feedback.
Cansu Korkmaz, Ege Çirakman, A. Murat Tekalp, Zafer Dogan
2024Two Heads Better Than One: Dual Degradation Representation for Blind Super-Resolution.
Hsuan Yuan, Shao-Yu Weng, I-Hsuan Lo, Wei-Chen Chiu, Yu-Syuan Xu, Hao-Chien Hsueh, Jen-Hui Chuang, Ching-Chun Huang
2024Two-Level Intra Prediction Using High-Order Macropixel Neighbors For Plenoptic Video Coding.
Vinh Van Duong, Thuc Nguyen Huu, Jonghoon Yim, Byeungwoo Jeon
2024Two-Stage Tripletnet: Light Weight Remote Sensing Scene Classification.
Xianbin Hu, Wei Wu, Zhu Li, Xueliang Luo, Zhengfeng Chen
2024U-Convnext Network for Infrared Small Target Detection.
Jian Ma, Xiuhong Li, Yuye Zhang, Boyuan Li, Dangxuan Wu, Zhenhong Jia
2024U-Tell: Unsupervised Task Expert Lifelong Learning.
Indu Solomon, Aye Phyu Phyu Aung, Uttam Kumar, Senthilnath Jayavelu
2024UTrCGAN: Uncertainty-Driven Cycle-Consistent Generative Adversarial Network for Low-Light Image Enhancement.
Jingshuo Guan, Na Qi, Qing Zhu, Liang Chen
2024Uimt: A Framework for Improving Unimodal Inference via Multimodal Training.
Kateryna Chumachenko, Alexandros Iosifidis, Moncef Gabbouj
2024Uncalibrated and Unsupervised Photometric Stereo with Piecewise Regularizer.
Alejandro Casanova, Antonio Agudo
2024Uncertainty-Aware AB3DMOT by Variational 3D Object Detection.
Illia Oleksiienko, Alexandros Iosifidis
2024Uncovering Communities Of Pipelines in the Task-FMRI Analytical Space.
Elodie Germani, Elisa Fromont, Camille Maumet
2024Underwater Change Detection Using Multiple Sampling-Based Probabilistic Learner and Feature Preservance Discriminator.
Mehvish Nissar, Badri Narayan Subudhi, Vinit Jakhetiya, Amit Kumar Mishra
2024Unicrowd Simulator: Visual and Behavioral Fidelity For The Generation of Crowd Datasets.
Niccolò Bisagno, Antonio Luigi Stefani, Nicola Garau, Francesco G. B. De Natale, Nicola Conci
2024Universal Black-Box Adversarial Patch Attack with Optimized Genetic Algorithm.
Qun Zhao, Yuan-Gen Wang
2024Unleashing Fine-Coarse Curve Perception Via Trunk-Branch Perturbation.
Yunxiang Cao, Li Chen, Yubo Wang, Zhida Feng, Xiaoming Liu
2024Unleashing the Power of Generalized Iterative Closest Point for Swift and Effective Point Cloud Registration.
Efthymios Koukoulis, Gerasimos Arvanitis, Konstantinos Moustakas
2024Unrolled Projected Gradient Algorithm For Stain Separation In Digital Histopathological Images.
Aymen Sadraoui, Astrid Laurent-Bellue, Mounir Kaaniche, Amel Benazza-Benyahia, Catherine Guettier, Jean-Christophe Pesquet
2024Unsupervised Coordinate-Based Video Denoising.
Mary Damilola Aiyetigbo, Dineshchandar Ravichandran, Reda Chalhoub, Peter Kalivas, Feng Luo, Nianyi Li
2024Unsupervised Domain Adaptive Semantic Segmentation Based on Clip-Guided Prototypical Contrastive Learning.
Kebin Liu, Chuang Zhu
2024VAG: Voxel Attenuation Grid For Sparse-View CBCT Reconstruction.
Jinhao Qiao, Jiang Liu, Heng Yu, Yi Xiao, Hongshan Yu, Yan Zheng, Sihan Li
2024VCDSet: A New Vehicle Collision Dataset In Asia Countries For Anticipating Accidents.
Chih-Chung Hsu, Yun-Zhong Jiang, Wei-Hao Huang
2024VF-Net: Robustness Via Understanding Distortions and Transformations.
Fatemeh Amerehi, Patrick Healy
2024VR-Based Generation of Photorealistic Synthetic Data for Training Hand-Object Tracking Models.
Chengyan Zhang, Rahul Chaudhari
2024Video Class-Incremental Learning With Clip Based Transformer.
Shuyun Lu, Jian Jiao, Lanxiao Wang, Heqian Qiu, Xingtao Lin, Hefei Mei, Hongliang Li
2024Vito: Vision Transformer Optimization Via Knowledge Distillation On Decoders.
Giovanni Bellitto, Renato Sortino, Paolo Spadaro, Simone Palazzo, Federica Proietto Salanitri, Giuseppe Fiameni, Efstratios Gavves, Concetto Spampinato
2024Vizecgnet: Visual ECG Image Network for Cardiovascular Diseases Classification With Multi-Modal Training and Knowledge Distillation.
Ju-Hyeon Nam, Seo-Hyung Park, Su Jung Kim, Sang-Chul Lee
2024Wavelet-Enhanced CNN for Depression Classification Based on MRI Images.
Yawei Zhang, Bo Li, Xin Li, Yuhan Huang, Hui Ding
2024Weather-Aware Drone-View Object Detection Via Environmental Context Understanding.
Hyunjun Kim, Dahye Lee, Sungjune Park, Yong Man Ro
2024When Self-Supervised Pre-Training Meets Single Image Denoising.
Hamadi Chihaoui, Paolo Favaro
2024WrappingNet: Mesh Autoencoder Via Deep Sphere Deformation.
Eric Lei, Muhammad Asad Lodhi, Jiahao Pang, Junghyun Ahn, Dong Tian
2024YOLO-Feder Fusionnet: A Novel Deep Learning Architecture for Drone Detection.
Tamara R. Lenhard, Andreas Weinmann, Stefan Jäger, Tobias Koch
2024Youtube SFV+HDR Quality Dataset.
Yilin Wang, Joong Gon Yim, Neil Birkbeck, Balu Adsumilli
2024Zero-Shot Composed Image Retrieval Considering Query-Target Relationship Leveraging Masked Image-Text Pairs.
Huaying Zhang, Rintaro Yanagi, Ren Togo, Takahiro Ogawa, Miki Haseyama