| 2025 | 3GPP PDU Set Framework: Release 19 Updates. Serhan Gül, Igor D. D. Curcio |
| 2025 | A First Look at Open-GoP Streaming with Av1 S-Frames. Akram Ansari, S. Ali John Naqvi, Mea Wang, Emir Halepovic |
| 2025 | A Forearm-Worn Haptic Device for Integrated Tactile and Kinaesthetic Feedback via Skin Stretch. Selin Nur Özsert, Daniel Rodriguez-Guevara, Leonardo Franco, Wenxuan Wei, Eckehard G. Steinbach, Domenico Prattichizzo |
| 2025 | A Hybrid Noise Perturbation-Based Denoising Autoencoder for Machine Sound Anomaly Detection. Kadir Torun, Mustafa Sert |
| 2025 | A One-Class Structural Similarity-Based Autoencoder for the Detection of Malaria-Infected Cells. Moses Omondi, Yassine Belkhouche |
| 2025 | AMICO: A Semantic and Multimodal Framework for AI-Assisted Clinical Reporting. Antonio Laudante, Mariano Barone, Giuseppe Riccio, Antonio Romano, Francesco Di Serio, Antonio Scialdone, Francesco Porciello, Nicola Rainone, Vincenzo Moscato |
| 2025 | AR in HbbTV-Based Hybrid TV Services. Fernando Boronat, Lluc Simó, Rubén Prieto, Almanzor Sapena |
| 2025 | Adaptation of CDN at the Edge Using Cloud-Native Network Telemetry Across Media Scenarios. Javier Iglesias, Juan Felipe Mogollón, Iñigo Tamayo, Zaloa Fernández, Olov Danielsson, Ivan Pretel, Asier Lopez |
| 2025 | Adaptive Obfuscation for Reusing RGB Datasets for Privacy-Preserving Human Pose Estimation. Francesco Pistolesi, Matteo Mugnai, Beatrice Lazzerini |
| 2025 | An Agent-Driven Architecture for Harmful Meme Detection through Multimodal Decomposition. Gian Marco Orlando, Marco Perillo, Diego Russo, Vincenzo Moscato |
| 2025 | An Efficient Optimization Criterion for Multi-View Feature Representation Learning. Lei Gao, Kai Liu, Kevin Tang, Ling Guan |
| 2025 | An Influence Analysis of Hybrid Lectures with a Simple Setup on the Student Experience. Florian Schimanke, Robert Mertens, Felix Prankel |
| 2025 | Analysis of Multimodal LLMs in VQA in the Field of Radiology. Cristovão Pessoa Cândido Neto, Cláudio de Souza Baptista, André Luiz Firmino Alves, Vivek Swarnakar, Anselmo Cardoso de Paiva |
| 2025 | Application of Computer Vision Research (ISVP.AI) in the Development of the Comprehensive Stellis One Platform for Sports Organizations. Lukasz Gasiorowski, Jagoda Lazarek, Sebastian Purtak, Pawel Góra |
| 2025 | Attention-Enhanced Multi-Branch Spiking Neural Network for Event Stream Super-Resolution. Ahmadreza Sezavar, Catarina Brites, João Ascenso |
| 2025 | CCAFF: Object Tracking Under Heavy Occlusion. Abdul Bhutta, Naimul Khan, Ling Guan |
| 2025 | Coding Gaussian Splat Scenes with V3C/V-PCC. Patrice Rondao Alface, Lauri Ilola, Lukasz Kondrad |
| 2025 | Comparative Analysis of Face Recognition Models: Runtime Environments and Compute Units on Edge. Lukasz Grzymkowski, Tomasz P. Stefanski |
| 2025 | Comparative Evaluation of Deep Learning Methods for Wood Surface Defect Detection: A Comprehensive Study of Semantic Segmentation Approaches. Yuki Yanai, Tomokazu Ishikawa |
| 2025 | Comparison of Multimodal Fall Detection Strategies. Reema Maheshbhai Gadhia, Nasim Hajari |
| 2025 | Cost-Optimal Design of Hybrid Broadcast - Unicast Video Delivery Systems. Yuriy A. Reznik |
| 2025 | DWT Domain Precinct-Wise Scrambling for Encryption-then-Compression with JPEG XS. Takayuki Nakachi, Park Cheolhwan, Yasuhisa Kato, Mitsuru Maruyama |
| 2025 | Dialogue-Pseudo: A Speaker Pseudonymization Framework for Privacy Protection in Dialogue Speech Data. Aoi Ito, Katunobu Itou |
| 2025 | Diversity-Aware Active Learning for Object Detection Utilizing Time-of-Day Metadata. Fumiya Higashide, Akira Kubota |
| 2025 | Emotion Detection and Classification of Different Saudi Dialects. Rehab K. Qarout, Joud Y. Samkari, Rahaf M. ALFudhayl, Ruba H. ALSulami, Shada M. Basudan, Ghadi K. AlJuhani, Nuha Zamzami |
| 2025 | Empowering Access to Public Services: An Analysis on Multimodal, Retrieval-Augmented Chatbots for Indic Language Support to Farmers. Mohsina Bilal, Gopakumar G |
| 2025 | Evaluating the Emerging MPEG Video Coding for Machines in Semantic Segmentation. Khoa Dang Pham, Farhad Pakdaman, Honglei Zhang, Hamed Rezazadegan Tavakoli, Nam Le, Jukka I. Ahonen, Moncef Gabbouj |
| 2025 | Evaluation of AR-Based Blind Spot Monitoring Across Diverse Driving Scenarios Using a VR Simulator. Yohei Sakai, Tomokazu Ishikawa |
| 2025 | Evaluation of a Floating-Head Communication Prototype for Video-Conferencing. William Menz, Alexander Zoubarev, David Kutschke, Rakesh Rao Ramachandra Rao, Louay Bassbouss, Sven Bliedung von der Heide, Steve Göring, Alexander Raake |
| 2025 | Exploiting LLMs for Metadata-Based Video Quality Prediction. Steve Göring, Rakesh Rao Ramachandra Rao, Alexander Raake |
| 2025 | Exploring Privacy and Security Risks in LLMs: Data Leakage, Prompt Injection, and Membership Inference. Giancarlo Sperlí |
| 2025 | ExposureEngine: Oriented Logo Detection and Sponsor Visibility Analytics in Sports Broadcasts. Mehdi Houshmand Sarkhoosh, Frøy Øye, Henrik Nestor Sørlie, Nam Hoang Vu, Dag Johansen, Cise Midoglu, Tomas Kupka, Pål Halvorsen |
| 2025 | Extending Visual Dialog Beyond English: An Analysis of Monolingual and Multilingual Models. Milena M. Adão, Silvio Jamil Ferzoli Guimarães, Zenilton Kleber G. do Patrocínio Jr. |
| 2025 | Extracting Player Speed from Football Videos. Ole Kristian Rustebakke, Mehdi Houshmand Sarkhoosh, Cise Midoglu, Pål Halvorsen |
| 2025 | Extrinsic Calibration of RGB-D Cameras Using Depth Refinement. Peter O. Fasogbon |
| 2025 | Facial Similarity-Guided Fine-Tuning for Hand Shape Correction in AI-Generated Human Images. Yuki Ryu, Akira Kubota |
| 2025 | Fewer-Shot Self-Supervised Image Recoloring for Deutan Deficiency Based on Laplacian Pyramid. Onhi Kato, Akira Kubota |
| 2025 | Foot-Strike Pattern Recognition from Inertial Data with Machine Learning. Michele Baldassini, Francesco Pistolesi, Beatrice Lazzerini |
| 2025 | Forecasting "Neg Storms": Time-Aware Modeling of Toxic Situations in Social Media. Irien Akter, Vivek K. Singh, Pradeep K. Atrey |
| 2025 | From 2D to 3D: How Discrete Dependencies Enable Cross-Dimensional Inference in Neural Networks in Defiance of Euclidean Geometry. Gerald Friedland, Robert Mertens |
| 2025 | Graph-Based Evaluation of Visual Brain Decoding from fMRI Data. Mohammad Moradi, Morteza Moradi, Marco Grassia, Giuseppe Mangioni |
| 2025 | High-Fidelity Semantic Video Communication with Controllable Image-To-Video Diffusion Models. Cem Eteke, Alexander Griessel, Wolfgang Kellerer, Eckehard G. Steinbach |
| 2025 | International Symposium on Multimedia, ISM 2025, Naples, Italy, December 8-10, 2025 |
| 2025 | Knowledge-Based Behavioral Biometrics for Secure Authentication in Virtual Reality. Numan Zafar, Priyo Ranjan Kundu Prosun, Shafique Ahmad Chaudhry |
| 2025 | LPConv: Laplacian Pyramid Convolutions for Parameter-Efficient Receptive Field Expansion. Naoki Nishiya, Akira Kubota |
| 2025 | Layout-Aware Self-Correcting Prompts for Multimodal LLM Parking Lot Monitoring. Viviana Crescitelli |
| 2025 | Lightweight High-Accuracy Tomato Detection and Classification by Efficientnet-Enhanced YOLOv8. Hayato Tsukada, Akira Kubota |
| 2025 | Low-Light Image Enhancement with Adaptive Brightness Transform Models for Video See-Through AR. Yingen Xiong, Christopher Peri |
| 2025 | MCAD: Multimodal Context-Aware Audio Description Generation for Soccer. Lipisha Chaudhary, Trisha Mittal, Subhadra Gopalakrishnan, Ifeoma Nwogu, Jaclyn Pytlarz |
| 2025 | Machine Learning Techniques for the Diagnosis and Monitoring of Nevi and Melanomas. Giulia Di Flamminio, Fabio Persia, Daniela D'Auria, Ciro Esposito, Vincenzo Coppola |
| 2025 | Metadata-Guided Hot Swapping of Specialized Super-Resolution Models in Streaming Systems. Alperen F. Zengin, Ekrem Çetinkaya, Ali C. Begen, Saba Ahsan, Serhan Gül, Kashyap Kammachi Sreedhar, Emre Aksu |
| 2025 | Multiscale RGB-Thermal Fusion for Vulnerable Road User Detection with ScaleFuse. Ibrahim Tinas, Yavuz Selim Bostanci, Müjdat Soytürk |
| 2025 | On Progressive Compressed Neural Model Storage. Hamed R. Tavakoli, Homayun Afrabadpey |
| 2025 | On the Suitability of Perceptual Quality Metrics for Learning-Based Screen Content Compression. H. Burak Dogaroglu, Hongjie You, Atanas Boev, Elena Alshina, Eckehard G. Steinbach |
| 2025 | One Size Doesn't Fit All: Age-Aware Gamification Mechanics for Multimedia Learning Environments. Sarah Kaißer, Markus Kleffmann, Kristina Schaaff |
| 2025 | Opt360: QoE Optimization for 360° Video Streaming. Reza Hedayati, Mea Wang, Logan Rakai |
| 2025 | Personalised Stress Detection: An Exploration of Temporal Multimodal Late Fusion Strategies. Misha Libman, Gelareh Mohammadi |
| 2025 | Personalized Adaptive Magnification in Gaze-Based Interaction. Florian Eggenkemper, Jana Swerew, Teresa Rehers, Manuel Hanhoff, Constantin A. Rothkopf, Robert Mertens |
| 2025 | Physics-Guided Exposure Parameter Estimation for Image Metadata Verification. Sharmilee Rajkumar Rajan, Ming-Ching Chang, Pradeep K. Atrey |
| 2025 | Prompted Vs. Organic Customer Insight: Comparing the Value of Focus Groups and Online Reviews in Product and Service Innovation. Maren Schnieder, Ana Isabel Canhoto, Ramin Behbehani, Ahmad Beltagui, Niraj Kumar, Amirreza Alizamani |
| 2025 | QoE Evaluation of BPP Packet Wash Using ROI-Based Scalable Video Coding. Mohammadreza Ghafari, Thibault Cholez, Olivier Festor |
| 2025 | Quality Assessment of Dynamic 3D Model in Virtual Reality: Effects of Level of Detail and Viewing Distance. Duc V. Nguyen, Nguyen Thi Quynh Ly, Truong Thu Huong |
| 2025 | RAG Chatbots for Educational Virtual Field Trips. Suryaprakash Reddy Kalvakolu, Heinrich Söbke, Florian Wehking, Mukesh Chandra Kumar Mamidala, Eckhard Kraft |
| 2025 | Recognition of Pitching Habits Using Multimodal Data of RGB Video and Skeleton. Satoki Hidaka, Kazuhiro Hotta |
| 2025 | Rendering Compressed Point Clouds with a Voxel-Based Method. Hyungwoo Kang, YeoJun Yoon, Joong-Hwan Baek, Byung Tae Oh |
| 2025 | Secure AI-Driven Super-Resolution for Real-Time Mixed Reality Applications. Mohammad Waquas Usmani, Sankalpa Timilsina, Michael Zink, Susmit Shannigrahi |
| 2025 | Smarter Traps: Neural Network-Driven Classification of Small Mammals. William Menz, Ralf Dittrich, Rakesh Rao Ramachandra Rao, Steve Göring, Alexander Raake |
| 2025 | Syntax-Aware Transformer for Sentiment Analysis of Japanese SNS Text. Sotaro Shiozawa, Akira Kubota |
| 2025 | SynthMed: Generating and Detecting Multimodal Deepfakes for Healthcare Communication. Mariano Barone, Francesco Di Serio, Vincenzo Moscato, Marco Postiglione, Giuseppe Riccio, Antonio Romano |
| 2025 | TARS: Temporal-Spatial Adaptation for Volumetric Video Streaming. Hadi Heidarirad, Amir Allahveran, Mea Wang |
| 2025 | The Sustainability Card: Measuring Sustainability of Multimedia AI Models. Francesco Pistolesi, Michele Baldassini, Matteo Mugnai, Beatrice Lazzerini |
| 2025 | Video Classification of Marchantia Polymorpha Using a Video Vision Transformer with Emphasized Channel Information. Haruhiko Murata, Naoki Minamino, Takashi Ueda, Yohei Kondo, Kazuhiro Hotta |