ISM C

72 papers

YearTitle / Authors
20253GPP PDU Set Framework: Release 19 Updates.
Serhan Gül, Igor D. D. Curcio
2025A First Look at Open-GoP Streaming with Av1 S-Frames.
Akram Ansari, S. Ali John Naqvi, Mea Wang, Emir Halepovic
2025A Forearm-Worn Haptic Device for Integrated Tactile and Kinaesthetic Feedback via Skin Stretch.
Selin Nur Özsert, Daniel Rodriguez-Guevara, Leonardo Franco, Wenxuan Wei, Eckehard G. Steinbach, Domenico Prattichizzo
2025A Hybrid Noise Perturbation-Based Denoising Autoencoder for Machine Sound Anomaly Detection.
Kadir Torun, Mustafa Sert
2025A One-Class Structural Similarity-Based Autoencoder for the Detection of Malaria-Infected Cells.
Moses Omondi, Yassine Belkhouche
2025AMICO: A Semantic and Multimodal Framework for AI-Assisted Clinical Reporting.
Antonio Laudante, Mariano Barone, Giuseppe Riccio, Antonio Romano, Francesco Di Serio, Antonio Scialdone, Francesco Porciello, Nicola Rainone, Vincenzo Moscato
2025AR in HbbTV-Based Hybrid TV Services.
Fernando Boronat, Lluc Simó, Rubén Prieto, Almanzor Sapena
2025Adaptation of CDN at the Edge Using Cloud-Native Network Telemetry Across Media Scenarios.
Javier Iglesias, Juan Felipe Mogollón, Iñigo Tamayo, Zaloa Fernández, Olov Danielsson, Ivan Pretel, Asier Lopez
2025Adaptive Obfuscation for Reusing RGB Datasets for Privacy-Preserving Human Pose Estimation.
Francesco Pistolesi, Matteo Mugnai, Beatrice Lazzerini
2025An Agent-Driven Architecture for Harmful Meme Detection through Multimodal Decomposition.
Gian Marco Orlando, Marco Perillo, Diego Russo, Vincenzo Moscato
2025An Efficient Optimization Criterion for Multi-View Feature Representation Learning.
Lei Gao, Kai Liu, Kevin Tang, Ling Guan
2025An Influence Analysis of Hybrid Lectures with a Simple Setup on the Student Experience.
Florian Schimanke, Robert Mertens, Felix Prankel
2025Analysis of Multimodal LLMs in VQA in the Field of Radiology.
Cristovão Pessoa Cândido Neto, Cláudio de Souza Baptista, André Luiz Firmino Alves, Vivek Swarnakar, Anselmo Cardoso de Paiva
2025Application of Computer Vision Research (ISVP.AI) in the Development of the Comprehensive Stellis One Platform for Sports Organizations.
Lukasz Gasiorowski, Jagoda Lazarek, Sebastian Purtak, Pawel Góra
2025Attention-Enhanced Multi-Branch Spiking Neural Network for Event Stream Super-Resolution.
Ahmadreza Sezavar, Catarina Brites, João Ascenso
2025CCAFF: Object Tracking Under Heavy Occlusion.
Abdul Bhutta, Naimul Khan, Ling Guan
2025Coding Gaussian Splat Scenes with V3C/V-PCC.
Patrice Rondao Alface, Lauri Ilola, Lukasz Kondrad
2025Comparative Analysis of Face Recognition Models: Runtime Environments and Compute Units on Edge.
Lukasz Grzymkowski, Tomasz P. Stefanski
2025Comparative Evaluation of Deep Learning Methods for Wood Surface Defect Detection: A Comprehensive Study of Semantic Segmentation Approaches.
Yuki Yanai, Tomokazu Ishikawa
2025Comparison of Multimodal Fall Detection Strategies.
Reema Maheshbhai Gadhia, Nasim Hajari
2025Cost-Optimal Design of Hybrid Broadcast - Unicast Video Delivery Systems.
Yuriy A. Reznik
2025DWT Domain Precinct-Wise Scrambling for Encryption-then-Compression with JPEG XS.
Takayuki Nakachi, Park Cheolhwan, Yasuhisa Kato, Mitsuru Maruyama
2025Dialogue-Pseudo: A Speaker Pseudonymization Framework for Privacy Protection in Dialogue Speech Data.
Aoi Ito, Katunobu Itou
2025Diversity-Aware Active Learning for Object Detection Utilizing Time-of-Day Metadata.
Fumiya Higashide, Akira Kubota
2025Emotion Detection and Classification of Different Saudi Dialects.
Rehab K. Qarout, Joud Y. Samkari, Rahaf M. ALFudhayl, Ruba H. ALSulami, Shada M. Basudan, Ghadi K. AlJuhani, Nuha Zamzami
2025Empowering Access to Public Services: An Analysis on Multimodal, Retrieval-Augmented Chatbots for Indic Language Support to Farmers.
Mohsina Bilal, Gopakumar G
2025Evaluating the Emerging MPEG Video Coding for Machines in Semantic Segmentation.
Khoa Dang Pham, Farhad Pakdaman, Honglei Zhang, Hamed Rezazadegan Tavakoli, Nam Le, Jukka I. Ahonen, Moncef Gabbouj
2025Evaluation of AR-Based Blind Spot Monitoring Across Diverse Driving Scenarios Using a VR Simulator.
Yohei Sakai, Tomokazu Ishikawa
2025Evaluation of a Floating-Head Communication Prototype for Video-Conferencing.
William Menz, Alexander Zoubarev, David Kutschke, Rakesh Rao Ramachandra Rao, Louay Bassbouss, Sven Bliedung von der Heide, Steve Göring, Alexander Raake
2025Exploiting LLMs for Metadata-Based Video Quality Prediction.
Steve Göring, Rakesh Rao Ramachandra Rao, Alexander Raake
2025Exploring Privacy and Security Risks in LLMs: Data Leakage, Prompt Injection, and Membership Inference.
Giancarlo Sperlí
2025ExposureEngine: Oriented Logo Detection and Sponsor Visibility Analytics in Sports Broadcasts.
Mehdi Houshmand Sarkhoosh, Frøy Øye, Henrik Nestor Sørlie, Nam Hoang Vu, Dag Johansen, Cise Midoglu, Tomas Kupka, Pål Halvorsen
2025Extending Visual Dialog Beyond English: An Analysis of Monolingual and Multilingual Models.
Milena M. Adão, Silvio Jamil Ferzoli Guimarães, Zenilton Kleber G. do Patrocínio Jr.
2025Extracting Player Speed from Football Videos.
Ole Kristian Rustebakke, Mehdi Houshmand Sarkhoosh, Cise Midoglu, Pål Halvorsen
2025Extrinsic Calibration of RGB-D Cameras Using Depth Refinement.
Peter O. Fasogbon
2025Facial Similarity-Guided Fine-Tuning for Hand Shape Correction in AI-Generated Human Images.
Yuki Ryu, Akira Kubota
2025Fewer-Shot Self-Supervised Image Recoloring for Deutan Deficiency Based on Laplacian Pyramid.
Onhi Kato, Akira Kubota
2025Foot-Strike Pattern Recognition from Inertial Data with Machine Learning.
Michele Baldassini, Francesco Pistolesi, Beatrice Lazzerini
2025Forecasting "Neg Storms": Time-Aware Modeling of Toxic Situations in Social Media.
Irien Akter, Vivek K. Singh, Pradeep K. Atrey
2025From 2D to 3D: How Discrete Dependencies Enable Cross-Dimensional Inference in Neural Networks in Defiance of Euclidean Geometry.
Gerald Friedland, Robert Mertens
2025Graph-Based Evaluation of Visual Brain Decoding from fMRI Data.
Mohammad Moradi, Morteza Moradi, Marco Grassia, Giuseppe Mangioni
2025High-Fidelity Semantic Video Communication with Controllable Image-To-Video Diffusion Models.
Cem Eteke, Alexander Griessel, Wolfgang Kellerer, Eckehard G. Steinbach
2025International Symposium on Multimedia, ISM 2025, Naples, Italy, December 8-10, 2025
2025Knowledge-Based Behavioral Biometrics for Secure Authentication in Virtual Reality.
Numan Zafar, Priyo Ranjan Kundu Prosun, Shafique Ahmad Chaudhry
2025LPConv: Laplacian Pyramid Convolutions for Parameter-Efficient Receptive Field Expansion.
Naoki Nishiya, Akira Kubota
2025Layout-Aware Self-Correcting Prompts for Multimodal LLM Parking Lot Monitoring.
Viviana Crescitelli
2025Lightweight High-Accuracy Tomato Detection and Classification by Efficientnet-Enhanced YOLOv8.
Hayato Tsukada, Akira Kubota
2025Low-Light Image Enhancement with Adaptive Brightness Transform Models for Video See-Through AR.
Yingen Xiong, Christopher Peri
2025MCAD: Multimodal Context-Aware Audio Description Generation for Soccer.
Lipisha Chaudhary, Trisha Mittal, Subhadra Gopalakrishnan, Ifeoma Nwogu, Jaclyn Pytlarz
2025Machine Learning Techniques for the Diagnosis and Monitoring of Nevi and Melanomas.
Giulia Di Flamminio, Fabio Persia, Daniela D'Auria, Ciro Esposito, Vincenzo Coppola
2025Metadata-Guided Hot Swapping of Specialized Super-Resolution Models in Streaming Systems.
Alperen F. Zengin, Ekrem Çetinkaya, Ali C. Begen, Saba Ahsan, Serhan Gül, Kashyap Kammachi Sreedhar, Emre Aksu
2025Multiscale RGB-Thermal Fusion for Vulnerable Road User Detection with ScaleFuse.
Ibrahim Tinas, Yavuz Selim Bostanci, Müjdat Soytürk
2025On Progressive Compressed Neural Model Storage.
Hamed R. Tavakoli, Homayun Afrabadpey
2025On the Suitability of Perceptual Quality Metrics for Learning-Based Screen Content Compression.
H. Burak Dogaroglu, Hongjie You, Atanas Boev, Elena Alshina, Eckehard G. Steinbach
2025One Size Doesn't Fit All: Age-Aware Gamification Mechanics for Multimedia Learning Environments.
Sarah Kaißer, Markus Kleffmann, Kristina Schaaff
2025Opt360: QoE Optimization for 360° Video Streaming.
Reza Hedayati, Mea Wang, Logan Rakai
2025Personalised Stress Detection: An Exploration of Temporal Multimodal Late Fusion Strategies.
Misha Libman, Gelareh Mohammadi
2025Personalized Adaptive Magnification in Gaze-Based Interaction.
Florian Eggenkemper, Jana Swerew, Teresa Rehers, Manuel Hanhoff, Constantin A. Rothkopf, Robert Mertens
2025Physics-Guided Exposure Parameter Estimation for Image Metadata Verification.
Sharmilee Rajkumar Rajan, Ming-Ching Chang, Pradeep K. Atrey
2025Prompted Vs. Organic Customer Insight: Comparing the Value of Focus Groups and Online Reviews in Product and Service Innovation.
Maren Schnieder, Ana Isabel Canhoto, Ramin Behbehani, Ahmad Beltagui, Niraj Kumar, Amirreza Alizamani
2025QoE Evaluation of BPP Packet Wash Using ROI-Based Scalable Video Coding.
Mohammadreza Ghafari, Thibault Cholez, Olivier Festor
2025Quality Assessment of Dynamic 3D Model in Virtual Reality: Effects of Level of Detail and Viewing Distance.
Duc V. Nguyen, Nguyen Thi Quynh Ly, Truong Thu Huong
2025RAG Chatbots for Educational Virtual Field Trips.
Suryaprakash Reddy Kalvakolu, Heinrich Söbke, Florian Wehking, Mukesh Chandra Kumar Mamidala, Eckhard Kraft
2025Recognition of Pitching Habits Using Multimodal Data of RGB Video and Skeleton.
Satoki Hidaka, Kazuhiro Hotta
2025Rendering Compressed Point Clouds with a Voxel-Based Method.
Hyungwoo Kang, YeoJun Yoon, Joong-Hwan Baek, Byung Tae Oh
2025Secure AI-Driven Super-Resolution for Real-Time Mixed Reality Applications.
Mohammad Waquas Usmani, Sankalpa Timilsina, Michael Zink, Susmit Shannigrahi
2025Smarter Traps: Neural Network-Driven Classification of Small Mammals.
William Menz, Ralf Dittrich, Rakesh Rao Ramachandra Rao, Steve Göring, Alexander Raake
2025Syntax-Aware Transformer for Sentiment Analysis of Japanese SNS Text.
Sotaro Shiozawa, Akira Kubota
2025SynthMed: Generating and Detecting Multimodal Deepfakes for Healthcare Communication.
Mariano Barone, Francesco Di Serio, Vincenzo Moscato, Marco Postiglione, Giuseppe Riccio, Antonio Romano
2025TARS: Temporal-Spatial Adaptation for Volumetric Video Streaming.
Hadi Heidarirad, Amir Allahveran, Mea Wang
2025The Sustainability Card: Measuring Sustainability of Multimedia AI Models.
Francesco Pistolesi, Michele Baldassini, Matteo Mugnai, Beatrice Lazzerini
2025Video Classification of Marchantia Polymorpha Using a Video Vision Transformer with Emphasized Channel Information.
Haruhiko Murata, Naoki Minamino, Takashi Ueda, Yohei Kondo, Kazuhiro Hotta