MMSys A

61 papers

YearTitle / Authors
2026A Comprehensive Long-duration 8K Dataset to Benchmark Hardware Encoding for Live 360° Video Tiled Streaming.
Olivier Brochu, Aris Leivadeas, Stéphane Coulombe
2026A Procedural Generation System for Chinese Traditional Artifacts Based on Fractal Structures.
Chuanping Lyu, Zhe Li, Zhen Yu
2026AI-Assisted Energy-Efficient Multimedia Systems.
Zoha Azimi Ourimi
2026AR as an Evaluation Playground: Bridging Metric and Visual Perception of Computer Vision Models.
Ashkan Ganj, Yiqin Zhao, Tian Guo
2026ARBot: A High-Fidelity Robotic Manipulator Teleoperation Framework for Human-Centered Augmented Reality Evaluation.
Harsh Chhajed, Tian Guo
2026Accelerating Diffusion Models with One-Step Distillation for Image and Video Super-Resolution.
Alessio Bugetti, Leonardo Galteri, Marco Bertini
2026Atom: Efficient On-Device Video-Language Pipelines Through Modular Reuse.
Kunjal Panchal, Saayan Mitra, Somdeb Sarkhel, Haoliang Wang, Ishita Dasgupta, Gang Wu, Hui Guan
2026Audio Made Simple: A Modern Framework for Audio Processing.
Jack Geraghty, Fatemeh Golpayegani, Andrew Hines
2026CADENCE: A Multi-Agent CDN Steering and Experimentation Platform for Dash.js.
Jashanjot Singh Sidhu, Mohammad Parsa Toopchinezhad, Abdelhak Bentaleb
2026CADENCE: Collaborative Multi-Agent Dual-Objective Framework for Intelligent CDN Selection.
Jashanjot Singh Sidhu, Chidambar Joshi, Abdelhak Bentaleb
2026Calliope: A TTS-based Narrated E-book Creator Ensuring Exact Synchronization, Privacy, and Layout Fidelity.
Hugo Lewi Hammer, Vajira Thambawita, Pål Halvorsen
2026Camera-Ready? Exploring Transport-Layer Performance Limits with GigE Vision.
Malte Wehmeier, Eric Lanfer, Kathrin Elmenhorst, Nils Aschenbruck
2026Convolutions Need Registers Too: HVS-Inspired Dynamic Attention for Video Quality Assessment.
Mayesha Maliha Rahman Mithila, Mylène C. Q. Farias
2026CubeGS: Cube-wise Motion Residual Prediction for Gaussian Splatting Streaming.
Syed Ali John Naqvi, Mea Wang, Emir Halepovic
2026DK-MARC: Risk-Adaptive 360° Streaming via Dual-Uncertainty Quantification and Multi-Modal Fusion.
Arman Nik Khah, Ravi Prakash
2026Delivering Layered Object-Based Media using WebAssembly with Selective Cloud Rendering.
Barry Porter, Rajiv Ramdhany, Nicholas J. P. Race
2026Depth-Aware Adaptive Video Streaming for Safe and Efficient Remote Autonomous Vehicle Supervision.
Ritik Vaishnav, Arani Bhattacharya
2026ELLMPEG: An Edge-based Agentic LLM Video Processing Tool.
Zoha Azimi Ourimi, Reza Fahrani, Radu Prodan, Christian Timmerer
2026FedVSR: Towards Model-Agnostic Federated Learning in Video Super-Resolution.
Ali Mollaahmadi Dehaghi, Hossein KhademSohi, Reza Razavi, Steve Drew, Mohammad Moshirpour
2026Fidelity as Forgetting: Rethinking Reconstruction in Neural Volumetric Media: A VR Artwork on Stereotyped Images, Situated Viewpoints, and Contemporary History.
Yin Bing, Shuyi Wang, Angela Chulei Tang, Siyun Wang, Yuan Zhang
2026GameLab: AI-Enabled Cloud Gaming Testbed.
Haseeb Ur Rehman, Shervin Shirmohammadi, Ihab Amer, Mohamed Hefeeda
2026GestureSync: Low-Latency Hand Gesture Control for Real-Time Audiovisual Performance.
Chengyao Li, Weihua Zheng
2026HAPhy: AI-Assisted Content Creator Supporting Tools for Multi-sensory Haptic Experiences in Immersive Media.
Seoyong Nam, Hyunwook Jung, Yongjae Yoo
2026ISM: Intelligent Multi-Path Scheduler for Multi-Camera Networked Systems.
Alireza Mohammadhosseini, Jacob Chakareski, Mallesham Dasari
2026Ink Voyage: An Immersive Embodied Experience of Landscape Paintings Based on Inertial Motion Capture.
Jinfan Qian, Xin Ge, Xiaojiao Chen, Yuyang Wang
2026JNTD-DS: A Benchmark Dataset for Just Noticeable frame rate-based Temporal Difference in Perceptual Video Coding.
Sanaz Nami, Farhad Pakdaman, Sahab Taali, Mahmoud Reza Hashemi, Shervin Shirmohammadi, Moncef Gabbouj
2026LMG: Efficient Streaming of Layered Mesh-Gaussian 3D Scenes.
Yuan-Chun Sun, Guodong Chen, Sam Ziaie Kondori, Mallesham Dasari, Cheng-Hsin Hsu
2026Lag-Busting WebRTC: L4S-Enabled Adaptive Video Streaming.
Yu You, Chia-Yu Chang, Koen De Schepper
2026Learning-Augmented 360° Video Streaming: Robust Viewport Adaptation with Simple Predictors.
Tianyu Chen, Mohammad Hajiesmaili, Ramesh K. Sitaraman
2026MARs: Multi-Scale Convolution-Attention residual Fusion for Video Summarization.
Joon-Seok Song, Juyeob Lee, Eunil Park
2026MOQtail: Open-Source, IETF-Compliant MOQT Protocol Libraries.
Zafer Gurel, Deniz Ugur, Ali C. Begen
2026MobileMold: A Smartphone-Based Microscopy Dataset for Food Mold Detection.
Dinh Nam Pham, Leonard Prokisch, Bennet Meyer, Jonas Thumbs
2026MoonAnything: A Vision Benchmark with Large-Scale Lunar Supervised Data.
Clementine Grethen, Yuang Shi, Simone Gasparini, Géraldine Morin
2026More Pixels, Less Bandwidth: A Live Demo of VSR-Bench over WebRTC.
Matin Fazel, Abdelhak Bentaleb
2026MultiSenseVR: An open multimodal dataset for human pose estimation and perception in interactive VR.
Javad Sameri, Nabeel Nisar Bhat, Filip De Turck, Rafael Berkvens, Jeroen Famaey, Maria Torres Vega
2026Multisensory Interactive Installation Design Based on Electronic Textiles and Procedural Generation.
Chuanping Lyu, Ruotong Zhao, Yuwen Lan, Zhe Li, Zhen Yu
2026NAVIS: Web-Native Interactive Visualization of Dynamic Point-Cloud Video.
Jashanjot Singh Sidhu, Jeremy Ouellette, Abdelhak Bentaleb
2026NILO: Nested Iterative Optimization for Video Bitrate Ladder Construction.
Sagar Bharadwaj Kalasibail Seetharam, Renata Teixeira, Xiaoqing Zhu, Kyle Swanson, Srinivasan Seshan
2026Next-Generation Video QoE Optimization: Bridging Metrics and Perception in DASH ABR.
Syed Uddin
2026P-GSVC: Layered Progressive 2D Gaussian Splatting for Scalable Image and Video.
Longan Wang, Yuang Shi, Wei Tsang Ooi
2026PRISM: Patch-Aware Reinforcement Intelligence for Strategic Multipath.
Jyoti Shokhanda, Arani Bhattacharya
2026Prism: A Trace-Driven Simulator for Component-Aware Adaptive Streaming of V3C Content.
Jeremy Ouellette, Abdelhak Bentaleb
2026Proceedings of the ACM Multimedia Systems Conference 2026, MMSys 2026, Hong Kong, SAR, China, April 4-8, 2026
2026QUEST-PCC: A Reference Dataset for Content-Aware V-PCC Streaming and Compression.
Jashanjot Singh Sidhu, Abdelhak Bentaleb, Ahmed Hamza, Srinivas Gudumasu
2026ReLadder: A Network-Adaptive and QoE-Optimizing Bitrate Ladder Adjusting Framework.
Renfei Shang, Chen Liu, Si Chen, Lemei Huang, Tongyu Dai, Wenhao Zhang
2026Remote Particle Trajectory Tracking using Event-Based Vision Streams.
Pina Kolling, Andrew C. Freeman, Amr Rizk
2026Research Proposal: Non-intrusive Stress Recognition using Multimodality Deep Learning.
Chau Thi Thuy Tran, Carsten Griwodz, Kai Morgan Kjølerbakken, Çagri Erdem, Anis Yazidi
2026Robust Adaptive Algorithms for High-Fidelity Next-Generation Video Streaming.
Tianyu Chen
2026SPARC: Proximity-aware Scheduling of AR Mapping and Cloud-based GenAI Upsampling for Efficient Multi-User SLAM.
Shneka Muthu Kumara Swamy, Mallesham Dasari, Nicholas Mastronarde, Jacob Chakareski
2026Scaling The Cheer: Co-Viewing with Dual-Mode MOQ Transport.
Ayse B. Demir, Mervegul Parlak, Zafer Gurel, Alperen F. Zengin, Ali C. Begen, Burak Kara
2026SportSBD: Shot Boundary Detection in Sports Footage.
Mehdi Houshmand Sarkhoosh, Cise Midoglu, Saeed Shafiee Sabet, Tomas Kupka, Dag Johansen, Pål Halvorsen
2026Super-Resolution Meets Compression in Live VV: Light on Bits, Rich on Quality.
Sepehr Ganji, Amir Allahveran, Mea Wang, Diwakar Krishnamurthy
2026TAROT: Towards Optimization-Driven Adaptive FEC Parameter Tuning for Video Streaming.
Jashanjot Singh Sidhu, Aman Sahu, Abdelhak Bentaleb
2026Time Travel in MOQ Conferencing.
Zafer Gurel, Kerem Bekmez, Ahmet Pehlivanoglu, Alperen F. Zengin, Ali C. Begen
2026Towards a Unified Learning-based Video Compression and Streaming Pipeline.
Hannes Keunen
2026Trinity: Exploiting Latency Sensitivity to Improve Quality of Experience on Cloud VR Gaming.
Lingzhi Zhao, Yongqiang Gui, Yanyan Suo, Sandesh Dhawaskar Sathyanarayana, Ruixiao Zhang, Klara Nahrstedt, Shu Shi
2026Unified Compression of Point Cloud Geometry and Attributes through Variable-Rate Conditioning.
Michael Rudolph, Aron Riemenschneider, Amr Rizk
2026V3CTK: An End-to-End V3C Content Preparation Toolkit for Tiled Dynamic Point Cloud Streaming.
Jeremy Ouellette, Abdelhak Bentaleb, Jashanjot Singh Sidhu
2026VIDEA-Dublin Dataset: 8K 60FPS Video Sequences for Analysis and Development.
Tariq Al Shoura, Ali Mollaahmadi Dehaghi, Reza Razavi, Mohammad Moshirpour
2026VSR-Bench: An Open-Source Platform for Browser-Native Real-Time VSR Evaluation in WebRTC.
Matin Fazel, Abdelhak Bentaleb
2026iSR: Super-resolution for Immersive Cloud VR Gaming Platforms.
Ghazaleh Bakhtiariazad, Haseeb Ur Rehman, Shervin Shirmohammadi, Ihab Amer, Mohamed Hefeeda