| 2024 | A Power-Law Transformation Approach for Template-Based Cross-Component Prediction. Zhikai Liu, Kun Zhang, Xin-Yi Cui, Wei Sun, Fan Liang |
| 2024 | A Server-driven View-aware Point Cloud Video Streaming Framework. Tran Gia Minh, Truong Thu Huong, Duc V. Nguyen |
| 2024 | A Simulation for the Evaluation of the Mean Opinion Score (MOS) for EVS-WB and AMR-WB Audio Codecs for 5G Mobile Networks. Jussif J. Abularach Arnez, Cassio A. Tavares Alves, Wederson Medeiros Silva, Isaac Barros Gomes, Carla Lapa Nogueira, Maria G. Lima Damasceno |
| 2024 | A Study on Mental Stress Test using Cybersickness caused by Virtual Reality Contents. Nan Bu, Kakeru Nakano |
| 2024 | A technical Concept for enhancing the Student Experience in Hybrid Lecture Scenarios. Florian Schimanke, Robert Mertens, Felix Prankel |
| 2024 | AI Maintenance Techniques by Detecting Performance Degradation in Domain Shift Using Model Ensembles. Keita Yamane, Akira Kitayama, Keigo Hasegawa, Yusuke Obonai, Hiroto Sasao |
| 2024 | Appeal prediction for AI up-scaled Images. Steve Göring, Rasmus Merten, Alexander Raake |
| 2024 | Characterizing students behavior in multi-user multi-computer testing environments. Rajini Chittimalla, Sujung Choi, Madhu Sai Vineel Reka, Yassine Belkhouche |
| 2024 | Cross-Modal 3D Model Retrieval. Raphael Waltenspül, Florian Spiess, Heiko Schuldt |
| 2024 | Data Augmentation with Diffusion Model for Hand Detection. Genta Matsukawa, Atsuo Yoshitaka |
| 2024 | Disparity Correction Method of the Monocular Omnidirectional Stereo Camera. Hisayoshi Kaneda, Ryota Kawamata, Kazuyoshi Yamazaki, Kazuya Shimizu |
| 2024 | Ensuring Color Consistency in RGB-D Multi-Camera Setup. Peter O. Fasogbon |
| 2024 | Evaluating Interactive Concept Maps Produced from E-Portfolios. Alexander Gantikow, Andreas Isking, Wolfgang Müller, Paul Libbrecht, Sandra Rebholz |
| 2024 | Evaluation Framework for Novel View Synthesis. Kolja Kieslich, Louay Bassbouss, Stephan Steglich, Stefan Arbanowski |
| 2024 | Evaluation of strategies for efficient rate-distortion NeRF streaming. Pedro Martin, António Rodrigues, João Ascenso, Maria Paula Queluz |
| 2024 | Exploring Augmented Table Setup and Lighting Customization in a Simulated Restaurant to Improve the User Experience. Jana Motowilowa, Maurizio Vergari, Tanja Kojic, Maximilian Warsinke, Sebastian Möller, Jan-Niklas Voigt-Antons |
| 2024 | Flexible And Faithful Data Insights Generation. Wei Zhang, Victor Soares Bursztyn |
| 2024 | FrameCorr: Adaptive, Autoencoder-based Neural Compression for Video Reconstruction in Resource and Timing Constrained Network Settings. John Li, Deepak Nair, Klara Nahrstedt, Indranil Gupta, Shehab Sarar Ahmed |
| 2024 | Fusion-Based Human Pose Estimation Using RGB and IR Images with Transformer-Based Decoding. Viviana Crescitelli, Takashi Oshima |
| 2024 | Gender Stereotypes in the Creation of Educational Cases with ChatGPT. Gabriel Valerio-Ureña, Giomara Sevilla-Campoverde, Soledad Ortúzar, Christian Lazcano |
| 2024 | Generating Bass Phrases from Guitar Chord Backing with NMF. Tomoo Kouzai, Junya Koguchi, Tetsuro Kitahara |
| 2024 | Generating and Evaluating Cursive Chinese Calligraphy by Semi-Classifying Style: A Case Study Using a Diffusion Model. Yi-Chieh Wu, Yu-Jung Hsu |
| 2024 | Holistic Visualization of Contextual Knowledge in Hotel Customer Reviews Using Self-Attention. Shuntaro Masuda, Toshihiko Yamasaki |
| 2024 | Homophonic Music Composition Using a GAN and LSTM Pipeline for Melody and Harmony Generation. Clément Saint-Marc, Katunobu Itou |
| 2024 | Human-in-the-loop knowledge base upkeep for retrieval augmented generation applications. Pedro Baptista de Castro, Hiroko Sukeda, Soichi Takashige |
| 2024 | IEEE International Symposium on Multimedia, ISM 2024, Tokyo, Japan, December 11-13, 2024 |
| 2024 | Influence of Display Devices and Field of View on Subjective Quality of Experience Evaluation of 8K 360° Videos. Daichi Arai, Yuichi Kondo, Yasuko Sugito, Yuichi Kusakabe |
| 2024 | Instrumentality Classification Evaluation System for Natural Sounds Yuhuan Wang, Katunobu Itou |
| 2024 | Investigating the Impact of High Frame Rate on Video Quality: A SAMVIQ Approach. Dominik Keller, Paul Rudi Frank, Steve Göring, Alexander Raake |
| 2024 | Investigation of Feature Distribution and Network Weight Updates in the Machine Unlearning Process. Wen-Hung Liao, Yang-Jing Lin |
| 2024 | LMM-Regularized CLIP Embeddings for Image Classification. Maria Tzelepi, Vasileios Mezaris |
| 2024 | LiveSkeleton: High-Quality Real-Time Human Tracking and Pose Estimation. Hannes Fassold |
| 2024 | Low Complexity Learning-based Lossless Event-based Compression. Ahmadreza Sezavar, Catarina Brites, João Ascenso |
| 2024 | Low-latency Software-based Uncompressed Video Transmission. Takuro Yamaguchi, Yasuhiro Mochida, Hirokazu Takahashi |
| 2024 | Modeling User Quality of Experience in Adaptive Point Cloud Video Streaming. Duc V. Nguyen, Quang Long Nguyen, Tran Thuy Hien, Nguyen Ngoc Huyen, Truong Thu Huong, Pham Ngoc Nam |
| 2024 | Modelling Concurrent RTP Flows for End-to-end Predictions of QoS in Real Time Communications. Tailai Song, Paolo Garza, Michela Meo, Maurizio Matteo Munafò |
| 2024 | Multi-View Gesture Recognition in Conflict Situations. Karam Dawoud, Birgit Nierula, Farelle Toumaleu Siewe, Thomas Koch, Daniel Johannes Meyer, Andreas Bock, Marianne Heinze, Daniela Knuth, Denis Martin, Julia Schander, Anna Hilsmann, Peter Eisert, Sebastian Bosse |
| 2024 | Occlusion-Aware Real-Time Tiny Facial Alignment Model for Makeup Virtual Try-On. Kin Ching Lydia Chau, Zhi Yu, Ruowei Jiang |
| 2024 | On Multi-CDN Delivery Costs Optimization Problem. Yuriy A. Reznik, Guillem Cabrera |
| 2024 | PanoramaViewer - A Framework for Educational Collaborative Virtual Field Trips. Mario Wolf, Sebastian Hartwig, Gregor Steinhöfel, Heinrich Söbke, Eckhard Kraft |
| 2024 | Perceptual Quality Driven Point Cloud Compression for 6DoF 3D Point Cloud Streaming. Yumeka Chujo, Yusuke Tagashira, Yukiko Harada, Kenji Kanai, Jiro Katto |
| 2024 | Platform for Endangered Language Education. Greeshma Sree Parimi, Gurkirat Singh Guliani, Min Chen |
| 2024 | PlayerTV: Advanced Player Tracking and Identification for Automatic Soccer Highlight Clips. Håkon Maric Solberg, Mehdi Houshmand Sarkhoosh, Sushant Gautam, Saeed Shafiee Sabet, Pål Halvorsen, Cise Midoglu |
| 2024 | Prevention of Unexpected Object Generation in Diffusion Model-Based Inpainting. Takumi Komori, Takahiro Hayashi |
| 2024 | Real-time Multi-modal Highlight Prediction for Simultaneous Viewing of Multiple Live Streams. Yusuke Maeda, Takahiro Hayashi |
| 2024 | S2MGen: A synthetic skin mask generator for improving segmentation. Subhadra Gopalakrishnan, Trisha Mittal, Jaclyn Pytlarz, Yuheng Zhao |
| 2024 | Slide Analysis Method for Editing Lecture Materials based on Hierarchical Structures of Subject Terminologies. Itsuki Sano, Yuanyuan Wang, Yukiko Kawai, Kazutoshi Sumiya |
| 2024 | Sliding Window Check: Repairing Object Identities. Geerthan Srikantharajah, Naimul Khan |
| 2024 | SoccerNet-Echoes: A Soccer Game Audio Commentary Dataset. Sushant Gautam, Mehdi Houshmand Sarkhoosh, Jan Held, Cise Midoglu, Anthony Cioppa, Silvio Giancola, Vajira Thambawita, Michael A. Riegler, Pål Halvorsen, Mubarak Shah |
| 2024 | Speaker Pseudonymization for Japanese Speech Using Duration Embeddings. Aoi Ito, Katunobu Itou |
| 2024 | SpotiView: Partial Face Display Method for Smooth Communication While Protecting Privacy. Ryota Kishimoto, Shuhei Tsuchida, Tsutomu Terada, Masahiko Tsukamoto |
| 2024 | StegoFusion-Net: Fusion of Convolutional Neural Networks for Spatial Image Steganalysis. Yassine Belkhouche, AlaaIdin Dwaik |
| 2024 | Synchronized Object Sharing for Augmented Reality Virtual Conferencing. John O. Murray, Michael Zink |
| 2024 | The ≪Huh?≫ Button: Improving Understanding in Educational Videos with Large Language Models. Boris Ruf, Marcin Detyniecki |
| 2024 | Two-stage instrument timbre transfer method using RAVE. Di Hu, Katunobu Ito |
| 2024 | Ultra-low-latency 8K120p-video-transmission System Parallelizing SMPTE ST 2110. Yasuhiro Mochida, Takuro Yamaguchi, Hirokazu Takahashi, Koichi Takasugi |
| 2024 | Unveiling the Potential of SSL-Generated Audio Embeddings for Cross-Lingual Speaker Recognition. Wen-Hung Liao, Po-Han Chen, Yi-Chieh Wu |
| 2024 | VEMOCLAP: A video emotion classification web application. Serkan Sulun, Paula Viana, Matthew E. P. Davies |
| 2024 | Visual Speech Recognition with Surrounding and Emotional Information. Pengcheng Zeng, Atsuo Yoshitaka |
| 2024 | Watch your back! Dynamic thumbnails for a 360-degree video player to enhance viewing experience on 2D displays. Jakub Kovác, Wolfgang Hürst |