| 2025 | A Block-Level Fine-Graining Framework for Multimodal Fusion in Federated Learning. Guozhi Zhang, Mengying Jia, Shuyan Feng, Zixuan Liu |
| 2025 | A Multifaceted Multi-Agent Framework for Zero-Shot Emotion Analysis and Recognition of Symbolic Music. Jiahao Zhao, Yunjia Li, Kazuyoshi Yoshii |
| 2025 | A Multilingual Telegram Chatbot for Mental Health Data Collection. Danila Mamontov, Alexey Karpov, Wolfgang Minker |
| 2025 | A Multilingual, Multimodal Dataset for Disinformation and Out-of-Context Analysis with Rich Supportive Information. Shuhan Cui, Hanrui Wang, Ching-Chun Chang, Huy H. Nguyen, Isao Echizen |
| 2025 | A Multimodal Classroom Video Question-Answering Framework for Automated Understanding of Collaborative Learning. Nithin Sivakumaran, Chia-Yu Yang, Abhay Zala, Shoubin Yu, Daeun Hong, Xiaotian Zou, Elias Stengel-Eskin, Dan Carpenter, Wookhee Min, Cindy E. Hmelo-Silver, Jonathan P. Rowe, James C. Lester, Mohit Bansal |
| 2025 | A Scenario-Based Design Pack for Exploring Multimodal Human-GenAI Relations. Josh Andres, Chris Danta, Andrea Bianchi, Sahar Farzanfar, Gloria Milena Fernández Nieto, Alexa Becker, Tara Capel, Frances Liddell, Shelby Hagemann, Ned Cooper, Sungyeon Hong, Li Lin, Eduardo Benítez Sandoval, Anna Brynskov, Hubert Dariusz Zajac, Zhuying Li, Tianyi Zhang, Arngeir Berge |
| 2025 | A Systematic Review of Fusion Methods for the User-Centered Design of Multimodal Interfaces. Ronja Heinrich, Chris Zimmerer, Martin Fischbach, Marc Erich Latoschik |
| 2025 | A multimodal Framework for exploring behavioural cues for automatic Stress Detection. Rebecca Valerio, Marwa Mahmoud |
| 2025 | Adaptive Gen-AI Guidance in Virtual Reality: A Multimodal Exploration of Engagement in Neapolitan Pizza-Making. Ka Hei Carrie Lau, Sema Sen, Philipp Stark, Efe Bozkir, Enkelejda Kasneci |
| 2025 | Affective and Physiological Responses to Immersive Intangible Cultural Heritage Experiences in Extended Reality. Fasih Haider, Sofia de la Fuente Garcia, Alicia Núñez García, Saturnino Luz |
| 2025 | AirSpartOne: One-Handed Distal Pointing for Large Displays on Mobile Devices and in Midair. Martin Birlouez, Yosra Rekik, Laurent Grisoni |
| 2025 | All of That in 15 Minutes? Exploring Privacy Perceptions Across Cognitive Abilities via Ad-hoc LLM-Generated Profiles Inferred from Social Media Use. Kirill Kronhardt, Sebastian Hoffmann, Fabian Adelt, Max Pascher, Jens Gerken |
| 2025 | Analyzing Character Representation in Media Content using Multimodal Foundation Model: Effectiveness and Trust. Evdoxia Taka, Debadyuti Bhattacharya, Joanne Garde-Hansen, Sanjay Sharma, Tanaya Guha |
| 2025 | Beyond Utterance: Understanding Group Problem Solving through Discussion Sequences. Zhuoxu Duan, Zhengye Yang, Brooke Foucault Welles, Richard J. Radke |
| 2025 | BiFuseNet: A Multimodal Network for Estimating Blood Alcohol Concentration via Bidirectional Hierarchical Fusion. Abdullah Tariq, Arooba Maqsood, Martin Masek, Syed Zulqarnain Gilani |
| 2025 | CCMI 2025: Cross-Cultural Multimodal Interaction. Koji Inoue, Shogo Okada, Divesh Lala, Sahba Zojaji, Nancy F. Chen, Tatsuya Kawahara |
| 2025 | Can Adaptive Interviewer Robot Based on Social Signals Make a Better Impression on Interviewees and Encourage Self-Disclosure? Fuminori Nagasawa, Shogo Okada |
| 2025 | Causal Explanation of the Quality of Parent-Child Interactions with Multimodal Behavioral Features. Katherine Guerrerio, Lujie Karen Chen, Lisa Berlin, Brenda Jones Harden |
| 2025 | Cognitive Effort Analysis in Digital Learning Environments. Shayla Sharmin |
| 2025 | Converting Spatial to Social: Using Persistent Homology to Understand Social Groups. Valerie K. Chen, Claire Liang, Julie A. Shah, Sean Andrist |
| 2025 | Decoding Affective States without Labels: Bimodal Image-brain Supervision. Vadym Gryshchuk, Maria Maistro, Christina Lioma, Tuukka Ruotsalo |
| 2025 | Decoding social interaction to understand traumatic behaviours in social dynamics. Pritesh Nalinbhai Contractor |
| 2025 | Demographic User Modeling for Social Robotics with Multimodal Pre-trained Models. Hamed Rahimi, Mouad Abrini, Jeanne Malecot, Ying Lai, Adrien Jacquet Crétides, Mahdi Khoramshahi, Mohamed Chetouani |
| 2025 | Designing Multimodal Nonverbal Communication Cues for Multirobot Supervision Through Event Detection and Policy Mapping. Richard Attfield |
| 2025 | Designing and Evaluating Gen-AI for Cultural Resilience. Ka Hei Carrie Lau |
| 2025 | Designing for Meaningful Oversight: Human and Organisational Agency in Multimodal AI Systems. Liming Zhu |
| 2025 | Developing Virtual Reality (VR) Simulations with Embedded User Analytics for Cognitive Rehabilitation in PTSD Veterans. Ravi Varman Selvakumaran |
| 2025 | Differentiating Frustration from Cognitive Workload in a Dual-task System. Heting Wang |
| 2025 | DifussionCleft: Facial Anomaly Synthesis Guided by Text. Karen Rosero, Lucas M. Harrison, Alex A. Kane, Rami R. Hallac, Carlos Busso |
| 2025 | Disentangling Cross-Modal Interactions for Enhanced Multimodal Emotion Recognition in Conversation. Jian Ding, Bo Zhang, Dailin Li, Jian Wang, Hongfei Lin |
| 2025 | Disentangling Perceptual Ambiguity in Multifunctional Nonverbal Behaviors in Conversations via Tensor Spectrum Decomposition. Issa Tamura, Momoka Tajima, Shiro Kumano, Kazuhiro Otsuka |
| 2025 | Enhancing Accessibility in Animation: A Context-Aware Audio Description System for Visually Impaired Children. Md. Fahad Bin Zamal |
| 2025 | Enhancing Gaze Prediction in Multi-Party Conversations via Speaker-Aware Multimodal Adaptation. Meng-Chen Lee, Zhigang Deng |
| 2025 | Evaluating the Efficacy of Pulse Transit Time between Palm and Forehead in Blood Pressure Estimation. Chuchu Qiu, Jing Wei Chin, Tsz Tai Chan, Kwan Long Wong, Richard Hau Yue So |
| 2025 | Exploring the Impact of Distance on XR Selection Techniques. Becky Spittle, Maite Frutos-Pascual, Chris Creed, Ian Williams |
| 2025 | Exploring the effects of force feedback on VR Keyboards with varying visual designs. Zhenxing Li, Jari Kangas, Ahmed Farooq, Roope Raisamo |
| 2025 | Few-shot Fine-grained Image Classification with Interpretable Prompt Learning through Distribution Alignment. Dongliang Guo, Handong Zhao, Ryan A. Rossi, Sungchul Kim, Nedim Lipka, Tong Yu, Sheng Li |
| 2025 | Foundation Feature-Guided Hierarchical Fusion of EEG-Physiological for Emotion Estimation. Haifeng Zhang, Von Ralph Dane Marquez Herbuela, Yukie Nagai |
| 2025 | From Lab to Wrist: Bridging Metabolic Monitoring and Consumer Wearables for Heart Rate and Oxygen Consumption Modeling. Barak Gahtan, Sanketh Vedula, Gil Samuelly Leichtag, Einat Kodesh, Alex M. Bronstein |
| 2025 | From Speech and PPG to EDA: Stress Detection Based on Cross-Modal Fine-Tuning of Foundation Models. Alia Ahmed Al Dossary, Mathieu Chollet, Alessandro Vinciarelli |
| 2025 | Functional Near-Infrared Spectroscopy (fNIRS) Analysis of Interaction Techniques in Touchscreen-Based Educational Gaming: fNIRS Analysis of Interaction Techniques in Touchscreen-Based Educational Gaming. Shayla Sharmin, Elham Bakhshipour, Mohammad Fahim Abrar, Behdokht Kiafar, Pinar Kullu, Nancy Getchell, Roghayeh Leila Barmaki |
| 2025 | HRAI 2025: The 1st Workshop on Holistic and Responsible Affective Intelligence. Yuanchao Li, Dimitrios Kollias, Guillaume Chanel, Marios A. Fanourakis, Michal Muszynski, Brandon M. Booth, Leimin Tian, Madhawa Perera, Catherine Lai, Huili Chen |
| 2025 | Human Authenticity and Flourishing in an AI-Driven World: Edmund's Journey and the Call for Mindfulness. Sebastian Zepf, Mark Colley |
| 2025 | ICMI'25 Grand Challenge: A Thermal and Spectral Multimodal Image Dataset for Contaminant Detection in Industrial Organic Food Waste. Matthew Vestal, James Ireland, Xing Wang, Ram Subramanian, Damith Herath |
| 2025 | Investigating differences in Paramedic trainees' multimodal interaction during low and high physiological synchrony. Vasundhara Joshi, Surely Akiri, Sanaz Taherzadeh, Gary Williams, Andrea Kleinsmith |
| 2025 | Knowledge Graphs and Fine-Grained Visual Features: A Potent Duo Against Cheapfakes. Tuan-Vinh La, Minh-Hieu Nguyen, Minh-Son Dao |
| 2025 | Large Language Models For Multimodal User Interaction in Virtual Environments. Ahmed Sayed, Kevin Pfeil |
| 2025 | LayLens: Improving Deepfake Understanding through Simplified Explanations. Abhijeet Narang, Parul Gupta, Liuyijia Su, Abhinav Dhall |
| 2025 | Learning Multimodal Motion Cues for Online End-of-Turn Prediction in Multi-Party Dialogue. Meng-Chen Lee, Zhigang Deng |
| 2025 | Leveraging Pre-Trained Transformers and Facial Embeddings for Multimodal Hirability Prediction in Job Interviews. Eric Fithian, Theodora Chaspari |
| 2025 | Lightweight Transformers for Isolated Sign Language Recognition. Cristina Luna Jiménez, Lennart Eing, Annalena Bea Aicher, Fabrizio Nunnari, Elisabeth André |
| 2025 | MENA: A Multimodal Framework for Analyzing Caregiver Emotions and Competencies in AR Geriatric Simulations. Behdokht Kiafar, Pavan Uttej Ravva, Salam Daher, Asif Ahmmed Joy, Roghayeh Leila Barmaki |
| 2025 | MERD-360VR: A Multimodal Emotional Response Dataset from 360° VR Videos Across Different Age Groups. Qiang Chen, Shikun Zhou, Yuming Fang, Dan Luo, Tingsong Lu |
| 2025 | MUSE: A Multimodal, Generative, and Symbolic Framework for Human Experience Modeling. Mohammad Rashedul Hasan |
| 2025 | Modeling Social Dynamics from Multimodal Cues in Natural Conversations. Kevin Hyekang Joo |
| 2025 | Motion Diffusion Autoencoders: Enabling Attribute Manipulation in Human Motion Demonstrated on Karate Techniques. Anthony Richardson, Felix Putze |
| 2025 | Multimodal AI for Transforming Industries and Empowering Social Interaction. Fang Chen |
| 2025 | Multimodal Analysis of Caregiving Interactions in Simulation-Based Training. Behdokht Kiafar |
| 2025 | Multimodal Analysis of Disagreement in Dyadic Conversations: An Approach Based on Emotion Recognition. Areej Buker, Emily Smith, Olga Perepelkina, Alessandro Vinciarelli |
| 2025 | Multimodal Behavioral Characterization of Dyadic Alliance in Support Groups. Kevin Hyekang Joo, Zongjian Li, Yunwen Wang, Yuanfeixue Nan, Mina J. Kian, Shriya Upadhyay, Maja J. Mataric, Lynn Carol Miller, Mohammad Soleymani |
| 2025 | Multimodal Behavioral Patterns Analysis with Eye-Tracking and LLM-Based Reasoning. Dongyang Guo, Yasmeen Abdrabou, Enkeleda Thaqi, Enkelejda Kasneci |
| 2025 | Multimodal Conversational Events Estimation in Complex Social Scenes. Litian Li |
| 2025 | Multimodal LLM using Federated Visual Instruction Tuning for Visually Impaired. Ankith Bala, Alina Vereshchaka |
| 2025 | Multimodal Quantitative Measures for Multiparty Behavior Evaluation. Ojas Shirekar, Wim T. J. L. Pouw, Chenxu Hao, Vrushank Phadnis, Thabo Beeler, Chirag Raman |
| 2025 | Multimodal Synthetic Data Finetuning and Model Collapse: Insights from VLMs and Diffusion Models. Zizhao Hu, Mohammad Rostami, Jesse Thomason |
| 2025 | Multimodal Task Analysis in Wearable Contexts. Julien Epps |
| 2025 | Pinching Visuo-haptic Display: Investigating Cross-Modal Effects of Visual Textures on Electrostatic Cloth Tactile Sensations. Takekazu Kitagishi, Chun Wei Ooi, Yuichi Hiroi, Jun Rekimoto |
| 2025 | Please Let Me Think: The Influence of Conversational Fillers on Transparency and Perception of Waiting Time when Interacting with a Conversational AI in Virtual Reality. David Obremski, Paula Friedrich, Carolin Wienrich |
| 2025 | PoseDoc: An Interactive Tool for Efficient Annotation in Human Pose Estimation. Chengyu Fan, Tahiya Chowdhury |
| 2025 | Predicting End-of-turn and Backchannel Based on Multimodal Voice Activity Prediction Model. Ryo Ishii, Shin'ichiro Eitoku, Ryota Yokoyama, Junichi Sawase |
| 2025 | Privileged Contrastive Pretraining for Multimodal Affect Modelling. Kosmas Pinitas, Konstantinos Makantasis, Georgios N. Yannakakis |
| 2025 | Proceedings of the 27th International Conference on Multimodal Interaction, ICMI 2025, Canberra, Australia, October 13-17, 2025 Ram Subramanian, Yukiko I. Nakano, Tom Gedeon, Mohan Kankanhalli, Tanaya Guha, Jainendra Shukla, Gelareh Mohammadi, Oya Çeliktutan |
| 2025 | Psychological and Neurophysiological Indicators of Stress and Relaxation in Immersive Virtual Reality Environments: A Multimodal Approach. Ankit Arvind Prasad, Shashank Laxmikant Bidwai, Ashutosh Jitendra Zawar, Diven Ashwani Ahuja, Apostolos Kalatzis, Vishnunarayan Girishan Prabhu |
| 2025 | Punctual or Continuous? Analyzing Depression Traces in Language and Paralanguage with Multiple Instance Learning. Rawan Alsarrani, Anna Esposito, Alessandro Vinciarelli |
| 2025 | Real-time Generation of Various Types of Nodding for Avatar Attentive Listening System. Kazushi Kato, Koji Inoue, Divesh Lala, Keiko Ochi, Tatsuya Kawahara |
| 2025 | Realtime Multimodal Emotion Estimation using Behavioral and Neurophysiological Data. Von Ralph Dane Marquez Herbuela, Yukie Nagai |
| 2025 | Seeing, Hearing, Feeling: Designing Multimodal Alerts for Critical Drone Scenarios. Nina Knieriemen, Anke Hirsch, Muhammad Moiz Sakha, Florian Daiber, Hannah Kolb, Simone Hüning, Frederik Wiehr, Antonio Krüger |
| 2025 | SignFlow: End-to-End Sign Language Generation for One-to-Many Modeling using Conditional Flow Matching. Nabeela Khan, Bowen Wu, Sihan Tan, Carlos Toshinori Ishi, Kazuhiro Nakadai |
| 2025 | Simulated Insight, Real-World Impact: Enhancing Driving Safety with CARLA-Simulated Personalized Lessons and Eye-Tracking Risk Coaching. Wenbin Gan, Minh-Son Dao, Koji Zettsu |
| 2025 | SocialWise: LLM-Agentic Conversation Therapy for Individuals with Autism Spectrum Disorder to Enhance Communication Skills. Albert Tang |
| 2025 | Speech-to-Joy: Self-Supervised Features for Enjoyment Prediction in Human-Robot Conversation. Ricardo Santana, Bahar Irfan, Erik Lagerstedt, Gabriel Skantze, André Pereira |
| 2025 | SpikEy: Preventing Drink Spiking using Wearables. Zhigang Yin, Ngoc Thi Nguyen, Agustin Zuniga, Mohan Liyanage, Petteri Nurmi, Huber Flores |
| 2025 | StoryDiffusion: How to Support UX Storyboarding With Generative-AI. Zhaohui Liang, Xiaoyu Zhang, Kevin Ma, Zhao Liu, Xipei Ren, Kosa Goucher-Lambert, Can Liu |
| 2025 | Talking-to-Build: How LLM-Assisted Interface Shapes Player Performance and Experience in Minecraft. Xin Sun, Lei Wang, Yue Li, Jie Li, Massimo Poesio, Julian Frommel, Koen V. Hindriks, Jiahuan Pei |
| 2025 | Team Dynamics in Human-AI Collaboration: Effects on Confidence, Satisfaction, and Accountability. Mamehgol Yousefi, Ahmad Shahi, Mos Sharifi, Alvaro J. Jorge Romera, Simon Hoermann, Thammathip Piumsomboon |
| 2025 | The Crock of Shh: A Whispering Water Interface for Reshaping Reality. Brandon Waylan Ables |
| 2025 | The Fifth Edition of the Automated Assessment of Pain (AAP 2025). Zakia Hammal, Steffen Walter, Nadia Bianchi-Berthouze |
| 2025 | The Human Record Needle: A Novel Interface for Embodied Music Interaction. Brandon Waylan Ables |
| 2025 | Time-channel Adaptive Fusion and Hierarchical Attention Mechanism for Dynamic Hand Gesture Recognition. Longjie Huang, Jianhai Liu, Yong Gu, Kai Jiang, Haibo Li |
| 2025 | Towards Audio Personalization for Accessible Digital Media. Dhruv Jain, Jason Miller |
| 2025 | Towards Context-sensitive Emotion Recognition. Sayak Mukherjee |
| 2025 | Towards Intelligent Adaption in Cognitive Assistance Systems through Physiological Computing. Jordan Schneider |
| 2025 | Towards Seamless Interaction: Neuroadaptive Virtual Reality Interfaces for Target Selection. Jalynn Blu Nicoly |
| 2025 | USER-VLM 360: Personalized Vision Language Models with User-aware Tuning for Social Human-Robot Interactions. Hamed Rahimi, Adil Bahaj, Mouad Abrini, Mahdi Khoramshahi, Mounir Ghogho, Mohamed Chetouani |
| 2025 | Understanding and Supporting Multimodal AI Chat Interactions of DHH College Students: an Empirical Study. Nan Zhuang, Yanni Ma, Xin Zhao, Wang Ying, Shaolong Chai, Shitong Weng, Mengru Xue, Yuxi Mao, Cheng Yao |
| 2025 | Unobtrusive Universal Acoustic Adversarial Attacks on Speech Foundation Models in the Wild. Jayden Fassett, Anjila Budathoki, Jack Morris, Qin Hu, Yi Ding |
| 2025 | Using a Secondary Channel to Display the Internal Empathic Resonance of LLM-Driven Agents for Mental Health Support. Matthias Schmidmaier, Jonathan Rupp, Sven Mayer |
| 2025 | VitaStress: A Multimodal Dataset for Stress Detection. Paul Schreiber, Simon Burbach, Beyza Cinar, Lennart Mackert, Maria Maleshkova |
| 2025 | WatchHAR: Real-time On-device Human Activity Recognition System for Smartwatches. Taeyoung Yeon, Vasco Xu, Henry Hoffmann, Karan Ahuja |
| 2025 | What makes you say yes? An investigation of mental state and personality in persuasion during a dyadic conversation. Siyuan Chen |
| 2025 | When Robots Listen: Predicting Empathy Valence from Multimodal Storytelling Data. Jiayu Wang, Himadri Shekhar Mondal, Tom Gedeon, Md. Zakir Hossain |
| 2025 | When Words Fall Short: The Case for Conversational Interfaces that Don't Listen. James Simpson, Hamish Stening, Gaurav Patil, Patrick Nalepka, Mark Dras, Rachel W. Kallen, Simon G. Hosking, Michael J. Richardson, Deborah Richards |
| 2025 | Write! Draw! Move!: Investigating the Effects of Positive and Negative Self-Reflection on Emotion through Self-Expression Modalities. Golnaz Moharrer, Kavya Rajendran, Rowena Pinto, Andrea Kleinsmith |
| 2025 | mIoG: An Evaluation Metric for Multispectral InstanceSegmentation in Robotics. Yue Peng, Yizheng Liu, Mengxuan Liang |