| 2024 | 21st International Conference on Content-Based Multimedia Indexing, CBMI 2024, Reykjavik, Iceland, September 18-20, 2024 |
| 2024 | A Behavior and Emotion Recognition Framework for Emotion-Aware Services in Physical Spaces. Sari Järvinen, Johanna Kallio, Johannes Peltola, Satu-Marja Mäkelä |
| 2024 | A Comparison of Late-Fusion Training Strategies for Quad-Modal Joint Embeddings. Domenic Luca Fürer, Abraham Bernstein, Luca Rossetto |
| 2024 | A Concept Design for a Positive Mood Supporting Application. Aurora Saibene, Riccardo Giussani, Claudia Rabaioli, Nicolò Dozio, Francesca Gasparini, Francesco Ferrise |
| 2024 | A Framework for Vision-Based 3D Inspections for Maintenance Activities and Digital Twin Integration. Panagiotis Vrachnos, Carlos Ramonell, Ilias Koulalis, Konstantinos Ioannidis, Irina Stipanovic, Stefanos Vrochidis |
| 2024 | A Hybrid AI System for Fusion of Object and Context Information: Application to the Rail Line Defect Detection. Alexey Zhukov, Jenny Benois-Pineau, Alain Rivero, Akka Zemmari, Mohamed Mosbah, Danilo Crispiani |
| 2024 | A Multi-Instance Learning Approach for Improving Knee Osteoarthritis Diagnosis From Mri Data. Mohamed Berrimi, Yun Xin Teoh, Aladine Chetouani, Lotfi Houam, Rachid Jennane |
| 2024 | A Quest Through Interconnected Datasets: Lessons From Highly-Cited ICASSP Papers. Cynthia C. S. Liem, Doga Tascilar, Andrew M. Demetriou |
| 2024 | A Survey on Graph Deep Representation Learning for Facial Expression Recognition. Théo Gueuret, Akrem Sellami, Chaabane Djeraba |
| 2024 | Coarse-To-Fine Pruning of Graph Convolutional Networks for Skeleton-Based Recognition. Hichem Sahbi |
| 2024 | Combining Image and Region Uncertainty-Based Active Learning for Melanoma Segmentation. Nicolas Martin, Jean-Pierre Chevallet, Philippe Mulhem, Georges Quénot |
| 2024 | Data-Efficient Domain Transfer for Instance Segmentation for AR Scenes. Stefanie Onsori-Wechtitsch, Hermann Fürntratt, Hannes Fassold, Werner Bailer |
| 2024 | Demo: Creating Player-Specific Soccer Highlight Clips with PlayerTV. Håkon Maric Solberg, Mehdi Houshmand Sarkhoosh, Sushant Gautam, Saeed Shafiee Sabet, Pål Halvorsen, Cise Midoglu |
| 2024 | Demo: Soccer Information Retrieval Via Natural Queries using SoccerRAG. Aleksander Theo Strand, Sushant Gautam, Cise Midoglu, Pål Halvorsen |
| 2024 | Descriptor Impact on Multimodal 3D Retrieval. Maria-Eirini Pegia, Björn Þór Jónsson, Anastasia Moumtzidou, Sotiris Diplaris, Ilias Gialampoukidis, Stefanos Vrochidis, Ioannis Kompatsiaris |
| 2024 | Divexplore at Ivr4b 2024. Mario Leopold, Klaus Schoeffmann |
| 2024 | Elevating Video Retrieval Capabilities: A Cross-Modal Approach Utilizing Text and Image Generative Models. Kazuya Ueki, Yuma Suzuki, Haruki Sato, Takayuki Hori, Takumi Takada, Hiroki Takushima, Hayato Tanoue, Aiswariya Manoj Kumar, Hiroki Nishihara |
| 2024 | Emvd Dataset: a Dataset of Extreme Vocal Distortion Techniques Used in Heavy Metal. Modan Tailleur, Julien Pinquier, Laurent Millot, Corsin Vogel, Mathieu Lagrange |
| 2024 | Enabling Domain Experts to Train Efficient Few-Shot Incremental Landmark Recognition. Helmut Neuschmied, Werner Bailer |
| 2024 | Enhanced Defect Detection in Airport Runway Infrastructure Using Image-Text Pairing. Marios Krestenitis, Eftichia Badeka, Ilias Koulalis, Konstantinos Ioannidis, Stefanos Vrochidis |
| 2024 | Evaluation of Deep Audio Representations for Semantic Sound Similarity. Recep Oguz Araz, Dmitry Bogdanov, Pablo Alonso-Jiménez, Frederic Font |
| 2024 | Exploring the Plausibility of Hate and Counter Speech Detectors With Explainable Ai. Adrian Jaques Böck, Djordje Slijepcevic, Matthias Zeppelzauer |
| 2024 | Expressive Multimedia Query Formulation for Novices in Virtual Reality with Vitrivr-VR. Florian Spiess, Heiko Schuldt |
| 2024 | Exquisitor: Studying the Interplay Between Conversational Search and Relevance Feedback. Omar Shahbaz Khan, Ujjwal Sharma, Stevan Rudinac, Björn Þór Jónsson |
| 2024 | Finding Video Shots for Immersive Journalism Through Text-to-Video Search. Lyndon J. B. Nixon, Damianos Galanopoulos, Vasileios Mezaris |
| 2024 | Fine-Grained Rebalancing of Datasets for Correct Demographic Classification. Andrea Bozzitelli, Pia Cavasinni di Benedetto, Maria De Marsico, Xing Di, Vishal M. Patel |
| 2024 | Fire Detection for Emergency Responders using X. Dimitrios Stefanopoulos, Aristeidis Bozas, Georgia Christodoulou, Maria I. Maslioukova, Yiannis Kouloglou, Maria Pegia, Anastasia Moumtzidou, Ilias Gialampoukidis, Konstantinos Avgerinakis, Stefanos Vrochidis, Ioannis Kompatsiaris |
| 2024 | From Controlled to Chaotic: Disparities in Laboratory vs Real-World Stress Detection. Simão Ferreira, Fátima Rodrigues, Johanna Kallio, Filipe Coelho, Vesa Kyllönen, Nuno Rocha, Matilde A. Rodrigues, Elena Vildjiounaite |
| 2024 | HRV Based Stress Detection Using Convolutional Neural Networks (CNNs). Salomé Quoy, Dan Istrate, Mouna Benchekroun, Vincent Zalc |
| 2024 | IMSearch: An Interactive Multimedia Video-Moment Search System. Duc-Tuan Luu, Duy-Ngoc Nguyen, Khanh-Linh Bui-Le, Vinh-Tiep Nguyen, Minh-Triet Tran |
| 2024 | Improving the Flexibility of Video Events Retrieval Through Dynamic Conditional Refinement With Multilingual Capabilities. Thang-Long Nguyen-Ho, Van-Tu Ninh, Minh-Triet Tran, Graham Healy, Cathal Gurrin |
| 2024 | Invariant Audio Prints for Music Indexing and Alignment. Rémi Mignot, Geoffroy Peeters |
| 2024 | Is Clip the Main Roadblock for Fine-Grained Open-World Perception? Lorenzo Bianchi, Fabio Carrara, Nicola Messina, Fabrizio Falchi |
| 2024 | Latent Space Exploration for Drum Samples. Jake Drysdale, Jason Hockman |
| 2024 | Learning Scene Semantics From Vehicle-Centric Data for City-Scale Digital Twins. Hermann Fürntratt, Stefanie Onsori-Wechtitsch, Werner Bailer, Isaac Agustí Ventura, Carles Sala Navarro, Aleksandar Jevtic, Jawad Haidar |
| 2024 | Leveraging Latent Diffusion Models for Training-Free in-Distribution Data Augmentation for Surface Defect Detection. Federico Girella, Ziyue Liu, Franco Fummi, Francesco Setti, Marco Cristani, Luigi Capogrosso |
| 2024 | Leveraging Query Expansion and Reformulation for Image Retrieval With Large Language and Vision-Language Models. Sandrina Frunza, Stevan Rudinac, Cees G. H. Diks |
| 2024 | Lowering Barriers to Entry for Fully-Integrated Custom Payloads on a DJI Matrice. Joshua Springer, Gylfi Þór Guðmundsson, Marcel Kyas |
| 2024 | MeshConv3D: Efficient Convolution and Pooling Operators for Triangular 3D Meshes. Germain Bregeon, Marius Preda, Radu Ispas, Titus Zaharia |
| 2024 | Modeling Musical Knowledge With Quantum Bayesian Networks. Florian Krebs, Hermann Fürntratt, Roland Unterberger, Franz Graf |
| 2024 | Motion Consistency Constraint Map for Facial Expression Spotting. Ouala Ben Jemaa, Amel Aissaoui, Benjamin Allaert, Ioan Marius Bilasco |
| 2024 | Music Scope Pad: Video Selecting Application by Natural Movement in VR Space. Masatoshi Hamanaka |
| 2024 | Pgnn-Based Approach for Robust 3D Light Direction Estimation in Outdoor Images. Marcello Zanardelli, Mahyar Gohari, Riccardo Leonardi, Sergio Benini, Nicola Adami |
| 2024 | Predicting 3D Projectile Motion in Table Tennis Using Computer Vision and Physics-Informed Neural Network. Zaineb Chiha, Renaud Péteri, Laurent Mascarilla |
| 2024 | Predicting Multiple Reading Tasks Using Eye Movement Measures. Onanong Kongmeesub, Cathal Gurrin, Prapaporn Rattanatamrong |
| 2024 | Query Refinement for Non-Existing Items in Image Retrieval. Naoto Naka, Shin'ichi Satoh |
| 2024 | Real-Time Musical Collaboration With a Probabilistic Model. Karl Johannsson, Victor Shepardson, Enrique Hurtado, Thor Magnusson, Hannes Högni Vilhjálmsson |
| 2024 | SAM in the Pipeline: Transforming Axis-Aligned to Oriented Bounding Boxes for Superior Sperm Detection. Pål Andreas Hoven Bentsen, Steven Hicks, Eric Jul, Pål Halvorsen, Vajira Thambawita |
| 2024 | SoccerRAG: Multimodal Soccer Information Retrieval via Natural Queries. Aleksander Theo Strand, Sushant Gautam, Cise Midoglu, Pål Halvorsen |
| 2024 | Taxonomap: an Interactive System for the Exploration and Explanation of Unsupervised Large-Scale News Classification. Simon Ott, Daria Liakhovets, Mina Schütz, Medina Andresel, Moritz W. Rothmund-Burgwall, Armin Vogl, Heidi Scheichenbauer, Michael Suker, Alexander Schindler |
| 2024 | Toolchain for Comprehensive Audio/Video Analysis Using Deep Learning Based Multimodal Approach: Use Case of Riot or Violent Context Detection. Lam Pham, Tin Nguyen, Phat Lam, Hieu Tang, Alexander Schindler |
| 2024 | Towards Advanced Wildfire Analysis: A Siamese Network-Based Change Detection Approach Through Self-Supervised Learning. Dimitris Valsamis, Alexandros Oikonomidis, Chrysoula Chatzichristaki, Anastasia Moumtzidou, Ilias Gialampoukidis, Stefanos Vrochidis, Ioannis Kompatsiaris |
| 2024 | Towards Training Music Taggers on Synthetic Data. Nadine Kroher, Steven Manangu, Aggelos Pikrakis |
| 2024 | Verge: Simplifying Video Search for Novice Users. Nick Pantelidis, Maria Pegia, Damianos Galanopoulos, Konstantinos Apostolidis, Dimitris Georgalis, Klearchos Stavrothanasopoulos, Anastasia Moumtzidou, Konstantinos Gkountakos, Ilias Gialampoukidis, Stefanos Vrochidis, Vasileios Mezaris, Ioannis Kompatsiaris |
| 2024 | VidBasys: A User-Friendly Interactive Video Retrieval System for Novice Users in IVR4B. Thao-Nhu Nguyen, Quang-Linh Tran, Hoang-Bao Le, Binh T. Nguyen, Liting Zhou, Gareth J. F. Jones, Cathal Gurrin |
| 2024 | Video Shot Discovery Through Text2Video Embeddings in a News Analytics Dashboard. Lyndon J. B. Nixon, Damianos Galanopoulos, Vasileios Mezaris, Alexander Hubmann-Haidvogel, Daniel Fischl, Arno Scharl |
| 2024 | Visione 5.0: Toward Evaluation With Novice Users. Giuseppe Amato, Paolo Bolettieri, Fabio Carrara, Fabrizio Falchi, Claudio Gennaro, Nicola Messina |
| 2024 | Weakly-Supervised Autism Severity Assessment in Long Videos. Abid Ali, Mahmoud Ali, Camilla Barbini, Séverine Dubuisson, Jean-Marc Odobez, François Brémond, Susanne Thümmler |
| 2024 | Wseseg: Introducing a Dataset for the Segmentation of Winter Sports Equipment With a Baseline for Interactive Segmentation. Robin Schön, Daniel Kienzle, Rainer Lienhart |
| 2024 | XAIface: A Framework and Toolkit for Explainable Face Recognition. Nélida Mirabet-Herranz, Martin Winter, Yuhang Lu, Naima Bousnina, Jonas Pfister, Chiara Galdi, Jean-Luc Dugelay, Werner Bailer, Touradj Ebrahimi, Paulo Lobato Correia, Fernando Pereira, Felix Schmautzer, Erich Schweighofer |