CBMI C

60 papers

YearTitle / Authors
202421st International Conference on Content-Based Multimedia Indexing, CBMI 2024, Reykjavik, Iceland, September 18-20, 2024
2024A Behavior and Emotion Recognition Framework for Emotion-Aware Services in Physical Spaces.
Sari Järvinen, Johanna Kallio, Johannes Peltola, Satu-Marja Mäkelä
2024A Comparison of Late-Fusion Training Strategies for Quad-Modal Joint Embeddings.
Domenic Luca Fürer, Abraham Bernstein, Luca Rossetto
2024A Concept Design for a Positive Mood Supporting Application.
Aurora Saibene, Riccardo Giussani, Claudia Rabaioli, Nicolò Dozio, Francesca Gasparini, Francesco Ferrise
2024A Framework for Vision-Based 3D Inspections for Maintenance Activities and Digital Twin Integration.
Panagiotis Vrachnos, Carlos Ramonell, Ilias Koulalis, Konstantinos Ioannidis, Irina Stipanovic, Stefanos Vrochidis
2024A Hybrid AI System for Fusion of Object and Context Information: Application to the Rail Line Defect Detection.
Alexey Zhukov, Jenny Benois-Pineau, Alain Rivero, Akka Zemmari, Mohamed Mosbah, Danilo Crispiani
2024A Multi-Instance Learning Approach for Improving Knee Osteoarthritis Diagnosis From Mri Data.
Mohamed Berrimi, Yun Xin Teoh, Aladine Chetouani, Lotfi Houam, Rachid Jennane
2024A Quest Through Interconnected Datasets: Lessons From Highly-Cited ICASSP Papers.
Cynthia C. S. Liem, Doga Tascilar, Andrew M. Demetriou
2024A Survey on Graph Deep Representation Learning for Facial Expression Recognition.
Théo Gueuret, Akrem Sellami, Chaabane Djeraba
2024Coarse-To-Fine Pruning of Graph Convolutional Networks for Skeleton-Based Recognition.
Hichem Sahbi
2024Combining Image and Region Uncertainty-Based Active Learning for Melanoma Segmentation.
Nicolas Martin, Jean-Pierre Chevallet, Philippe Mulhem, Georges Quénot
2024Data-Efficient Domain Transfer for Instance Segmentation for AR Scenes.
Stefanie Onsori-Wechtitsch, Hermann Fürntratt, Hannes Fassold, Werner Bailer
2024Demo: Creating Player-Specific Soccer Highlight Clips with PlayerTV.
Håkon Maric Solberg, Mehdi Houshmand Sarkhoosh, Sushant Gautam, Saeed Shafiee Sabet, Pål Halvorsen, Cise Midoglu
2024Demo: Soccer Information Retrieval Via Natural Queries using SoccerRAG.
Aleksander Theo Strand, Sushant Gautam, Cise Midoglu, Pål Halvorsen
2024Descriptor Impact on Multimodal 3D Retrieval.
Maria-Eirini Pegia, Björn Þór Jónsson, Anastasia Moumtzidou, Sotiris Diplaris, Ilias Gialampoukidis, Stefanos Vrochidis, Ioannis Kompatsiaris
2024Divexplore at Ivr4b 2024.
Mario Leopold, Klaus Schoeffmann
2024Elevating Video Retrieval Capabilities: A Cross-Modal Approach Utilizing Text and Image Generative Models.
Kazuya Ueki, Yuma Suzuki, Haruki Sato, Takayuki Hori, Takumi Takada, Hiroki Takushima, Hayato Tanoue, Aiswariya Manoj Kumar, Hiroki Nishihara
2024Emvd Dataset: a Dataset of Extreme Vocal Distortion Techniques Used in Heavy Metal.
Modan Tailleur, Julien Pinquier, Laurent Millot, Corsin Vogel, Mathieu Lagrange
2024Enabling Domain Experts to Train Efficient Few-Shot Incremental Landmark Recognition.
Helmut Neuschmied, Werner Bailer
2024Enhanced Defect Detection in Airport Runway Infrastructure Using Image-Text Pairing.
Marios Krestenitis, Eftichia Badeka, Ilias Koulalis, Konstantinos Ioannidis, Stefanos Vrochidis
2024Evaluation of Deep Audio Representations for Semantic Sound Similarity.
Recep Oguz Araz, Dmitry Bogdanov, Pablo Alonso-Jiménez, Frederic Font
2024Exploring the Plausibility of Hate and Counter Speech Detectors With Explainable Ai.
Adrian Jaques Böck, Djordje Slijepcevic, Matthias Zeppelzauer
2024Expressive Multimedia Query Formulation for Novices in Virtual Reality with Vitrivr-VR.
Florian Spiess, Heiko Schuldt
2024Exquisitor: Studying the Interplay Between Conversational Search and Relevance Feedback.
Omar Shahbaz Khan, Ujjwal Sharma, Stevan Rudinac, Björn Þór Jónsson
2024Finding Video Shots for Immersive Journalism Through Text-to-Video Search.
Lyndon J. B. Nixon, Damianos Galanopoulos, Vasileios Mezaris
2024Fine-Grained Rebalancing of Datasets for Correct Demographic Classification.
Andrea Bozzitelli, Pia Cavasinni di Benedetto, Maria De Marsico, Xing Di, Vishal M. Patel
2024Fire Detection for Emergency Responders using X.
Dimitrios Stefanopoulos, Aristeidis Bozas, Georgia Christodoulou, Maria I. Maslioukova, Yiannis Kouloglou, Maria Pegia, Anastasia Moumtzidou, Ilias Gialampoukidis, Konstantinos Avgerinakis, Stefanos Vrochidis, Ioannis Kompatsiaris
2024From Controlled to Chaotic: Disparities in Laboratory vs Real-World Stress Detection.
Simão Ferreira, Fátima Rodrigues, Johanna Kallio, Filipe Coelho, Vesa Kyllönen, Nuno Rocha, Matilde A. Rodrigues, Elena Vildjiounaite
2024HRV Based Stress Detection Using Convolutional Neural Networks (CNNs).
Salomé Quoy, Dan Istrate, Mouna Benchekroun, Vincent Zalc
2024IMSearch: An Interactive Multimedia Video-Moment Search System.
Duc-Tuan Luu, Duy-Ngoc Nguyen, Khanh-Linh Bui-Le, Vinh-Tiep Nguyen, Minh-Triet Tran
2024Improving the Flexibility of Video Events Retrieval Through Dynamic Conditional Refinement With Multilingual Capabilities.
Thang-Long Nguyen-Ho, Van-Tu Ninh, Minh-Triet Tran, Graham Healy, Cathal Gurrin
2024Invariant Audio Prints for Music Indexing and Alignment.
Rémi Mignot, Geoffroy Peeters
2024Is Clip the Main Roadblock for Fine-Grained Open-World Perception?
Lorenzo Bianchi, Fabio Carrara, Nicola Messina, Fabrizio Falchi
2024Latent Space Exploration for Drum Samples.
Jake Drysdale, Jason Hockman
2024Learning Scene Semantics From Vehicle-Centric Data for City-Scale Digital Twins.
Hermann Fürntratt, Stefanie Onsori-Wechtitsch, Werner Bailer, Isaac Agustí Ventura, Carles Sala Navarro, Aleksandar Jevtic, Jawad Haidar
2024Leveraging Latent Diffusion Models for Training-Free in-Distribution Data Augmentation for Surface Defect Detection.
Federico Girella, Ziyue Liu, Franco Fummi, Francesco Setti, Marco Cristani, Luigi Capogrosso
2024Leveraging Query Expansion and Reformulation for Image Retrieval With Large Language and Vision-Language Models.
Sandrina Frunza, Stevan Rudinac, Cees G. H. Diks
2024Lowering Barriers to Entry for Fully-Integrated Custom Payloads on a DJI Matrice.
Joshua Springer, Gylfi Þór Guðmundsson, Marcel Kyas
2024MeshConv3D: Efficient Convolution and Pooling Operators for Triangular 3D Meshes.
Germain Bregeon, Marius Preda, Radu Ispas, Titus Zaharia
2024Modeling Musical Knowledge With Quantum Bayesian Networks.
Florian Krebs, Hermann Fürntratt, Roland Unterberger, Franz Graf
2024Motion Consistency Constraint Map for Facial Expression Spotting.
Ouala Ben Jemaa, Amel Aissaoui, Benjamin Allaert, Ioan Marius Bilasco
2024Music Scope Pad: Video Selecting Application by Natural Movement in VR Space.
Masatoshi Hamanaka
2024Pgnn-Based Approach for Robust 3D Light Direction Estimation in Outdoor Images.
Marcello Zanardelli, Mahyar Gohari, Riccardo Leonardi, Sergio Benini, Nicola Adami
2024Predicting 3D Projectile Motion in Table Tennis Using Computer Vision and Physics-Informed Neural Network.
Zaineb Chiha, Renaud Péteri, Laurent Mascarilla
2024Predicting Multiple Reading Tasks Using Eye Movement Measures.
Onanong Kongmeesub, Cathal Gurrin, Prapaporn Rattanatamrong
2024Query Refinement for Non-Existing Items in Image Retrieval.
Naoto Naka, Shin'ichi Satoh
2024Real-Time Musical Collaboration With a Probabilistic Model.
Karl Johannsson, Victor Shepardson, Enrique Hurtado, Thor Magnusson, Hannes Högni Vilhjálmsson
2024SAM in the Pipeline: Transforming Axis-Aligned to Oriented Bounding Boxes for Superior Sperm Detection.
Pål Andreas Hoven Bentsen, Steven Hicks, Eric Jul, Pål Halvorsen, Vajira Thambawita
2024SoccerRAG: Multimodal Soccer Information Retrieval via Natural Queries.
Aleksander Theo Strand, Sushant Gautam, Cise Midoglu, Pål Halvorsen
2024Taxonomap: an Interactive System for the Exploration and Explanation of Unsupervised Large-Scale News Classification.
Simon Ott, Daria Liakhovets, Mina Schütz, Medina Andresel, Moritz W. Rothmund-Burgwall, Armin Vogl, Heidi Scheichenbauer, Michael Suker, Alexander Schindler
2024Toolchain for Comprehensive Audio/Video Analysis Using Deep Learning Based Multimodal Approach: Use Case of Riot or Violent Context Detection.
Lam Pham, Tin Nguyen, Phat Lam, Hieu Tang, Alexander Schindler
2024Towards Advanced Wildfire Analysis: A Siamese Network-Based Change Detection Approach Through Self-Supervised Learning.
Dimitris Valsamis, Alexandros Oikonomidis, Chrysoula Chatzichristaki, Anastasia Moumtzidou, Ilias Gialampoukidis, Stefanos Vrochidis, Ioannis Kompatsiaris
2024Towards Training Music Taggers on Synthetic Data.
Nadine Kroher, Steven Manangu, Aggelos Pikrakis
2024Verge: Simplifying Video Search for Novice Users.
Nick Pantelidis, Maria Pegia, Damianos Galanopoulos, Konstantinos Apostolidis, Dimitris Georgalis, Klearchos Stavrothanasopoulos, Anastasia Moumtzidou, Konstantinos Gkountakos, Ilias Gialampoukidis, Stefanos Vrochidis, Vasileios Mezaris, Ioannis Kompatsiaris
2024VidBasys: A User-Friendly Interactive Video Retrieval System for Novice Users in IVR4B.
Thao-Nhu Nguyen, Quang-Linh Tran, Hoang-Bao Le, Binh T. Nguyen, Liting Zhou, Gareth J. F. Jones, Cathal Gurrin
2024Video Shot Discovery Through Text2Video Embeddings in a News Analytics Dashboard.
Lyndon J. B. Nixon, Damianos Galanopoulos, Vasileios Mezaris, Alexander Hubmann-Haidvogel, Daniel Fischl, Arno Scharl
2024Visione 5.0: Toward Evaluation With Novice Users.
Giuseppe Amato, Paolo Bolettieri, Fabio Carrara, Fabrizio Falchi, Claudio Gennaro, Nicola Messina
2024Weakly-Supervised Autism Severity Assessment in Long Videos.
Abid Ali, Mahmoud Ali, Camilla Barbini, Séverine Dubuisson, Jean-Marc Odobez, François Brémond, Susanne Thümmler
2024Wseseg: Introducing a Dataset for the Segmentation of Winter Sports Equipment With a Baseline for Interactive Segmentation.
Robin Schön, Daniel Kienzle, Rainer Lienhart
2024XAIface: A Framework and Toolkit for Explainable Face Recognition.
Nélida Mirabet-Herranz, Martin Winter, Yuhang Lu, Naima Bousnina, Jonas Pfister, Chiara Galdi, Jean-Luc Dugelay, Werner Bailer, Touradj Ebrahimi, Paulo Lobato Correia, Fernando Pereira, Felix Schmautzer, Erich Schweighofer