| 2012 | A RELIEF-based modality weighting approach for multimodal information retrieval. Turgay Yilmaz, Elvan Gulen, Adnan Yazici, Masaru Kitsuregawa |
| 2012 | A visual approach for video geocoding using bag-of-scenes. Otávio Augusto Bizetto Penatti, Lin Tzy Li, Jurandy Almeida, Ricardo da Silva Torres |
| 2012 | Active learning of custom sound taxonomies in unstructured audio data. Gerard Roma, Jordi Janer, Perfecto Herrera |
| 2012 | Advanced shape context for plant species identification using leaf image retrieval. Sofiène Mouine, Itheri Yahiaoui, Anne Verroust-Blondet |
| 2012 | All vehicles are cars: subclass preferences in container concepts. Daan T. J. Vreeswijk, Cees G. M. Snoek, Koen E. A. van de Sande, Arnold W. M. Smeulders |
| 2012 | An extensible personal photograph collection for graded relevance assessments and user simulation. David Zellhöfer |
| 2012 | Analysing Facebook features to support event detection for photo-based Facebook applications. Mohammad Rabbath, Philipp Sandhaus, Susanne Boll |
| 2012 | Bayesian approach for near-duplicate image detection. Lucas Moutinho Bueno, Eduardo Valle, Ricardo da Silva Torres |
| 2012 | Because not all displays are lists. Simone Santini |
| 2012 | Beyond audio and video retrieval: towards multimedia summarization. Duo Ding, Florian Metze, Shourabh Rawat, Peter Franz Schulam, Susanne Burger, Ehsan Younessian, Lei Bao, Michael G. Christel, Alexander G. Hauptmann |
| 2012 | Categorization of a collection of pictures into structured events. Riccardo Mattivi, Jasper R. R. Uijlings, Francesco G. B. De Natale, Nicu Sebe |
| 2012 | Classification based group photo retrieval with bag of people features. Kazuya Shimizu, Naoko Nitta, Yujiro Nakai, Noboru Babaguchi |
| 2012 | Classifier-specific intermediate representation for multimedia tasks. Zhigang Ma, Alexander G. Hauptmann, Yi Yang, Nicu Sebe |
| 2012 | Cluster-based photo browsing and tagging on the go. Symeon Papadopoulos, Juxhin Bakalli, Yiannis Kompatsiaris, Emmanouil Schinas |
| 2012 | Color CENTRIST: a color descriptor for scene categorization. Wei-Ta Chu, Chih-Hao Chen |
| 2012 | Compact hashing for mixed image-keyword query over multi-label images. Xianglong Liu, Yadong Mu, Bo Lang, Shih-Fu Chang |
| 2012 | Constrained keypoint quantization: towards better bag-of-words model for large-scale multimedia retrieval. Yang Cai, Wei Tong, Linjun Yang, Alexander G. Hauptmann |
| 2012 | Content is still king: the effect of neighbor voting schemes on tag relevance for social image retrieval. Ba Quan Truong, Aixin Sun, Sourav S. Bhowmick |
| 2012 | Contour canonical form: an efficient intrinsic embedding approach to matching non-rigid 3D objects. Xulei Wang, Hongbin Zha |
| 2012 | Cross-modal categorisation of user-generated video sequences. Sebastian Schmiedeke, Pascal Kelm, Thomas Sikora |
| 2012 | Deriving a discriminative color model for a given object class from weakly labeled training data. Christian X. Ries, Rainer Lienhart |
| 2012 | Discovering inherent event taxonomies from social media collections. Minh-Son Dao, Giulia Boato, Francesco G. B. De Natale |
| 2012 | Distributed KNN-graph approximation via hashing. Mohamed Riadh Trad, Alexis Joly, Nozha Boujemaa |
| 2012 | Efficient graffiti image retrieval. Chunlei Yang, Pak Chung Wong, William Ribarsky, Jianping Fan |
| 2012 | Efficient video copy detection via aligning video signature time series. Jennifer Ren, Fangzhe Chang, Thomas L. Wood, John R. Zhang |
| 2012 | Event-based classification of social media streams. Timo Reuter, Philipp Cimiano |
| 2012 | Exploring two spaces with one feature: kernelized multidimensional modeling of visual alphabets. Miriam Redi, Bernard Mérialdo |
| 2012 | Fusing concept detection and geo context for visual search. Xirong Li, Cees G. M. Snoek, Marcel Worring, Arnold W. M. Smeulders |
| 2012 | Geo-based automatic image annotation. Hatem Mousselly Sergieh, Gabriele Gianini, Mario Döller, Harald Kosch, Elöd Egyed-Zsigmond, Jean-Marie Pinon |
| 2012 | Hamming embedding similarity-based image classification. Mihir Jain, Rachid Benmokhtar, Hervé Jégou, Patrick Gros |
| 2012 | High-confidence near-duplicate image detection. Wei Dong, Zhe Wang, Moses Charikar, Kai Li |
| 2012 | Image exploration using online feature extraction and reranking. Jakub Lokoc, Tomás Grosup, Tomás Skopal |
| 2012 | ImageTerrier: an extensible platform for scalable high-performance image retrieval. Jonathon S. Hare, Sina Samangooei, David Dupplaw, Paul H. Lewis |
| 2012 | Indexing media by personal events. Javier Paniagua, Ivan Tankoyeu, Julian Stöttinger, Fausto Giunchiglia |
| 2012 | International Conference on Multimedia Retrieval, ICMR '12, Hong Kong, China, June 5-8, 2012 Horace Ho-Shing Ip, Yong Rui |
| 2012 | Joint audio-visual bi-modal codewords for video event detection. Guangnan Ye, I-Hong Jhuo, Dong Liu, Yu-Gang Jiang, D. T. Lee, Shih-Fu Chang |
| 2012 | Joint-rerank: a novel method for image search reranking. Gang Wang, Xin-Shun Xu |
| 2012 | Labelset anchored subspace ensemble (LASE) for multi-label annotation. Tianyi Zhou, Dacheng Tao |
| 2012 | Large vocabulary quantization for searching instances from videos. Cai-Zhi Zhu, Shin'ichi Satoh |
| 2012 | Learning to summarize web image and text mutually. Piji Li, Jun Ma, Shuai Gao |
| 2012 | Linking visual concept detection with viewer demographics. Adrian Ulges, Markus Koch, Damian Borth |
| 2012 | Making a scene: alignment of complete sets of clips based on pairwise audio match. Kai Su, Mor Naaman, Avadhut Gurjar, Mohsin Patel, Daniel P. W. Ellis |
| 2012 | Mobile image browsing on a 3D globe: demo paper. Klaus Schöffmann, Marco A. Hudelist, Manfred del Fabro, Gerald Schaefer |
| 2012 | Multi-graph multi-instance learning for object-based image and video retrieval. Fei Li, Rujie Liu |
| 2012 | Multi-modal region selection approach for training object detectors. Elisavet Chatzilari, Spiros Nikolopoulos, Yiannis Kompatsiaris, Josef Kittler |
| 2012 | Multimodal feature generation framework for semantic image classification. Amel Znaidia, Aymen Shabou, Adrian Popescu, Hervé Le Borgne, Céline Hudelot |
| 2012 | Multimodal fusion for image retrieval using matrix factorization. Juan C. Caicedo, Fabio A. González |
| 2012 | Multimodal knowledge-based analysis in multimedia event detection. Ehsan Younessian, Teruko Mitamura, Alexander G. Hauptmann |
| 2012 | Musical keys and chords recognition using unsupervised learning with infinite Gaussian mixture. Yunsheng Wang, Harry Wechsler |
| 2012 | Name that sculpture. Relja Arandjelovic, Andrew Zisserman |
| 2012 | One size does not fit all: multimodal search on mobile and desktop devices with the I-SEARCH search engine. Thomas Steiner, Marilena Lazzaro, Francesco Saverio Nucci, Vincenzo Croce, Lorenzo Sutton, Alberto Massari, Antonio Camurri, Sabine Spiller, Anne Verroust-Blondet, Laurent Joyeux |
| 2012 | Ordinal preserving projection: a novel dimensionality reduction method for image ranking. Changsheng Li, Jing Liu, Yan Liu, Changsheng Xu, Qingshan Liu, Hanqing Lu |
| 2012 | PythiaSearch: a multiple search strategy-supportive multimedia retrieval system. David Zellhöfer, Maria Bertram, Thomas Böttcher, Christoph Schmidt, Claudius Tillmann, Ingo Schmitt |
| 2012 | Quantity versus quality: the role of layout and interaction complexity in thumbnail-based video retrieval interfaces. Wolfgang Hürst, Dimitri Darzentas |
| 2012 | SUPER: towards real-time event recognition in internet videos. Yu-Gang Jiang |
| 2012 | Security-oriented picture-in-picture visual modifications. Thanh-Toan Do, Ewa Kijak, Laurent Amsaleg, Teddy Furon |
| 2012 | Semiconducting bilinear deep learning for incomplete image recognition. Sheng-hua Zhong, Yan Liu, Korris Fu-Lai Chung, Gangshan Wu |
| 2012 | Social event detection and retrieval in collaborative photo collections. Markus Brenner, Ebroul Izquierdo |
| 2012 | Social event detection using multimodal clustering and integrating supervisory signals. Georgios Petkos, Symeon Papadopoulos, Yiannis Kompatsiaris |
| 2012 | Supervised dictionary learning for music genre classification. Chin-Chia Michael Yeh, Yi-Hsuan Yang |
| 2012 | Supporting browsing of user generated video on a tablet. Frank Hopfgartner, David Scott, Jinlin Guo, Yang Yang, Cathal Gurrin, Alan F. Smeaton |
| 2012 | Texmix: an automatically generated news navigation portal. Morgan Bréhinier, Sébastien Campion, Guillaume Gravier |
| 2012 | Video retrieval by mimicking poses. Nataraj Jammalamadaka, Andrew Zisserman, Marcin Eichner, Vittorio Ferrari, C. V. Jawahar |
| 2012 | Video saliency detection with robust temporal alignment and local-global spatial contrast. Zhixiang Ren, Liang-Tien Chia, Deepu Rajan |
| 2012 | Visual pattern discovery for architecture image classification and product image search. Wei-Ta Chu, Ming-Hung Tsai |
| 2012 | Watching and talking: media content as social nexus. David A. Shamma, Lyndon Kennedy, Elizabeth F. Churchill |
| 2012 | World seer: a realtime geo-tweet photo mapping system. Keiji Yanai |
| 2012 | nepDroid: an intelligent mobile music player. Sebastian Huber, Markus Schedl, Peter Knees |