| 2022 | 3D-Augmented Contrastive Knowledge Distillation for Image-based Object Pose Estimation. Zhidan Liu, Zhen Xing, Xiangdong Zhou, Yijiang Chen, Guichun Zhou |
| 2022 | Accelerated Sign Hunter: A Sign-based Black-box Attack via Branch-Prune Strategy and Stabilized Hierarchical Search. Siyuan Li, Guangji Huang, Xing Xu, Yang Yang, Fumin Shen |
| 2022 | Adaptive Temporal Grouping for Black-box Adversarial Attacks on Videos. Zhipeng Wei, Jingjing Chen, Hao Zhang, Linxi Jiang, Yu-Gang Jiang |
| 2022 | An Effective Two-way Metapath Encoder over Heterogeneous Information Network for Recommendation. Yanbin Jiang, Huifang Ma, Xiaohui Zhang, Zhixin Li, Liang Chang |
| 2022 | Automatic Visual Recognition of Unexploded Ordnances Using Supervised Deep Learning. Georgios Begkas, Panagiotis Giannakeris, Konstantinos Ioannidis, Georgios Kalpakis, Theodora Tsikrika, Stefanos Vrochidis, Ioannis Kompatsiaris |
| 2022 | Blindfold Attention: Novel Mask Strategy for Facial Expression Recognition. Bo Fu, Yuanxin Mao, Shilin Fu, Yonggong Ren, Zhongxuan Luo |
| 2022 | CLIP4Hashing: Unsupervised Deep Hashing for Cross-Modal Video-Text Retrieval. Yaoxin Zhuo, Yikang Li, Jenhao Hsiao, Chiuman Ho, Baoxin Li |
| 2022 | Camouflaged Poisoning Attack on Graph Neural Networks. Chao Jiang, Yi He, Richard Chapman, Hongyi Wu |
| 2022 | Constructing Phrase-level Semantic Labels to Form Multi-Grained Supervision for Image-Text Retrieval. Zhihao Fan, Zhongyu Wei, Zejun Li, Siyuan Wang, Haijun Shan, Xuanjing Huang, Jianqing Fan |
| 2022 | Cross-Modal Retrieval between Event-Dense Text and Image. Zhongwei Xie, Lin Li, Luo Zhong, Jianquan Liu, Ling Liu |
| 2022 | Cross-Pixel Dependency with Boundary-Feature Transformation for Weakly Supervised Semantic Segmentation. Yuhui Guo, Xun Liang, Tang Hui, Bo Wu, Xiangping Zheng |
| 2022 | Cross-lingual Adaptation for Recipe Retrieval with Mixup. Bin Zhu, Chong-Wah Ngo, Jingjing Chen, Wing Kwong Chan |
| 2022 | DMPCANet: A Low Dimensional Aggregation Network for Visual Place Recognition. Yinghao Wang, Haonan Chen, Jiong Wang, Yingying Zhu |
| 2022 | DiGAN: Directional Generative Adversarial Network for Object Transfiguration. Zhen Luo, Yingfang Zhang, Peihao Zhong, Jingjing Chen, Donglong Chen |
| 2022 | Disentangled Representations and Hierarchical Refinement of Multi-Granularity Features for Text-to-Image Synthesis. Pei Dong, Lei Wu, Lei Meng, Xiangxu Meng |
| 2022 | Dual-Channel Localization Networks for Moment Retrieval with Natural Language. Bolin Zhang, Bin Jiang, Chao Yang, Liang Pang |
| 2022 | Dual-Level Decoupled Transformer for Video Captioning. Yiqi Gao, Xinglin Hou, Wei Suo, Mengyang Sun, Tiezheng Ge, Yuning Jiang, Peng Wang |
| 2022 | Efficient Linear Attention for Fast and Accurate Keypoint Matching. Suwichaya Suwanwimolkul, Satoshi Komorita |
| 2022 | EmoMTB: Emotion-aware Music Tower Blocks. Alessandro B. Melchiorre, David Penz, Christian Ganhör, Oleg Lesota, Vasco Fragoso, Florian Friztl, Emilia Parada-Cabaleiro, Franz Schubert, Markus Schedl |
| 2022 | Extracting Precedence Relations between Video Lectures in MOOCs. Kui Xiao, Youheng Bai, Yan Zhang |
| 2022 | Fashion Image Search via Anchor-Free Detector. Shanchuan Gao, Fankai Zeng, Lu Cheng, Jicong Fan, Mingbo Zhao |
| 2022 | Fashion Style-Aware Embeddings for Clothing Image Retrieval. Rino Naka, Marie Katsurai, Keisuke Yanagi, Ryosuke Goto |
| 2022 | FedNKD: A Dependable Federated Learning Using Fine-tuned Random Noise and Knowledge Distillation. Shaoxiong Zhu, Qi Qi, Zirui Zhuang, Jingyu Wang, Haifeng Sun, Jianxin Liao |
| 2022 | Flexible Order Aware Sequential Recommendation. Mingda Qian, Xiaoyan Gu, Lingyang Chu, Feifei Dai, Haihui Fan, Bo Li |
| 2022 | FreqCAM: Frequent Class Activation Map for Weakly Supervised Object Localization. Runsheng Zhang |
| 2022 | GIO: A Timbre-informed Approach for Pitch Tracking in Highly Noisy Environments. Xiaoheng Sun, Xia Liang, Qiqi He, Bilei Zhu, Zejun Ma |
| 2022 | Generating Topological Structure of Floorplans from Room Attributes. Yu Yin, Will Hutchcroft, Naji Khosravan, Ivaylo Boyadzhiev, Yun Fu, Sing Bing Kang |
| 2022 | HybridVocab: Towards Multi-Modal Machine Translation via Multi-Aspect Alignment. Ru Peng, Yawen Zeng, Junbo Zhao |
| 2022 | I2-Net: Intra- and Inter-scale Collaborative Learning Network for Abdominal Multi-organ Segmentation. Chao Suo, Xuanya Li, Donghui Tan, Yuan Zhang, Xieping Gao |
| 2022 | ICDAR'22: Intelligent Cross-Data Analysis and Retrieval. Minh-Son Dao, Michael Alexander Riegler, Duc-Tien Dang-Nguyen, Cathal Gurrin, Yuta Nakashima, Mianxiong Dong |
| 2022 | ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27 - 30, 2022 Vincent Oria, Maria Luisa Sapino, Shin'ichi Satoh, Brigitte Kerhervé, Wen-Huang Cheng, Ichiro Ide, Vivek K. Singh |
| 2022 | Improve Image Captioning by Modeling Dynamic Scene Graph Extension. Minghao Geng, Qingjie Zhao |
| 2022 | Improving Image Captioning via Enhancing Dual-Side Context Awareness. Yiqi Gao, Ning Wang, Wei Suo, Mengyang Sun, Peng Wang |
| 2022 | Ingredient-enriched Recipe Generation from Cooking Videos. Jianlong Wu, Liangming Pan, Jingjing Chen, Yu-Gang Jiang |
| 2022 | Introduction to the Fifth Annual Lifelog Search Challenge, LSC'22. Cathal Gurrin, Liting Zhou, Graham Healy, Björn Þór Jónsson, Duc-Tien Dang-Nguyen, Jakub Lokoc, Minh-Triet Tran, Wolfgang Hürst, Luca Rossetto, Klaus Schöffmann |
| 2022 | Joint Modality Synergy and Spatio-temporal Cue Purification for Moment Localization. Xingyu Shen, Long Lan, Huibin Tan, Xiang Zhang, Xurui Ma, Zhigang Luo |
| 2022 | Learning Hierarchical Semantic Correspondences for Cross-Modal Image-Text Retrieval. Sheng Zeng, Changhong Liu, Jun Zhou, Yong Chen, Aiwen Jiang, Hanxi Li |
| 2022 | Learning Sample Importance for Cross-Scenario Video Temporal Grounding. Peijun Bao, Yadong Mu |
| 2022 | Lesion Localization in OCT by Semi-Supervised Object Detection. Yue Wu, Yang Zhou, Jianchun Zhao, Jingyuan Yang, Weihong Yu, Youxin Chen, Xirong Li |
| 2022 | Local Slot Attention for Vision and Language Navigation. Yifeng Zhuang, Qiang Sun, Yanwei Fu, Lifeng Chen, Xiangyang Xue |
| 2022 | M2TR: Multi-modal Multi-scale Transformers for Deepfake Detection. Junke Wang, Zuxuan Wu, Wenhao Ouyang, Xintong Han, Jingjing Chen, Yu-Gang Jiang, Ser-Nam Lim |
| 2022 | MAD '22 Workshop: Multimedia AI against Disinformation. Bogdan Ionescu, Giorgos Kordopatis-Zilos, Adrian Popescu, Luca Cuccovillo, Symeon Papadopoulos |
| 2022 | MFGAN: A Lightweight Fast Multi-task Multi-scale Feature-fusion Model based on GAN. Lijia Deng, Yu-Dong Zhang |
| 2022 | MMArt-ACM 2022: 5th Joint Workshop on Multimedia Artworks Analysis and Attractiveness Computing in Multimedia. Naoko Nitta, Anita Min-Chun Hu, Kensuke Tobitani |
| 2022 | MSSPQ: Multiple Semantic Structure-Preserving Quantization for Cross-Modal Retrieval. Lei Zhu, Liewu Cai, Jiayu Song, Xinghui Zhu, Chengyuan Zhang, Shichao Zhang |
| 2022 | Mobile Emotion Recognition via Multiple Physiological Signals using Convolution-augmented Transformer. Kangning Yang, Benjamin Tag, Yue Gu, Chaofan Wang, Tilman Dingler, Greg Wadley, Jorge Gonçalves |
| 2022 | Motor Learning based on Presentation of a Tentative Goal. Siqi Sun, Yongqing Sun, Mitsuhiro Goto, Shigekuni Kondo, Dan Mikami, Susumu Yamamoto |
| 2022 | MuLER: Multiplet-Loss for Emotion Recognition. Anwer Slimi, Mounir Zrigui, Henri Nicolas |
| 2022 | Multi-Modal Contrastive Pre-training for Recommendation. Zhuang Liu, Yunpu Ma, Matthias Schubert, Yuanxin Ouyang, Zhang Xiong |
| 2022 | MultiCLU: Multi-stage Context Learning and Utilization for Storefront Accessibility Detection and Evaluation. Xuan Wang, Jiajun Chen, Hao Tang, Zhigang Zhu |
| 2022 | Multiple Biological Granularities Network for Person Re-Identification. Shuyuan Tu, Tianzhen Guan, Li Kuang |
| 2022 | Music-to-Dance Generation with Multiple Conformer. Mingao Zhang, Changhong Liu, Yong Chen, Zhenchun Lei, Mingwen Wang |
| 2022 | Nearest Neighbor Search with Compact Codes: A Decoder Perspective. Kenza Amara, Matthijs Douze, Alexandre Sablayrolles, Hervé Jégou |
| 2022 | OCR-oriented Master Object for Text Image Captioning. Wenliang Tang, Zhenzhen Hu, Zijie Song, Richang Hong |
| 2022 | OSCARS: An Outlier-Sensitive Content-Based Radiography Retrieval System. Xiaoyuan Guo, Jiali Duan, Saptarshi Purkayastha, Hari Trivedi, Judy Wawira Gichoya, Imon Banerjee |
| 2022 | Parallelism Network with Partial-aware and Cross-correlated Transformer for Vehicle Re-identification. Guangqi Jiang, Huibing Wang, Jinjia Peng, Xianping Fu |
| 2022 | Person Search by Uncertain Attributes. Tingting Dong, Jianquan Liu |
| 2022 | Phrase-level Prediction for Video Temporal Localization. Sizhe Li, Chang Li, Minghang Zheng, Yang Liu |
| 2022 | Pluggable Weakly-Supervised Cross-View Learning for Accurate Vehicle Re-Identification. Lu Yang, Hongbang Liu, Lingqiao Liu, Jinghao Zhou, Lei Zhang, Peng Wang, Yanning Zhang |
| 2022 | Real-Time Deepfake System for Live Streaming. Yifei Fan, Modan Xie, Peihan Wu, Gang Yang |
| 2022 | Relevance-based Margin for Contrastively-trained Video Retrieval Models. Alex Falcon, Swathikiran Sudhakaran, Giuseppe Serra, Sergio Escalera, Oswald Lanz |
| 2022 | Reproducibility Companion Paper: Human Object Interaction Detection via Multi-level Conditioned Network. Yunqing He, Xu Sun, Hui Jiang, Tongwei Ren, Gangshan Wu, Maria Sinziana Astefanoaei, Andreas Leibetseder |
| 2022 | Review of Deep Learning Models for Spine Segmentation. Neng Zhou, Hairu Wen, Yi Wang, Yang Liu, Longfei Zhou |
| 2022 | Revisiting Performance Measures for Cross-Modal Hashing. Hongya Wang, Shunxin Dai, Ming Du, Bo Xu, Mingyong Li |
| 2022 | SA-NAS-BFNR: Spatiotemporal Attention Neural Architecture Search for Task-based Brain Functional Network Representation. Fenxia Duan, Chunhong Cao, Xieping Gao |
| 2022 | STAFNet: Swin Transformer Based Anchor-Free Network for Detection of Forward-looking Sonar Imagery. Xingyu Zhu, Yingshuo Liang, Jianlei Zhang, Zengqiang Chen |
| 2022 | Selective Hypergraph Convolutional Networks for Skeleton-based Action Recognition. Yiran Zhu, Guangji Huang, Xing Xu, Yanli Ji, Fumin Shen |
| 2022 | Self-Lifting: A Novel Framework for Unsupervised Voice-Face Association Learning. Guangyu Chen, Deyuan Zhang, Tao Liu, Xiaoyong Du |
| 2022 | Sequential Intention-aware Recommender based on User Interaction Graph. Jinpeng Chen, Yuan Cao, Fan Zhang, Pengfei Sun, Kaimin Wei |
| 2022 | Source-free Temporal Attentive Domain Adaptation for Video Action Recognition. Peipeng Chen, Andy J. Ma |
| 2022 | Style-woven Attention Network for Zero-shot Ink Wash Painting Style Transfer. Haochen Sun, Lei Wu, Xiang Li, Xiangxu Meng |
| 2022 | Summarizing Videos using Concentrated Attention and Considering the Uniqueness and Diversity of the Video Frames. Evlampios Apostolidis, Georgios Balaouras, Vasileios Mezaris, Ioannis Patras |
| 2022 | Supervised Contrastive Vehicle Quantization for Efficient Vehicle Retrieval. Yongbiao Chen, Kaicheng Guo, Fangxin Liu, Yusheng Huang, Zhengwei Qi |
| 2022 | Teaching a New Dog Old Tricks: Contrastive Random Walks in Videos with Unsupervised Priors. Jan Schutte, Pascal Mettes |
| 2022 | Temporal-Consistent Visual Clue Attentive Network for Video-Based Person Re-Identification. Bingliang Jiao, Liying Gao, Peng Wang |
| 2022 | The Impact of Dataset Splits on Classification Performance in Medical Videos. Markus Fox, Klaus Schoeffmann |
| 2022 | TransHash: Transformer-based Hamming Hashing for Efficient Image Retrieval. Yongbiao Chen, Sheng Zhang, Fangxin Liu, Zhigang Chang, Mang Ye, Zhengwei Qi |
| 2022 | TransPCC: Towards Deep Point Cloud Compression via Transformers. Zujie Liang, Fan Liang |
| 2022 | TriReID: Towards Multi-Modal Person Re-Identification via Descriptive Fusion Model. Yajing Zhai, Yawen Zeng, Da Cao, Shaofei Lu |
| 2022 | UF-VTON: Toward User-Friendly Virtual Try-On Network. Yuan Chang, Tao Peng, Ruhan He, Xinrong Hu, Junping Liu, Zili Zhang, Minghua Jiang |
| 2022 | Unseen Food Segmentation. Yuma Honbu, Keiji Yanai |
| 2022 | Unsupervised Contrastive Masking for Visual Haze Classification. Jingyu Li, Haokai Ma, Xiangxian Li, Zhuang Qi, Lei Meng, Xiangxu Meng |
| 2022 | VAC-Net: Visual Attention Consistency Network for Person Re-identification. Weidong Shi, Yunzhou Zhang, Shangdong Zhu, Yixiu Liu, Sonya Coleman, Dermot Kerr |
| 2022 | ViRMA: Virtual Reality Multimedia Analytics. Aaron Duane, Björn Þór Jónsson |
| 2022 | Video2Subtitle: Matching Weakly-Synchronized Sequences via Dynamic Temporal Alignment. Ben Xue, Chenchen Liu, Yadong Mu |
| 2022 | VideoCLIP: A Cross-Attention Model for Fast Video-Text Retrieval Task with Image CLIP. Yikang Li, Jenhao Hsiao, Chiuman Ho |
| 2022 | Weakly Supervised Fine-grained Recognition based on Combined Learning for Small Data and Coarse Label. Anqi Hu, Zhengxing Sun, Qian Li |
| 2022 | Weakly Supervised Pediatric Bone Age Assessment Using Ultrasonic Images via Automatic Anatomical RoI Detection. Yunyan Yan, Chuanbin Liu, Hongtao Xie, Sicheng Zhang, Zhendong Mao |
| 2022 | Weakly-supervised Cerebrovascular Segmentation Network with Shape Prior and Model Indicator. Qian Wu, Yufei Chen, Ning Huang, Xiaodong Yue |