ACM Multimedia A*

830 papers

YearTitle / Authors
20223D Body Reconstruction Revisited: Exploring the Test-time 3D Body Mesh Refinement Strategy via Surrogate Adaptation.
Jonathan Samuel Lumentut, In Kyu Park
20223D Human Mesh Reconstruction by Learning to Sample Joint Adaptive Tokens for Transformers.
Youze Xue, Jiansheng Chen, Yudong Zhang, Cheng Yu, Huimin Ma, Hongbing Ma
20223D-CNN for Facial Micro- and Macro-expression Spotting on Long Video Sequences using Temporal Oriented Reference Frame.
Chuin Hong Yap, Moi Hoon Yap, Adrian K. Davison, Connah Kendrick, Jingting Li, Su-Jing Wang, Ryan Cunningham
2022A
Liming Zhai, Qing Guo, Xiaofei Xie, Lei Ma, Yi Estelle Wang, Yang Liu
2022A Baseline for Detecting Out-of-Distribution Examples in Image Captioning.
Gal-Lev Shalev, Gabi Shalev, Joseph Keshet
2022A Baseline for ViCo Conversational Head Generation Challenge.
Meng Liu, Shuyan Zhai, Yongqiang Li, Weili Guan, Liqiang Nie
2022A Combination of Visual-Semantic Reasoning and Text Entailment-based Boosting Algorithm for Cheapfake Detection.
Tuan-Vinh La, Minh-Son Dao, Quang-Tien Tran, Thanh-Phuc Tran, Anh-Duy Tran, Duc-Tien Dang-Nguyen
2022A Comprehensive Study of Spatiotemporal Feature Learning for Social Medial Popularity Prediction.
Chih-Chung Hsu, Pi-Ju Tsai, Ting-Chun Yeh, Xiu-Yu Hou
2022A Conversational Shopping Assistant for Online Virtual Stores.
Tiago Fornelos, Pedro Valente, Rafael Ferreira, Diogo Tavares, Diogo Silva, David Semedo, João Magalhães, Nuno Correia
2022A Deep Learning based No-reference Quality Assessment Model for UGC Videos.
Wei Sun, Xiongkuo Min, Wei Lu, Guangtao Zhai
2022A Dual-Masked Auto-Encoder for Robust Motion Capture with Spatial-Temporal Skeletal Token Completion.
Junkun Jiang, Jie Chen, Yike Guo
2022A Feature-space Multimodal Data Augmentation Technique for Text-video Retrieval.
Alex Falcon, Giuseppe Serra, Oswald Lanz
2022A High-resolution Image-based Virtual Try-on System in Taobao E-commerce Scenario.
Zhilong Zhou, Shiyao Wang, Tiezheng Ge, Yuning Jiang
2022A Knowledge Augmented and Multimodal-Based Framework for Video Summarization.
Jiehang Xie, Xuanbai Chen, Shao-Ping Lu, Yulu Yang
2022A Lightweight Graph Transformer Network for Human Mesh Reconstruction from 2D Human Pose.
Ce Zheng, Matías Mendieta, Pu Wang, Aidong Lu, Chen Chen
2022A Multi-Stream Approach for Video Understanding.
Lutharsanen Kunam, Luca Rossetto, Abraham Bernstein
2022A Multi-view Spectral-Spatial-Temporal Masked Autoencoder for Decoding Emotions with Self-supervised Learning.
Rui Li, Yiting Wang, Wei-Long Zheng, Bao-Liang Lu
2022A Numerical DEs Perspective on Unfolded Linearized ADMM Networks for Inverse Problems.
Weixin An, Yingjie Yue, Yuanyuan Liu, Fanhua Shang, Hongying Liu
2022A Parameter-free Multi-view Information Bottleneck Clustering Method by Cross-view Weighting.
Shizhe Hu, Ruilin Geng, Zhaoxu Cheng, Chaoyang Zhang, Guoliang Zou, Zhengzheng Lou, Yangdong Ye
2022A Platform for Deploying the TFE Ecosystem of Automatic Speech Recognition.
Yuanfeng Song, Rongzhong Lian, Yixin Chen, Di Jiang, Xuefang Zhao, Conghui Tan, Qian Xu, Raymond Chi-Wing Wong
2022A Probabilistic Model for Controlling Diversity and Accuracy of Ambiguous Medical Image Segmentation.
Wei Zhang, Xiaohong Zhang, Sheng Huang, Yuting Lu, Kun Wang
2022A Region-based Document VQA.
Xinya Wu, Duo Zheng, Ruonan Wang, Jiashen Sun, Minzhen Hu, Fangxiang Feng, Xiaojie Wang, Huixing Jiang, Fan Yang
2022A Textual-Visual-Entailment-based Unsupervised Algorithm for Cheapfake Detection.
Quang-Tien Tran, Thanh-Phuc Tran, Minh-Son Dao, Tuan-Vinh La, Anh-Duy Tran, Duc-Tien Dang-Nguyen
2022A Transformer Based Approach for Activity Detection.
Gulshan Sharma, Abhinav Dhall, Ramanathan Subramanian
2022A Tree-Based Structure-Aware Transformer Decoder for Image-To-Markup Generation.
Shuhan Zhong, Sizhe Song, Guanyao Li, S.-H. Gary Chan
2022A Unified End-to-End Retriever-Reader Framework for Knowledge-based VQA.
Yangyang Guo, Liqiang Nie, Yongkang Wong, Yibing Liu, Zhiyong Cheng, Mohan S. Kankanhalli
2022A Unified Framework against Topology and Class Imbalance.
Junyu Chen, Qianqian Xu, Zhiyong Yang, Xiaochun Cao, Qingming Huang
2022ABPN: Apex and Boundary Perception Network for Micro- and Macro-Expression Spotting.
Wenhao Leng, Sirui Zhao, Yiming Zhang, Shifeng Liu, Xinglong Mao, Hao Wang, Tong Xu, Enhong Chen
2022ADGNet: Attention Discrepancy Guided Deep Neural Network for Blind Image Quality Assessment.
Xiaoyu Ma, Yaqi Wang, Chang Liu, Suiyu Zhang, Dingguo Yu
2022AEDNet: Asynchronous Event Denoising with Spatial-Temporal Correlation among Irregular Data.
Huachen Fang, Jinjian Wu, Leida Li, Junhui Hou, Weisheng Dong, Guangming Shi
2022AGTGAN: Unpaired Image Translation for Photographic Ancient Character Generation.
Hongxiang Huang, Daihui Yang, Gang Dai, Zhen Han, Yuyi Wang, Kin-Man Lam, Fan Yang, Shuangping Huang, Yongge Liu, Mengchao He
2022AI Carpet: Automatic Generation of Aesthetic Carpet Pattern.
Ziyi Wang, Xingqi Wang, Zeyu Jin, Xiaohan Li, Shikun Sun, Jia Jia
2022AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation.
Yiyang Ma, Huan Yang, Bei Liu, Jianlong Fu, Jiaying Liu
2022AI-VQA: Visual Question Answering based on Agent Interaction with Interpretability.
Rengang Li, Cong Xu, Zhenhua Guo, Baoyu Fan, Runze Zhang, Wei Liu, Yaqian Zhao, Weifeng Gong, Endong Wang
2022ALEGORIA: Joint Multimodal Search and Spatial Navigation into the Geographic Iconographic Heritage.
Florent Geniet, Valérie Gouet-Brunet, Mathieu Brédif
2022APCCPA '22: 1st International Workshop on Advances in Point Cloud Compression, Processing and Analysis.
Wei Gao, Ge Li, Hui Yuan, Raouf Hamzaoui, Zhu Li, Shan Liu
2022APPTracker: Improving Tracking Multiple Objects in Low-Frame-Rate Videos.
Tao Zhou, Wenhan Luo, Zhiguo Shi, Jiming Chen, Qi Ye
2022ARMANI: Part-level Garment-Text Alignment for Unified Cross-Modal Fashion Design.
Xujie Zhang, Yu Sha, Michael C. Kampffmeyer, Zhenyu Xie, Zequn Jie, Chengwen Huang, Jianqing Peng, Xiaodan Liang
2022ARRA: Absolute-Relative Ranking Attack against Image Retrieval.
Siyuan Li, Xing Xu, Zailei Zhou, Yang Yang, Guoqing Wang, Heng Tao Shen
2022AVA-AVD: Audio-visual Speaker Diarization in the Wild.
Eric Zhongcong Xu, Zeyang Song, Satoshi Tsutsui, Chao Feng, Mang Ye, Mike Zheng Shou
2022AVQA: A Dataset for Audio-Visual Question Answering on Videos.
Pinci Yang, Xin Wang, Xuguang Duan, Hong Chen, Runze Hou, Cong Jin, Wenwu Zhu
2022Accelerating General-purpose Lossless Compression via Simple and Scalable Parameterization.
Yu Mao, Yufei Cui, Tei-Wei Kuo, Chun Jason Xue
2022Action-conditioned On-demand Motion Generation.
Qiujing Lu, Yipeng Zhang, Mingjian Lu, Vwani Roychowdhury
2022Active Learning for Point Cloud Semantic Segmentation via Spatial-Structural Diversity Reasoning.
Feifei Shao, Yawei Luo, Ping Liu, Jie Chen, Yi Yang, Yulei Lu, Jun Xiao
2022Active Patterns Perceived for Stochastic Video Prediction.
Yechao Xu, Zhengxing Sun, Qian Li, Yunhan Sun, Shoutong Luo
2022AdaMask: Enabling Machine-Centric Video Streaming with Adaptive Frame Masking for DNN Inference Offloading.
Shengzhong Liu, Tianshi Wang, Jinyang Li, Dachun Sun, Mani B. Srivastava, Tarek F. Abdelzaher
2022Adaptive Affine Transformation: A Simple and Effective Operation for Spatial Misaligned Image Generation.
Zhimeng Zhang, Yu Ding
2022Adaptive Anti-Bottleneck Multi-Modal Graph Learning Network for Personalized Micro-video Recommendation.
Desheng Cai, Shengsheng Qian, Quan Fang, Jun Hu, Changsheng Xu
2022Adaptive Camera Margin for Mask-guided Domain Adaptive Person Re-identification.
Rui Wang, Feng Chen, Jun Tang, Pu Yan
2022Adaptive Dual Motion Model for Facial Micro-Expression Generation.
Xinqi Fan, Ali Raza Shahid, Hong Yan
2022Adaptive Hierarchical Pooling for Weakly-supervised Sound Event Detection.
Lijian Gao, Ling Zhou, Qirong Mao, Ming Dong
2022Adaptive Hypergraph Convolutional Network for No-Reference 360-degree Image Quality Assessment.
Jun Fu, Chen Hou, Wei Zhou, Jiahua Xu, Zhibo Chen
2022Adaptive Mixture of Experts Learning for Generalizable Face Anti-Spoofing.
Qianyu Zhou, Ke-Yue Zhang, Taiping Yao, Ran Yi, Shouhong Ding, Lizhuang Ma
2022Adaptive Structural Similarity Preserving for Unsupervised Cross Modal Hashing.
Liang Li, Baihua Zheng, Weiwei Sun
2022Adaptive Transformer-Based Conditioned Variational Autoencoder for Incomplete Social Event Classification.
Zhangming Li, Shengsheng Qian, Jie Cao, Quan Fang, Changsheng Xu
2022Adaptively Learning Low-high Frequency Information Integration for Pan-sharpening.
Man Zhou, Jie Huang, Chongyi Li, Hu Yu, Keyu Yan, Naishan Zheng, Feng Zhao
2022Adaptively-weighted Integral Space for Fast Multiview Clustering.
Man-Sheng Chen, Tuo Liu, Chang-Dong Wang, Dong Huang, Jian-Huang Lai
2022Adjustable Memory-efficient Image Super-resolution via Individual Kernel Sparsity.
Xiaotong Luo, Mingliang Dai, Yulun Zhang, Yuan Xie, Ding Liu, Yanyun Qu, Yun Fu, Junping Zhang
2022Adma-GAN: Attribute-Driven Memory Augmented GANs for Text-to-Image Generation.
Xintian Wu, Hanbin Zhao, Liangli Zheng, Shouhong Ding, Xi Li
2022AdsCVLR: Commercial Visual-Linguistic Representation Modeling in Sponsored Search.
Yongjie Zhu, Chunhui Han, Yuefeng Zhan, Bochen Pang, Zhaoju Li, Hao Sun, Si Li, Boxin Shi, Nan Duan, Weiwei Deng, Ruofei Zhang, Liangjie Zhang, Qi Zhang
2022Advances in Quality Assessment Of Video Streaming Systems: Algorithms, Methods, Tools.
Yiannis Andreopoulos, Cosmin Stejerean
2022AesUST: Towards Aesthetic-Enhanced Universal Style Transfer.
Zhizhong Wang, Zhanjie Zhang, Lei Zhao, Zhiwen Zuo, Ailin Li, Wei Xing, Dongming Lu
2022AggCast: Practical Cost-effective Scheduling for Large-scale Cloud-edge Crowdsourced Live Streaming.
Rui-Xiao Zhang, Changpeng Yang, Xiaochan Wang, Tianchi Huang, Chenglei Wu, Jiangchuan Liu, Lifeng Sun
2022Alexa, let's work together! How Alexa Helps Customers Complete Tasks with Verbal and Visual Guidance in the Alexa Prize TaskBot Challenge.
Yoelle Maarek
2022Align and Adapt: A Two-stage Adaptation Framework for Unsupervised Domain Adaptation.
Yan Yu, Yuchen Zhai, Yin Zhang
2022Align, Reason and Learn: Enhancing Medical Vision-and-Language Pre-training with Knowledge.
Zhihong Chen, Guanbin Li, Xiang Wan
2022All is Noise: In Search of Enlightenment, a VR Experience.
Manuel Silva, Luana Santos, Luís Teixeira, José Vasco Carvalho
2022Alleviating Style Sensitivity then Adapting: Source-free Domain Adaptation for Medical Image Segmentation.
Yalan Ye, Ziqi Liu, Yangwuyong Zhang, Jingjing Li, Hengtao Shen
2022An AI Powered Re-Identification System for Real-time Contextual Multimedia Applications.
Giuseppe Becchi, Andrea Ferracani, Filippo Principi, Alberto Del Bimbo
2022An Efficient Multi-View Multimodal Data Processing Framework for Social Media Popularity Prediction.
Yunpeng Tan, Fangyu Liu, Bowei Li, Zheng Zhang, Bo Zhang
2022An End-to-End Conditional Generative Adversarial Network Based on Depth Map for 3D Craniofacial Reconstruction.
Niankai Zhang, Junli Zhao, Fuqing Duan, Zhenkuan Pan, Zhongke Wu, Mingquan Zhou, Xianfeng Gu
2022An Image-to-video Model for Real-Time Video Enhancement.
Dongyu She, Kun Xu
2022Angular Gap: Reducing the Uncertainty of Image Difficulty through Model Calibration.
Bohua Peng, Mobarakol Islam, Mei Tu
2022Anomaly Warning: Learning and Memorizing Future Semantic Patterns for Unsupervised Ex-ante Potential Anomaly Prediction.
Jiaxu Leng, Mingpi Tan, Xinbo Gao, Wen Lu, Zongyi Xu
2022Approximate Shifted Laplacian Reconstruction for Multiple Kernel Clustering.
Jiali You, Zhenwen Ren, Quansen Sun, Yuan Sun, Xingfeng Li
2022Arbitrary Bit-width Network: A Joint Layer-Wise Quantization and Adaptive Inference Approach.
Chen Tang, Haoyu Zhai, Kai Ouyang, Zhi Wang, Yifei Zhu, Wenwu Zhu
2022Asymmetric Adversarial-based Feature Disentanglement Learning for Cross-Database Micro-Expression Recognition.
Shiting Xu, Zhiheng Zhou, Junyuan Shang
2022AtHom: Two Divergent Attentions Stimulated By Homomorphic Training in Text-to-Image Synthesis.
Zhenbo Shi, Zhi Chen, Zhenbo Xu, Wei Yang, Liusheng Huang
2022Atrous Pyramid Transformer with Spectral Convolution for Image Inpainting.
Muqi Huang, Lefei Zhang
2022Attack is the Best Defense: Towards Preemptive-Protection Person Re-Identification.
Lin Wang, Wanqian Zhang, Dayan Wu, Fei Zhu, Bo Li
2022Attribute Controllable Beautiful Caucasian Face Generation by Aesthetics Driven Reinforcement Learning.
Xin Jin, Shu Zhao, Le Zhang, Xin Zhao, Qiang Deng, Chaoen Xiao
2022Attribute-guided Dynamic Routing Graph Network for Transductive Few-shot Learning.
Chaofan Chen, Xiaoshan Yang, Ming Yan, Changsheng Xu
2022Audio Features from the Wav2Vec 2.0 Embeddings for the ACM Multimedia 2022 Stuttering Challenge.
Claude Montacié, Marie-José Caraty, Nikola Lackovic
2022Audio-driven Talking Head Generation with Transformer and 3D Morphable Model.
Ricong Huang, Weizhi Zhong, Guanbin Li
2022Augmented Dual-Contrastive Aggregation Learning for Unsupervised Visible-Infrared Person Re-Identification.
Bin Yang, Mang Ye, Jun Chen, Zesen Wu
2022Auto-captions on GIF: A Large-scale Video-sentence Dataset for Vision-language Pre-training.
Yingwei Pan, Yehao Li, Jianjie Luo, Jun Xu, Ting Yao, Tao Mei
2022Automatic Piano Fingering from Partially Annotated Scores using Autoregressive Neural Networks.
Pedro Ramoneda, Dasaem Jeong, Eita Nakamura, Xavier Serra, Marius Miron
2022Autonomous UAV Cinematography.
Ioannis Pitas, Ioannis Mademlis
2022Backdoor Attacks on Crowd Counting.
Yuhua Sun, Tailai Zhang, Xingjun Ma, Pan Zhou, Jian Lou, Zichuan Xu, Xing Di, Yu Cheng, Lichao Sun
2022Background Layout Generation and Object Knowledge Transfer for Text-to-Image Generation.
Zhuowei Chen, Zhendong Mao, Shancheng Fang, Bo Hu
2022BadHash: Invisible Backdoor Attacks against Deep Hashing with Clean Label.
Shengshan Hu, Ziqi Zhou, Yechao Zhang, Leo Yu Zhang, Yifeng Zheng, Yuanyuan He, Hai Jin
2022Balanced Gradient Penalty Improves Deep Long-Tailed Learning.
Dong Wang, Yicheng Liu, Liangji Fang, Fanhua Shang, Yuanyuan Liu, Hongying Liu
2022Bandwidth-Efficient Multi-video Prefetching for Short Video Streaming.
Xutong Zuo, Yishu Li, Mohan Xu, Wei Tsang Ooi, Jiangchuan Liu, Junchen Jiang, Xinggong Zhang, Kai Zheng, Yong Cui
2022Bayesian based Re-parameterization for DNN Model Pruning.
Xiaotong Lu, Teng Xi, Baopu Li, Gang Zhang, Weisheng Dong, Guangming Shi
2022Beauty: Machine Microbial Interface as Artistic Experimentation.
Johnny DiBlasi, Carlos Castellanos, Bello Bello
2022Being's Spread: Mirror of Life Interconnection.
Xinrui Wang, Yulu Song, Xiaohui Wang
2022Benign Adversarial Attack: Tricking Models for Goodness.
Jitao Sang, Xian Zhao, Jiaming Zhang, Zhiyu Lin
2022Best of Both Worlds: See and Understand Clearly in the Dark.
Xinwei Xue, Jia He, Long Ma, Yi Wang, Xin Fan, Risheng Liu
2022BetterSight: Immersive Vision Training for Basketball Players.
Pin-Xuan Liu, Tse-Yu Pan, Hsin-Shih Lin, Hung-Kuo Chu, Min-Chun Hu
2022Beyond Geo-localization: Fine-grained Orientation of Street-view Images by Cross-view Matching with Satellite Imagery.
Wenmiao Hu, Yichen Zhang, Yuxuan Liang, Yifang Yin, Andrei Georgescu, An Tran, Hannes Kruppa, See-Kiong Ng, Roger Zimmermann
2022Bi-directional Heterogeneous Graph Hashing towards Efficient Outfit Recommendation.
Weili Guan, Xuemeng Song, Haoyu Zhang, Meng Liu, Chung-Hsing Yeh, Xiaojun Chang
2022Bidirectional Self-Training with Multiple Anisotropic Prototypes for Domain Adaptive Semantic Segmentation.
Yulei Lu, Yawei Luo, Li Zhang, Zheyang Li, Yi Yang, Jun Xiao
2022Bidirectionally Learning Dense Spatio-temporal Feature Propagation Network for Unsupervised Video Object Segmentation.
Jiaqing Fan, Tiankang Su, Kaihua Zhang, Qingshan Liu
2022Bipartite Graph-based Discriminative Feature Learning for Multi-View Clustering.
Weiqing Yan, Jindong Xu, Jinglei Liu, Guanghui Yue, Chang Tang
2022Blind Robust Video Watermarking Based on Adaptive Region Selection and Channel Reference.
Qinwei Chang, Leichao Huang, Shaoteng Liu, Hualuo Liu, Tianshu Yang, Yexin Wang
2022BlumNet: Graph Component Detection for Object Skeleton Extraction.
Yulu Zhang, Liang Sang, Marcin Grzegorzek, John See, Cong Yang
2022Boat in the Sky: Background Decoupling and Object-aware Pooling for Weakly Supervised Semantic Segmentation.
Jianjun Xu, Hongtao Xie, Hai Xu, Yuxin Wang, Sun'ao Liu, Yongdong Zhang
2022Bodily Behaviors in Social Interaction: Novel Annotations and State-of-the-Art Evaluation.
Michal Balazia, Philipp Müller, Ákos Levente Tánczos, August von Liechtenstein, François Brémond
2022Boosting Single-Frame 3D Object Detection by Simulating Multi-Frame Point Clouds.
Wu Zheng, Li Jiang, Fanbin Lu, Yangyang Ye, Chi-Wing Fu
2022Boosting Video-Text Retrieval with Explicit High-Level Semantics.
Haoran Wang, Di Xu, Dongliang He, Fu Li, Zhong Ji, Jungong Han, Errui Ding
2022Box-FaceS: A Bidirectional Method for Box-Guided Face Component Editing.
Wenjing Huang, Shikui Tu, Lei Xu
2022Brain Topography Adaptive Network for Satisfaction Modeling in Interactive Information Access System.
Ziyi Ye, Xiaohui Xie, Yiqun Liu, Zhihong Wang, Xuesong Chen, Min Zhang, Shaoping Ma
2022Breaking Isolation: Multimodal Graph Fusion for Multimedia Recommendation by Edge-wise Modulation.
Feiyu Chen, Junjie Wang, Yinwei Wei, Hai-Tao Zheng, Jie Shao
2022C
Junsheng Wang, Tiantian Gong, Zhixiong Zeng, Changchang Sun, Yan Yan
2022CACOLIT: Cross-domain Adaptive Co-learning for Imbalanced Image-to-Image Translation.
Yijun Wang, Tao Liang, Jianxin Lin
2022CAIBC: Capturing All-round Information Beyond Color for Text-based Person Retrieval.
Zijie Wang, Aichun Zhu, Jingyi Xue, Xili Wan, Chao Liu, Tian Wang, Yifeng Li
2022CALM: Commen-Sense Knowledge Augmentation for Document Image Understanding.
Qinyi Du, Qingqing Wang, Keqian Li, Jidong Tian, Liqiang Xiao, Yaohui Jin
2022CAPTCHA the Flag: Interactive Plotter Livestream.
Tiago Rorke
2022CAliC: Accurate and Efficient Image-Text Retrieval via Contrastive Alignment and Visual Contexts Modeling.
Hongyu Gao, Chao Zhu, Mengyin Liu, Weibo Gu, Hongfa Wang, Wei Liu, Xu-Cheng Yin
2022CEA++'22: 1st International Workshop on Multimedia for Cooking, Eating, and related APPlications.
Yoko Yamakata, Atsushi Hashimoto, Jingjing Chen
2022CLIPTexture: Text-Driven Texture Synthesis.
Yiren Song
2022CLOP: Video-and-Language Pre-Training with Knowledge Regularizations.
Guohao Li, Hu Yang, Feng He, Zhifan Feng, Yajuan Lyu, Hua Wu, Haifeng Wang
2022CLUT-Net: Learning Adaptively Compressed Representations of 3DLUTs for Lightweight Image Enhancement.
Fengyi Zhang, Hui Zeng, Tianjun Zhang, Lin Zhang
2022CMAL: A Novel Cross-Modal Associative Learning Framework for Vision-Language Pre-Training.
Zhiyuan Ma, Jianjun Li, Guohui Li, Kaiyan Huang
2022CRNet: Unsupervised Color Retention Network for Blind Motion Deblurring.
Suiyi Zhao, Zhao Zhang, Richang Hong, Mingliang Xu, Haijun Zhang, Meng Wang, Shuicheng Yan
2022CVNets: High Performance Library for Computer Vision.
Sachin Mehta, Farzad Abdolhosseini, Mohammad Rastegari
2022Calibrating Class Weights with Multi-Modal Information for Partial Video Domain Adaptation.
Xiyu Wang, Yuecong Xu, Jianfei Yang, Kezhi Mao
2022Camera-specific Informative Data Augmentation Module for Unbalanced Person Re-identification.
Pingting Hong, Dayan Wu, Bo Li, Weiping Wang
2022Can Language Understand Depth?
Renrui Zhang, Ziyao Zeng, Ziyu Guo, Yafeng Li
2022Caption-Aware Medical VQA via Semantic Focusing and Progressive Cross-Modality Comprehension.
Fu'ze Cong, Shibiao Xu, Li Guo, Yinbing Tian
2022CariPainter: Sketch Guided Interactive Caricature Generation.
Xin Huang, Dong Liang, Hongrui Cai, Juyong Zhang, Jinyuan Jia
2022Cartoon-Flow: A Flow-Based Generative Adversarial Network for Arbitrary-Style Photo Cartoonization.
Jieun Lee, Hyeonwoo Kim, Jonghwa Shim, Eenjun Hwang
2022Cellular Trending: Fragmented Information Dissemination on Social Media Through Generative Lens.
Bo Shui, Xiaohui Wang
2022Certifying Better Robust Generalization for Unsupervised Domain Adaptation.
Zhiqiang Gao, Shufei Zhang, Kaizhu Huang, Qiufeng Wang, Rui Zhang, Chaoliang Zhong
2022CharFormer: A Glyph Fusion based Attentive Framework for High-precision Character Image Denoising.
Daqian Shi, Xiaolei Diao, Lida Shi, Hao Tang, Yang Chi, Chuntao Li, Hao Xu
2022ChartStamp: Robust Chart Embedding for Real-World Applications.
Jiayun Fu, Bin B. Zhu, Haidong Zhang, Yayi Zou, Song Ge, Weiwei Cui, Yun Wang, Dongmei Zhang, Xiaojing Ma, Hai Jin
2022ChebyLighter: Optimal Curve Estimation for Low-light Image Enhancement.
Jinwang Pan, Deming Zhai, Yuanchao Bai, Junjun Jiang, Debin Zhao, Xianming Liu
2022Chinese Character Recognition with Augmented Character Profile Matching.
Xinyan Zu, Haiyang Yu, Bin Li, Xiangyang Xue
2022ChoreoGraph: Music-conditioned Automatic Dance Choreography over a Style and Tempo Consistent Dynamic Graph.
Ho Yin Au, Jie Chen, Junkun Jiang, Yike Guo
2022Chunk-aware Alignment and Lexical Constraint for Visual Entailment with Natural Language Explanations.
Qian Yang, Yunxin Li, Baotian Hu, Lin Ma, Yuxin Ding, Min Zhang
2022Class Discriminative Adversarial Learning for Unsupervised Domain Adaptation.
Lihua Zhou, Mao Ye, Xiatian Zhu, Shuaifeng Li, Yiguang Liu
2022Class Gradient Projection For Continual Learning.
Cheng Chen, Ji Zhang, Jingkuan Song, Lianli Gao
2022Cloud2Sketch: Augmenting Clouds with Imaginary Sketches.
Zhaoyi Wan, Dejia Xu, Zhangyang Wang, Jian Wang, Jiebo Luo
2022Clustering Generative Adversarial Networks for Story Visualization.
Bowen Li, Philip H. S. Torr, Thomas Lukasiewicz
2022Co-Completion for Occluded Facial Expression Recognition.
Zhen Xing, Weimin Tan, Ruian He, Yangle Lin, Bo Yan
2022CoHOZ: Contrastive Multimodal Prompt Tuning for Hierarchical Open-set Zero-shot Recognition.
Ning Liao, Yifeng Liu, Xiaobo Li, Chenyi Lei, Guoxin Wang, Xian-Sheng Hua, Junchi Yan
2022Collaboration Superpowers: The Process of Crafting an Interactive Storytelling Animation.
Sofia Hinckel Dias, Sara Rodrigues Silva, Beatriz Rodrigues Silva, Rui Nóbrega
2022Combining Vision and Language Representations for Patch-based Identification of Lexico-Semantic Relations.
Prince Jha, Gaël Dias, Alexis Lechervy, José G. Moreno, Anubhav Jangra, Sebastião Pais, Sriparna Saha
2022Complementarity-Enhanced and Redundancy-Minimized Collaboration Network for Multi-agent Perception.
Guiyang Luo, Hui Zhang, Quan Yuan, Jinglin Li
2022Complementary Graph Representation Learning for Functional Neuroimaging Identification.
Rongyao Hu, Liang Peng, Jiangzhang Gan, Xiaoshuang Shi, Xiaofeng Zhu
2022Composite Photograph Harmonization with Complete Background Cues.
Yazhou Xing, Yu Li, Xintao Wang, Ye Zhu, Qifeng Chen
2022Compound Batch Normalization for Long-tailed Image Classification.
Lechao Cheng, Chaowei Fang, Dingwen Zhang, Guanbin Li, Gang Huang
2022Comprehensive Relationship Reasoning for Composed Query Based Image Retrieval.
Feifei Zhang, Ming Yan, Ji Zhang, Changsheng Xu
2022Compute to Tell the Tale: Goal-Driven Narrative Generation.
Yongkang Wong, Shaojing Fan, Yangyang Guo, Ziwei Xu, Karen Stephen, Rishabh Sheoran, Anusha Bhamidipati, Vivek Barsopia, Jianquan Liu, Mohan S. Kankanhalli
2022Concept Propagation via Attentional Knowledge Graph Reasoning for Video-Text Retrieval.
Sheng Fang, Shuhui Wang, Junbao Zhuo, Qingming Huang, Bin Ma, Xiaoming Wei, Xiaolin Wei
2022ConceptBeam: Concept Driven Target Speech Extraction.
Yasunori Ohishi, Marc Delcroix, Tsubasa Ochiai, Shoko Araki, Daiki Takeuchi, Daisuke Niizumi, Akisato Kimura, Noboru Harada, Kunio Kashino
2022Confederated Learning: Going Beyond Centralization.
Zitai Wang, Qianqian Xu, Ke Ma, Xiaochun Cao, Qingming Huang
2022Consistency Learning based on Class-Aware Style Variation for Domain Generalizable Semantic Segmentation.
Siwei Su, Haijian Wang, Meng Yang
2022Consistency-Contrast Learning for Conceptual Coding.
Jianhui Chang, Jian Zhang, Youmin Xu, Jiguo Li, Siwei Ma, Wen Gao
2022Content and Gradient Model-driven Deep Network for Single Image Reflection Removal.
Ya-Nan Zhang, Linlin Shen, Qiufu Li
2022Content based User Preference Modeling in Music Generation.
Xichu Ma, Yuchen Wang, Ye Wang
2022Continual Multi-view Clustering.
Xinhang Wan, Jiyuan Liu, Weixuan Liang, Xinwang Liu, Yi Wen, En Zhu
2022Correct Twice at Once: Learning to Correct Noisy Labels for Robust Deep Learning.
Jingzheng Li, Hailong Sun
2022Correspondence Matters for Video Referring Expression Comprehension.
Meng Cao, Ji Jiang, Long Chen, Yuexian Zou
2022Counterexample Contrastive Learning for Spurious Correlation Elimination.
Jinqiang Wang, Rui Hu, Chaoquan Jiang, Jitao Sang
2022Counterfactual Reasoning for Out-of-distribution Multimodal Sentiment Analysis.
Teng Sun, Wenjie Wang, Liqiang Jing, Yiran Cui, Xuemeng Song, Liqiang Nie
2022Counterfactually Measuring and Eliminating Social Bias in Vision-Language Pre-training Models.
Yi Zhang, Junyang Wang, Jitao Sang
2022CreaGAN: An Automatic Creative Generation Framework for Display Advertising.
Shiyao Wang, Qi Liu, Yicheng Zhong, Zhilong Zhou, Tiezheng Ge, Defu Lian, Yuning Jiang
2022Cross-Compatible Embedding and Semantic Consistent Feature Construction for Sketch Re-identification.
Yafei Zhang, Yongzeng Wang, Huafeng Li, Shuang Li
2022Cross-Domain 3D Model Retrieval Based On Contrastive Learning And Label Propagation.
Dan Song, Yue Yang, Weizhi Nie, Xuanya Li, An-An Liu
2022Cross-Domain and Cross-Modal Knowledge Distillation in Domain Adaptation for 3D Semantic Segmentation.
Miaoyu Li, Yachao Zhang, Yuan Xie, Zuodong Gao, Cuihua Li, Zhizhong Zhang, Yanyun Qu
2022Cross-Lingual Cross-Modal Retrieval with Noise-Robust Learning.
Yabing Wang, Jianfeng Dong, Tianxiang Liang, Minsong Zhang, Rui Cai, Xun Wang
2022Cross-Modal Retrieval with Heterogeneous Graph Embedding.
Dapeng Chen, Min Wang, Haobin Chen, Lin Wu, Jing Qin, Wei Peng
2022Cross-Modality Domain Adaptation for Freespace Detection: A Simple yet Effective Baseline.
Yuanbin Wang, Leyan Zhu, Shaofei Huang, Tianrui Hui, Xiaojie Li, Fei Wang, Si Liu
2022Cross-Modality High-Frequency Transformer for MR Image Super-Resolution.
Chaowei Fang, Dingwen Zhang, Liang Wang, Yulun Zhang, Lechao Cheng, Junwei Han
2022Cross-modal Co-occurrence Attributes Alignments for Person Search by Language.
Kai Niu, Linjiang Huang, Yan Huang, Peng Wang, Liang Wang, Yanning Zhang
2022Cross-modal Knowledge Graph Contrastive Learning for Machine Learning Method Recommendation.
Xianshuai Cao, Yuliang Shi, Jihu Wang, Han Yu, Xinjun Wang, Zhongmin Yan
2022Cross-modal Semantic Alignment Pre-training for Vision-and-Language Navigation.
Siying Wu, Xueyang Fu, Feng Wu, Zheng-Jun Zha
2022CrossHuman: Learning Cross-guidance from Multi-frame Images for Human Reconstruction.
Liliang Chen, Jiaqi Li, Han Huang, Yandong Guo
2022CrossNet: Boosting Crowd Counting with Localization.
Ji Zhang, Zhi-Qi Cheng, Xiao Wu, Wei Li, Jian-Jun Qiao
2022Crossmodal Few-shot 3D Point Cloud Semantic Segmentation.
Ziyu Zhao, Zhenyao Wu, Xinyi Wu, Canyu Zhang, Song Wang
2022CubeMLP: An MLP-based Model for Multimodal Sentiment Analysis and Depression Estimation.
Hao Sun, Hongyi Wang, Jiaqing Liu, Yen-Wei Chen, Lanfen Lin
2022CurML: A Curriculum Machine Learning Library.
Yuwei Zhou, Hong Chen, Zirui Pan, Chuanhao Yan, Fanqi Lin, Xin Wang, Wenwu Zhu
2022Curriculum-NAS: Curriculum Weight-Sharing Neural Architecture Search.
Yuwei Zhou, Xin Wang, Hong Chen, Xuguang Duan, Chaoyu Guan, Wenwu Zhu
2022Customizing GAN Using Few-shot Sketches.
Syed Muhammad Israr, Feng Zhao
2022Cycle Encoding of a StyleGAN Encoder for Improved Reconstruction and Editability.
Xudong Mao, Liujuan Cao, Aurele Tohokantche Gnanha, Zhenguo Yang, Qing Li, Rongrong Ji
2022Cycle Self-Training for Semi-Supervised Object Detection with Distribution Consistency Reweighting.
Hao Liu, Bin Chen, Bo Wang, Chunpeng Wu, Feng Dai, Peng Wu
2022Cycle-Interactive Generative Adversarial Network for Robust Unsupervised Low-Light Enhancement.
Zhangkai Ni, Wenhan Yang, Hanli Wang, Shiqi Wang, Lin Ma, Sam Kwong
2022CycleHand: Increasing 3D Pose Estimation Ability on In-the-wild Monocular Image through Cyclic Flow.
Daiheng Gao, Xindi Zhang, Xingyu Chen, Andong Tan, Bang Zhang, Pan Pan, Ping Tan
2022CyclicShift: A Data Augmentation Method For Enriching Data Patterns.
Hui Lu, Xuan Cheng, Wentao Xia, Pan Deng, Minghui Liu, Tianshu Xie, Xiaomin Wang, Ming Liu
2022Cyclical Fusion: Accurate 3D Reconstruction via Cyclical Monotonicity.
Duo Chen, Zixin Tang, Yiguang Liu
2022D
Zhuo Chen, Chaoyue Wang, Haimei Zhao, Bo Yuan, Xiu Li
2022DAM: Deep Reinforcement Learning based Preload Algorithm with Action Masking for Short Video Streaming.
Si-Ze Qian, Yuhong Xie, Zipeng Pan, Yuan Zhang, Tao Lin
2022DAO: Dynamic Adaptive Offloading for Video Analytics.
Taslim Murad, Anh Nguyen, Zhisheng Yan
2022DDAM '22: 1st International Workshop on Deepfake Detection for Audio Multimedia.
Jianhua Tao, Jiangyan Yi, Cunhang Fan, Ruibo Fu, Shan Liang, Pengyuan Zhang, Haizhou Li, Helen Meng, Dong Yu, Masato Akagi
2022DDGHM: Dual Dynamic Graph with Hybrid Metric Training for Cross-Domain Sequential Recommendation.
Xiaolin Zheng, Jiajie Su, Weiming Liu, Chaochao Chen
2022DEAL: An Unsupervised Domain Adaptive Framework for Graph-level Classification.
Nan Yin, Li Shen, Baopu Li, Mengzhu Wang, Xiao Luo, Chong Chen, Zhigang Luo, Xian-Sheng Hua
2022DHHN: Dual Hierarchical Hybrid Network for Weakly-Supervised Audio-Visual Video Parsing.
Xun Jiang, Xing Xu, Zhiguo Chen, Jingran Zhang, Jingkuan Song, Fumin Shen, Huimin Lu, Heng Tao Shen
2022DOMFN: A Divergence-Orientated Multi-Modal Fusion Network for Resume Assessment.
Yang Yang, Jingshuai Zhang, Fan Gao, Xiaoru Gao, Hengshu Zhu
2022DPCNet: Dual Path Multi-Excitation Collaborative Network for Facial Expression Representation Learning in Videos.
Yan Wang, Yixuan Sun, Wei Song, Shuyong Gao, Yiwen Huang, Zhaoyu Chen, Weifeng Ge, Wenqiang Zhang
2022DS-MVSNet: Unsupervised Multi-view Stereo via Depth Synthesis.
Jingliang Li, Zhengda Lu, Yiqun Wang, Ying Wang, Jun Xiao
2022DSE-GAN: Dynamic Semantic Evolution Generative Adversarial Network for Text-to-Image Generation.
Mengqi Huang, Zhendong Mao, Penghui Wang, Quan Wang, Yongdong Zhang
2022DTR: An Information Bottleneck Based Regularization Framework for Video Action Recognition.
Jiawei Fan, Yu Zhao, Xie Yu, Lihua Ma, Junqi Liu, Fangqiu Yi, Boxun Li
2022DVR: Micro-Video Recommendation Optimizing Watch-Time-Gain under Duration Bias.
Yu Zheng, Chen Gao, Jingtao Ding, Lingling Yi, Depeng Jin, Yong Li, Meng Wang
2022Data Science against COVID-19: The Valencian Experience.
Nuria Oliver
2022DavarOCR: A Toolbox for OCR and Multi-Modal Document Understanding.
Liang Qiao, Hui Jiang, Ying Chen, Can Li, Pengfei Li, Zaisheng Li, Baorui Zou, Dashan Guo, Yingda Xu, Yunlu Xu, Zhanzhan Cheng, Yi Niu
2022DeViT: Deformed Vision Transformers in Video Inpainting.
Jiayin Cai, Changlin Li, Xin Tao, Chun Yuan, Yu-Wing Tai
2022Decoupling Recognition from Detection: Single Shot Self-Reliant Scene Text Spotter.
Jingjing Wu, Pengyuan Lyu, Guangming Lu, Chengquan Zhang, Kun Yao, Wenjie Pei
2022Deep Evidential Learning with Noisy Correspondence for Cross-modal Retrieval.
Yang Qin, Dezhong Peng, Xi Peng, Xu Wang, Peng Hu
2022Deep Flexible Structure Preserving Image Smoothing.
Mingjia Li, Yuanbin Fu, Xinhui Li, Xiaojie Guo
2022Deep Learning-Based Acoustic Mosquito Detection in Noisy Conditions Using Trainable Kernels and Augmentations.
Sean Campos, Devesh Khandelwal, Shwetha C. Nagaraj, Fred Nugen, Alberto Todeschini
2022Deep Learning-based Point Cloud Coding for Immersive Experiences.
Fernando Pereira
2022Deep Multi-Resolution Mutual Learning for Image Inpainting.
Huan Zheng, Zhao Zhang, Haijun Zhang, Yi Yang, Shuicheng Yan, Meng Wang
2022Deep Video Understanding with a Unified Multi-Modal Retrieval Framework.
Chen-Wei Xie, Siyang Sun, Liming Zhao, Jianmin Wu, Dangwei Li, Yun Zheng
2022Deep-BVQM: A Deep-learning Bitstream-based Video Quality Model.
Nasim Jamshidi Avanaki, Steven Schmidt, Thilo Michael, Saman Zadtootaghaj, Sebastian Möller
2022DeepWSD: Projecting Degradations in Perceptual Space to Wasserstein Distance in Deep Feature Space.
Xingran Liao, Baoliang Chen, Hanwei Zhu, Shiqi Wang, Mingliang Zhou, Sam Kwong
2022Deepfake Video Detection with Spatiotemporal Dropout Transformer.
Daichi Zhang, Fanzhao Lin, Yingying Hua, Pengju Wang, Dan Zeng, Shiming Ge
2022Deeply Exploit Visual and Language Information for Social Media Popularity Prediction.
Jianmin Wu, Liming Zhao, Dangwei Li, Chen-Wei Xie, Siyang Sun, Yun Zheng
2022Defeating DeepFakes via Adversarial Visual Reconstruction.
Ziwen He, Wei Wang, Weinan Guan, Jing Dong, Tieniu Tan
2022Defending Physical Adversarial Attack on Object Detection via Adversarial Patch-Feature Energy.
Taeheon Kim, Youngjoon Yu, Yong Man Ro
2022Delegate-based Utility Preserving Synthesis for Pedestrian Image Anonymization.
Zhenzhong Kuang, Longbin Teng, Zhou Yu, Jun Yu, Jianping Fan, Mingliang Xu
2022Delving Globally into Texture and Structure for Image Inpainting.
Haipeng Liu, Yang Wang, Meng Wang, Yong Rui
2022Delving into the Continuous Domain Adaptation.
Yinsong Xu, Zhuqing Jiang, Aidong Men, Yang Liu, Qingchao Chen
2022Delving into the Frequency: Temporally Consistent Human Motion Transfer in the Fourier Space.
Guang Yang, Wu Liu, Xinchen Liu, Xiaoyan Gu, Juan Cao, Jintao Li
2022Demographic Feature Isolation for Bias Research using Deepfakes.
Kurtis Haut, Caleb Wohn, Victor Antony, Aidan Goldfarb, Melissa Welsh, Dillanie Sumanthiran, Md. Rafayet Ali, Ehsan Hoque
2022Depth-inspired Label Mining for Unsupervised RGB-D Salient Object Detection.
Teng Yang, Yue Wang, Lu Zhang, Jinqing Qi, Huchuan Lu
2022Design What You Desire: Icon Generation from Orthogonal Application and Theme Labels.
Yinpeng Chen, Zhiyu Pan, Min Shi, Hao Lu, Zhiguo Cao, Weicai Zhong
2022DetFusion: A Detection-driven Infrared and Visible Image Fusion Network.
Yiming Sun, Bing Cao, Pengfei Zhu, Qinghua Hu
2022Detach and Attach: Stylized Image Captioning without Paired Stylized Dataset.
Yutong Tan, Zheng Lin, Peng Fu, Mingyu Zheng, Lanrui Wang, Yanan Cao, Weiping Wang
2022Developing Embodied Conversational Agents in the Unreal Engine: The FANTASIA Plugin.
Antonio Origlia, Martina Di Bratto, Maria Di Maro, Sabrina Mennella
2022DiT: Self-supervised Pre-training for Document Image Transformer.
Junlong Li, Yiheng Xu, Tengchao Lv, Lei Cui, Cha Zhang, Furu Wei
2022Difference Residual Graph Neural Networks.
Liang Yang, Weihang Peng, Wenmiao Zhou, Bingxin Niu, Junhua Gu, Chuan Wang, Yuanfang Guo, Dongxiao He, Xiaochun Cao
2022Differentiable Cross-modal Hashing via Multimodal Transformers.
Junfeng Tu, Xueliang Liu, Zongxiang Lin, Richang Hong, Meng Wang
2022Digging Into Normal Incorporated Stereo Matching.
Zihua Liu, Songyan Zhang, Zhicheng Wang, Masatoshi Okutomi
2022Dilated Context Integrated Network with Cross-Modal Consensus for Temporal Emotion Localization in Videos.
Juncheng Li, Junlin Xie, Linchao Zhu, Long Qian, Siliang Tang, Wenqiao Zhang, Haochen Shi, Shengyu Zhang, Longhui Wei, Qi Tian, Yueting Zhuang
2022DisCo: Disentangled Implicit Content and Rhythm Learning for Diverse Co-Speech Gestures Synthesis.
Haiyang Liu, Naoya Iwamoto, Zihao Zhu, Zhengqing Li, You Zhou, Elif Bozkurt, Bo Zheng
2022Disentangled Representation Learning for Multimodal Emotion Recognition.
Dingkang Yang, Shuai Huang, Haopeng Kuang, Yangtao Du, Lihua Zhang
2022Disparity-based Stereo Image Compression with Aligned Cross-View Priors.
Yongqi Zhai, Luyang Tang, Yi Ma, Rui Peng, Ronggang Wang
2022Display of 3D Illuminations using Flying Light Specks.
Shahram Ghandeharizadeh
2022Distance Matters in Human-Object Interaction Detection.
Guangzhi Wang, Yangyang Guo, Yongkang Wong, Mohan S. Kankanhalli
2022Distilling Resolution-robust Identity Knowledge for Texture-Enhanced Face Hallucination.
Qiqi Bao, Rui Zhu, Bowen Gang, Pengyang Zhao, Wenming Yang, Qingmin Liao
2022Diverse Human Motion Prediction via Gumbel-Softmax Sampling from an Auxiliary Space.
Lingwei Dang, Yongwei Nie, Chengjiang Long, Qing Zhang, Guiqing Li
2022DoF-NeRF: Depth-of-Field Meets Neural Radiance Fields.
Zijin Wu, Xingyi Li, Juewen Peng, Hao Lu, Zhiguo Cao, Weicai Zhong
2022Domain Adaptation for Time-Series Classification to Mitigate Covariate Shift.
Felix Ott, David Rügamer, Lucas Heublein, Bernd Bischl, Christopher Mutschler
2022Domain Generalization via Frequency-domain-based Feature Disentanglement and Interaction.
Jingye Wang, Ruoyi Du, Dongliang Chang, Kongming Liang, Zhanyu Ma
2022Domain Reconstruction and Resampling for Robust Salient Object Detection.
Senbo Yan, Liang Peng, Chuer Yu, Zheng Yang, Haifeng Liu, Deng Cai
2022Domain-Specific Conditional Jigsaw Adaptation for Enhancing transferability and Discriminability.
Qi He, Zhaoquan Yuan, Xiao Wu, Jun-Yan He
2022Domain-Specific Fusion Of Objective Video Quality Metrics.
Aaron Chadha, Ioannis Katsavounidis, Ayan Kumar Bhunia, Cosmin Stejerean, Muhammad Umar Karim Khan, Yiannis Andreopoulos
2022DomainPlus: Cross Transform Domain Learning towards High Dynamic Range Imaging.
Bolun Zheng, Xiaokai Pan, Hua Zhang, Xiaofei Zhou, Gregory G. Slabaugh, Chenggang Yan, Shanxin Yuan
2022Draw Your Art Dream: Diverse Digital Art Synthesis with Multimodal Guided Diffusion.
Nisha Huang, Fan Tang, Weiming Dong, Changsheng Xu
2022DrawMon: A Distributed System for Detection of Atypical Sketch Content in Concurrent Pictionary Games.
Nikhil Bansal, Kartik Gupta, Kiruthika Kannan, Sivani Pentapati, Ravi Kiran Sarvadevabhatla
2022Dream Painter: An Interactive Art Installation Bridging Audience Interaction, Robotics, and Creative AI.
Varvara Guljajeva, Mar Canet Sola
2022Dual Contrastive Learning for Spatio-temporal Representation.
Shuangrui Ding, Rui Qian, Hongkai Xiong
2022Dual Part Discovery Network for Zero-Shot Learning.
Jiannan Ge, Hongtao Xie, Shaobo Min, Pandeng Li, Yongdong Zhang
2022DualSign: Semi-Supervised Sign Language Production with Balanced Multi-Modal Multi-Task Dual Transformation.
Wencan Huang, Zhou Zhao, Jinzheng He, Mingmin Zhang
2022DuetFace: Collaborative Privacy-Preserving Face Recognition via Channel Splitting in the Frequency Domain.
Yuxi Mi, Yuge Huang, Jiazhen Ji, Hongquan Liu, Xingkun Xu, Shouhong Ding, Shuigeng Zhou
2022Dynamic Graph Modeling for Weakly-Supervised Temporal Action Localization.
Haichao Shi, Xiaoyu Zhang, Changsheng Li, Lixing Gong, Yong Li, Yongjun Bao
2022Dynamic Graph Reasoning for Multi-person 3D Pose Estimation.
Zhongwei Qiu, Qiansheng Yang, Jian Wang, Dongmei Fu
2022Dynamic Incomplete Multi-view Imputing and Clustering.
Xingfeng Li, Quansen Sun, Zhenwen Ren, Yinghui Sun
2022Dynamic Prototype Mask for Occluded Person Re-Identification.
Lei Tan, Pingyang Dai, Rongrong Ji, Yongjian Wu
2022Dynamic Scene Graph Generation via Temporal Prior Inference.
Shuang Wang, Lianli Gao, Xinyu Lyu, Yuyu Guo, Pengpeng Zeng, Jingkuan Song
2022Dynamic Spatio-Temporal Modular Network for Video Question Answering.
Zi Qian, Xin Wang, Xuguang Duan, Hong Chen, Wenwu Zhu
2022Dynamic Transformer for Few-shot Instance Segmentation.
Haochen Wang, Jie Liu, Yongtuo Liu, Subhransu Maji, Jan-Jakob Sonke, Efstratios Gavves
2022Dynamic Weighted Semantic Correspondence for Few-Shot Image Generative Adaptation.
Xingzhong Hou, Boxiao Liu, Shuai Zhang, Lulin Shi, Zite Jiang, Haihang You
2022Dynamically Adjust Word Representations Using Unaligned Multimodal Information.
Jiwei Guo, Jiajia Tang, Weichen Dai, Yu Ding, Wanzeng Kong
2022EASE: Robust Facial Expression Recognition via Emotion Ambiguity-SEnsitive Cooperative Networks.
Lijuan Wang, Guoli Jia, Ning Jiang, Haiying Wu, Jufeng Yang
2022ELMformer: Efficient Raw Image Restoration with a Locally Multiplicative Transformer.
Jiaqi Ma, Shengyuan Yan, Lefei Zhang, Guoli Wang, Qian Zhang
2022Early-Learning regularized Contrastive Learning for Cross-Modal Retrieval with Noisy Labels.
Tianyuan Xu, Xueliang Liu, Zhen Huang, Dan Guo, Richang Hong, Meng Wang
2022Effective Video Abnormal Event Detection by Learning A Consistency-Aware High-Level Feature Extractor.
Guang Yu, Siqi Wang, Zhiping Cai, Xinwang Liu, Chengkun Wu
2022Efficient Anchor Learning-based Multi-view Clustering - A Late Fusion Method.
Tiejian Zhang, Xinwang Liu, En Zhu, Sihang Zhou, Zhibin Dong
2022Efficient Hash Code Expansion by Recycling Old Bits.
Dayan Wu, Qinghang Su, Bo Li, Weiping Wang
2022Efficient Modeling of Future Context for Image Captioning.
Zhengcong Fei
2022Efficient Multiple Kernel Clustering via Spectral Perturbation.
Chang Tang, Zhenglai Li, Weiqing Yan, Guanghui Yue, Wei Zhang
2022EliMRec: Eliminating Single-modal Bias in Multimedia Recommendation.
Xiaohao Liu, Zhulin Tao, Jiahong Shao, Lifang Yang, Xianglin Huang
2022Eliminating Spatial Ambiguity for Weakly Supervised 3D Object Detection without Spatial Labels.
Haizhuang Liu, Huimin Ma, Yilin Wang, Bochao Zou, Tianyu Hu, Rongquan Wang, Jiansheng Chen
2022Emotional Machines: Toward Affective Virtual Environments.
Jorge Forero, Gilberto Bernardes, Mónica Mendes
2022Enabling Effective Low-Light Perception using Ubiquitous Low-Cost Visible-Light Cameras.
Igor Morawski
2022End-to-End 3D Face Reconstruction with Expressions and Specular Albedos from Single In-the-wild Images.
Qixin Deng, Binh Huy Le, Aobo Jin, Zhigang Deng
2022End-to-End Compound Table Understanding with Multi-Modal Modeling.
Zaisheng Li, Yi Li, Liang Qiao, Pengfei Li, Zhanzhan Cheng, Yi Niu, Shiliang Pu, Xi Li
2022End-to-End and Self-Supervised Learning for ComParE 2022 Stuttering Sub-Challenge.
Shakeel A. Sheikh, Md. Sahidullah, Slim Ouni, Fabrice Hirsch
2022Energy-Based Domain Generalization for Face Anti-Spoofing.
Zhekai Du, Jingjing Li, Lin Zuo, Lei Zhu, Ke Lu
2022Engaging Museum Visitors with Gamification of Body and Facial Expressions.
Maria Giovanna Donadio, Filippo Principi, Andrea Ferracani, Marco Bertini, Alberto Del Bimbo
2022Enhancement by Your Aesthetic: An Intelligible Unsupervised Personalized Enhancer for Low-Light Images.
Naishan Zheng, Jie Huang, Qi Zhu, Man Zhou, Feng Zhao, Zheng-Jun Zha
2022Enhancing Image Rescaling using Dual Latent Variables in Invertible Neural Network.
Min Zhang, Zhihong Pan, Xin Zhou, C.-C. Jay Kuo
2022Enhancing Semi-Supervised Learning with Cross-Modal Knowledge.
Hui Zhu, Yongchun Lü, Hongbin Wang, Xunyi Zhou, Qin Ma, Yanhong Liu, Ning Jiang, Xin Wei, Linchengxi Zeng, Xiaofang Zhao
2022Enlarging the Long-time Dependencies via RL-based Memory Network in Movie Affective Analysis.
Jie Zhang, Yin Zhao, Kai Qian
2022Enriching Existing Educational Video Datasets to Improve Slide Classification and Analysis.
Travis Seng
2022Equivariant and Invariant Grounding for Video Question Answering.
Yicong Li, Xiang Wang, Junbin Xiao, Tat-Seng Chua
2022Error Concealment of Dynamic 3D Point Cloud Streaming.
Tzu-Kuan Hung, I-Chun Huang, Samuel Rhys Cox, Wei Tsang Ooi, Cheng-Hsin Hsu
2022Estimation of Reliable Proposal Quality for Temporal Action Detection.
Junshan Hu, Chaoxu Guo, Liansheng Zhuang, Biao Wang, Tiezheng Ge, Yuning Jiang, Houqiang Li
2022EuglPollock: Rethinking Interspecies Collaboration through Art Making.
Kyungwon Lee, Yu-Kyung Jang, Jaewoo Jung, Dong Hwan Kim, Hyun-Jean Lee, Seung Ah Lee
2022Evaluating the Impact of Tiled User-Adaptive Real-Time Point Cloud Streaming on VR Remote Communication.
Shishir Subramanyam, Irene Viola, Jack Jansen, Evangelos Alexiou, Alan Hanjalic, Pablo César
2022Event-guided Video Clip Generation from Blurry Images.
Xin Ding, Tsuyoshi Takatani, Zhongyuan Wang, Ying Fu, Yinqiang Zheng
2022Everything is There in Latent Space: Attribute Editing and Attribute Style Manipulation by StyleGAN Latent Space Exploration.
Rishubh Parihar, Ankit Dhiman, Tejan Karmali, Venkatesh Babu R.
2022Evidential Reasoning for Video Anomaly Detection.
Che Sun, Yunde Jia, Yuwei Wu
2022Exploiting Transformation Invariance and Equivariance for Self-supervised Sound Localisation.
Jinxiang Liu, Chen Ju, Weidi Xie, Ya Zhang
2022Exploring Effective Knowledge Transfer for Few-shot Object Detection.
Zhiyuan Zhao, Qingjie Liu, Yunhong Wang
2022Exploring Feature Compensation and Cross-level Correlation for Infrared Small Target Detection.
Mingjin Zhang, Ke Yue, Jing Zhang, Yunsong Li, Xinbo Gao
2022Exploring High-quality Target Domain Information for Unsupervised Domain Adaptive Semantic Segmentation.
Junjie Li, Zilei Wang, Yuan Gao, Xiaoming Hu
2022Exploring Negatives in Contrastive Learning for Unpaired Image-to-Image Translation.
Yupei Lin, Sen Zhang, Tianshui Chen, Yongyi Lu, Guangping Li, Yukai Shi
2022Exploring Spherical Autoencoder for Spherical Video Content Processing.
Jin Zhou, Na Li, Yao Liu, Shuochao Yao, Songqing Chen
2022Exploring the Effectiveness of Video Perceptual Representation in Blind Video Quality Assessment.
Liang Liao, Kangmin Xu, Haoning Wu, Chaofeng Chen, Wenxiu Sun, Qiong Yan, Weisi Lin
2022Exposure-Consistency Representation Learning for Exposure Correction.
Jie Huang, Man Zhou, Yajing Liu, Mingde Yao, Feng Zhao, Zhiwei Xiong
2022Extreme-scale Talking-Face Video Upsampling with Audio-Visual Priors.
Sindhu B. Hegde, Rudrabha Mukhopadhyay, Vinay P. Namboodiri, C. V. Jawahar
2022FCL-GAN: A Lightweight and Real-Time Baseline for Unsupervised Blind Image Deblurring.
Suiyi Zhao, Zhao Zhang, Richang Hong, Mingliang Xu, Yi Yang, Meng Wang
2022FME '22: 2nd Workshop on Facial Micro-Expression: Advanced Techniques for Multi-Modal Facial Expression Analysis.
Jingting Li, Moi Hoon Yap, Wen-Huang Cheng, John See, Xiaopeng Hong, Xiaobai Li, Su-Jing Wang
2022FMNet: Frequency-Aware Modulation Network for SDR-to-HDR Translation.
Gang Xu, Qibin Hou, Le Zhang, Ming-Ming Cheng
2022Face Anthropometry Aware Audio-visual Age Verification.
Pavel Korshunov, Sébastien Marcel
2022Face Forgery Detection via Symmetric Transformer.
Luchuan Song, Xiaodan Li, Zheng Fang, Zhenchao Jin, Yuefeng Chen, Chenliang Xu
2022Facial Expression Spotting Based on Optical Flow Features.
Jun Yu, Zhongpeng Cai, Zepeng Liu, Guochen Xie, Peng He
2022Factorized and Controllable Neural Re-Rendering of Outdoor Scene for Photo Extrapolation.
Boming Zhao, Bangbang Yang, Zhenyang Li, Zuoyue Li, Guofeng Zhang, Jiashu Zhao, Dawei Yin, Zhaopeng Cui, Hujun Bao
2022Fast Hierarchical Deep Unfolding Network for Image Compressed Sensing.
Wenxue Cui, Shaohui Liu, Debin Zhao
2022FastLTS: Non-Autoregressive End-to-End Unconstrained Lip-to-Speech Synthesis.
Yongqi Wang, Zhou Zhao
2022FastPR: One-stage Semantic Person Retrieval via Self-supervised Learning.
Meng Sun, Ju Ren, Xin Wang, Wenwu Zhu, Yaoxue Zhang
2022Feature and Semantic Views Consensus Hashing for Image Set Classification.
Yuan Sun, Dezhong Peng, Haixiao Huang, Zhenwen Ren
2022FedMed-ATL: Misaligned Unpaired Cross-Modality Neuroimage Synthesis via Affine Transform Loss.
Jinbao Wang, Guoyang Xie, Yawen Huang, Yefeng Zheng, Yaochu Jin, Feng Zheng
2022Feeling Without Sharing: A Federated Video Emotion Recognition Framework Via Privacy-Agnostic Hybrid Aggregation.
Fan Qi, Zixin Zhang, Xianshan Yang, Huaiwen Zhang, Changsheng Xu
2022Few-Shot Model Agnostic Federated Learning.
Wenke Huang, Mang Ye, Bo Du, Xiang Gao
2022Few-shot Image Generation Using Discrete Content Representation.
Yan Hong, Li Niu, Jianfu Zhang, Liqing Zhang
2022Few-shot Open-set Recognition Using Background as Unknowns.
Nan Song, Chi Zhang, Guosheng Lin
2022Few-shot X-ray Prohibited Item Detection: A Benchmark and Weak-feature Enhancement Network.
Renshuai Tao, Tianbo Wang, Ziyang Wu, Cong Liu, Aishan Liu, Xianglong Liu
2022Finding the Host from the Lesion by Iteratively Mining the Registration Graph.
Zijie Yang, Lingxi Xie, Xinyue Huo, Sheng Tang, Qi Tian, Yongdong Zhang
2022Fine-Grained Fragment Diffusion for Cross Domain Crowd Counting.
Huilin Zhu, Jingling Yuan, Zhengwei Yang, Xian Zhong, Zheng Wang
2022Fine-grained Action Recognition with Robust Motion Representation Decoupling and Concentration.
Baoli Sun, Xinchen Ye, Tiantian Yan, Zhihui Wang, Haojie Li, Zhiyong Wang
2022Fine-grained Micro-Expression Generation based on Thin-Plate Spline and Relative AU Constraint.
Sirui Zhao, Shukang Yin, Huaying Tang, Rijin Jin, Yifan Xu, Tong Xu, Enhong Chen
2022Fine-tuning with Multi-modal Entity Prompts for News Image Captioning.
Jingjing Zhang, Shancheng Fang, Zhendong Mao, Zhiwei Zhang, Yongdong Zhang
2022Flexible Hybrid Lenses Light Field Super-Resolution using Layered Refinement.
Song Chang, Youfang Lin, Shuo Zhang
2022Forcing the Whole Video as Background: An Adversarial Learning Strategy for Weakly Temporal Action Localization.
Ziqiang Li, Yongxin Ge, Jiaruo Yu, Zhongming Chen
2022Fragrance In Sight: Personalized Perfume Production Based on Style Recognition.
Jiaxiang You, Yinyu Chen, Xiaohui Wang
2022Free-Lunch for Cross-Domain Few-Shot Learning: Style-Aware Episodic Training with Robust Contrastive Learning.
Ji Zhang, Jingkuan Song, Lianli Gao, Hengtao Shen
2022From Abstract to Details: A Generative Multimodal Fusion Framework for Recommendation.
Fangxiong Xiao, Lixi Deng, Jingjing Chen, Houye Ji, Xiaorui Yang, Zhuoye Ding, Bo Long
2022From Token to Word: OCR Token Evolution via Contrastive Learning and Semantic Matching for Text-VQA.
Zan-Xia Jin, Mike Zheng Shou, Fang Zhou, Satoshi Tsutsui, Jingyan Qin, Xu-Cheng Yin
2022GCL: Graph Calibration Loss for Trustworthy Graph Neural Network.
Min Wang, Hao Yang, Qing Cheng
2022GSRFormer: Grounded Situation Recognition Transformer with Alternate Semantic Attention Refinement.
Zhi-Qi Cheng, Qi Dai, Siyao Li, Teruko Mitamura, Alexander Hauptmann
2022GT-MUST: Gated Try-on by Learning the Mannequin-Specific Transformation.
Ning Wang, Jing Zhang, Lefei Zhang, Dacheng Tao
2022Gait Recognition in the Wild with Multi-hop Temporal Switch.
Jinkai Zheng, Xinchen Liu, Xiaoyan Gu, Yaoqi Sun, Chuang Gan, Jiyong Zhang, Wu Liu, Chenggang Yan
2022Gaze- and Spacing-flow Unveil Intentions: Hidden Follower Discovery.
Danni Xu, Ruimin Hu, Zheng Wang, Linbo Luo, Dengshi Li, Wenjun Zeng
2022Generalized Global Ranking-Aware Neural Architecture Ranker for Efficient Image Classifier Search.
Bicheng Guo, Tao Chen, Shibo He, Haoyu Liu, Lilin Xu, Peng Ye, Jiming Chen
2022Generalized Inter-class Loss for Gait Recognition.
Weichen Yu, Hongyuan Yu, Yan Huang, Liang Wang
2022Generating Smooth and Facial-Details-Enhanced Talking Head Video: A Perspective of Pre and Post Processes.
Tian Lv, Yu-Hui Wen, Zhiyao Sun, Zipeng Ye, Yong-Jin Liu
2022Generating Transferable Adversarial Examples against Vision Transformers.
Yuxuan Wang, Jiakai Wang, Zixin Yin, Ruihao Gong, Jingyi Wang, Aishan Liu, Xianglong Liu
2022Generative Steganography Network.
Ping Wei, Sheng Li, Xinpeng Zhang, Ge Luo, Zhenxing Qian, Qing Zhou
2022Generic Image Manipulation Localization through the Lens of Multi-scale Spatial Inconsistence.
Zan Gao, Shenghao Chen, Yangyang Guo, Weili Guan, Jie Nie, Anan Liu
2022Geometric Warping Error Aware CNN for DIBR Oriented View Synthesis.
Shuai Li, Kaixin Wang, Yanbo Gao, Xun Cai, Mao Ye
2022Geometry Aligned Variational Transformer for Image-conditioned Layout Generation.
Yunning Cao, Ye Ma, Min Zhou, Chuanbin Liu, Hongtao Xie, Tiezheng Ge, Yuning Jiang
2022Geometry-Aware Reference Synthesis for Multi-View Image Super-Resolution.
Ri Cheng, Yuqi Sun, Bo Yan, Weimin Tan, Chenxi Ma
2022GetWild: A VR Editing System with AI-Generated 3D Object and Terrain.
Shing Ming Wong, Chien-Wen Chen, Tse-Yu Pan, Hung-Kuo Chu, Min-Chun Hu
2022Global Meets Local: Effective Multi-Label Image Classification via Category-Aware Weak Supervision.
Jiawei Zhan, Jun Liu, Wei Tang, Guannan Jiang, Xi Wang, Bin-Bin Gao, Tianliang Zhang, Wenlong Wu, Wei Zhang, Chengjie Wang, Yuan Xie
2022Global-Local Cross-View Fisher Discrimination for View-Invariant Action Recognition.
Lingling Gao, Yanli Ji, Yang Yang, Heng Tao Shen
2022Gloss Semantic-Enhanced Network with Online Back-Translation for Sign Language Production.
Shengeng Tang, Richang Hong, Dan Guo, Meng Wang
2022Graph Reasoning Transformer for Image Parsing.
Dong Zhang, Jinhui Tang, Kwang-Ting Cheng
2022Graph-DETR3D: Rethinking Overlapping Regions for Multi-View 3D Object Detection.
Zehui Chen, Zhenyu Li, Shiquan Zhang, Liangji Fang, Qinhong Jiang, Feng Zhao
2022Graph-based Group Modelling for Backchannel Detection.
Garima Sharma, Kalin Stefanov, Abhinav Dhall, Jianfei Cai
2022Grounding, Meaning and Foundation Models: Adventures in Multimodal Machine Learning.
Douwe Kiela
2022GroupDancer: Music to Multi-People Dance Synthesis with Style Collaboration.
Zixuan Wang, Jia Jia, Haozhe Wu, Junliang Xing, Jinghe Cai, Fanbo Meng, Guowen Chen, Yanfeng Wang
2022Grouped Adaptive Loss Weighting for Person Search.
Yanling Tian, Di Chen, Yunan Liu, Shanshan Zhang, Jian Yang
2022Guess-It-Generator: Generating in a Lewis Signaling Framework through Logical Reasoning.
Arghya Pal, Sailaja Rajanala, Raphael C.-W. Phan, KokSheik Wong
2022HCMA'22: 3rd International Workshop on Human-Centric Multimedia Analysis.
Dingwen Zhang, Chaowei Fang, Wu Liu, Xinchen Liu, Jingkuan Song, Hongyuan Zhu, Wenbing Huang, John Smith
2022HEART: Towards Effective Hash Codes under Label Noise.
Jinan Sun, Haixin Wang, Xiao Luo, Shikun Zhang, Wei Xiang, Chong Chen, Xian-Sheng Hua
2022HERO: HiErarchical spatio-tempoRal reasOning with Contrastive Action Correspondence for End-to-End Video Object Grounding.
Mengze Li, Tianbao Wang, Haoyu Zhang, Shengyu Zhang, Zhou Zhao, Wenqiao Zhang, Jiaxu Miao, Shiliang Pu, Fei Wu
2022HMTN: Hierarchical Multi-scale Transformer Network for 3D Shape Recognition.
Yue Zhao, Weizhi Nie, Zan Gao, Anan Liu
2022Heterogeneous Learning for Scene Graph Generation.
Yunqing He, Tongwei Ren, Jinhui Tang, Gangshan Wu
2022Hierarchical Few-Shot Object Detection: Problem, Benchmark and Method.
Lu Zhang, Yang Wang, Jiaogen Zhou, Chenbo Zhang, Yinglu Zhang, Jihong Guan, Yatao Bian, Shuigeng Zhou
2022Hierarchical Graph Embedded Pose Regularity Learning via Spatio-Temporal Transformer for Abnormal Behavior Detection.
Chao Huang, Yabo Liu, Zheng Zhang, Chengliang Liu, Jie Wen, Yong Xu, Yaowei Wang
2022Hierarchical Hourglass Convolutional Network for Efficient Video Classification.
Yi Tan, Yanbin Hao, Hao Zhang, Shuo Wang, Xiangnan He
2022Hierarchical Scene Normality-Binding Modeling for Anomaly Detection in Surveillance Videos.
Qianyue Bao, Fang Liu, Yang Liu, Licheng Jiao, Xu Liu, Lingling Li
2022Hierarchical Walking Transformer for Object Re-Identification.
Xudong Tian, Jun Liu, Zhizhong Zhang, Chengjie Wang, Yanyun Qu, Yuan Xie, Lizhuang Ma
2022High-Fidelity Variable-Rate Image Compression via Invertible Activation Transformation.
Shilv Cai, Zhijun Zhang, Liqun Chen, Luxin Yan, Sheng Zhong, Xu Zou
2022High-Quality 3D Face Reconstruction with Affine Convolutional Networks.
Zhiqian Lin, Jiangke Lin, Lincheng Li, Yi Yuan, Zhengxia Zou
2022How Much Attention Should we Pay to Mosquitoes?
Moreno La Quatra, Lorenzo Vaiani, Alkis Koudounas, Luca Cagliero, Paolo Garza, Elena Baralis
2022HyP
Chengyin Xu, Zenghao Chai, Zhengzhuo Xu, Chun Yuan, Yanbo Fan, Jue Wang
2022Hybrid Conditional Deep Inverse Tone Mapping.
Tong Shao, Deming Zhai, Junjun Jiang, Xianming Liu
2022Hybrid Spatial-Temporal Entropy Modelling for Neural Video Compression.
Jiahao Li, Bin Li, Yan Lu
2022ICNet: Joint Alignment and Reconstruction via Iterative Collaboration for Video Super-Resolution.
Jiaxu Leng, Jia Wang, Xinbo Gao, Bo Hu, Ji Gan, Chenqiang Gao
2022IDEA: Increasing Text Diversity via Online Multi-Label Recognition for Vision-Language Pre-training.
Xinyu Huang, Youcai Zhang, Ying Cheng, Weiwei Tian, Ruiwei Zhao, Rui Feng, Yuejie Zhang, Yaqian Li, Yandong Guo, Xiaobo Zhang
2022IDEAL: High-Order-Ensemble Adaptation Network for Learning with Noisy Labels.
Peng-Fei Zhang, Zi Huang, Guangdong Bai, Xin-Shun Xu
2022IMuR 2022: Introduction to the 2nd Workshop on Interactive Multimedia Retrieval.
Luca Rossetto, Werner Bailer, Jakub Lokoc, Klaus Schoeffmann
2022IVT: An End-to-End Instance-guided Video Transformer for 3D Pose Estimation.
Zhongwei Qiu, Qiansheng Yang, Jian Wang, Dongmei Fu
2022IXR '22: 1st Workshop on Interactive eXtended Reality.
Irene Viola, Hadi Amirpour, Maria Torres Vega
2022Im2Oil: Stroke-Based Oil Painting Rendering with Linearly Controllable Fineness Via Adaptive Sampling.
Zhengyan Tong, Xiaohang Wang, Shengchao Yuan, Xuanhong Chen, Junjie Wang, Xiangzhong Fang
2022Image Generation Network for Covert Transmission in Online Social Network.
Zhengxin You, Qichao Ying, Sheng Li, Zhenxing Qian, Xinpeng Zhang
2022Image Inpainting Detection via Enriched Attentive Pattern with Near Original Image Augmentation.
Wenhan Yang, Rizhao Cai, Alex C. Kot
2022Image Quality Assessment: From Mean Opinion Score to Opinion Score Distribution.
Yixuan Gao, Xiongkuo Min, Yucheng Zhu, Jing Li, Xiao-Ping Zhang, Guangtao Zhai
2022Image Understanding by Captioning with Differentiable Architecture Search.
Ramtin Hosseini, Pengtao Xie
2022Image-Signal Correlation Network for Textile Fiber Identification.
Bo Peng, Liren He, Yining Qiu, Dong Wu, Mingmin Chi
2022Image-Text Matching with Fine-Grained Relational Dependency and Bidirectional Attention-Based Generative Networks.
Jianwei Zhu, Zhixin Li, Yufei Zeng, Jiahui Wei, Huifang Ma
2022Imitated Detectors: Stealing Knowledge of Black-box Object Detectors.
Siyuan Liang, Aishan Liu, Jiawei Liang, Longkang Li, Yang Bai, Xiaochun Cao
2022Immunofluorescence Capillary Imaging Segmentation: Cases Study.
Runpeng Hou, Ziyuan Ye, Chengyu Yang, Linhao Fu, Chao Liu, Quanying Liu
2022Improved Deep Unsupervised Hashing via Prototypical Learning.
Zeyu Ma, Wei Ju, Xiao Luo, Chong Chen, Xian-Sheng Hua, Guangming Lu
2022Improving Fusion of Region Features and Grid Features via Two-Step Interaction for Image-Text Retrieval.
Dongqing Wu, Huihui Li, Cang Gu, Lei Guo, Hang Liu
2022Improving Generalization for Neural Adaptive Video Streaming via Meta Reinforcement Learning.
Nuowen Kan, Yuankun Jiang, Chenglin Li, Wenrui Dai, Junni Zou, Hongkai Xiong
2022Improving Meeting Inclusiveness using Speech Interruption Analysis.
Szu-Wei Fu, Yaran Fan, Yasaman Hosseinkashi, Jayant Gupchup, Ross Cutler
2022Improving Scalability, Sustainability and Availability via Workload Distribution in Edge-Cloud Gaming.
Iryanto Jaya, Yusen Li, Wentong Cai
2022Improving Transferability for Domain Adaptive Detection Transformers.
Kaixiong Gong, Shuang Li, Shugang Li, Rui Zhang, Chi Harold Liu, Qiang Chen
2022In-N-Out Generative Learning for Dense Unsupervised Video Segmentation.
Xiao Pan, Peike Li, Zongxin Yang, Huiling Zhou, Chang Zhou, Hongxia Yang, Jingren Zhou, Yi Yang
2022InDiD: Instant Disorder Detection via a Principled Neural Network.
Evgenia Romanenkova, Alexander Stepikin, Matvey Morozov, Alexey Zaytsev
2022Incremental Few-Shot Semantic Segmentation via Embedding Adaptive-Update and Hyper-class Representation.
Guangchen Shi, Yirui Wu, Jun Liu, Shaohua Wan, Wenhai Wang, Tong Lu
2022Inferential Visual Question Generation.
Chao Bi, Shuhui Wang, Zhe Xue, Shengbo Chen, Qingming Huang
2022Inferring Speaking Styles from Multi-modal Conversational Context by Multi-scale Relational Graph Convolutional Networks.
Jingbei Li, Yi Meng, Xixin Wu, Zhiyong Wu, Jia Jia, Helen Meng, Qiao Tian, Yuping Wang, Yuxuan Wang
2022Integrating Object-aware and Interaction-aware Knowledge for Weakly Supervised Scene Graph Generation.
Xingchen Li, Long Chen, Wenbo Ma, Yi Yang, Jun Xiao
2022Interact with Open Scenes: A Life-long Evolution Framework for Interactive Segmentation Models.
Ruitong Gan, Junsong Fan, Yuxi Wang, Zhaoxiang Zhang
2022Interaction with Immersive Cultural Heritage Environments: Using XR Technologies to Represent Multiple Perspectives on Serralves Museum.
Manuel Silva
2022Interactive Video Corpus Moment Retrieval using Reinforcement Learning.
Zhixin Ma, Chong-Wah Ngo
2022Interpretable Melody Generation from Lyrics with Discrete-Valued Adversarial Training.
Wei Duan, Zhe Zhang, Yi Yu, Keizo Oyama
2022Invariant Representation Learning for Multimedia Recommendation.
Xiaoyu Du, Zike Wu, Fuli Feng, Xiangnan He, Jinhui Tang
2022JPEG Compression-aware Image Forgery Localization.
Menglu Wang, Xueyang Fu, Jiawei Liu, Zheng-Jun Zha
2022Joint Learning Content and Degradation Aware Feature for Blind Super-Resolution.
Yifeng Zhou, Chuming Lin, Donghao Luo, Yong Liu, Ying Tai, Chengjie Wang, Mingang Chen
2022Keypoint-Guided Modality-Invariant Discriminative Learning for Visible-Infrared Person Re-identification.
Tengfei Liang, Yi Jin, Wu Liu, Songhe Feng, Tao Wang, Yidong Li
2022Keyword Spotting in the Homomorphic Encrypted Domain Using Deep Complex-Valued CNN.
Peijia Zheng, Zhiwei Cai, Huicong Zeng, Jiwu Huang
2022KnifeCut: Refining Thin Part Segmentation with Cutting Lines.
Zheng Lin, Zheng-Peng Duan, Zhao Zhang, Chun-Le Guo, Ming-Ming Cheng
2022Knowledge Guided Representation Disentanglement for Face Recognition from Low Illumination Images.
Xiangyu Miao, Shangfei Wang
2022LFBCNet: Light Field Boundary-aware and Cascaded Interaction Network for Salient Object Detection.
Mianzhao Wang, Fan Shi, Xu Cheng, Meng Zhao, Yao Zhang, Chen Jia, Weiwei Tian, Shengyong Chen
2022LS-GAN: Iterative Language-based Image Manipulation via Long and Short Term Consistency Reasoning.
Gaoxiang Cong, Liang Li, Zhenhuan Liu, Yunbin Tu, Weijun Qin, Shenyuan Zhang, Chengang Yan, Wenyu Wang, Bin Jiang
2022LVI-ExC: A Target-free LiDAR-Visual-Inertial Extrinsic Calibration Framework.
Zhong Wang, Lin Zhang, Ying Shen, Yicong Zhou
2022Label-Efficient Domain Generalization via Collaborative Exploration and Generalization.
Junkun Yuan, Xu Ma, Defang Chen, Kun Kuang, Fei Wu, Lanfen Lin
2022Layout-Aware Information Extraction for Document-Grounded Dialogue: Dataset, Method and Demonstration.
Zhenyu Zhang, Bowen Yu, Haiyang Yu, Tingwen Liu, Cheng Fu, Jingyang Li, Chengguang Tang, Jian Sun, Yongbin Li
2022LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking.
Yupan Huang, Tengchao Lv, Lei Cui, Yutong Lu, Furu Wei
2022Learn to Understand Negation in Video Retrieval.
Ziyue Wang, Aozhu Chen, Fan Hu, Xirong Li
2022Learnability Enhancement for Low-light Raw Denoising: Where Paired Real Data Meets Noise Modeling.
Hansen Feng, Lizhi Wang, Yuzhi Wang, Hua Huang
2022Learnable Privacy-Preserving Anonymization for Pedestrian Images.
Junwu Zhang, Mang Ye, Yao Yang
2022Learned Internet Congestion Control for Short Video Uploading.
Tianchi Huang, Chao Zhou, Lianchen Jia, Rui-Xiao Zhang, Lifeng Sun
2022Learning Action-guided Spatio-temporal Transformer for Group Activity Recognition.
Wei Li, Tianzhao Yang, Xiao Wu, Xian-Jun Du, Jian-Jun Qiao
2022Learning Cross-Image Object Semantic Relation in Transformer for Few-Shot Fine-Grained Image Classification.
Bo Zhang, Jiakang Yuan, Baopu Li, Tao Chen, Jiayuan Fan, Botian Shi
2022Learning Dual Convolutional Dictionaries for Image De-raining.
Chengjie Ge, Xueyang Fu, Zheng-Jun Zha
2022Learning Dynamic Prior Knowledge for Text-to-Face Pixel Synthesis.
Jun Peng, Xiaoxiong Du, Yiyi Zhou, Jing He, Yunhang Shen, Xiaoshuai Sun, Rongrong Ji
2022Learning Generalizable Latent Representations for Novel Degradations in Super-Resolution.
Fengjun Li, Xin Feng, Fanglin Chen, Guangming Lu, Wenjie Pei
2022Learning Granularity-Unified Representations for Text-to-Image Person Re-identification.
Zhiyin Shao, Xinyu Zhang, Meng Fang, Zhifeng Lin, Jian Wang, Changxing Ding
2022Learning Hierarchical Dynamics with Spatial Adjacency for Image Enhancement.
Yudong Liang, Bin Wang, Wenqi Ren, Jiaying Liu, Wenjian Wang, Wangmeng Zuo
2022Learning Hybrid Behavior Patterns for Multimedia Recommendation.
Zongshen Mu, Yueting Zhuang, Jie Tan, Jun Xiao, Siliang Tang
2022Learning Interest-oriented Universal User Representation via Self-supervision.
Qinghui Sun, Jie Gu, Xiaoxiao Xu, Renjun Xu, Ke Liu, Bei Yang, Hong Liu, Huan Xu
2022Learning Intrinsic and Extrinsic Intentions for Cold-start Recommendation with Neural Stochastic Processes.
Huafeng Liu, Liping Jing, Dahai Yu, Mingjie Zhou, Michael Ng
2022Learning Modality-Specific and -Agnostic Representations for Asynchronous Multimodal Language Sequences.
Dingkang Yang, Haopeng Kuang, Shuai Huang, Lihua Zhang
2022Learning Occlusion-aware Coarse-to-Fine Depth Map for Self-supervised Monocular Depth Estimation.
Zhengming Zhou, Qiulei Dong
2022Learning Parallax Transformer Network for Stereo Image JPEG Artifacts Removal.
Xuhao Jiang, Weimin Tan, Ri Cheng, Shili Zhou, Bo Yan
2022Learning Projection Views for Sparse-View CT Reconstruction.
Liutao Yang, Rongjun Ge, Shichang Feng, Daoqiang Zhang
2022Learning Smooth Representation for Multi-view Subspace Clustering.
Shudong Huang, Yixi Liu, Yazhou Ren, Ivor W. Tsang, Zenglin Xu, Jiancheng Lv
2022Learning Visible Surface Area Estimation for Irregular Objects.
Xu Liu, Jianing Li, Xianqi Zhang, Jingyuan Sun, Xiaopeng Fan, Yonghong Tian
2022Learning a Dynamic Cross-Modal Network for Multispectral Pedestrian Detection.
Jin Xie, Rao Muhammad Anwer, Hisham Cholakkal, Jing Nie, Jiale Cao, Jorma Laaksonen, Fahad Shahbaz Khan
2022Learning an Inference-accelerated Network from a Pre-trained Model with Frequency-enhanced Feature Distillation.
Xuesong Niu, Jili Gu, Guoxin Zhang, Pengfei Wan, Zhongyuan Wang
2022Learning for Motion Deblurring with Hybrid Frames and Events.
Wen Yang, Jinjian Wu, Jupo Ma, Leida Li, Weisheng Dong, Guangming Shi
2022Learning from Different text-image Pairs: A Relation-enhanced Graph Convolutional Network for Multimodal NER.
Fei Zhao, Chunhui Li, Zhen Wu, Shangyu Xing, Xinyu Dai
2022Learning from Label Relationships in Human Affect.
Niki Maria Foteinopoulou, Ioannis Patras
2022Learning to Estimate External Forces of Human Motion in Video.
Nathan Louis, Jason J. Corso, Tylan N. Templin, Travis D. Eliason, Daniel P. Nicolella
2022Learning to Retrieve Videos by Asking Questions.
Avinash Madasu, Junier Oliva, Gedas Bertasius
2022Learning-Based Video Coding with Joint Deep Compression and Enhancement.
Tiesong Zhao, Weize Feng, Hongji Zeng, Yiwen Xu, Yuzhen Niu, Jiaying Liu
2022Less is More: Consistent Video Depth Estimation with Masked Frames Modeling.
Yiran Wang, Zhiyu Pan, Xingyi Li, Zhiguo Cao, Ke Xian, Jianming Zhang
2022Leveraging GAN Priors for Few-Shot Part Segmentation.
Mengya Han, Heliang Zheng, Chaoyue Wang, Yong Luo, Han Hu, Bo Du
2022Leveraging Text Representation and Face-head Tracking for Long-form Multimodal Semantic Relation Understanding.
Raksha Ramesh, Vishal Anand, Zifan Chen, Yifei Dong, Yun Chen, Ching-Yung Lin
2022Lip-to-Speech Synthesis for Arbitrary Speakers in the Wild.
Sindhu B. Hegde, K. R. Prajwal, Rudrabha Mukhopadhyay, Vinay P. Namboodiri, C. V. Jawahar
2022Long-Term Person Re-identification with Dramatic Appearance Change: Algorithm and Benchmark.
Mengmeng Liu, Zhi Ma, Tao Li, Yanfeng Jiang, Kai Wang
2022Long-term Leap Attention, Short-term Periodic Shift for Video Classification.
Hao Zhang, Lechao Cheng, Yanbin Hao, Chong-Wah Ngo
2022Look Before You Leap: Improving Text-based Person Retrieval by Learning A Consistent Cross-modal Common Manifold.
Zijie Wang, Aichun Zhu, Jingyi Xue, Xili Wan, Chao Liu, Tian Wang, Yifeng Li
2022Look Less Think More: Rethinking Compositional Action Recognition.
Rui Yan, Peng Huang, Xiangbo Shu, Junhao Zhang, Yonghua Pan, Jinhui Tang
2022Low Latency Live Streaming Implementation in DASH and HLS.
Abdelhak Bentaleb, Zhengdao Zhan, Farzad Tashtarian, May Lim, Saad Harous, Christian Timmerer, Hermann Hellwagner, Roger Zimmermann
2022M4MM '22: 1st International Workshop on Methodologies for Multimedia.
Xavier Alameda-Pineda, Qin Jin, Vincent Oria, Laura Toni
2022MADiMa'22: 7th International Workshop on Multimedia Assisted Dietary Management.
Stavroula G. Mougiakakou, Giovanni Maria Farinella, Keiji Yanai, Dario Allegra
2022MAFW: A Large-scale, Multi-modal, Compound Affective Database for Dynamic Facial Expression Recognition in the Wild.
Yuanyuan Liu, Wei Dai, Chuanxu Feng, Wenbin Wang, Guanghao Yin, Jiabei Zeng, Shiguang Shan
2022MAPLE: Masked Pseudo-Labeling autoEncoder for Semi-supervised Point Cloud Action Recognition.
Xiaodong Chen, Wu Liu, Xinchen Liu, Yongdong Zhang, Jungong Han, Tao Mei
2022MAVT-FG: Multimodal Audio-Visual Transformer for Weakly-supervised Fine-Grained Recognition.
Xiaoyu Zhou, Xiaotong Song, Hao Wu, Jingran Zhang, Xing Xu
2022MC-SLT: Towards Low-Resource Signer-Adaptive Sign Language Translation.
Tao Jin, Zhou Zhao, Meng Zhang, Xingshan Zeng
2022MCFR'22: 1st Workshop on Multimedia Computing towards Fashion Recommendation.
Xuemeng Song, Jingjing Chen, Federico Becattini, Weili Guan, Yibing Zhan, Tat-Seng Chua
2022ME-D2N: Multi-Expert Domain Decompositional Network for Cross-Domain Few-Shot Learning.
Yuqian Fu, Yu Xie, Yanwei Fu, Jingjing Chen, Yu-Gang Jiang
2022MEGC2022: ACM Multimedia 2022 Micro-Expression Grand Challenge.
Jingting Li, Moi Hoon Yap, Wen-Huang Cheng, John See, Xiaopeng Hong, Xiaobai Li, Su-Jing Wang, Adrian K. Davison, Yante Li, Zizhao Dong
2022MESH2IR: Neural Acoustic Impulse Response Generator for Complex 3D Scenes.
Anton Ratnarajah, Zhenyu Tang, Rohith Aralikatti, Dinesh Manocha
2022MF-Net: A Novel Few-shot Stylized Multilingual Font Generation Method.
Yufan Zhang, Junkai Man, Peng Sun
2022MIntRec: A New Dataset for Multimodal Intent Recognition.
Hanlei Zhang, Hua Xu, Xin Wang, Qianrui Zhou, Shaojie Zhao, Jiayan Teng
2022MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10 - 14, 2022
João Magalhães, Alberto Del Bimbo, Shin'ichi Satoh, Nicu Sebe, Xavier Alameda-Pineda, Qin Jin, Vincent Oria, Laura Toni
2022MM-ALT: A Multimodal Automatic Lyric Transcription System.
Xiangming Gu, Longshen Ou, Danielle Ong, Ye Wang
2022MM-Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing.
Jiashuo Yu, Ying Cheng, Rui-Wei Zhao, Rui Feng, Yuejie Zhang
2022MMDV: Interpreting DNNs via Building Evaluation Metrics, Manual Manipulation and Decision Visualization.
Keyang Cheng, Yu Si, Hao Zhou, Rabia Tahir
2022MMH-index: Enhancing Apache Lucene with High-Performance Multi-Modal Indexing and Searching.
Ruicheng Liu, Jialing Liang, Peiquan Jin, Yi Wang
2022MMRotate: A Rotated Object Detection Benchmark using PyTorch.
Yue Zhou, Xue Yang, Gefan Zhang, Jiabao Wang, Yanyi Liu, Liping Hou, Xue Jiang, Xingzhao Liu, Junchi Yan, Chengqi Lyu, Wenwei Zhang, Kai Chen
2022MMSports'22: 5th International ACM Workshop on Multimedia Content Analysis in Sports.
Hideo Saito, Thomas B. Moeslund, Rainer Lienhart
2022MMT: Image-guided Story Ending Generation with Multimodal Memory Transformer.
Dizhan Xue, Shengsheng Qian, Quan Fang, Changsheng Xu
2022MONOPOLY: Financial Prediction from MONetary POLicY Conference Videos Using Multimodal Cues.
Puneet Mathur, Atula Tejaswi Neerkaje, Malika Chhibber, Ramit Sawhney, Fuming Guo, Franck Dernoncourt, Sanghamitra Dutta, Dinesh Manocha
2022MVLayoutNet: 3D Layout Reconstruction with Multi-view Panoramas.
Zhihua Hu, Bo Duan, Yanfeng Zhang, Mingwei Sun, Jingwei Huang
2022MVPTR: Multi-Level Semantic Alignment for Vision-Language Pre-Training via Multi-Stage Learning.
Zejun Li, Zhihao Fan, Huaixiao Tou, Jingjing Chen, Zhongyu Wei, Xuanjing Huang
2022MVSPlenOctree: Fast and Generic Reconstruction of Radiance Fields in PlenOctree from Multi-view Stereo.
Wenpeng Xing, Jie Chen
2022MaMiCo: Macro-to-Micro Semantic Correspondence for Self-supervised Video Representation Learning.
Bo Fang, Wenhao Wu, Chang Liu, Yu Zhou, Dongliang He, Weiping Wang
2022Machine Unlearning for Image Retrieval: A Generative Scrubbing Approach.
Peng-Fei Zhang, Guangdong Bai, Zi Huang, Xin-Shun Xu
2022Magic ELF: Image Deraining Meets Association Learning and Transformer.
Kui Jiang, Zhongyuan Wang, Chen Chen, Zheng Wang, Laizhong Cui, Chia-Wen Lin
2022Making The Best of Both Worlds: A Domain-Oriented Transformer for Unsupervised Domain Adaptation.
Wenxuan Ma, Jinming Zhang, Shuang Li, Chi Harold Liu, Yulin Wang, Wei Li
2022Marior: Margin Removal and Iterative Content Rectification for Document Dewarping in the Wild.
Jiaxin Zhang, Canjie Luo, Lianwen Jin, Fengjun Guo, Kai Ding
2022Masked Modeling-based Audio Representation for ACM Multimedia 2022 Computational Paralinguistics ChallengE.
Kang You, Kele Xu, Boqing Zhu, Ming Feng, Dawei Feng, Bo Liu, Tian Gao, Bo Ding
2022Maze: A Cost-Efficient Video Deduplication System at Web-scale.
An Qin, Mengbai Xiao, Ben Huang, Xiaodong Zhang
2022Mediascape XR: A Cultural Heritage Experience in Social VR.
Ignacio Reimat, Yanni Mei, Evangelos Alexiou, Jack Jansen, Jie Li, Shishir Subramanyam, Irene Viola, Johan Oomen, Pablo César
2022Meditation in Motion: Interactive Media Art Visualization Based on Ancient Tai Chi Chuan.
Ze Gao, Anqi Wang, Pan Hui, Tristan Braud
2022MegaPortraits: One-shot Megapixel Neural Head Avatars.
Nikita Drobyshev, Jenya Chelishev, Taras Khakhulin, Aleksei Ivakhnenko, Victor Lempitsky, Egor Zakharov
2022Memory Networks.
Federico Becattini, Tiberio Uricchio
2022Meta Clustering Learning for Large-scale Unsupervised Person Re-identification.
Xin Jin, Tianyu He, Xu Shen, Tongliang Liu, Xinchao Wang, Jianqiang Huang, Zhibo Chen, Xian-Sheng Hua
2022Meta Reconciliation Normalization for Lifelong Person Re-Identification.
Nan Pu, Yu Liu, Wei Chen, Erwin M. Bakker, Michael S. Lew
2022Micro Expression Generation with Thin-plate Spline Motion Model and Face Parsing.
Jun Yu, Guochen Xie, Zhongpeng Cai, Peng He, Fang Gao, Qiang Ling
2022Micro-video Tagging via Jointly Modeling Social Influence and Tag Relation.
Xiao Wang, Tian Gan, Yinwei Wei, Jianlong Wu, Dai Meng, Liqiang Nie
2022MimCo: Masked Image Modeling Pre-training with Contrastive Teacher.
Qiang Zhou, Chaohui Yu, Hao Luo, Zhibin Wang, Hao Li
2022Mimicking the Annotation Process for Recognizing the Micro Expressions.
Bo-Kai Ruan, Ling Lo, Hong-Han Shuai, Wen-Huang Cheng
2022Mix-DANN and Dynamic-Modal-Distillation for Video Domain Adaptation.
Yuehao Yin, Bin Zhu, Jingjing Chen, Lechao Cheng, Yu-Gang Jiang
2022Mixed Supervision for Instance Learning in Object Detection with Few-shot Annotation.
Yi Zhong, Chengyao Wang, Shiyong Li, Zhu Zhou, Yaowei Wang, Wei-Shi Zheng
2022MoZuMa: A Model Zoo for Multimedia Applications.
Stéphane Massonnet, Marco Romanelli, Rémi Lebret, Niels Poulsen, Karl Aberer
2022Modality Eigen-Encodings Are Keys to Open Modality Informative Containers.
Yiyuan Zhang, Yuqi Ji
2022Modality-aware Contrastive Instance Learning with Self-Distillation for Weakly-Supervised Audio-Visual Violence Detection.
Jiashuo Yu, Jinyu Liu, Ying Cheng, Rui Feng, Yuejie Zhang
2022Model-Guided Multi-Contrast Deep Unfolding Network for MRI Super-resolution Reconstruction.
Gang Yang, Li Zhang, Man Zhou, Aiping Liu, Xun Chen, Zhiwei Xiong, Feng Wu
2022More is better: Multi-source Dynamic Parsing Attention for Occluded Person Re-identification.
Xinhua Cheng, Mengxi Jia, Qian Wang, Jian Zhang
2022MuSe 2022 Challenge: Multimodal Humour, Emotional Reactions, and Stress.
Shahin Amiriparian, Lukas Christ, Andreas König, Eva-Maria Meßner, Alan Cowen, Erik Cambria, Björn W. Schuller
2022Multi-Attention Network for Compressed Video Referring Object Segmentation.
Weidong Chen, Dexiang Hong, Yuankai Qi, Zhenjun Han, Shuhui Wang, Laiyun Qing, Qingming Huang, Guorong Li
2022Multi-Camera Collaborative Depth Prediction via Consistent Structure Estimation.
Jialei Xu, Xianming Liu, Yuanchao Bai, Junjun Jiang, Kaixuan Wang, Xiaozhi Chen, Xiangyang Ji
2022Multi-Granular Semantic Mining for Weakly Supervised Semantic Segmentation.
Meijie Zhang, Jianwu Li, Tianfei Zhou
2022Multi-Level Region Matching for Fine-Grained Sketch-Based Image Retrieval.
Zhixin Ling, Zhen Xing, Jiangtong Li, Li Niu
2022Multi-Level Spatiotemporal Network for Video Summarization.
Ming Yao, Yu Bai, Wei Du, Xuejun Zhang, Heng Quan, Fuli Cai, Hongwei Kang
2022Multi-Modal Experience Inspired AI Creation.
Qian Cao, Xu Chen, Ruihua Song, Hao Jiang, Guang Yang, Zhao Cao
2022Multi-Mode Interactive Image Segmentation.
Zheng Lin, Zhao Zhang, Linghao Han, Shao-Ping Lu
2022Multi-Scale Coarse-to-Fine Transformer for Frame Interpolation.
Chen Li, Li Song, Xueyi Zou, Jiaming Guo, Youliang Yan, Wenjun Zhang
2022Multi-directional Knowledge Transfer for Few-Shot Learning.
Shuo Wang, Xinyu Zhang, Yanbin Hao, Chengbing Wang, Xiangnan He
2022Multi-modal Learning Algorithms and Network Architectures for Information Extraction and Retrieval.
Maurits J. R. Bleeker
2022Multi-view Gait Video Synthesis.
Weilai Xiang, Hongyu Yang, Di Huang, Yunhong Wang
2022Multi-view Layout Design for VR Concert Experience.
Minju Kim, Yuhyun Lee, Jungjin Lee
2022MultiMediate'22: Backchannel Detection and Agreement Estimation in Group Interactions.
Philipp Müller, Michael Dietz, Dominik Schiller, Dominike Thomas, Hali Lindsay, Patrick Gebhard, Elisabeth André, Andreas Bulling
2022Multigranular Visual-Semantic Embedding for Cloth-Changing Person Re-identification.
Zan Gao, Hongwei Wei, Weili Guan, Weizhi Nie, Meng Liu, Meng Wang
2022Multimedia Content Understanding in Harsh Environments.
Zheng Wang, Dan Xu, Zhedong Zheng, Kui Jiang
2022Multimedia Event Extraction From News With a Unified Contrastive Learning Framework.
Jian Liu, Yufeng Chen, Jinan Xu
2022Multimodal Analysis for Deep Video Understanding with Video Language Transformer.
Beibei Zhang, Yaqun Fang, Tongwei Ren, Gangshan Wu
2022Multimodal Hate Speech Detection via Cross-Domain Knowledge Transfer.
Chuanpeng Yang, Fuqing Zhu, Guihua Liu, Jizhong Han, Songlin Hu
2022Multimodal In-bed Pose and Shape Estimation under the Blankets.
Yu Yin, Joseph P. Robinson, Yun Fu
2022Multiple Kernel Clustering with Dual Noise Minimization.
Junpu Zhang, Liang Li, Siwei Wang, Jiyuan Liu, Yue Liu, Xinwang Liu, En Zhu
2022Multiple Temporal Fusion based Weakly-supervised Pre-training Techniques for Video Categorization.
Xiaochen Cai, Hengxing Cai, Boqing Zhu, Kele Xu, Weiwei Tu, Dawei Feng
2022Multiview Contrastive Learning for Completely Blind Video Quality Assessment of User Generated Content.
Shankhanil Mitra, Rajiv Soundararajan
2022Mutual Adaptive Reasoning for Monocular 3D Multi-Person Pose Estimation.
Juze Zhang, Jingya Wang, Ye Shi, Fei Gao, Lan Xu, Jingyi Yu
2022NarSUM '22: 1st Workshop on User-centric Narrative Summarization of Long Videos.
Mohan S. Kankanhalli, Jianquan Liu, Yongkang Wong, Karen Stephen, Rishabh Sheoran, Anusha Bhamidipati
2022NeRF-SR: High Quality Neural Radiance Fields using Supersampling.
Chen Wang, Xian Wu, Yuan-Chen Guo, Song-Hai Zhang, Yu-Wing Tai, Shi-Min Hu
2022Neighbor Correspondence Matching for Flow-based Video Frame Synthesis.
Zhaoyang Jia, Yan Lu, Houqiang Li
2022Neural Network Model Protection with Piracy Identification and Tampering Localization Capability.
Cheng Xiong, Guorui Feng, Xinran Li, Xinpeng Zhang, Chuan Qin
2022No-Reference Image Quality Assessment Using Dynamic Complex-Valued Neural Model.
Zihan Zhou, Yong Xu, Ruotao Xu, Yuhui Quan
2022No-reference Omnidirectional Image Quality Assessment Based on Joint Network.
Chaofan Zhang, Shiguang Liu
2022Non-Autoregressive Cross-Modal Coherence Modelling.
Yi Bin, Wenhao Shi, Jipeng Zhang, Yujuan Ding, Yang Yang, Heng Tao Shen
2022Normalization-based Feature Selection and Restitution for Pan-sharpening.
Man Zhou, Jie Huang, Keyu Yan, Gang Yang, Aiping Liu, Chongyi Li, Feng Zhao
2022Not All Pixels Are Matched: Dense Contrastive Learning for Cross-Modality Person Re-Identification.
Hanzhe Sun, Jun Liu, Zhizhong Zhang, Chengjie Wang, Yanyun Qu, Yuan Xie, Lizhuang Ma
2022OCR-Pose: Occlusion-aware Contrastive Representation for Unsupervised 3D Human Pose Estimation.
Junjie Wang, Zhenbo Yu, Zhengyan Tong, Hang Wang, Jinxian Liu, Wenjun Zhang, Xiaoyan Wu
2022OISSR: Optical Image Stabilization Based Super Resolution on Smartphone Cameras.
Hao Pan, Feitong Tan, Wenhao Li, Yi-Chao Chen, Guangtao Xue
2022OS-MSL: One Stage Multimodal Sequential Link Framework for Scene Segmentation and Classification.
Ye Liu, Lingfeng Qiao, Di Yin, Zhuoxuan Jiang, Xinghua Jiang, Deqiang Jiang, Bo Ren
2022On Generating Identifiable Virtual Faces.
Zhuowen Yuan, Zhengxin You, Sheng Li, Zhenxing Qian, Xinpeng Zhang, Alex C. Kot
2022On Leveraging Variational Graph Embeddings for Open World Compositional Zero-Shot Learning.
Muhammad Umer Anwaar, Zhihui Pan, Martin Kleinsteuber
2022One-step Low-Rank Representation for Clustering.
Zhiqiang Fu, Yao Zhao, Dongxia Chang, Yiming Wang, Jie Wen, Xingxing Zhang, Guodong Guo
2022Online Deep Learning from Doubly-Streaming Data.
Heng Lian, John Scovil Atwood, Bojian Hou, Jian Wu, Yi He
2022Open Challenges of Interactive Video Search and Evaluation.
Jakub Lokoc, Klaus Schoeffmann, Werner Bailer, Luca Rossetto, Björn Þór Jónsson
2022OpenHardwareVC: An Open Source Library for 8K UHD Video Coding Hardware Implementation.
Wei Gao, Hang Yuan, Yang Guo, Lvfang Tao, Zhanyuan Cai, Ge Li
2022OpenPointCloud: An Open-Source Algorithm Library of Deep Learning Based Point Cloud Compression.
Wei Gao, Hua Ye, Ge Li, Huiming Zheng, Yuyang Wu, Liang Xie
2022Opportunistic Backdoor Attacks: Exploring Human-imperceptible Vulnerabilities on Speech Recognition Systems.
Qiang Liu, Tongqing Zhou, Zhiping Cai, Yonghao Tang
2022Order-aware Human Interaction Manipulation.
Mandi Luo, Jie Cao, Ran He
2022Ordered Attention for Coherent Visual Storytelling.
Tom Braude, Idan Schwartz, Alexander G. Schwing, Ariel Shamir
2022Overview of the Multimedia Grand Challenges 2022.
Miriam Redi, Georges Quénot
2022PC
Chen Long, Wenxiao Zhang, Ruihui Li, Hao Wang, Zhen Dong, Bisheng Yang
2022PC-Dance: Posture-controllable Music-driven Dance Synthesis.
Jibin Gao, Junfu Pu, Honglun Zhang, Ying Shan, Wei-Shi Zheng
2022PDAS: Probability-Driven Adaptive Streaming for Short Video.
Chao Zhou, Yixuan Ban, Yangchao Zhao, Liang Guo, Bing Yu
2022PDD-GAN: Prior-based GAN Network with Decoupling Ability for Single Image Dehazing.
Xiaoxuan Chai, Junchi Zhou, Hang Zhou, Jui-Hsin Lai
2022PIA: Parallel Architecture with Illumination Allocator for Joint Enhancement and Detection in Low-Light.
Tengyu Ma, Long Ma, Xin Fan, Zhongxuan Luo, Risheng Liu
2022PIC'22: 4th Person in Context Workshop.
Si Liu, Qin Jin, Luoqi Liu, Zongheng Tang, Linli Lin
2022PIES-ME '22: 1st Workshop on Photorealistic Image and Environment Synthesis for Multimedia Experiments.
Ravi Prakash, Mylène C. Q. Farias, Marcelo M. Carvalho, Ryan P. McMahan
2022PIMoG: An Effective Screen-shooting Noise-Layer Simulation for Deep-Learning-Based Watermarking Network.
Han Fang, Zhaoyang Jia, Zehua Ma, Ee-Chien Chang, Weiming Zhang
2022PPMN: Pixel-Phrase Matching Network for One-Stage Panoptic Narrative Grounding.
Zihan Ding, Zi-han Ding, Tianrui Hui, Junshi Huang, Xiaoming Wei, Xiaolin Wei, Si Liu
2022PRO-Face: A Generic Framework for Privacy-preserving Recognizable Obfuscation of Face Images.
Lin Yuan, Linguo Liu, Xiao Pu, Zhao Li, Hongbo Li, Xinbo Gao
2022PVSeRF: Joint Pixel-, Voxel- and Surface-Aligned Radiance Field for Single-Image Novel View Synthesis.
Xianggang Yu, Jiapeng Tang, Yipeng Qin, Chenghong Li, Xiaoguang Han, Linchao Bao, Shuguang Cui
2022PYSKL: Towards Good Practices for Skeleton Action Recognition.
Haodong Duan, Jiaqi Wang, Kai Chen, Dahua Lin
2022PaCL: Part-level Contrastive Learning for Fine-grained Few-shot Image Classification.
Chuanming Wang, Huiyuan Fu, Huadong Ma
2022Paint and Distill: Boosting 3D Object Detection with Semantic Passing Network.
Bo Ju, Zhikang Zou, Xiaoqing Ye, Minyue Jiang, Xiao Tan, Errui Ding, Jingdong Wang
2022Paired Cross-Modal Data Augmentation for Fine-Grained Image-to-Text Retrieval.
Hao Wang, Guosheng Lin, Steven C. H. Hoi, Chunyan Miao
2022Parameterization of Cross-token Relations with Relative Positional Encoding for Vision MLP.
Zhicai Wang, Yanbin Hao, Xingyu Gao, Hao Zhang, Shuo Wang, Tingting Mu, Xiangnan He
2022ParseMVS: Learning Primitive-aware Surface Representations for Sparse Multi-view Stereopsis.
Haiyang Ying, Jinzhi Zhang, Yuzhe Chen, Zheng Cao, Jing Xiao, Ruqi Huang, Lu Fang
2022Partially Relevant Video Retrieval.
Jianfeng Dong, Xianke Chen, Minsong Zhang, Xun Yang, Shujie Chen, Xirong Li, Xun Wang
2022PassWalk: Spatial Authentication Leveraging Lateral Shift and Gaze on Mobile Headsets.
Abhishek Kumar, Lik-Hang Lee, Jagmohan Chauhan, Xiang Su, Mohammad Ashraful Hoque, Susanna Pirttikangas, Sasu Tarkoma, Pan Hui
2022Patch-based Knowledge Distillation for Lifelong Person Re-Identification.
Zhicheng Sun, Yadong Mu
2022Pay Attention to Your Positive Pairs: Positive Pair Aware Contrastive Knowledge Distillation.
Zhipeng Yu, Qianqian Xu, Yangbangyan Jiang, Haoyu Qin, Qingming Huang
2022Perceptual Conversational Head Generation with Regularized Driver and Enhanced Renderer.
Ailin Huang, Zhewei Huang, Shuchang Zhou
2022Personality-Driven Social Multimedia Content Recommendation.
Qi Yang, Sergey I. Nikolenko, Alfred Huang, Aleksandr Farseev
2022Personalized 360-Degree Video Streaming: A Meta-Learning Approach.
Yiyun Lu, Yifei Zhu, Zhi Wang
2022Phase-based Memory Network for Video Dehazing.
Ye Liu, Liang Wan, Huazhu Fu, Jing Qin, Lei Zhu
2022Phoneme-Aware Adaptation with Discrepancy Minimization and Dynamically-Classified Vector for Text-independent Speaker Verification.
Jia Wang, Tianhao Lan, Jie Chen, Chengwen Luo, Chao Wu, Jianqiang Li
2022Photorealistic Style Transfer via Adaptive Filtering and Channel Seperation.
Hong Ding, Fei Luo, Caoqing Jiang, Gang Fu, Zipei Chen, Shenghong Hu, Chunxia Xiao
2022Physical Backdoor Attacks to Lane Detection Systems in Autonomous Driving.
Xingshuo Han, Guowen Xu, Yuan Zhou, Xuehuan Yang, Jiwei Li, Tianwei Zhang
2022PicT: A Slim Weakly Supervised Vision Transformer for Pavement Distress Classification.
Wenhao Tang, Sheng Huang, Xiaoxian Zhang, Luwen Huangfu
2022Pixel Exclusion: Uncertainty-aware Boundary Discovery for Active Cross-Domain Semantic Segmentation.
Fuming You, Jingjing Li, Zhi Chen, Lei Zhu
2022Pixel-Level Anomaly Detection via Uncertainty-aware Prototypical Transformer.
Chao Huang, Chengliang Liu, Zheng Zhang, Zhihao Wu, Jie Wen, Qiuping Jiang, Yong Xu
2022PixelSeg: Pixel-by-Pixel Stochastic Semantic Segmentation for Ambiguous Medical Images.
Wei Zhang, Xiaohong Zhang, Sheng Huang, Yuting Lu, Kun Wang
2022Pixelwise Adaptive Discretization with Uncertainty Sampling for Depth Completion.
Rui Peng, Tao Zhang, Bing Li, Yitong Wang
2022Point Cloud Completion via Multi-Scale Edge Convolution and Attention.
Rui Cao, Kaiyi Zhang, Yang Chen, Ximing Yang, Cheng Jin
2022Point to Rectangle Matching for Image Text Retrieval.
Zheng Wang, Zhenwei Gao, Xing Xu, Yadan Luo, Yang Yang, Heng Tao Shen
2022PreyNet: Preying on Camouflaged Objects.
Miao Zhang, Shuang Xu, Yongri Piao, Dongxiang Shi, Shusen Lin, Huchuan Lu
2022Prism: Handling Packet Loss for Ultra-low Latency Video.
Devdeep Ray, Vicente Bobadilla Riquelme, Srinivasan Seshan
2022Privacy-preserving Reflection Rendering for Augmented Reality.
Yiqin Zhao, Sheng Wei, Tian Guo
2022ProDiff: Progressive Fast Diffusion Model for High-Quality Text-to-Speech.
Rongjie Huang, Zhou Zhao, Huadai Liu, Jinglin Liu, Chenye Cui, Yi Ren
2022Progressive Attribute Embedding for Accurate Cross-modality Person Re-ID.
Aihua Zheng, Peng Pan, Hongchao Li, Chenglong Li, Bin Luo, Chang Tan, Ruoran Jia
2022Progressive Cross-modal Knowledge Distillation for Human Action Recognition.
Jianyuan Ni, Anne H. H. Ngu, Yan Yan
2022Progressive Limb-Aware Virtual Try-On.
Xiaoyu Han, Shengping Zhang, Qinglin Liu, Zonglin Li, Chenyang Wang
2022Progressive Spatial-temporal Collaborative Network for Video Frame Interpolation.
Mengshun Hu, Kui Jiang, Liang Liao, Zhixiang Nie, Jing Xiao, Zheng Wang
2022Progressive Tree-Structured Prototype Network for End-to-End Image Captioning.
Pengpeng Zeng, Jinkuan Zhu, Jingkuan Song, Lianli Gao
2022Progressive Unsupervised Learning of Local Descriptors.
Wufan Wang, Lei Zhang, Hua Huang
2022Prompt-based Zero-shot Video Moment Retrieval.
Guolong Wang, Xun Wu, Zhaoyuan Liu, Junchi Yan
2022Prompting for Multi-Modal Tracking.
Jinyu Yang, Zhe Li, Feng Zheng, Ales Leonardis, Jingkuan Song
2022Prototype-based Selective Knowledge Distillation for Zero-Shot Sketch Based Image Retrieval.
Kai Wang, Yifan Wang, Xing Xu, Xin Liu, Weihua Ou, Huimin Lu
2022Proxy Probing Decoder for Weakly Supervised Object Localization: A Baseline Investigation.
Jingyuan Xu, Hongtao Xie, Chuanbin Liu, Yongdong Zhang
2022Purifier: Plug-and-play Backdoor Mitigation for Pre-trained Models Via Anomaly Activation Suppression.
Xiaoyu Zhang, Yulin Jin, Tao Wang, Jian Lou, Xiaofeng Chen
2022Pursuing Knowledge Consistency: Supervised Hierarchical Contrastive Learning for Facial Action Unit Recognition.
Yingjie Chen, Chong Chen, Xiao Luo, Jianqiang Huang, Xian-Sheng Hua, Tao Wang, Yun Liang
2022Pyramidal Transformer with Conv-Patchify for Person Re-identification.
He Li, Mang Ye, Cong Wang, Bo Du
2022QoE-aware Download Control and Bitrate Adaptation for Short Video Streaming.
Ximing Wu, Lei Zhang, Laizhong Cui
2022QoEVMA'22: 2nd Workshop on Quality of Experience (QoE) in Visual Multimedia Applications.
Jing Li, Patrick Le Callet, Xinbo Gao, Zhi Li, Wen Lu, Jiachen Yang, Junle Wang
2022QuadTreeCapsule: QuadTree Capsules for Deep Regression Tracking.
Ding Ma, Xiangqian Wu
2022Quality Assessment of Image Super-Resolution: Balancing Deterministic and Statistical Fidelity.
Wei Zhou, Zhou Wang
2022Query Prior Matters: A MRC Framework for Multimodal Named Entity Recognition.
Meihuizi Jia, Xin Shen, Lei Shen, Jinhui Pang, Lejian Liao, Yang Song, Meng Chen, Xiaodong He
2022Query-driven Generative Network for Document Information Extraction in the Wild.
Haoyu Cao, Xin Li, Jiefeng Ma, Deqiang Jiang, Antai Guo, Yiqing Hu, Hao Liu, Yinsong Liu, Bo Ren
2022R-FEC: RL-based FEC Adjustment for Better QoE in WebRTC.
Insoo Lee, Seyeon Kim, Sandesh Dhawaskar Sathyanarayana, Kyungmin Bin, Song Chong, Kyunghan Lee, Dirk Grunwald, Sangtae Ha
2022RCRN: Real-world Character Image Restoration Network via Skeleton Extraction.
Daqian Shi, Xiaolei Diao, Hao Tang, Xiaomin Li, Hao Xing, Hao Xu
2022REMOT: A Region-to-Whole Framework for Realistic Human Motion Transfer.
Quanwei Yang, Xinchen Liu, Wu Liu, Hongtao Xie, Xiaoyan Gu, Lingyun Yu, Yongdong Zhang
2022RKformer: Runge-Kutta Transformer with Random-Connection Attention for Infrared Small Target Detection.
Mingjin Zhang, Haichen Bai, Jing Zhang, Rui Zhang, Chaoyue Wang, Jie Guo, Xinbo Gao
2022ROMA: Cross-Domain Region Similarity Matching for Unpaired Nighttime Infrared to Daytime Visible Video Translation.
Zhenjie Yu, Kai Chen, Shuang Li, Bingfeng Han, Chi Harold Liu, Shuigen Wang
2022RONF: Reliable Outlier Synthesis under Noisy Feature Space for Out-of-Distribution Detection.
Rundong He, Zhongyi Han, Xiankai Lu, Yilong Yin
2022RPPformer-Flow: Relative Position Guided Point Transformer for Scene Flow Estimation.
Hanlin Li, Guanting Dong, Yueyi Zhang, Xiaoyan Sun, Zhiwei Xiong
2022Rail Detection: An Efficient Row-based Network and a New Benchmark.
Xinpeng Li, Xiaojiang Peng
2022Rate-Distortion-Guided Learning Approach with Cross-Projection Information for V-PCC Fast CU Decision.
Hang Yuan, Wei Gao, Ge Li, Zhu Li
2022Re-ordered Micro Image based High Efficient Residual Coding in Light Field Compression.
Hyunmin Jung, Hyuk-Jae Lee, Chae-Eun Rhee
2022ReCoRo: Region-Controllable Robust Light Enhancement with User-Specified Imprecise Masks.
Dejia Xu, Hayk Poghosyan, Shant Navasardyan, Yifan Jiang, Humphrey Shi, Zhangyang Wang
2022ReFormer: The Relational Transformer for Image Captioning.
Xuewen Yang, Yingru Liu, Xin Wang
2022ReFu: Refine and Fuse the Unobserved View for Detail-Preserving Single-Image 3D Human Reconstruction.
Gyumin Shim, Minsoo Lee, Jaegul Choo
2022ReLyMe: Improving Lyric-to-Melody Generation by Incorporating Lyric-Melody Relationships.
Chen Zhang, LuChin Chang, Songruoyao Wu, Xu Tan, Tao Qin, Tie-Yan Liu, Kejun Zhang
2022Read Your Voice: A Playful Interactive Sound Encoder/Decoder.
Hugo Pauget Ballesteros, Gilles Azzaro, Jean Mélou, Yvain Quéau, Jean-Denis Durou
2022Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition.
Mingkun Yang, Minghui Liao, Pu Lu, Jing Wang, Shenggao Zhu, Hualin Luo, Qi Tian, Xiang Bai
2022Real-World Blind Super-Resolution via Feature Matching with Implicit High-Resolution Priors.
Chaofeng Chen, Xinyu Shi, Yipeng Qin, Xiaoming Li, Xiaoguang Han, Tao Yang, Shihui Guo
2022Real-time Semantic Segmentation with Parallel Multiple Views Feature Augmentation.
Jian-Jun Qiao, Zhi-Qi Cheng, Xiao Wu, Wei Li, Ji Zhang
2022Real-time Streaming Video Denoising with Bidirectional Buffers.
Chenyang Qi, Junming Chen, Xin Yang, Qifeng Chen
2022Recipe-oriented Food Logging for Nutritional Management.
Yoko Yamakata, Akihisa Ishino, Akiko Sunto, Sosuke Amano, Kiyoharu Aizawa
2022Recurrent Meta-Learning against Generalized Cold-start Problem in CTR Prediction.
Junyu Chen, Qianqian Xu, Zhiyong Yang, Ke Ma, Xiaochun Cao, Qingming Huang
2022Reducing the Vision and Language Bias for Temporal Sentence Grounding.
Daizong Liu, Xiaoye Qu, Wei Hu
2022RefCrowd: Grounding the Target in Crowd with Referring Expressions.
Heqian Qiu, Hongliang Li, Taijin Zhao, Lanxiao Wang, Qingbo Wu, Fanman Meng
2022Reflecting on Experiences for Response Generation.
Chenchen Ye, Lizi Liao, Suyu Liu, Tat-Seng Chua
2022Region-based Pixels Integration Mechanism for Weakly Supervised Semantic Segmentation.
Chen Qian, Hui Zhang
2022Relation-enhanced Negative Sampling for Multimodal Knowledge Graph Completion.
Derong Xu, Tong Xu, Shiwei Wu, Jingbo Zhou, Enhong Chen
2022Relational Representation Learning in Visually-Rich Documents.
Xin Li, Yan Zheng, Yiqing Hu, Haoyu Cao, Yunfei Wu, Deqiang Jiang, Yinsong Liu, Bo Ren
2022Relative Alignment Network for Source-Free Multimodal Video Domain Adaptation.
Yi Huang, Xiaoshan Yang, Ji Zhang, Changsheng Xu
2022Relative Pose Estimation for Multi-Camera Systems from Point Correspondences with Scale Ratio.
Banglei Guan, Ji Zhao
2022RepSR: Training Efficient VGG-style Super-Resolution Networks with Structural Re-Parameterization and Batch Normalization.
Xintao Wang, Chao Dong, Ying Shan
2022Repainting and Imitating Learning for Lane Detection.
Yue He, Minyue Jiang, Xiaoqing Ye, Liang Du, Zhikang Zou, Wei Zhang, Xiao Tan, Errui Ding
2022Representation Learning through Multimodal Attention and Time-Sync Comments for Affective Video Content Analysis.
Jicai Pan, Shangfei Wang, Lin Fang
2022Reproducibility Companion Paper: Focusing on Persons: Colorizing Old Images Learning from Modern Historical Movies.
Xin Jin, Ke Liu, Dongqing Zou, Zhonglan Li, Heng Huang, Vajira Thambawita
2022Restoration of Analog Videos Using Swin-UNet.
Lorenzo Agnolucci, Leonardo Galteri, Marco Bertini, Alberto Del Bimbo
2022Restoration of User Videos Shared on Social Media.
Hongming Luo, Fei Zhou, Kin-Man Lam, Guoping Qiu
2022Rethinking Open-World Object Detection in Autonomous Driving Scenarios.
Zeyu Ma, Yang Yang, Guoqing Wang, Xing Xu, Heng Tao Shen, Mingxing Zhang
2022Rethinking Optical Flow Methods for Micro-Expression Spotting.
Yuan Zhao, Xin Tong, Zichong Zhu, Jianda Sheng, Lei Dai, Lingling Xu, Xuehai Xia, Yu Jiang, Jiao Li
2022Rethinking Super-Resolution as Text-Guided Details Generation.
Chenxi Ma, Bo Yan, Qing Lin, Weimin Tan, Siming Chen
2022Rethinking the Mechanism of the Pattern Pruning and the Circle Importance Hypothesis.
Hengyi Zhou, Longjun Liu, Haonan Zhang, Nanning Zheng
2022Rethinking the Metric in Few-shot Learning: From an Adaptive Multi-Distance Perspective.
Jinxiang Lai, Siqian Yang, Guannan Jiang, Xi Wang, Yuxi Li, Zihui Jia, Xiaochen Chen, Jun Liu, Bin-Bin Gao, Wei Zhang, Yuan Xie, Chengjie Wang
2022Rethinking the Reference-based Distinctive Image Captioning.
Yangjun Mao, Long Chen, Zhihong Jiang, Dong Zhang, Zhimeng Zhang, Jian Shao, Jun Xiao
2022Rethinking the Vulnerability of DNN Watermarking: Are Watermarks Robust against Naturalness-aware Perturbations?
Run Wang, Haoxuan Li, Lingzhou Mu, Jixing Ren, Shangwei Guo, Li Liu, Liming Fang, Jing Chen, Lina Wang
2022Revisiting Stochastic Learning for Generalizable Person Re-identification.
Jiajian Zhao, Yifan Zhao, Xiaowu Chen, Jia Li
2022Robust Actor Recognition in Entertainment Multimedia at Scale.
Abhinav Aggarwal, Yash Pandya, Lokesh A. Ravindranathan, Laxmi S. Ahire, Manivel Sethu, Kaustav Nandy
2022Robust Attention Deraining Network for Synchronous Rain Streaks and Raindrops Removal.
Yanyan Wei, Zhao Zhang, Mingliang Xu, Richang Hong, Jicong Fan, Shuicheng Yan
2022Robust Diversified Graph Contrastive Network for Incomplete Multi-view Clustering.
Zhe Xue, Junping Du, Hai Zhu, Zhongchao Guan, Yunfei Long, Yu Zang, MeiYu Liang
2022Robust Industrial UAV/UGV-Based Unsupervised Domain Adaptive Crack Recognitions with Depth and Edge Awareness: From System and Database Constructions to Real-Site Inspections.
Kangcheng Liu
2022Robust Low-Rank Convolution Network for Image Denoising.
Jiahuan Ren, Zhao Zhang, Richang Hong, Mingliang Xu, Haijun Zhang, Mingbo Zhao, Meng Wang
2022Robust Multimodal Depth Estimation using Transformer based Generative Adversarial Networks.
Md Fahim Faysal Khan, Anusha Devulapally, Siddharth Advani, Vijaykrishnan Narayanan
2022Rotation Invariant Transformer for Recognizing Object in UAVs.
Shuoyi Chen, Mang Ye, Bo Du
2022S-CCR: Super-Complete Comparative Representation for Low-Light Image Quality Inference In-the-wild.
Miaohui Wang, Zhuowei Xu, Yuanhao Gong, Wuyuan Xie
2022SD-GAN: Semantic Decomposition for Face Image Synthesis with Discrete Attribute.
Kangneng Zhou, Xiaobin Zhu, Daiheng Gao, Kai Lee, Xinjie Li, Xu-Cheng Yin
2022SDRTV-to-HDRTV via Hierarchical Dynamic Context Feature Mapping.
Gang He, Kepeng Xu, Li Xu, Chang Wu, Ming Sun, Xing Wen, Yu-Wing Tai
2022SER30K: A Large-Scale Dataset for Sticker Emotion Recognition.
Shengzhe Liu, Xin Zhang, Jufeng Yang
2022SGINet: Toward Sufficient Interaction Between Single Image Deraining and Semantic Segmentation.
Yanyan Wei, Zhao Zhang, Huan Zheng, Richang Hong, Yi Yang, Meng Wang
2022SIM-Trans: Structure Information Modeling Transformer for Fine-grained Visual Categorization.
Hongbo Sun, Xiangteng He, Yuxin Peng
2022SIR-Former: Stereo Image Restoration Using Transformer.
Zizheng Yang, Mingde Yao, Jie Huang, Man Zhou, Feng Zhao
2022SPTS: Single-Point Text Spotting.
Dezhi Peng, Xinyu Wang, Yuliang Liu, Jiaxin Zhang, Mingxin Huang, Songxuan Lai, Jing Li, Shenggao Zhu, Dahua Lin, Chunhua Shen, Xiang Bai, Lianwen Jin
2022SUMAC '22: 4th ACM International workshop on Structuring and Understanding of Multimedia heritAge Contents.
Valérie Gouet-Brunet, Ronak Kosti, Li Weng
2022Saliency in Augmented Reality.
Huiyu Duan, Wei Shen, Xiongkuo Min, Danyang Tu, Jing Li, Guangtao Zhai
2022Sample Weighted Multiple Kernel K-means via Min-Max optimization.
Yi Zhang, Weixuan Liang, Xinwang Liu, Sisi Dai, Siwei Wang, Liyang Xu, En Zhu
2022Scale-flow: Estimating 3D Motion from Video.
Han Ling, Quansen Sun, Zhenwen Ren, Yazhou Liu, Hongyuan Wang, Zichen Wang
2022ScatterNet: Point Cloud Learning via Scatters.
Qi Liu, Nianjuan Jiang, Jiangbo Lu, Mingang Chen, Ran Yi, Lizhuang Ma
2022ScoreActuary: Hoop-Centric Trajectory-Aware Network for Fine-Grained Basketball Shot Analysis.
Ting-Yang Kao, Tse-Yu Pan, Chen-Ni Chen, Tsung-Hsun Tsai, Hung-Kuo Chu, Min-Chun Hu
2022Search-oriented Micro-video Captioning.
Liqiang Nie, Leigang Qu, Dai Meng, Min Zhang, Qi Tian, Alberto Del Bimbo
2022Searching Lightweight Neural Network for Image Signal Processing.
Haojia Lin, Lijiang Li, Xiawu Zheng, Fei Chao, Rongrong Ji
2022Seeing Speech: Magnetic Resonance Imaging-Based Vocal Tract Deformation Visualization Using Cross-Modal Transformer.
Kele Xu, Ming Feng, Weiquan Huang
2022Self-Aligned Concave Curve: Illumination Enhancement for Unsupervised Adaptation.
Wenjing Wang, Zhengbo Xu, Haofeng Huang, Jiaying Liu
2022Self-Paced Label Distribution Learning for In-The-Wild Facial Expression Recognition.
Jianjian Shao, Zhenqian Wu, Yuanyan Luo, Shudong Huang, Xiaorong Pu, Yazhou Ren
2022Self-Supervised Graph Neural Network for Multi-Source Domain Adaptation.
Jin Yuan, Feng Hou, Yangzhou Du, Zhongchao Shi, Xin Geng, Jianping Fan, Yong Rui
2022Self-Supervised Human Pose based Multi-Camera Video Synchronization.
Liqiang Yin, Ruize Han, Wei Feng, Song Wang
2022Self-Supervised Multi-view Stereo via Adjacent Geometry Guided Volume Completion.
Luoyuan Xu, Tao Guan, Yuesong Wang, Yawei Luo, Zhuo Chen, Wenkai Liu, Wei Yang
2022Self-Supervised Representation Learning for Skeleton-Based Group Activity Recognition.
Cunling Bian, Wei Feng, Song Wang
2022Self-Supervised Text Erasing with Controllable Image Synthesis.
Gangwei Jiang, Shiyao Wang, Tiezheng Ge, Yuning Jiang, Ying Wei, Defu Lian
2022Self-supervised Exclusive Learning for 3D Segmentation with Cross-Modal Unsupervised Domain Adaptation.
Yachao Zhang, Miaoyu Li, Yuan Xie, Cuihua Li, Cong Wang, Zhizhong Zhang, Yanyun Qu
2022Self-supervised Multi-view Stereo via Inter and Intra Network Pseudo Depth.
Ke Qiu, Yawen Lai, Shiyi Liu, Ronggang Wang
2022Self-supervised Scene Text Segmentation with Object-centric Layered Representations Augmented by Text Regions.
Yibo Wang, Yunhu Ye, Yuanpeng Mao, Yanwei Yu, Yuanping Song
2022Semantic Data Augmentation based Distance Metric Learning for Domain Generalization.
Mengzhu Wang, Jianlong Yuan, Qi Qian, Zhibin Wang, Hao Li
2022Semantic Structure Enhanced Contrastive Adversarial Hash Network for Cross-media Representation Learning.
MeiYu Liang, Junping Du, Xiaowen Cao, Yang Yu, Kangkang Lu, Zhe Xue, Min Zhang
2022Semantic-aware Responsive Listener Head Synthesis.
Wei Zhao, Peng Xiao, Rongju Zhang, Yijun Wang, Jianxin Lin
2022Semantically-Consistent Dynamic Blurry Image Generation for Image Deblurring.
Zhaohui Jing, Youjian Zhang, Chaoyue Wang, Daqing Liu, Yong Xia
2022Semantics-Driven Generative Replay for Few-Shot Class Incremental Learning.
Aishwarya Agarwal, Biplab Banerjee, Fabio Cuzzolin, Subhasis Chaudhuri
2022Semi-supervised Crowd Counting via Density Agency.
Hui Lin, Zhiheng Ma, Xiaopeng Hong, Yaowei Wang, Zhou Su
2022Semi-supervised Human Pose Estimation in Art-historical Images.
Matthias Springstein, Stefanie Schneider, Christian Althaus, Ralph Ewerth
2022Semi-supervised Learning for Multi-label Video Action Detection.
Hongcheng Zhang, Xu Zhao, Dongqi Wang
2022Semi-supervised Semantic Segmentation via Prototypical Contrastive Learning.
Zenggui Chen, Zhouhui Lian
2022Semi-supervised Video Shadow Detection via Image-assisted Pseudo-label Generation.
Zipei Chen, Xiao Lu, Ling Zhang, Chunxia Xiao
2022Sentiment-aware Classifier for Out-of-Context Caption Detection.
Muhannad Alkaddour, Abhinav Dhall, Usman Tariq, Hasan Al-Nashash, Fares Al-Shargie
2022Set-Based Face Recognition Beyond Disentanglement: Burstiness Suppression With Variance Vocabulary.
Jiong Wang, Zhou Zhao, Fei Wu
2022Shifting Perspective to See Difference: A Novel Multi-view Method for Skeleton based Action Recognition.
Ruijie Hou, Yanran Li, Ningyu Zhang, Yulin Zhou, Xiaosong Yang, Zhao Wang
2022Show Me What I Like: Detecting User-Specific Video Highlights Using Content-Based Multi-Head Attention.
Uttaran Bhattacharya, Gang Wu, Stefano Petrangeli, Viswanathan Swaminathan, Dinesh Manocha
2022Simple Self-supervised Multiplex Graph Representation Learning.
Yujie Mo, Yuhuan Chen, Liang Peng, Xiaoshuang Shi, Xiaofeng Zhu
2022SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation.
Rongjie Huang, Chenye Cui, Feiyang Chen, Yi Ren, Jinglin Liu, Zhou Zhao, Baoxing Huai, Zhefeng Wang
2022SingMaster: A Sight-singing Evaluation System of "Shoot and Sing" Based on Smartphone.
Wei Xu, Bowen Tian, Lijie Luo, Weiming Yang, Xianke Wang, Lei Wu
2022Single Image Shadow Detection via Complementary Mechanism.
Yurui Zhu, Xueyang Fu, Chengzhi Cao, Xi Wang, Qibin Sun, Zheng-Jun Zha
2022Situational Perception Guided Image Matting.
Bo Xu, Jiake Xie, Han Huang, Ziwen Li, Cheng Lu, Yong Tang, Yandong Guo
2022Skeleton-based Action Recognition via Adaptive Cross-Form Learning.
Xuanhan Wang, Yan Dai, Lianli Gao, Jingkuan Song
2022Skeleton2Humanoid: Animating Simulated Characters for Physically-plausible Motion In-betweening.
Yunhao Li, Zhenbo Yu, Yucheng Zhu, Bingbing Ni, Guangtao Zhai, Wei Shen
2022Sketch Transformer: Asymmetrical Disentanglement Learning from Dynamic Synthesis.
Cuiqun Chen, Mang Ye, Meibin Qi, Bo Du
2022Skimming, Locating, then Perusing: A Human-Like Framework for Natural Language Video Localization.
Daizong Liu, Wei Hu
2022SlimSeg: Slimmable Semantic Segmentation with Boundary Supervision.
Danna Xue, Fei Yang, Pei Wang, Luis Herranz, Jinqiu Sun, Yu Zhu, Yanning Zhang
2022SoftSkip: Empowering Multi-Modal Dynamic Pruning for Single-Stage Referring Comprehension.
Dulanga Weerakoon, Vigneshwaran Subbaraju, Tuan Tran, Archan Misra
2022SongDriver: Real-time Music Accompaniment Generation without Logical Latency nor Exposure Bias.
Zihao Wang, Kejun Zhang, Yuxing Wang, Chen Zhang, Qihao Liang, Pengfei Yu, Yongsheng Feng, Wenbo Liu, Yikai Wang, Yuntai Bao, Yiheng Yang
2022Sophon: Super-Resolution Enhanced 360° Video Streaming with Visual Saliency-aware Prefetch.
Jianxin Shi, Lingjun Pu, Xinjing Yuan, Qianyun Gong, Jingdong Xu
2022Source-Free Active Domain Adaptation via Energy-Based Locality Preserving Transfer.
Xinyao Li, Zhekai Du, Jingjing Li, Lei Zhu, Ke Lu
2022Source-Free Domain Adaptation for Real-World Image Dehazing.
Hu Yu, Jie Huang, Yajing Liu, Qi Zhu, Man Zhou, Feng Zhao
2022Span-based Audio-Visual Localization.
Yiling Wu, Xinfeng Zhang, Yaowei Wang, Qingming Huang
2022Spatial-Temporal Aligned Multi-Agent Learning for Visual Dialog Systems.
Yong Zhuang, Tong Yu, Junda Wu, Shiqu Wu, Shuai Li
2022Speech Fusion to Face: Bridging the Gap Between Human's Vocal Characteristics and Facial Imaging.
Yeqi Bai, Tao Ma, Lipo Wang, Zhenjie Zhang
2022Split-PU: Hardness-aware Training Strategy for Positive-Unlabeled Learning.
Chengming Xu, Chen Liu, Siqian Yang, Yabiao Wang, Shijie Zhang, Lijie Jia, Yanwei Fu
2022StimulusLoop: Game-Actuated Mutuality Artwork for Evoking Affective State.
Tai-Chen Tsai, Tse-Yu Pan, Min-Chun Hu, Ya-Lun Tao
2022Structure- and Texture-Aware Learning for Low-Light Image Enhancement.
Jinghao Zhang, Jie Huang, Mingde Yao, Man Zhou, Feng Zhao
2022Structure-Enhanced Pop Music Generation via Harmony-Aware Learning.
Xueyao Zhang, Jinchao Zhang, Yao Qiu, Li Wang, Jie Zhou
2022Structure-Inferred Bi-level Model for Underwater Image Enhancement.
Pan Mu, Haotian Qian, Cong Bai
2022Structure-Preserving Motion Estimation for Learned Video Compression.
Han Gao, Jinzhong Cui, Mao Ye, Shuai Li, Yu Zhao, Xiatian Zhu
2022Sundial-GAN: A Cascade Generative Adversarial Networks Framework for Deciphering Oracle Bone Inscriptions.
Xiang Chang, Fei Chao, Changjing Shang, Qiang Shen
2022Support for Teaching Mathematics of the Blind by Sighted Tutors Through Multisensual Access to Formulas with Braille Converters and Speech.
Dariusz Mikulowski
2022Symmetric Uncertainty-Aware Feature Transmission for Depth Super-Resolution.
Wuxuan Shi, Mang Ye, Bo Du
2022Sync Sofa: Sofa-type Side-by-side Communication Experience Based on Multimodal Expression.
Yuki Tajima, Shota Okubo, Tomoaki Konno, Toshiharu Horiuchi, Tatsuya Kobayashi
2022Synthesizing Counterfactual Samples for Effective Image-Text Matching.
Hao Wei, Shuhui Wang, Xinzhe Han, Zhe Xue, Bin Ma, Xiaoming Wei, Xiaolin Wei
2022Synthetic Data Supervised Salient Object Detection.
Zhenyu Wu, Lin Wang, Wei Wang, Tengfei Shi, Chenglizhao Chen, Aimin Hao, Shuo Li
2022T-former: An Efficient Transformer for Image Inpainting.
Ye Deng, Siqi Hui, Sanping Zhou, Deyu Meng, Jinjun Wang
2022TA-CNN: A Unified Network for Human Behavior Analysis in Multi-Person Conversations.
Fuyan Ma, Ziyu Ma, Bin Sun, Shutao Li
2022TAGPerson: A Target-Aware Generation Pipeline for Person Re-identification.
Kai Chen, Weihua Chen, Tao He, Rong Du, Fan Wang, Xiuyu Sun, Yuchen Guo, Guiguang Ding
2022TFF-Former: Temporal-Frequency Fusion Transformer for Zero-training Decoding of Two BCI Tasks.
Xujin Li, Wei Wei, Shuang Qiu, Huiguang He
2022TGDM: Target Guided Dynamic Mixup for Cross-Domain Few-Shot Learning.
Linhai Zhuo, Yuqian Fu, Jingjing Chen, Yixin Cao, Yu-Gang Jiang
2022TPSNet: Reverse Thinking of Thin Plate Splines for Arbitrary Shape Scene Text Representation.
Wei Wang, Yu Zhou, Jiahao Lyu, Dayan Wu, Guoqing Zhao, Ning Jiang, Weiping Wang
2022TSRFormer: Table Structure Recognition with Transformers.
Weihong Lin, Zheng Sun, Chixiang Ma, Mingze Li, Jiawei Wang, Lei Sun, Qiang Huo
2022TVFormer: Trajectory-guided Visual Quality Assessment on 360° Images with Transformers.
Li Yang, Mai Xu, Tie Liu, Liangyu Huo, Xinbo Gao
2022TWIZ: The Multimodal Conversational Task Wizard.
Rafael Ferreira, Diogo Silva, Diogo Tavares, Frederico Vicente, Mariana Bonito, Gustavo Gonçalves, Rui Margarido, Paula Figueiredo, Helder Rodrigues, David Semedo, João Magalhães
2022Tackling Instance-Dependent Label Noise with Dynamic Distribution Calibration.
Manyi Zhang, Yuxin Ren, Zihao Wang, Chun Yuan
2022Talk2Face: A Unified Sequence-based Framework for Diverse Face Generation and Analysis Tasks.
Yudong Li, Xianxu Hou, Zhe Zhao, Linlin Shen, Xuefeng Yang, Kimmo Yan
2022Talking Head from Speech Audio using a Pre-trained Image Generator.
Mohammed M. Alghamdi, He Wang, Andrew J. Bulpitt, David C. Hogg
2022Target-Driven Structured Transformer Planner for Vision-Language Navigation.
Yusheng Zhao, Jinyu Chen, Chen Gao, Wenguan Wang, Lirong Yang, Haibing Ren, Huaxia Xia, Si Liu
2022Task-adaptive Spatial-Temporal Video Sampler for Few-shot Action Recognition.
Huabin Liu, Weixian Lv, John See, Weiyao Lin
2022Temporal Sentiment Localization: Listen and Look in Untrimmed Videos.
Zhicheng Zhang, Jufeng Yang
2022Text Style Transfer based on Multi-factor Disentanglement and Mixture.
Anna Zhu, Zhanhui Yin, Brian Kenji Iwana, Xinyu Zhou, Shengwu Xiong
2022Text's Armor: Optimized Local Adversarial Perturbation Against Scene Text Editing Attacks.
Tao Xiang, Hangcheng Liu, Shangwei Guo, Hantao Liu, Tianwei Zhang
2022TextBlock: Towards Scene Text Spotting without Fine-grained Detection.
Jin Wei, Yuan Zhang, Yu Zhou, Gangyan Zeng, Zhi Qiao, Youhui Guo, Haiying Wu, Hongbin Wang, Weiping Wang
2022The ACM Multimedia 2022 Computational Paralinguistics Challenge: Vocalisations, Stuttering, Activity, & Mosquitoes.
Björn W. Schuller, Anton Batliner, Shahin Amiriparian, Christian Bergler, Maurice Gerczuk, Natalie Holz, Pauline Larrouy-Maestri, Sebastian P. Bayerl, Korbinian Riedhammer, Adria Mallol-Ragolta, Maria Pateraki, Harry Coppock, Ivan Kiskin, Marianne Sinka, Stephen J. Roberts
2022The ACM Multimedia 2022 Deep Video Understanding Grand Challenge.
Keith Curtis, George Awad, Shahzad Rajput, Ian Soboroff
2022The Beauty of Repetition in Machine Composition Scenarios.
Zhejing Hu, Xiao Ma, Yan Liu, Gong Chen, Yongxu Liu
2022The First Impression: Understanding the Impact of Multimodal System Responses on User Behavior in Task-oriented Agents.
Diogo Silva
2022The More, The Better? Active Silencing of Non-Positive Transfer for Efficient Multi-Domain Few-Shot Classification.
Xingxing Zhang, Zhizhe Liu, Weikai Yang, Liyuan Wang, Jun Zhu
2022Time and Memory Efficient Large-Scale Canonical Correlation Analysis in Fourier Domain.
Xiang-Jun Shen, Zhaorui Xu, Liangjun Wang, Zechao Li
2022Title-and-Tag Contrastive Vision-and-Language Transformer for Social Media Popularity Prediction.
Weilong Chen, Chenghao Huang, Weimin Yuan, Xiaolu Chen, Wenhao Hu, Xinran Zhang, Yanru Zhang
2022Token Embeddings Alignment for Cross-Modal Retrieval.
Chen-Wei Xie, Jianmin Wu, Yun Zheng, Pan Pan, Xian-Sheng Hua
2022TopicVAE: Topic-aware Disentanglement Representation Learning for Enhanced Recommendation.
Zhiqiang Guo, Guohui Li, Jianjun Li, Huaicong Chen
2022Towards Accurate Post-Training Quantization for Vision Transformer.
Yifu Ding, Haotong Qin, Qinghua Yan, Zhenhua Chai, Junjie Liu, Xiaolin Wei, Xianglong Liu
2022Towards Adversarial Attack on Vision-Language Pre-training Models.
Jiaming Zhang, Qi Yi, Jitao Sang
2022Towards All Weather and Unobstructed Multi-Spectral Image Stitching: Algorithm and Benchmark.
Zhiying Jiang, Zengxi Zhang, Xin Fan, Risheng Liu
2022Towards Blind Watermarking: Combining Invertible and Non-invertible Mechanisms.
Rui Ma, Mengxi Guo, Yi Hou, Fan Yang, Yuan Li, Huizhu Jia, Xiaodong Xie
2022Towards Causality Inference for Very Important Person Localization.
Xiao Wang, Zheng Wang, Wu Liu, Xin Xu, Qijun Zhao, Shin'ichi Satoh
2022Towards Complex Document Understanding By Discrete Reasoning.
Fengbin Zhu, Wenqiang Lei, Fuli Feng, Chao Wang, Haozhou Zhang, Tat-Seng Chua
2022Towards Continual Adaptation in Industrial Anomaly Detection.
Wujin Li, Jiawei Zhan, Jinbao Wang, Bizhong Xia, Bin-Bin Gao, Jun Liu, Chengjie Wang, Feng Zheng
2022Towards Counterfactual Image Manipulation via CLIP.
Yingchen Yu, Fangneng Zhan, Rongliang Wu, Jiahui Zhang, Shijian Lu, Miaomiao Cui, Xuansong Xie, Xian-Sheng Hua, Chunyan Miao
2022Towards Further Comprehension on Referring Expression with Rationale.
Rengang Li, Baoyu Fan, Xiaochuan Li, Runze Zhang, Zhenhua Guo, Kun Zhao, Yaqian Zhao, Weifeng Gong, Endong Wang
2022Towards High-Fidelity Face Normal Estimation.
Meng Wang, Chaoyue Wang, Xiaojie Guo, Jiawan Zhang
2022Towards Open-Ended Text-to-Face Generation, Combination and Manipulation.
Jun Peng, Han Pan, Yiyi Zhou, Jing He, Xiaoshuai Sun, Yan Wang, Yongjian Wu, Rongrong Ji
2022Towards Robust Video Object Segmentation with Adaptive Object Calibration.
Xiaohao Xu, Jinglu Wang, Xiang Ming, Yan Lu
2022Towards Unbiased Visual Emotion Recognition via Causal Intervention.
Yuedong Chen, Xu Yang, Tat-Jen Cham, Jianfei Cai
2022Towards Understanding Cross Resolution Feature Matching for Surveillance Face Recognition.
Chiawei Kuo, Yi-Ting Tsai, Hong-Han Shuai, Yi-Ren Yeh, Ching-Chun Huang
2022Tracking Game: Self-adaptative Agent based Multi-object Tracking.
Shuai Wang, Da Yang, Yubin Wu, Yang Liu, Hao Sheng
2022Trajectory Prediction from Hierarchical Perspective.
Tangwen Qian, Yongjun Xu, Zhao Zhang, Fei Wang
2022TransCNN-HAE: Transformer-CNN Hybrid AutoEncoder for Blind Image Inpainting.
Haoru Zhao, Zhaorui Gu, Bing Zheng, Haiyong Zheng
2022Transcript to Video: Efficient Clip Sequencing from Texts.
Yu Xiong, Fabian Caba Heilbron, Dahua Lin
2022Transductive Aesthetic Preference Propagation for Personalized Image Aesthetics Assessment.
Yaohui Li, Yuzhe Yang, Huaxiong Li, Haoxing Chen, Liwu Xu, Leida Li, Yaqian Li, Yandong Guo
2022Transformers in Spectral Domain for Estimating Image Geometric Transformation.
Mingii Choi, Sangyeong Lee, Heesun Jung, Jong-Uk Hou
2022Two stage Multi-Modal Modeling for Video Interaction Analysis in Deep Video Understanding Challenge.
Siyang Sun, Xiong Xiong, Yun Zheng
2022Two-Stage Multi-Scale Resolution-Adaptive Network for Low-Resolution Face Recognition.
Haihan Wang, Shangfei Wang, Lin Fang
2022Two-Stream Transformer for Multi-Label Image Classification.
Xuelin Zhu, Jiuxin Cao, Jiawei Ge, Weijia Liu, Bo Liu
2022TxVAD: Improved Video Action Detection by Transformers.
Zhenyu Wu, Zhou Ren, Yi Wu, Zhangyang Wang, Gang Hua
2022UConNet: Unsupervised Controllable Network for Image and Video Deraining.
Jun-Hao Zhuang, Yi-Si Luo, Xile Zhao, Tai-Xiang Jiang, Bichuan Guo
2022UDoc-GAN: Unpaired Document Illumination Correction with Background Light Prior.
Yonghui Wang, Wengang Zhou, Zhenbo Lu, Houqiang Li
2022Unbiased Directed Object Attention Graph for Object Navigation.
Ronghao Dang, Zhuofan Shi, Liuyi Wang, Zongtao He, Chengju Liu, Qijun Chen
2022Uncertainty-Aware 3D Human Pose Estimation from Monocular Video.
Jinlu Zhang, Yujin Chen, Zhigang Tu
2022Uncertainty-Aware Semi-Supervised Learning of 3D Face Rigging from Single Image.
Yong Zhao, Haifeng Chen, Hichem Sahli, Ke Lu, Dongmei Jiang
2022Understanding News Text and Images Connection with Context-enriched Multimodal Transformers.
Cláudio Bartolomeu, Rui Nóbrega, David Semedo
2022Understanding Political Polarization via Jointly Modeling Users, Connections and Multimodal Contents on Heterogeneous Graphs.
Hanjia Lyu, Jiebo Luo
2022Understanding and Identifying Artwork Plagiarism with the Wisdom of Designers: A Case Study on Poster Artworks.
Shenglan Cui, Fang Liu, Tongqing Zhou, Mohan Zhang
2022Unified Multi-modal Pre-training for Few-shot Sentiment Analysis with Prompt-based Learning.
Yang Yu, Dong Zhang, Shoushan Li
2022Unified Multimodal Model with Unlikelihood Training for Visual Dialog.
Zihao Wang, Junli Wang, Changjun Jiang
2022Unified Normalization for Accelerating and Stabilizing Transformers.
Qiming Yang, Kai Zhang, Chaoxiang Lan, Zhi Yang, Zheyang Li, Wenming Tan, Jun Xiao, Shiliang Pu
2022Unified QA-aware Knowledge Graph Generation Based on Multi-modal Modeling.
Penggang Qin, Jiarui Yu, Yan Gao, Derong Xu, Yunkai Chen, Shiwei Wu, Tong Xu, Enhong Chen, Yanbin Hao
2022Universal Domain Adaptive Object Detector.
Wenxu Shi, Lei Zhang, Weijie Chen, Shiliang Pu
2022Unsupervised Domain Adaptation Integrating Transformer and Mutual Information for Cross-Corpus Speech Emotion Recognition.
Shiqing Zhang, Ruixin Liu, Yijiao Yang, Xiaoming Zhao, Jun Yu
2022Unsupervised Multi-object Tracking via Dynamical VAE and Variational Inference.
Xiaoyu Lin
2022Unsupervised Textured Terrain Generation via Differentiable Rendering.
Peichi Zhou, Dingbo Lu, Chen Li, Jian Zhang, Long Liu, Changbo Wang
2022Unsupervised Video Hashing with Multi-granularity Contextualization and Multi-structure Preservation.
Yanbin Hao, Jingru Duan, Hao Zhang, Bin Zhu, Pengyuan Zhou, Xiangnan He
2022Unsupervised and Pseudo-Supervised Vision-Language Alignment in Visual Dialog.
Feilong Chen, Duzhen Zhang, Xiuyi Chen, Jing Shi, Shuang Xu, Bo Xu
2022UoLMM'22: 2nd International Workshop on Robust Understanding of Low-quality Multimedia Data: Unitive Enhancement, Analysis and Evaluation.
Liang Liao, Dan Xu, Yang Wu, Xiao Wang, Jing Xiao
2022VMRF: View Matching Neural Radiance Fields.
Jiahui Zhang, Fangneng Zhan, Rongliang Wu, Yingchen Yu, Wenqing Zhang, Song Bai, Xiaoqin Zhang, Shijian Lu
2022VQ-DcTr: Vector-Quantized Autoencoder With Dual-channel Transformer Points Splitting for 3D Point Cloud Completion.
Ben Fei, Weidong Yang, Wen-Ming Chen, Lipeng Ma
2022Video Coding Enhancements for HTTP Adaptive Streaming.
Vignesh V. Menon
2022Video Coding using Learned Latent GAN Compression.
Mustafa Shukor, Bharath Bhushan Damodaran, Xu Yao, Pierre Hellier
2022Video Grounding and Its Generalization.
Xin Wang, Xiaohan Lan, Wenwu Zhu
2022Video Instance Lane Detection via Deep Temporal and Geometry Consistency Constraints.
Mingqian Wang, Yujun Zhang, Wei Feng, Lei Zhu, Song Wang
2022Video Moment Retrieval with Hierarchical Contrastive Learning.
Bolin Zhang, Chao Yang, Bin Jiang, Xiaokang Zhou
2022Video-Guided Curriculum Learning for Spoken Video Grounding.
Yan Xia, Zhou Zhao, Shangwei Ye, Yang Zhao, Haoyuan Li, Yi Ren
2022VigilanceNet: Decouple Intra- and Inter-Modality Learning for Multimodal Vigilance Estimation in RSVP-Based BCI.
Xinyu Cheng, Wei Wei, Changde Du, Shuang Qiu, Sanli Tian, Xiaojun Ma, Huiguang He
2022Visual Dialog for Spotting the Differences between Pairs of Similar Images.
Duo Zheng, Fandong Meng, Qingyi Si, Hairun Fan, Zipeng Xu, Jie Zhou, Fangxiang Feng, Xiaojie Wang
2022Visual Grounding in Remote Sensing Images.
Yuxi Sun, Shanshan Feng, Xutao Li, Yunming Ye, Jian Kang, Xu Huang
2022Visual Knowledge Graph for Human Action Reasoning in Videos.
Yue Ma, Yali Wang, Yue Wu, Ziyu Lyu, Siran Chen, Xiu Li, Yu Qiao
2022Viva Contemporary! Mobile Music Laboratory.
Emily Graber, Charles Picasso, Elaine Chew
2022WOC: A Handy Webcam-based 3D Online Chatroom.
Chuanhang Yan, Yu Sun, Qian Bao, Jinhui Pang, Wu Liu, Tao Mei
2022Wander: An AI-driven Chatbot to Visit the Future Earth.
Yuqian Sun, Chenhang Cheng, Ying Xu, Yihua Li, Chang Hee Lee, Ali Asadipour
2022Wav2vec2-based Paralinguistic Systems to Recognise Vocalised Emotions and Stuttering.
Tamás Grósz, Dejan Porjazovski, Yaroslav Getman, Sudarsana Reddy Kadiri, Mikko Kurimo
2022Wavelet-enhanced Weakly Supervised Local Feature Learning for Face Forgery Detection.
Jiaming Li, Hongtao Xie, Lingyun Yu, Yongdong Zhang
2022Weakly Supervised Video Salient Object Detection via Point Supervision.
Shuyong Gao, Haozhe Xing, Wei Zhang, Yan Wang, Qianyu Guo, Wenqiang Zhang
2022Weakly-Supervised Temporal Action Alignment Driven by Unbalanced Spectral Fused Gromov-Wasserstein Distance.
Dixin Luo, Yutong Wang, Angxiao Yue, Hongteng Xu
2022Weakly-supervised Disentanglement Network for Video Fingerspelling Detection.
Ziqi Jiang, Shengyu Zhang, Siyuan Yao, Wenqiao Zhang, Sihan Zhang, Juncheng Li, Zhou Zhao, Fei Wu
2022Webly Supervised Image Hashing with Lightweight Semantic Transfer Network.
Hui Cui, Lei Zhu, Jingjing Li, Zheng Zhang, Weili Guan
2022When True Becomes False: Few-Shot Link Prediction beyond Binary Relations through Mining False Positive Entities.
Xuan Zhang, Xun Liang, Xiangping Zheng, Bo Wu, Yuhui Guo
2022Where Are You Looking?: A Large-Scale Dataset of Head and Gaze Behavior for 360-Degree Videos and a Pilot Study.
Yili Jin, Junhua Liu, Fangxin Wang, Shuguang Cui
2022X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval.
Yiwei Ma, Guohai Xu, Xiaoshuai Sun, Ming Yan, Ji Zhang, Rongrong Ji
2022You Can even Annotate Text with Voice: Transcription-only-Supervised Text Spotting.
Jingqun Tang, Su Qiao, Benlei Cui, Yuhang Ma, Sheng Zhang, Dimitrios Kanoulas
2022You Only Align Once: Bidirectional Interaction for Spatial-Temporal Video Super-Resolution.
Mengshun Hu, Kui Jiang, Zhixiang Nie, Zheng Wang
2022You Only Hypothesize Once: Point Cloud Registration with Rotation-equivariant Descriptors.
Haiping Wang, Yuan Liu, Zhen Dong, Wenping Wang
2022Zero-shot Generalization of Multimodal Dialogue Agents.
Diogo Tavares
2022Zero-shot Video Classification with Appropriate Web and Task Knowledge Transfer.
Junbao Zhuo, Yan Zhu, Shuhao Cui, Shuhui Wang, Bin Ma, Qingming Huang, Xiaoming Wei, Xiaolin Wei
2022mmBody Benchmark: 3D Body Reconstruction Dataset and Analysis for Millimeter Wave Radar.
Anjun Chen, Xiangyu Wang, Shaohao Zhu, Yanxu Li, Jiming Chen, Qi Ye
2022mmLayout: Multi-grained MultiModal Transformer for Document Understanding.
Wenjin Wang, Zhengjie Huang, Bin Luo, Qianglong Chen, Qiming Peng, Yinxu Pan, Weichong Yin, Shikun Feng, Yu Sun, Dianhai Yu, Yin Zhang
2022xCloth: Extracting Template-free Textured 3D Clothes from a Monocular Image.
Astitva Srivastava, Chandradeep Pokhariya, Sai Sagar Jinka, Avinash Sharma