Papers
Multi-Modal Gait Recognition via Effective Spatial-Temporal Feature Fusion
Yufeng Cui, Yimei Kang
Multimodal Industrial Anomaly Detection via Hybrid Fusion
Yue Wang, Jinlong Peng, Jiangning Zhang et al.
Multimodality Helps Unimodality: Cross-Modal Few-Shot Learning With Multimodal Models
Zhiqiu Lin, Samuel Yu, Zhiyi Kuang et al.
Multi-Modal Learning With Missing Modality via Shared-Specific Feature Modelling
Hu Wang, Yuanhong Chen, Congbo Ma et al.
Multimodal Prompting With Missing Modalities for Visual Recognition
Yi-Lun Lee, Yi-Hsuan Tsai, Wei-Chen Chiu et al.
Multi-Modal Representation Learning With Text-Driven Soft Masks
Jaeyoo Park, Bohyung Han
Multi-Mode Online Knowledge Distillation for Self-Supervised Visual Representation Learning
Kaiyou Song, Jin Xie, Shan Zhang et al.
Multi-Object Manipulation via Object-Centric Neural Scattering Functions
Stephen Tian, Yancheng Cai, Hong-Xing Yu et al.
Multiple Instance Learning via Iterative Self-Paced Supervised Contrastive Learning
Kangning Liu, Weicheng Zhu, Yiqiu Shen et al.
Multiplicative Fourier Level of Detail
Yishun Dou, Zhong Zheng, Qiaoqiao Jin et al.
Multi-Realism Image Compression With a Conditional Generator
Eirikur Agustsson, David Minnen, George Toderici et al.
Multi-Sensor Large-Scale Dataset for Multi-View 3D Reconstruction
Oleg Voynov, Gleb Bobrovskikh, Pavel Karpyshev et al.
Multi-Space Neural Radiance Fields
Ze-Xin Yin, Jiaxiong Qiu, Ming-Ming Cheng et al.
Multispectral Video Semantic Segmentation: A Benchmark Dataset and Baseline
Wei Ji, Jingjing Li, Cheng Bian et al.
Multivariate, Multi-Frequency and Multimodal: Rethinking Graph Neural Networks for Emotion Recognition in Conversation
Feiyu Chen, Jie Shao, Shuyuan Zhu et al.
Multi-View Adversarial Discriminator: Mine the Non-Causal Factors for Object Detection in Unseen Domains
Mingjun Xu, Lingyun Qin, Weijie Chen et al.
Multi-View Azimuth Stereo via Tangent Space Consistency
Xu Cao, Hiroaki Santo, Fumio Okura et al.
Multiview Compressive Coding for 3D Reconstruction
Chao-Yuan Wu, Justin Johnson, Jitendra Malik et al.
Multi-View Inverse Rendering for Large-Scale Real-World Indoor Scenes
Zhen Li, Lingli Wang, Mofang Cheng et al.
Multi-View Reconstruction Using Signed Ray Distance Functions (SRDF)
Pierre Zins, Yuanlu Xu, Edmond Boyer et al.
Multi-View Stereo Representation Revist: Region-Aware MVSNet
Yisu Zhang, Jianke Zhu, Lixiang Lin
Music-Driven Group Choreography
Nhat Le, Thang Pham, Tuong Do et al.
Mutual Information-Based Temporal Difference Learning for Human Pose Estimation in Video
Runyang Feng, Yixing Gao, Xueqing Ma et al.
MVImgNet: A Large-Scale Dataset of Multi-View Images
Xianggang Yu, Mutian Xu, Yidan Zhang et al.