Papers
MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling With Informative-Preserved Reconstruction and Self-Distilled Consistency
Mingye Xu, Mutian Xu, Tong He et al.
MMANet: Margin-Aware Distillation and Modality-Aware Regularization for Incomplete Multimodal Learning
Shicai Wei, Chunbo Luo, Yang Luo
MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
Ludan Ruan, Yiyang Ma, Huan Yang et al.
MMG-Ego4D: Multimodal Generalization in Egocentric Action Recognition
Xinyu Gong, Sreyas Mohan, Naina Dhingra et al.
MMVC: Learned Multi-Mode Video Compression With Block-Based Prediction Mode Selection and Density-Adaptive Entropy Coding
Bowen Liu, Yu Chen, Rakesh Chowdary Machineni et al.
MobileBrick: Building LEGO for 3D Reconstruction on Mobile Devices
Kejie Li, Jia-Wang Bian, Robert Castle et al.
MobileNeRF: Exploiting the Polygon Rasterization Pipeline for Efficient Neural Field Rendering on Mobile Architectures
Zhiqin Chen, Thomas Funkhouser, Peter Hedman et al.
MobileOne: An Improved One Millisecond Mobile Backbone
Pavan Kumar Anasosalu Vasu, James Gabriel, Jeff Zhu et al.
Mobile User Interface Element Detection via Adaptively Prompt Tuning
Zhangxuan Gu, Zhuoer Xu, Haoxing Chen et al.
MobileVOS: Real-Time Video Object Segmentation Contrastive Learning Meets Knowledge Distillation
Roy Miles, Mehmet Kerim Yucel, Bruno Manganelli et al.
Modality-Agnostic Debiasing for Single Domain Generalization
Sanqing Qu, Yingwei Pan, Guang Chen et al.
Modality-Invariant Visual Odometry for Embodied Vision
Marius Memmel, Roman Bachmann, Amir Zamir
MoDAR: Using Motion Forecasting for 3D Object Detection in Point Cloud Sequences
Yingwei Li, Charles R. Qi, Yin Zhou et al.
Model-Agnostic Gender Debiased Image Captioning
Yusuke Hirota, Yuta Nakashima, Noa Garcia
Model Barrier: A Compact Un-Transferable Isolation Domain for Model Intellectual Property Protection
Lianyu Wang, Meng Wang, Daoqiang Zhang et al.
Modeling Entities As Semantic Points for Visual Information Extraction in the Wild
Zhibo Yang, Rujiao Long, Pengfei Wang et al.
Modeling Inter-Class and Intra-Class Constraints in Novel Class Discovery
Wenbin Li, Zhichen Fan, Jing Huo et al.
Modeling the Distributional Uncertainty for Salient Object Detection Models
Xinyu Tian, Jing Zhang, Mochu Xiang et al.
Modeling Video As Stochastic Processes for Fine-Grained Video Representation Learning
Heng Zhang, Daqing Liu, Qi Zheng et al.
Modernizing Old Photos Using Multiple References via Photorealistic Style Transfer
Agus Gunawan, Soo Ye Kim, Hyeonjun Sim et al.
MoDi: Unconditional Motion Synthesis From Diverse Data
Sigal Raab, Inbal Leibovitch, Peizhuo Li et al.
Mod-Squad: Designing Mixtures of Experts As Modular Multi-Task Learners
Zitian Chen, Yikang Shen, Mingyu Ding et al.
Modular Memorability: Tiered Representations for Video Memorability Prediction
Théo Dumont, Juan Segundo Hevia, Camilo L. Fosco
Mofusion: A Framework for Denoising-Diffusion-Based Motion Synthesis
Rishabh Dabral, Muhammad Hamza Mughal, Vladislav Golyanik et al.
MoLo: Motion-Augmented Long-Short Contrastive Learning for Few-Shot Action Recognition
Xiang Wang, Shiwei Zhang, Zhiwu Qing et al.