Papers
Make-An-Animation: Large-Scale Text-conditional 3D Human Motion Generation
Samaneh Azadi, Akbar Shah, Thomas Hayes et al.
Make Encoder Great Again in 3D GAN Inversion through Geometry and Occlusion-Aware Encoding
Ziyang Yuan, Yiming Zhu, Yu Li et al.
Make-It-3D: High-fidelity 3D Creation from A Single Image with Diffusion Prior
Junshu Tang, Tengfei Wang, Bo Zhang et al.
MAMo: Leveraging Memory and Attention for Monocular Video Depth Estimation
Rajeev Yasarla, Hong Cai, Jisoo Jeong et al.
Manipulate by Seeing: Creating Manipulation Controllers from Pre-Trained Representations
Jianren Wang, Sudeep Dasari, Mohan Kumar Srirama et al.
MAPConNet: Self-supervised 3D Pose Transfer with Mesh and Point Contrastive Learning
Jiaze Sun, Zhixiang Chen, Tae-Kyun Kim
MapFormer: Boosting Change Detection by Using Pre-change Information
Maximilian Bernhard, Niklas Strauß, Matthias Schubert
MapPrior: Bird's-Eye View Map Layout Estimation with Generative Models
Xiyue Zhu, Vlas Zyrianov, Zhijian Liu et al.
MAP: Towards Balanced Generalization of IID and OOD through Model-Agnostic Adapters
Min Zhang, Junkun Yuan, Yue He et al.
March in Chat: Interactive Prompting for Remote Embodied Referring Expression
Yanyuan Qiao, Yuankai Qi, Zheng Yu et al.
Markov Game Video Augmentation for Action Segmentation
Nicolas Aziere, Sinisa Todorovic
MARS: Model-agnostic Biased Object Removal without Additional Supervision for Weakly-Supervised Semantic Segmentation
Sanghyun Jo, In-Jae Yu, Kyungsu Kim
MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing
Mingdeng Cao, Xintao Wang, Zhongang Qi et al.
Mask-Attention-Free Transformer for 3D Instance Segmentation
Xin Lai, Yuhui Yuan, Ruihang Chu et al.
Masked Autoencoders are Efficient Class Incremental Learners
Jiang-Tian Zhai, Xialei Liu, Andrew D. Bagdanov et al.
Masked Autoencoders Are Stronger Knowledge Distillers
Shanshan Lao, Guanglu Song, Boxiao Liu et al.
Masked Diffusion Transformer is a Strong Image Synthesizer
Shanghua Gao, Pan Zhou, Ming-Ming Cheng et al.
Masked Motion Predictors are Strong 3D Action Representation Learners
Yunyao Mao, Jiajun Deng, Wengang Zhou et al.
Masked Retraining Teacher-Student Framework for Domain Adaptive Object Detection
Zijing Zhao, Sitong Wei, Qingchao Chen et al.
Masked Spatio-Temporal Structure Prediction for Self-supervised Learning on Point Cloud Videos
Zhiqiang Shen, Xiaoxiao Sheng, Hehe Fan et al.
Masked Spiking Transformer
Ziqing Wang, Yuetong Fang, Jiahang Cao et al.
MasQCLIP for Open-Vocabulary Universal Image Segmentation
Xin Xu, Tianyi Xiong, Zheng Ding et al.
Mastering Spatial Graph Prediction of Road Networks
Anagnostidis Sotiris, Aurelien Lucchi, Thomas Hofmann
MAS: Towards Resource-Efficient Federated Multiple-Task Learning
Weiming Zhuang, Yonggang Wen, Lingjuan Lyu et al.
MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge
Wei Lin, Leonid Karlinsky, Nina Shvetsova et al.