Papers
Masked Diffusion Transformer is a Strong Image Synthesizer
Shanghua Gao, Pan Zhou, Ming-Ming Cheng et al.
Masked Motion Predictors are Strong 3D Action Representation Learners
Yunyao Mao, Jiajun Deng, Wengang Zhou et al.
Masked Retraining Teacher-Student Framework for Domain Adaptive Object Detection
Zijing Zhao, Sitong Wei, Qingchao Chen et al.
Masked Spatio-Temporal Structure Prediction for Self-supervised Learning on Point Cloud Videos
Zhiqiang Shen, Xiaoxiao Sheng, Hehe Fan et al.
Masked Spiking Transformer
Ziqing Wang, Yuetong Fang, Jiahang Cao et al.
MasQCLIP for Open-Vocabulary Universal Image Segmentation
Xin Xu, Tianyi Xiong, Zheng Ding et al.
Mastering Spatial Graph Prediction of Road Networks
Anagnostidis Sotiris, Aurelien Lucchi, Thomas Hofmann
MAS: Towards Resource-Efficient Federated Multiple-Task Learning
Weiming Zhuang, Yonggang Wen, Lingjuan Lyu et al.
MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge
Wei Lin, Leonid Karlinsky, Nina Shvetsova et al.
MATE: Masked Autoencoders are Online 3D Test-Time Learners
M. Jehanzeb Mirza, Inkyu Shin, Wei Lin et al.
MatrixCity: A Large-scale City Dataset for City-scale Neural Rendering and Beyond
Yixuan Li, Lihan Jiang, Linning Xu et al.
MatrixVT: Efficient Multi-Camera to BEV Transformation for 3D Perception
Hongyu Zhou, Zheng Ge, Zeming Li et al.
MBPTrack: Improving 3D Point Cloud Tracking with Memory Networks and Box Priors
Tian-Xing Xu, Yuan-Chen Guo, Yu-Kun Lai et al.
MB-TaylorFormer: Multi-Branch Efficient Transformer Expanded by Taylor Formula for Image Dehazing
Yuwei Qiu, Kaihao Zhang, Chenxi Wang et al.
MDCS: More Diverse Experts with Consistency Self-distillation for Long-tailed Recognition
Qihao Zhao, Chen Jiang, Wei Hu et al.
Measuring Asymmetric Gradient Discrepancy in Parallel Continual Learning
Fan Lyu, Qing Sun, Fanhua Shang et al.
MedKLIP: Medical Knowledge Enhanced Language-Image Pre-Training for X-ray Diagnosis
Chaoyi Wu, Xiaoman Zhang, Ya Zhang et al.
MEFLUT: Unsupervised 1D Lookup Tables for Multi-exposure Image Fusion
Ting Jiang, Chuan Wang, Xinpeng Li et al.
MEGA: Multimodal Alignment Aggregation and Distillation For Cinematic Video Segmentation
Najmeh Sadoughi, Xinyu Li, Avijit Vajpayee et al.
Membrane Potential Batch Normalization for Spiking Neural Networks
Yufei Guo, Yuhan Zhang, Yuanpei Chen et al.
Memory-and-Anticipation Transformer for Online Action Understanding
Jiahao Wang, Guo Chen, Yifei Huang et al.
MemorySeg: Online LiDAR Semantic Segmentation with a Latent Memory
Enxu Li, Sergio Casas, Raquel Urtasun
MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking
Ruopeng Gao, Limin Wang
Mesh2Tex: Generating Mesh Textures from Image Queries
Alexey Bokhovkin, Shubham Tulsiani, Angela Dai
MetaBEV: Solving Sensor Failures for 3D Detection and Map Segmentation
Chongjian Ge, Junsong Chen, Enze Xie et al.