Papers
MolGrapher: Graph-based Visual Recognition of Chemical Structures
Lucas Morin, Martin Danelljan, Maria Isabel Agea et al.
Moment Detection in Long Tutorial Videos
Ioana Croitoru, Simion-Vlad Bogolin, Samuel Albanie et al.
Monocular 3D Object Detection with Bounding Box Denoising in 3D by Perceiver
Xianpeng Liu, Ce Zheng, Kelvin B Cheng et al.
MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection
Renrui Zhang, Han Qiu, Tai Wang et al.
MonoNeRD: NeRF-like Representations for Monocular 3D Object Detection
Junkai Xu, Liang Peng, Haoran Cheng et al.
MonoNeRF: Learning a Generalizable Dynamic Radiance Field from Monocular Videos
Fengrui Tian, Shaoyi Du, Yueqi Duan
Monte Carlo Linear Clustering with Single-Point Supervision is Enough for Infrared Small Target Detection
Boyang Li, Yingqian Wang, Longguang Wang et al.
MoreauGrad: Sparse and Robust Interpretation of Neural Networks via Moreau Envelope
Jingwei Zhang, Farzan Farnia
MosaiQ: Quantum Generative Adversarial Networks for Image Generation on NISQ Computers
Daniel Silver, Tirthak Patel, William Cutler et al.
MOSE: A New Dataset for Video Object Segmentation in Complex Scenes
Henghui Ding, Chang Liu, Shuting He et al.
Most Important Person-Guided Dual-Branch Cross-Patch Attention for Group Affect Recognition
Hongxia Xie, Ming-Xian Lee, Tzu-Jui Chen et al.
MOST: Multiple Object Localization with Self-Supervised Transformers for Object Discovery
Sai Saketh Rambhatla, Ishan Misra, Rama Chellappa et al.
MoTIF: Learning Motion Trajectories with Local Implicit Neural Functions for Continuous Space-Time Video Super-Resolution
Yi-Hsin Chen, Si-Cun Chen, Yi-Hsin Chen et al.
MotionBERT: A Unified Perspective on Learning Human Motion Representations
Wentao Zhu, Xiaoxuan Ma, Zhaoyang Liu et al.
MotionDeltaCNN: Sparse CNN Inference of Frame Differences in Moving Camera Videos with Spherical Buffers and Padded Convolutions
Mathias Parger, Chengcheng Tang, Thomas Neff et al.
Motion-Guided Masking for Spatiotemporal Representation Learning
David Fan, Jue Wang, Shuai Liao et al.
MotionLM: Multi-Agent Motion Forecasting as Language Modeling
Ari Seff, Brian Cera, Dian Chen et al.
Movement Enhancement toward Multi-Scale Video Feature Representation for Temporal Action Detection
Zixuan Zhao, Dongqi Wang, Xu Zhao
MPCViT: Searching for Accurate and Efficient MPC-Friendly Vision Transformer with Heterogeneous Attention
Wenxuan Zeng, Meng Li, Wenjie Xiong et al.
MPI-Flow: Learning Realistic Optical Flow with Multiplane Images
Yingping Liang, Jiaming Liu, Debing Zhang et al.
MRM: Masked Relation Modeling for Medical Image Pre-Training with Genetics
Qiushi Yang, Wuyang Li, Baopu Li et al.
MRN: Multiplexed Routing Network for Incremental Multilingual Text Recognition
Tianlun Zheng, Zhineng Chen, Bingchen Huang et al.
MST-compression: Compressing and Accelerating Binary Neural Networks with Minimum Spanning Tree
Quang Hieu Vo, Linh-Tam Tran, Sung-Ho Bae et al.
MULLER: Multilayer Laplacian Resizer for Vision
Zhengzhong Tu, Peyman Milanfar, Hossein Talebi
Multi3DRefer: Grounding Text Description to Multiple 3D Objects
Yiming Zhang, ZeMing Gong, Angel X. Chang