Papers
MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation
Rongyu Zhang, Menghang Dong, Yuan Zhang et al.
MoLoRA: Boosting LLM-based End-to-end Speech Translation with Mixture of Low-rank Experts
Hao Zhang, Yaqi Chen, Nianwen Si et al.
MolSight: Optical Chemical Structure Recognition with SMILES Pretraining, Multi-Granularity Learning and Reinforcement Learning
Wenrui Zhang, Xinggang Wang, Bin Feng et al.
MoMoREC: A Multi-agent Motivation Generation Framework for Residual Semantic ID-Aware Recommendation
Yige Wang, Mingming Li, Li Wang et al.
Mono3DVG-EnSD: Enhanced Spatial-aware and Dimension-decoupled Text Encoding for Monocular 3D Visual Grounding
Yuzhen Li, Min Liu, Zhaoyang Li et al.
MonoCLUE: Object-Aware Clustering Enhances Monocular 3D Object Detection
Sunghun Yang, Minhyeok Lee, Jungho Lee et al.
Monocular Mesh Recovery and Body Measurement of Female Saanen Goats
Bo Jin, ShichaoZhao, Jin Lyu et al.
Monocular Vehicle Pose and Shape Reconstruction via Dynamic Context Adaptation and Progressive Geometry Refinement
Wei Li, Long Ji, Ying Wang et al.
MonoDream: Monocular Vision-Language Navigation with Panoramic Dreaming
Shuo Wang, Yongcai Wang, Zhaoxin Fan et al.
Monte Carlo Diffusion for Generalizable Learning-Based RANSAC
Jiale Wang, Chen Zhao, Wei Ke et al.
Moral Change or Noise? On Problems of Aligning AI with Temporally Unstable Human Feedback
Vijay Keswani, Cyrus Cousins, Breanna Nguyen et al.
MoReMouse: Monocular Reconstruction of Laboratory Mouse
Yuan Zhong, Jingxiang Sun, Zhongbin Zhang et al.
More than Irrational: Modeling Belief-Biased Agents
Yifan Zhu, Sammie Katt, Samuel Kaski
MORGAN: To Bridge Mixture of Experts and Spectral Graph Neural Network
Lihui Liu, Yuchen Yan
MosaicDoc: A Large-Scale Bilingual Benchmark for Visually Rich Document Understanding
Ketong Chen, Yuhao Chen, Yang Xue
Mosaic Pruning: A Hierarchical Framework for Generalizable Pruning of Mixture-of-Experts Models
Wentao Hu, Mingkuan Zhao, Shuangyong Song et al.
MoSE: Hierarchical Self-Distillation Enhances Early Layer Embeddings
Andrea Gurioli, Federico Pennino, Joao Monteiro et al.
MoSs: Mixture of Scales for Efficient High-Resolution Autoregressive Image Generation
Yaoxiu Lian, Hao Liang, Zhihong Gou et al.
MOTIF: Multi-strategy Optimization via Turn-based Interactive Framework
Nguyen Viet Tuan Kiet, Tung Dao, Cong Dao Tran et al.
Motion-Aware Object Tracking via Motion and Geometry-Aware Cues
Hongtao Yang, Bineng Zhong, Qihua Liang et al.
MotionCharacter: Fine-Grained Motion Controllable Human Video Generation
Haopeng Fang, Di Qiu, Binjie Mao et al.
MotionFlow: Attention-Driven Motion Transfer in Video Diffusion Models
Tuna Han Salih Meral, Hidir Yesiltepe, Connor Dunlop et al.
MotionPhysics: Learnable Motion Distillation for Text-Guided Simulation
Miaowei Wang, Jakub Zadrożny, Oisin Mac Aodha et al.