Papers
4,428 papers found
Modeling and Learning Multiple Hypotheses for Monocular 3D Object Detection
Hyeonjeong Park, Peixi Xiong, Pei Yu et al.
Moire Zero: An Efficient and High-Performance Neural Architecture for Moire Removal
Seungryong Lee, Woojeong Baek, Younghyun Kim et al.
MomentMix Augmentation with Length-Aware DETR for Temporally Robust Moment Retrieval
Seojeong Park, Jiho Choi, Kyungjune Baek et al.
MooTrack360: A Novel Fisheye Camera Dataset for Robust Multi Diary Cow Detection and Tracking
Rasmus Gjerlund K. Christiansen, Toan Van Nguyen, Lasse Rose Malskær et al.
MoRe: Monocular Geometry Refinement via Graph Optimization for Cross-View Consistency
Dongki Jung, Jaehoon Choi, Yonghan Lee et al.
More Than Memory Savings: Zeroth-Order Optimization Mitigates Forgetting in Continual Learning
Wanhao Yu, Zheng Wang, Shuteng Niu et al.
Morphing Through Time: Diffusion-Based Bridging of Temporal Gaps for Robust Alignment in Change Detection
Seyedehanita Madani, Vishal M. Patel
MorphXAI: An Explainable Framework for Morphological Analysis of Parasites in Blood Smear Images
Aqsa Yousaf, Sint Sint Win, Megan Coffee et al.
MoSCo: Real-time and Efficient Text-to-Motion Synthesis via Delta Training
Zhiyuan Zhang, Lingqiao Liu
Motion-Aware Graph Fusion Network for 3D Human Pose Estimation
Yen Pham, Xiaohui Yuan, Chengyuan Zhuang
MR-Pruner: Training-free Multi-resolution Visual Token Pruning for Multi-modal Large Language Models
Seunghoon Han, Hyewon Lee, Soyoung Park et al.
MSRTrack: LLM-Powered Object Tracking with Motion and Semantic Reasoning
Tong Shen, Di Wang, José M. F. Moura
Multi-Grained Text-Guided Image Fusion for Multi-Exposure and Multi-Focus Scenarios
Mingwei Tang, Jiahao Nie, Guang Yang et al.
Multimodal Adversarial Defense for Vision-Language Models by Leveraging One-To-Many Relationships
Futa Waseda, Antonio Tejero-de-Pablos, Isao Echizen
Multimodal Graph Representation Learning over Arbitrary Sets of Modalities
Santosh Patapati, Trisanth Srinivasan
Multimodal Medical Image Binding via Shared Text Embeddings
Yunhao Liu, Suyang Xi, Shiqi Liu et al.
Multi-Modal Soccer Scene Analysis with Masked Pre-Training
Marc Peral, Guillem Capellera, Luis Ferraz et al.
Multi-view Stereo with Multiple Projectors for Oneshot Entire Shape Scan based on Neural SDF and DSSS Demultiplexing
Kota Nishihara, Ryo Furukawa, Ryusuke Sagawa et al.
MuSACo: Multimodal Subject-Specific Selection and Adaptation for Expression Recognition with Co-Training
Muhammad Osama Zeeshan, Natacha Gillet, Alessandro Lameiras Koerich et al.
MuseDance: A Diffusion-based Music-Driven Image Animation System
Zhikang Dong, Weituo Hao, Ju-Chiang Wang et al.
MUSE: Model-based Uncertainty-aware Similarity Estimation for zero-shot 2D Object Detection and Segmentation
Sungmin Cho, Sungbum Park, Insoo Oh
MVAT: Multi-View Aware Teacher for Weakly Supervised 3D Object Detection
Saad Lahlali, Alexandre Fournier-Mongieux, Nicolas Granger et al.
NAPP: Noise-Adaptive Prototype Perturbation for Few-Shot Learning
Ilhwan Kim, Sangwoo Yun, Dongheon Lee et al.
Narrating For You: Prompt-guided Audio-visual Narrating Face Generation Employing Multi-entangled Latent Space
Aashish Chandra K, Aashutosh A V, Abhijit Das
NavMapFusion: Diffusion-based Fusion of Navigation Maps for Online Vectorized HD Map Construction
Thomas Monninger, Zihan Zhang, Steffen Staab et al.