Papers
4,428 papers found
MedROV: Towards Real-Time Open-Vocabulary Detection Across Diverse Medical Imaging Modalities
Tooba Tehreem Sheikh, Jean Lahoud, Rao Muhammad Anwer et al.
MEGA-PCC: A Mamba-based Efficient Approach for Joint Geometry and Attribute Point Cloud Compression
Kai-Hsiang Hsieh, Monyneath Yim, Wen-Hsiao Peng et al.
MemeTAG: Keyword-Driven Meme Classification through Tag Embedding Reconstruction
Akshit Sharma, Prashant W Patil
Mem-MLP: Real-Time 3D Human Motion Generation from Sparse Inputs
Sinan Mutlu, Georgios F. Angelis, Savas Ozkan et al.
Memoire: Learning User Personas from Gallery Tags for Personalized Photo Curation
Praful Mathur, Mohsin Iftekhar, Aman Sharma et al.
Memory-Augmented Representation for Efficient Event-based Visuomotor Policy Learning with Adaptive Perception and Control
Uday Kamal, Saibal Mukhopadhyay
mEOL: Training-Free Instruction-Guided Multimodal Embedder for Vector Graphics and Image Retrieval
Kyeong Seon Kim, Baek Seong-Eun, Lee Jung-Mok et al.
M-ErasureBench: A Comprehensive Multimodal Evaluation Benchmark for Concept Erasure in Diffusion Models
Ju-Hsuan Weng, Jia-Wei Liao, Cheng-Fu Chou et al.
MergeSlide: Continual Model Merging and Task-to-Class Prompt-Aligned Inference for Lifelong Learning on Whole Slide Images
Doanh C. Bui, Ba Hung Ngo, Hoai Luan Pham et al.
Meta-YOLO: Metadata-Guided Real-Time Object Detector in Aerial Imagery
Deukryeol Yoon, Seonghak Kim, Young Hwa Sung et al.
milliMamba: Specular-Aware Human Pose Estimation via Dual mmWave Radar with Multi-Frame Mamba Fusion
Niraj Prakash Kini, Shiau-Rung Tsai, Guan-Hsun Lin et al.
MIST: Multilingual Incidental Dataset for Scene Text Detection
Saumya Mundra, Ajoy Mondal, C.V. Jawahar
Mitigating Backdoor Attacks via Trigger Reconstruction and Model Hardening
Guanhong Tao, Siyuan Cheng, Guangyu Shen et al.
Mitigating Object and Action Hallucinations in Multimodal LLMs via Self-Augmented Contrastive Alignment
Kai-Po Chang, Wei-Yuan Cheng, Chi-Pin Huang et al.
Mitigating the Modality Gap: Few-Shot Out-of-Distribution Detection with Multi-modal Prototypes and Image Bias Estimation
Yimu Wang, Evelien Riddell, Adrian Chow et al.
MIX-based Foreground and Background Patch Augmentation Guided by Physics and Material Properties for X-ray Detection
Xintong Liu, Dongliang Chang, Yujun Tong et al.
Mixed Diffusion for 3D Indoor Scene Synthesis
Siyi Hu, Diego MartÃn Arroyo, Stephanie Debats et al.
MixER: From Cross-Modal to Mixed-Modal Visible-Infrared Re-Identification
Mahdi Alehdaghi, Rajarshi Bhattacharya, Dai Yannick et al.
MMCM: Multimodality-aware Metric using Clustering-based Modes for Probabilistic Human Motion Prediction
Kyotaro Tokoro, Hiromu Taketsugu, Norimichi Ukita
MMHOI: Modeling Complex 3D Multi-Human Multi-Object Interactions
Kaen Kogashi, Anoop Cherian, Meng-Yu Jennifer Kuo
MM-TS: Multi-Modal Temperature and Margin Schedules for Contrastive Learning with Long-Tail Data
Siarhei Sheludzko, Dhimitrios Duka, Bernt Schiele et al.
mmWEAVER: Environment-Specific mmWave Signal Synthesis from a Photo and Activity Description
Mahathir Monjur, Shahriar Nirjon
Mobile-Oriented Video Diffusion: Enabling Text-to-Video Generation on Mobile Devices Without Retraining, Compression, or Pruning
Bosung Kim, Kyuhwan Lee, Isu Jeong et al.
Model-free Domain Adaptation for Concealed Multimodal Large-Language Models
Yu Mitsuzumi, Akisato Kimura, Hisashi Kashima