Papers
4,428 papers found
Locally Explaining Prediction Behavior via Gradual Interventions and Measuring Property Gradients
Niklas Penzel, Joachim Denzler
LogicCBMs: Logic-Enhanced Concept-Based Learning
Deepika SN Vemuri, Gautham Bellamkonda, Aditya Pola et al.
Logit-Adjusted Test-Time Adaptation under Partial Class Imbalance
Thilina Weerasinghe, Ruwan Tennakoon, WeiQin Chuah et al.
LooC: Effective Low-Dimensional Codebook for Compositional Vector Quantization
Jie Li, Kwan-Yee K. Wong, Kai Han
Lorentz Entailment Cone for Semantic Segmentation
Zahid Hasan, Masud Ahmed, Nirmalya Roy
Lose Your Self (LoYS): An Adversarial Entropy-based Unsupervised Approach for Model Debiasing
Vito Paolo Pastore, Massimiliano Ciranni, Vittorio Murino
Low-Rank Expert Merging for Multi-Source Domain Adaptation in Person Re-Identification
Taha Mustapha Nehdi, Nairouz Mrabah, Atif Belal et al.
LVM-Lite: Training Large Vision Models with Efficient Sequential Modeling
Xianhang Li, Hongru Zhu, Sucheng Ren et al.
M4U: Evaluating Multilingual Understanding and Reasoning for Large Multimodal Models
Hongyu Wang, Jiayu Xu, Senwei Xie et al.
MAESTRO: Masked AutoEncoders for Multimodal, Multitemporal, and Multispectral Earth Observation Data
Antoine Labatie, Michael Vaccaro, Nina Lardiere et al.
MAFM3: Modular Adaptation of Foundation Models for Multi-Modal Medical AI
Mohammad Areeb Qazi, Munachiso S Nwadike, Ibrahim Almakky et al.
MageBench: Bridging Large Multimodal Models to Agents
Miaosen Zhang, Qi Dai, Yifan Yang et al.
MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes
Ruiyuan Gao, Kai Chen, Zhihao Li et al.
MANTA: Physics-Informed Generalized Underwater Object Tracking
Suhas Srinath, Hemang Jamadagni, Aditya Chandrasekar et al.
MapleGrasp: Mask-guided Feature Pooling for Language-driven Efficient Robotic Grasping
Vineet Bhat, Naman Patel, Prashanth Krishnamurthy et al.
MapVerse: A Benchmark for Geospatial Question Answering on Diverse Real-World Maps
Sharat Bhat, Harshita Khandelwal, Tushar Kataria et al.
MarineEval: Assessing the Marine Intelligence of Vision-Language Models
Yuk Kwan Wong, Tuan-An To, Jipeng Zhang et al.
MARS: a Multimodal Alignment and Ranking System for Few-Shot Segmentation
Nico Catalano, Stefano Samele, Paolo Pertino et al.
Marshaled Learning: Bridging Large Neural Networks with Memory-Constrained Trusted Execution Environments in Federated Learning
Shiwei Ding, Xiaoyong Yuan, Zhenlin Wang et al.
Matching Semantically Similar Non-Identical Objects
Yusuke Marumo, Kazuhiko Kawamoto, Satomi Tanaka et al.
MaxInfo: A Training-Free Key-Frame Selection Method Using Maximum Volume for Enhanced Video Understanding
Pengyi Li, Irina Abdullaeva, Alexander Gambashidze et al.
MBTI: Metric-Based Textual Inversion for Fine-Grained Image Generation
Byungkwan Chae, Youngjae Choi, Heewon Kim
MDUNet: Multimodal Decoding UNet for Passive Occluder-Aided Non-line-of-sight 3D Imaging
Fadlullah Raji, John Murray-Bruce
Mean-Shift Distillation for Diffusion Mode Seeking
Vikas Thamizharasan, Nikitas Chatzis, Iliyan Georgiev et al.
MEDAL: multi-modal MEta-space Distillation and ALignment for Visual Compatibility Learning
Dween Rabius Sanny, Vinay Kumar Verma, Prateek Sircar et al.