Papers
11,015 papers found
Multimodal Lego: Model Merging and Fine-Tuning Across Topologies and Modalities in Biomedicine
Konstantin Hemker, Nikola Simidjievski, Mateja Jamnik
Multimodal Quantitative Language for Generative Recommendation
Jianyang Zhai, Zi-Feng Mai, Chang-Dong Wang et al.
Multimodal Situational Safety
Kaiwen Zhou, Chengzhi Liu, Xuandong Zhao et al.
Multimodal Unsupervised Domain Generalization by Retrieving Across the Modality Gap
Christopher Liao, Christian So, Theodoros Tsiligkaridis et al.
Multi-objective antibody design with constrained preference optimization
Milong Ren, ZaiKai He, Haicang Zhang
Multi-objective Differentiable Neural Architecture Search
Rhea Sanjay Sukthanker, Arber Zela, Benedikt Staffler et al.
Multi-Perspective Data Augmentation for Few-shot Object Detection
Anh Khoa Nguyen Vu, Truong Quoc Truong, Vinh-Tiep Nguyen et al.
Multiple Heads are Better than One: Mixture of Modality Knowledge Experts for Entity Representation Learning
Yichi Zhang, Zhuo Chen, Lingbing Guo et al.
Multiplicative Logit Adjustment Approximates Neural-Collapse-Aware Decision Boundary Adjustment
Naoya Hasegawa, Issei Sato
Multi-Resolution Decomposable Diffusion Model for Non-Stationary Time Series Anomaly Detection
Guojin Zhong, pan wang, Jin Yuan et al.
Multi-Reward as Condition for Instruction-based Image Editing
Xin Gu, Ming Li, Libo Zhang et al.
Multi-Robot Motion Planning with Diffusion Models
Yorai Shaoul, Itamar Mishani, Shivam Vats et al.
Multi-Scale Fusion for Object Representation
Rongzhen Zhao, Vivienne Huiling Wang, Juho Kannala et al.
Multi-session, multi-task neural decoding from distinct cell-types and brain regions
Mehdi Azabou, Krystal Xuejing Pan, Vinam Arora et al.
Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation
Sungnyun Kim, Sungwoo Cho, Sangmin Bae et al.
Multi-Task Dense Predictions via Unleashing the Power of Diffusion
Yuqi Yang, Peng-Tao Jiang, Qibin Hou et al.
Multiview Equivariance Improves 3D Correspondence Understanding with Minimal Feature Finetuning
Yang You, Yixin Li, Congyue Deng et al.
MuPT: A Generative Symbolic Music Pretrained Transformer
Xingwei Qu, yuelin bai, Yinghao Ma et al.
MuseGNN: Forming Scalable, Convergent GNN Layers that Minimize a Sampling-Based Energy
Haitian Jiang, Renjie Liu, Zengfeng Huang et al.
MUSE: Machine Unlearning Six-Way Evaluation for Language Models
Weijia Shi, Jaechan Lee, Yangsibo Huang et al.
Mutual Effort for Efficiency: A Similarity-based Token Pruning for Vision Transformers in Self-Supervised Learning
Sheng Li, Qitao Tan, Yue Dai et al.
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solver
Zhenting Qi, Mingyuan MA, Jiahang Xu et al.
MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow
Hanzhuo Huang, Yuan Liu, Ge Zheng et al.
NarrativeBridge: Enhancing Video Captioning with Causal-Temporal Narrative
Asmar Nadeem, Faegheh Sardari, Robert Dawes et al.
Narrowing Information Bottleneck Theory for Multimodal Image-Text Representations Interpretability
Zhiyu Zhu, Zhibo Jin, Jiayu Zhang et al.