Papers
MM-LLMs: Recent Advances in MultiModal Large Language Models
Duzhen Zhang, Yahan Yu, Jiahua Dong et al.
MM-SAP: A Comprehensive Benchmark for Assessing Self-Awareness of Multimodal Large Language Models in Perception
Yuhao Wang, Yusheng Liao, Heyang Liu et al.
MM-SOC: Benchmarking Multimodal Large Language Models in Social Media Platforms
Yiqiao Jin, Minje Choi, Gaurav Verma et al.
MMToM-QA: Multimodal Theory of Mind Question Answering
Chuanyang Jin, Yutong Wu, Jing Cao et al.
Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents
Shihan Deng, Weikai Xu, Hongda Sun et al.
MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech
Shengpeng Ji, Ziyue Jiang, Hanting Wang et al.
MODABS: Multi-Objective Learning for Dynamic Aspect-Based Summarization
Xiaobo Guo, Soroush Vosoughi
Modality-Aware Integration with Large Language Models for Knowledge-Based Visual Question Answering
Junnan Dong, Qinggang Zhang, Huachi Zhou et al.
MODDP: A Multi-modal Open-domain Chinese Dataset for Dialogue Discourse Parsing
Chen Gong, DeXin Kong, Suxian Zhao et al.
Model Composition for Multimodal Large Language Models
Chi Chen, Yiyang Du, Zheng Fang et al.
Model Editing at Scale leads to Gradual and Catastrophic Forgetting
Akshat Gupta, Anurag Rao, Gopala Anumanchipalli
Model Editing by Standard Fine-Tuning
Govind Krishnan Gangadhar, Karl Stratos
Modeling Complex Interactions in Long Documents for Aspect-Based Sentiment Analysis
Zehong Yan, Wynne Hsu, Mong-Li Lee et al.
Modeling Dynamic Topics in Chain-Free Fashion by Evolution-Tracking Contrastive Learning and Unassociated Word Exclusion
Xiaobao Wu, Xinshuai Dong, Liangming Pan et al.
Modeling Emotional Trajectories in Written Stories Utilizing Transformers and Weakly-Supervised Learning
Lukas Christ, Shahin Amiriparian, Manuel Milling et al.
Modeling Overregularization in Children with Small Language Models
Akari Haga, Saku Sugawara, Akiyo Fukatsu et al.
Modeling Uncertainty and Using Post-fusion as Fallback Improves Retrieval Augmented Generation with LLMs
Ye Liu, Rui Meng, Meghana Moorthy Bhat et al.
Modelling Commonsense Commonalities with Multi-Facet Concept Embeddings
Hanane Kteich, Na Li, Usashi Chatterjee et al.
Modelling Variability in Human Annotator Simulation
Wen Wu, Wenlin Chen, Chao Zhang et al.
MODOS at ArAIEval Shared Task: Multimodal Propagandistic Memes Classification Using Weighted SAM, CLIP and ArabianGPT
Abdelhamid Haouhat, Hadda Cherroun, Slimane Bellaouar et al.
MoE-SLU: Towards ASR-Robust Spoken Language Understanding via Mixture-of-Experts
Xuxin Cheng, Zhihong Zhu, Xianwei Zhuang et al.
MoExtend: Tuning New Experts for Modality and Task Extension
Shanshan Zhong, Shanghua Gao, Zhongzhan Huang et al.
Mol2Lang-VLM: Vision- and Text-Guided Generative Pre-trained Language Models for Advancing Molecule Captioning through Multimodal Fusion
Duong Thanh Tran, Nhat Truong Pham, Nguyen Doan Hieu Nguyen et al.
MolTC: Towards Molecular Relational Modeling In Language Models
Junfeng Fang, Shuai Zhang, Chang Wu et al.
Monitoring Depression Severity and Symptoms in User-Generated Content: An Annotation Scheme and Guidelines
Falwah Alhamed, Rebecca Bendayan, Julia Ive et al.