Papers
Mass-Editing Memory with Attention in Transformers: A cross-lingual exploration of knowledge
Daniel Tamayo, Aitor Gonzalez-Agirre, Javier Hernando et al.
Match More, Extract Better! Hybrid Matching Model for Open Domain Web Keyphrase Extraction
Mingyang Song, Liping Jing, Yi Feng
MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark
Hongwei Liu, Zilong Zheng, Yuxuan Qiao et al.
MathGenie: Generating Synthetic Data with Question Back-translation for Enhancing Mathematical Reasoning of LLMs
Zimu Lu, Aojun Zhou, Houxing Ren et al.
Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations
Peiyi Wang, Lei Li, Zhihong Shao et al.
MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization
Zhiyu Yang, Zihan Zhou, Shuo Wang et al.
MATTER: Memory-Augmented Transformer Using Heterogeneous Knowledge Sources
Dongkyu Lee, Chandana Satya Prakash, Jack FitzGerald et al.
MAVEN-ARG: Completing the Puzzle of All-in-One Event Understanding Dataset with Event Argument Annotation
Xiaozhi Wang, Hao Peng, Yong Guan et al.
Maverick: Efficient and Accurate Coreference Resolution Defying Recent Trends
Giuliano Martinelli, Edoardo Barba, Roberto Navigli
MBIAS: Mitigating Bias in Large Language Models While Retaining Context
Shaina Raza, Ananya Raval, Veronica Chatrath
mBLIP: Efficient Bootstrapping of Multilingual Vision-LLMs
Gregor Geigle, Abhay Jain, Radu Timofte et al.
MC2: Towards Transparent and Culturally-Aware NLP for Minority Languages in China
Chen Zhang, Mingxu Tao, Quzhe Huang et al.
mCoT: Multilingual Instruction Tuning for Reasoning Consistency in Language Models
Huiyuan Lai, Malvina Nissim
mCSQA: Multilingual Commonsense Reasoning Dataset with Unified Creation Strategy by Language Models and Humans
Yusuke Sakai, Hidetaka Kamigaito, Taro Watanabe
Measuring and Addressing Indexical Bias in Information Retrieval
Caleb Ziems, William Held, Jane Dwivedi-Yu et al.
Measuring Bargaining Abilities of LLMs: A Benchmark and A Buyer-Enhancement Method
Tian Xia, Zhiwei He, Tong Ren et al.
Measuring Meaning Composition in the Human Brain with Composition Scores from Large Language Models
Changjiang Gao, Jixing Li, Jiajun Chen et al.
Measuring Political Bias in Large Language Models: What Is Said and How It Is Said
Yejin Bang, Delong Chen, Nayeon Lee et al.
Measuring Retrieval Complexity in Question Answering Systems
Matteo Gabburo, Nicolaas Paul Jedema, Siddhant Garg et al.
Measuring the Inconsistency of Large Language Models in Preferential Ranking
Xiutian Zhao, Ke Wang, Wei Peng
MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning
Xiangru Tang, Anni Zou, Zhuosheng Zhang et al.
MedDec: A Dataset for Extracting Medical Decisions from Discharge Summaries
Mohamed Elgaar, Jiali Cheng, Nidhi Vakil et al.
MedExQA: Medical Question Answering Benchmark with Multiple Explanations
Yunsoo Kim, Jinge Wu, Yusuf Abdulle et al.
Media Framing: A typology and Survey of Computational Approaches Across Disciplines
Yulia Otmakhova, Shima Khanehzar, Lea Frermann
Medical Dialogue System: A Survey of Categories, Methods, Evaluation and Challenges
Xiaoming Shi, Zeming Liu, Li Du et al.