Papers
MARIO: MAth Reasoning with code Interpreter Output - A Reproducible Pipeline
Minpeng Liao, Chengxi Li, Wei Luo et al.
MARS: Meaning-Aware Response Scoring for Uncertainty Estimation in Generative LLMs
Yavuz Faruk Bakman, Duygu Nur Yaldiz, Baturalp Buyukates et al.
MARVEL: Unlocking the Multi-Modal Capability of Dense Retrieval via Visual Module Plugin
Tianshuo Zhou, Sen Mei, Xinze Li et al.
Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models
Changyu Chen, Xiting Wang, Ting-En Lin et al.
MaskLID: Code-Switching Language Identification through Iterative Masking
Amir Hossein Kargaran, François Yvon, Hinrich Schuetze
Mass-Editing Memory with Attention in Transformers: A cross-lingual exploration of knowledge
Daniel Tamayo, Aitor Gonzalez-Agirre, Javier Hernando et al.
Match More, Extract Better! Hybrid Matching Model for Open Domain Web Keyphrase Extraction
Mingyang Song, Liping Jing, Yi Feng
MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark
Hongwei Liu, Zilong Zheng, Yuxuan Qiao et al.
MathGenie: Generating Synthetic Data with Question Back-translation for Enhancing Mathematical Reasoning of LLMs
Zimu Lu, Aojun Zhou, Houxing Ren et al.
Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations
Peiyi Wang, Lei Li, Zhihong Shao et al.
MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization
Zhiyu Yang, Zihan Zhou, Shuo Wang et al.
MATTER: Memory-Augmented Transformer Using Heterogeneous Knowledge Sources
Dongkyu Lee, Chandana Satya Prakash, Jack FitzGerald et al.
MAVEN-ARG: Completing the Puzzle of All-in-One Event Understanding Dataset with Event Argument Annotation
Xiaozhi Wang, Hao Peng, Yong Guan et al.
Maverick: Efficient and Accurate Coreference Resolution Defying Recent Trends
Giuliano Martinelli, Edoardo Barba, Roberto Navigli
MBIAS: Mitigating Bias in Large Language Models While Retaining Context
Shaina Raza, Ananya Raval, Veronica Chatrath
mBLIP: Efficient Bootstrapping of Multilingual Vision-LLMs
Gregor Geigle, Abhay Jain, Radu Timofte et al.
MC2: Towards Transparent and Culturally-Aware NLP for Minority Languages in China
Chen Zhang, Mingxu Tao, Quzhe Huang et al.
mCoT: Multilingual Instruction Tuning for Reasoning Consistency in Language Models
Huiyuan Lai, Malvina Nissim
mCSQA: Multilingual Commonsense Reasoning Dataset with Unified Creation Strategy by Language Models and Humans
Yusuke Sakai, Hidetaka Kamigaito, Taro Watanabe
Measuring and Addressing Indexical Bias in Information Retrieval
Caleb Ziems, William Held, Jane Dwivedi-Yu et al.
Measuring Bargaining Abilities of LLMs: A Benchmark and A Buyer-Enhancement Method
Tian Xia, Zhiwei He, Tong Ren et al.
Measuring Meaning Composition in the Human Brain with Composition Scores from Large Language Models
Changjiang Gao, Jixing Li, Jiajun Chen et al.
Measuring Political Bias in Large Language Models: What Is Said and How It Is Said
Yejin Bang, Delong Chen, Nayeon Lee et al.
Measuring Retrieval Complexity in Question Answering Systems
Matteo Gabburo, Nicolaas Paul Jedema, Siddhant Garg et al.
Measuring the Inconsistency of Large Language Models in Preferential Ranking
Xiutian Zhao, Ke Wang, Wei Peng