Papers
Measuring Distribution Shift in User Prompts and Its Effects on LLM Performance
Parker Seegmiller, Sarah Masud Preum
Measuring Human Contribution in AI-Assisted Content Generation
Yueqi Xie, Tao Qi, Jingwei Yi et al.
Measuring Idiomaticity in Text Embedding Models with epsilon-compositionality
Sondre Wold, Étienne Simon, Erik Velldal et al.
Measuring Linguistic Competence of LLMs on Indigenous Languages of the Americas
Justin Vasselli, Arturo Mp, Frederikus Hudi et al.
Measuring LLMs’ Sensitivity to Paraphrased Opinion Prompts
Bushra Alhetelah, Irfan Ahmad
Measuring Mechanistic Independence: Can Bias Be Removed Without Erasing Demographics?
Zhengyang Shan, Aaron Mueller
Measuring Model Performance in the Presence of an Intervention
Winston Chen, Michael W. Sjoding, Jenna Wiens
Measuring Social Bias in Vision-Language Models with Face-Only Counterfactuals from Real Photos
Haodong Chen, Qiang Huang, Jiaqi Zhao et al.
Measuring Social Integration Through Participation: Categorizing Organizations and Leisure Activities in the Displaced Karelians Interview Archive using LLMs
Joonatan Laato, Veera Schroderus, Jenna Kanerva et al.
Measuring the Symbolic Power of Languages with LLM-based Multilingual Persuasion Simulation
Yin Jou Huang, Fei Cheng
Measuring the Unmeasurable: Unveiling Latent Cognitive Capabilities of LLM
Cui Danxin, Sihang Jiang, Keyi Wang et al.
Measuring User’s Mental Models of Speech Translation in Human-AI Collaboration
HyoJung Han, Nishant Balepur, Jordan Lee Boyd-Graber et al.
Measuring What Matters!! Assessing Therapeutic Principles in Mental-Health Conversation
Abdullah Mazhar, Het Riteshkumar Shah, Aseem Srivastava et al.
Measuring What Matters: Scenario-Driven Evaluation for Trajectory Predictors in Autonomous Driving
Longchao Da, David Isele, Hua Wei et al.
MECH: A Cost-Effective Multi-Task Cascade Framework for Classroom Opinion Evolution Recognition
Yancui Li, Xiaoyu Zhou, Guoyi Miao et al.
MechaFormer: Sequence Learning for Kinematic Mechanism Design Automation
Diana Bolanos, Mohammadmehdi Ataei, Pradeep Kumar Jayaraman
Mechanisms of Prompt-Induced Hallucination in Vision–Language Models
William Rudman, Michal Golovanevsky, Dana Arad et al.
Mechanistic Analysis Of Universality: Numerical Comparison Circuits Across Transformer Architectures
Arya Bhardia, Julian Ramirez, Siddhanta Verma et al.
Mechanistic Dissection of Cross-Attention Subspaces in Text-to-Image Diffusion Models
Jun-Hyun Bae, Wonyong Jo, Jaehyup Lee et al.
Mechanistic Interpretability Should Prioritize Feature Consistency in Sparse Autoencoders
Xiangchen Song, Aashiq Muhamed, Yujia Zheng et al.
MEDAL: A Framework for Benchmarking LLMs as Multilingual Open-Domain Dialogue Evaluators
John Mendonça, Alon Lavie, Isabel Trancoso
MEDAL: multi-modal MEta-space Distillation and ALignment for Visual Compatibility Learning
Dween Rabius Sanny, Vinay Kumar Verma, Prateek Sircar et al.
MedAtlas: Evaluating LLMs for Multi-Round, Multi-Task Medical Reasoning Across Diverse Imaging Modalities and Clinical Text
Ronghao Xu, Zhen Huang, Yangbo Wei et al.
MED-COPILOT: A Medical Assistant Powered by GraphRAG and Similar Patient Case Retrieval
Shuheng Chen, Namratha Patil, Haonan Pan et al.