Papers
Med-HALT: Medical Domain Hallucination Test for Large Language Models
Ankit Pal, Logesh Kumar Umapathi, Malaikannan Sankarasubbu
MediaHG: Rethinking Eye-catchy Features in Social Media Headline Generation
Boning Zhang, Yang Yang
Medical Text Simplification: Optimizing for Readability with Unlikelihood Training and Reranked Beam Search Decoding
Lorenzo Jaime Flores, Heyuan Huang, Kejian Shi et al.
MEE4 and XLsim : IIIT HYD’s Submissions’ for WMT23 Metrics Shared Task
Ananya Mukherjee, Manish Shrivastava
MEEP: Is this Engaging? Prompting Large Language Models for Dialogue Evaluation in Multilingual Settings
Amila Ferron, Amber Shore, Ekata Mitra et al.
MEGA: Multilingual Evaluation of Generative AI
Kabir Ahuja, Harshita Diddee, Rishav Hada et al.
MEGClass: Extremely Weakly Supervised Text Classification via Mutually-Enhancing Text Granularities
Priyanka Kargupta, Tanay Komarlu, Susik Yoon et al.
MemeCap: A Dataset for Captioning and Interpreting Memes
EunJeong Hwang, Vered Shwartz
Memorisation Cartography: Mapping out the Memorisation-Generalisation Continuum in Neural Machine Translation
Verna Dankers, Ivan Titov, Dieuwke Hupkes
Memory-Based Invariance Learning for Out-of-Domain Text Classification
Chen Jia, Yue Zhang
Memory Injections: Correcting Multi-Hop Reasoning Failures During Inference in Transformer-Based Language Models
Mansi Sakarvadia, Aswathy Ajith, Arham Khan et al.
MenatQA: A New Dataset for Testing the Temporal Comprehension and Reasoning Abilities of Large Language Models
Yifan Wei, Yisong Su, Huanhuan Ma et al.
Merging Experts into One: Improving Computational Efficiency of Mixture of Experts
Shwai He, Run-Ze Fan, Liang Ding et al.
Merging Generated and Retrieved Knowledge for Open-Domain QA
Yunxiang Zhang, Muhammad Khalifa, Lajanugen Logeswaran et al.
Meta-Learning of Prompt Generation for Lightweight Prompt Engineering on Language-Model-as-a-Service
Hyeonmin Ha, Jihye Lee, Wookje Han et al.
Meta-Learning Online Adaptation of Language Models
Nathan Hu, Eric Mitchell, Christopher Manning et al.
METAPROBE: A Representation- and Task-Agnostic Probe
Yichu Zhou, Vivek Srikumar
MetaReVision: Meta-Learning with Retrieval for Visually Grounded Compositional Concept Acquisition
Guangyue Xu, Parisa Kordjamshidi, Joyce Chai
Methodological Insights in Detecting Subtle Semantic Shifts with Contextualized and Static Language Models
Sanne Hoeken, Özge Alacam, Antske Fokkens et al.
Metric Score Landscape Challenge (MSLC23): Understanding Metrics’ Performance on a Wider Landscape of Translation Quality
Chi-kiu Lo, Samuel Larkin, Rebecca Knowles
MetricX-23: The Google Submission to the WMT 2023 Metrics Shared Task
Juraj Juraska, Mara Finkelstein, Daniel Deutsch et al.
MILAB at PragTag-2023: Enhancing Cross-Domain Generalization through Data Augmentation with Reduced Uncertainty
Yoonsang Lee, Dongryeol Lee, Kyomin Jung
MILDSum: A Novel Benchmark Dataset for Multilingual Summarization of Indian Legal Case Judgments
Debtanu Datta, Shubham Soni, Rajdeep Mukherjee et al.
MindGames: Targeting Theory of Mind in Large Language Models with Dynamic Epistemic Modal Logic
Damien Sileo, Antoine Lernould
Mind the Gap: Automated Corpus Creation for Enthymeme Detection and Reconstruction in Learner Arguments
Maja Stahl, Nick Düsterhus, Mei-Hua Chen et al.