Papers
Multi-Task Transfer Matters During Instruction-Tuning
David Mueller, Mark Dredze, Nicholas Andrews
muNERa at WojoodNER 2024: Multi-tasking NER Approach
Nouf Alotaibi, Haneen Alhomoud, Hanan Murayshid et al.
Must NLP be Extractive?
Steven Bird
MusTQ: A Temporal Knowledge Graph Question Answering Dataset for Multi-Step Temporal Reasoning
Tingyi Zhang, Jiaan Wang, Zhixu Li et al.
MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector
Marta R. Costa-jussà, Mariano Coria Meglioli, Pierre Andrews et al.
“My Answer is C”: First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language Models
Xinpeng Wang, Bolei Ma, Chengzhi Hu et al.
My Climate Advisor: An Application of NLP in Climate Adaptation for Agriculture
Vincent Nguyen, Sarvnaz Karimi, Willow Hallgren et al.
MYTE: Morphology-Driven Byte Encoding for Better and Fairer Multilingual Language Modeling
Tomasz Limisiewicz, Terra Blevins, Hila Gonen et al.
NACL: A General and Effective KV Cache Eviction Framework for LLM at Inference Time
Yilong Chen, Guoxia Wang, Junyuan Shang et al.
NADI 2024: The Fifth Nuanced Arabic Dialect Identification Shared Task
Muhammad Abdul-Mageed, Amr Keleg, AbdelRahim Elmadany et al.
NaijaHate: Evaluating Hate Speech Detection on Nigerian Twitter Using Representative Data
Manuel Tonneau, Pedro Quinta De Castro, Karim Lasri et al.
NAIST Simultaneous Speech Translation System for IWSLT 2024
Yuka Ko, Ryo Fukuda, Yuta Nishikawa et al.
Naming, Describing, and Quantifying Visual Objects in Humans and LLMs
Alberto Testoni, Juell Sprott, Sandro Pezzelle
Narrative Navigators at FIGNEWS 2024 Shared Task: New Frontiers in Bias and Propaganda Annotation Techniques
Maryam AlEmadi, Jana ElMesselmani, Lyna Bermak et al.
Narratives at Conflict: Computational Analysis of News Framing in Multilingual Disinformation Campaigns
Antonina Sinelnik, Dirk Hovy
Narrowing the Knowledge Evaluation Gap: Open-Domain Question Answering with Multi-Granularity Answers
Gal Yona, Roee Aharoni, Mor Geva
NaturalCodeBench: Examining Coding Performance Mismatch on HumanEval and Natural User Queries
Shudan Zhang, Hanlin Zhao, Xiao Liu et al.
Natural Language Satisfiability: Exploring the Problem Distribution and Evaluating Transformer-based Language Models
Tharindu Madusanka, Ian Pratt-Hartmann, Riza Batista-Navarro
Navigate through Enigmatic Labyrinth A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future
Zheng Chu, Jingchang Chen, Qianglong Chen et al.
Navigating the Dual Facets: A Comprehensive Evaluation of Sequential Memory Editing in Large Language Models
Zihao Lin, Mohammad Beigi, Hongxuan Li et al.
Navigating the Metrics Maze: Reconciling Score Magnitudes and Accuracies
Tom Kocmi, Vilém Zouhar, Christian Federmann et al.
Navigating the OverKill in Large Language Models
Chenyu Shi, Xiao Wang, Qiming Ge et al.
Navigating the Shadows: Unveiling Effective Disturbances for Modern AI Content Detectors
Ying Zhou, Ben He, Le Sun
Negative Object Presence Evaluation (NOPE) to Measure Object Hallucination in Vision-Language Models
Holy Lovenia, Wenliang Dai, Samuel Cahyawijaya et al.
NEO-BENCH: Evaluating Robustness of Large Language Models with Neologisms
Jonathan Zheng, Alan Ritter, Wei Xu