Papers
Open Grounded Planning: Challenges and Benchmark Construction
Shiguang Guo, Ziliang Deng, Hongyu Lin et al.
Open Ko-LLM Leaderboard: Evaluating Large Language Models in Korean with Ko-H5 Benchmark
Chanjun Park, Hyeonwoo Kim, Dahyun Kim et al.
Open-Set Semi-Supervised Text Classification via Adversarial Disagreement Maximization
Junfan Chen, Richong Zhang, Junchi Chen et al.
OpenToM: A Comprehensive Benchmark for Evaluating Theory-of-Mind Reasoning Capabilities of Large Language Models
Hainiu Xu, Runcong Zhao, Lixing Zhu et al.
OpenVNA: A Framework for Analyzing the Behavior of Multimodal Language Understanding System under Noisy Scenarios
Ziqi Yuan, Baozheng Zhang, Hua Xu et al.
OpenWebAgent: An Open Toolkit to Enable Web Agents on Large Language Models
Iat Long Iong, Xiao Liu, Yuxuan Chen et al.
OPEx: A Component-Wise Analysis of LLM-Centric Agents in Embodied Instruction Following
Haochen Shi, Zhiyuan Sun, Xingdi Yuan et al.
Optimal Transport Guided Correlation Assignment for Multimodal Entity Linking
Zefeng Zhang, Jiawei Sheng, Chuang Zhang et al.
Optimizing Multimodal Large Language Models for Detection of Alcohol Advertisements via Adaptive Prompting
Daniel Cabrera Lozoya, Jiahe Liu, Simon D’Alfonso et al.
Order-Agnostic Data Augmentation for Few-Shot Named Entity Recognition
Huiming Wang, Liying Cheng, Wenxuan Zhang et al.
OTTAWA: Optimal TransporT Adaptive Word Aligner for Hallucination and Omission Translation Errors Detection
Chenyang Huang, Abbas Ghaddar, Ivan Kobyzev et al.
Outdated Issue Aware Decoding for Factual Knowledge Editing
Zengkui Sun, Yijin Liu, Jiaan Wang et al.
Out-of-Domain Dependency Parsing for Dialects of Arabic: A Case Study
Noor Abo Mokh, Daniel Dakota, Sandra Kübler
Overcoming Catastrophic Forgetting by Exemplar Selection in Task-oriented Dialogue System
Chen Chen, Ruizhe Li, Yuchen Hu et al.
Overview of DialAM-2024: Argument Mining in Natural Language Dialogues
Ramon Ruiz-Dolz, John Lawrence, Ella Schad et al.
Overview of PerspectiveArg2024 The First Shared Task on Perspective Argument Retrieval
Neele Falk, Andreas Waldis, Iryna Gurevych
Overview of #SMM4H 2024 – Task 2: Cross-Lingual Few-Shot Relation Extraction for Pharmacovigilance in French, German, and Japanese
Lisa Raithel, Philippe Thomas, Bhuvanesh Verma et al.
Overview of the 9th Social Media Mining for Health Applications (#SMM4H) Shared Tasks at ACL 2024 – Large Language Models and Generalizability for Social Media NLP
Dongfang Xu, Guillermo Garcia, Lisa Raithel et al.
Overview of the BioLaySumm 2024 Shared Task on the Lay Summarization of Biomedical Research Articles
Tomas Goldsack, Carolina Scarton, Matthew Shardlow et al.
Overview of the Context24 Shared Task on Contextualizing Scientific Claims
Chu Sern Joel Chan, Aakanksha Naik, Matthew Akamatsu et al.
Overview of the DagPap24 Shared Task on Detecting Automatically Generated Scientific Paper
Savvas Chamezopoulos, Drahomira Herrmannova, Anita de Waard et al.
Overview of the First Shared Task on Clinical Text Generation: RRG24 and “Discharge Me!”
Justin Xu, Zhihong Chen, Andrew Johnston et al.
Overview of the Fourth Workshop on Scholarly Document Processing
Tirthankar Ghosal, Amanpreet Singh, Anita De Waard et al.
Overview of the Shared Task on Machine Translation Gender Bias Evaluation with Multilingual Holistic Bias
Marta R. Costa-jussà, Pierre Andrews, Christine Basta et al.