Papers
NTTSU at WMT2025 General Translation Task
Zhang Yin, Hiroyuki Deguchi, Haruto Azami et al.
NU_Internship team at ImageEval 2025: From Zero-Shot to Ensembles: Enhancing Grounded Arabic Image Captioning
Rana Gaber, Seif Eldin Amgad, Ahmed Sherif Nasri et al.
Nullspace Disentanglement for Red Teaming Language Models
Yi Han, Yuanxing Liu, Weinan Zhang et al.
NUMINA: A Natural Understanding Benchmark for Multi-dimensional Intelligence and Numerical Reasoning Abilities
Changyu Zeng, Yifan Wang, Zimu Wang et al.
NUR at IslamicEval 2025 Shared Task: Retrieval-Augmented LLMs for Qur’an and Hadith QA
Serag Amin, Ranwa Aly, Yara Allam et al.
NurseLLM: The First Specialized Language Model for Nursing
Md Tawkat Islam Khondaker, Julia Harrington, Shady Shehata
NUTMEG: Separating Signal From Noise in Annotator Disagreement
Jonathan Ivey, Susan Gauch, David Jurgens
Nvidia-Nemo’s WMT 2025 Metrics Shared Task Submission
Brian Yan, Shuoyang Ding, Kuang-Da Wang et al.
NyayGraph: A Knowledge Graph Enhanced Approach for Legal Statute Identification in Indian Law using Large Language Models
Siddharth Shukla, Tanuj Tyagi, Abhay Singh Bisht et al.
NYUAD at QIAS Shared Task: Benchmarking the Legal Reasoning of LLMs in Arabic Islamic Inheritance Cases
Nouar AlDahoul, Yasir Zaki
OAgents: An Empirical Study of Building Effective Agents
He Zhu, Tianrui Qin, King Zhu et al.
OBLIVIATE: Robust and Practical Machine Unlearning for Large Language Models
Xiaoyu Xu, Minxin Du, Qingqing Ye et al.
Octopus: Towards Building the Arabic Speech LLM Suite
Sara Althubaiti, Vasista Sai Lodagala, Tjad Clark et al.
Offloaded Reasoning: Efficient Inference for Large Language Models via Modular Reasoning and Refinement
Ishan Jindal, Jayant Taneja, Badrinath Chandana et al.
OG-RAG: Ontology-grounded retrieval-augmented generation for large language models
Kartik Sharma, Peeyush Kumar, Yunqing Li
OkraLong: A Flexible Retrieval-Augmented Framework for Long-Text Question Answering
Yulong Hui, Yihao Liu, Yao Lu et al.
o-MEGA: Optimized Methods for Explanation Generation and Analysis
Ľuboš Kriš, Jaroslav Kopčan, Qiwei Peng et al.
OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain
Shuting Wang, Jiejun Tan, Zhicheng Dou et al.
OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking
Zekun Xi, Wenbiao Yin, Jizhan Fang et al.
OMS: On-the-fly, Multi-Objective, Self-Reflective Ad Keyword Generation via LLM Agent
Bowen Chen, Zhao Wang, Shingo Takamatsu
On Assigning Product and Software Codes to Customer Service Requests with Large Language Models
Sujatha Das Gollapalli, Mouad Hakam, Mingzhe Du et al.
Once Upon a Time: Interactive Learning for Storytelling with Small Language Models
Jonas Mayer Martins, Ali Hamza Bashir, Muhammad Rehan Khalid et al.
On Collaborating Small and Large Models For Few-shot Intent Detection
Peng Chen, Bang Wang