Papers

5,479 papers found
2025 NAACL
2025 NAACL
MLLM-Bench: Evaluating Multimodal LLMs with Per-sample Criteria
Wentao Ge, Shunian Chen, Hardy Chen et al.
2025 NAACL
2025 NAACL
2025 NAACL
My LLM might Mimic AAE - But When Should It?
Sandra Camille Sandoval, Christabel Acquaye, Kwesi Adu Cobbina et al.
2025 NAACL
Arabic Dataset for LLM Safeguard Evaluation
Yasser Ashraf, Yuxia Wang, Bin Gu et al.
2025 NAACL
SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators
Daniil Moskovskiy, Nikita Sushko, Sergey Pletenev et al.
2025 NAACL
Iterative Self-Tuning LLMs for Enhanced Jailbreaking Capabilities
Chung-En Sun, Xiaodong Liu, Weiwei Yang et al.
2025 NAACL
AEGIS2.0: A Diverse AI Safety Dataset and Risks Taxonomy for Alignment of LLM Guardrails
Shaona Ghosh, Prasoon Varshney, Makesh Narsimhan Sreedhar et al.
2025 NAACL