conftrace_

Papers

8,918 papers found · incl. 1,966 without abstracts Only with abstracts
Has this Fact been Edited? Detecting Knowledge Edits in Language Models
Paul Youssef, Zhixue Zhao, Christin Seifert et al.
2025 NAACL
HateImgPrompts: Mitigating Generation of Images Spreading Hate Speech
Vineet Kumar Khullar, Venkatesh Velugubantla, Bhanu Prakash Reddy Rella et al.
2025 NAACL
Have LLMs Reopened the Pandora’s Box of AI-Generated Fake News?
Xinyu Wang, Wenbo Zhang, Sai Koneru et al.
2025 NAACL
2025 NAACL
2025 NAACL
2025 NAACL
2025 NAACL
2025 NAACL
2025 NAACL
High-Dimension Human Value Representation in Large Language Models
Samuel Cahyawijaya, Delong Chen, Yejin Bang et al.
2025 NAACL
HISTOIRESMORALES: A French Dataset for Assessing Moral Alignment
Thibaud Leteno, Irina Proskurina, Antoine Gourru et al.
2025 NAACL
2025 NAACL
2025 NAACL
How Inclusively do LMs Perceive Social and Moral Norms?
Michael Galarnyk, Agam Shah, Dipanwita Guhathakurta et al.
2025 NAACL