Papers

2,781 papers found
Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges
Aman Singh Thakur, Kartik Choudhary, Venkat Srinik Ramayapally et al.
2025 ACL
2024 COLING
A Survey on Detection of LLMs-Generated Content
Xianjun Yang, Liangming Pan, Xuandong Zhao et al.
2024 EMNLP
2024 EMNLP
2025 EMNLP
Benchmarking the Detection of LLMs-Generated Modern Chinese Poetry
Shanshan Wang, Junchao Wu, Fengying Ye et al.
2025 EMNLP
On scalable oversight with weak LLMs judging strong LLMs
Zachary Kenton, Noah Y. Siegel, János Kramár et al.
2024 NIPS