Papers

20,003 papers found
2026 AAAI
Aligning Attention with Human Rationales for Self-Explaining Hate Speech Detection
Brage Eilertsen, Røskva Bjørgfinsdóttir, Francielle Vargas et al.
2026 AAAI
2026 AAAI
Align to Structure: Aligning Large Language Models with Structural Information
Zae Myung Kim, Anand Ramachandran, Farideh Tavazoee et al.
2026 AAAI
2026 AAAI
2026 AAAI
ALPHA: Action-Based Learning for Pluralistic Human Alignment in Large Language Models
Aanisha Bhattacharyya, Susmit Agrawal, Yaman Kumar Singla et al.
2026 AAAI
2026 AAAI
2026 AAAI
2026 AAAI
2026 AAAI