Papers

2,781 papers found
Style-Specific Neurons for Steering LLMs in Text Style Transfer
Wen Lai, Viktor Hangya, Alexander Fraser
2024 EMNLP
2024 EMNLP
Are LLMs Good Zero-Shot Fallacy Classifiers?
Fengjun Pan, Xiaobao Wu, Zongrui Li et al.
2024 EMNLP
BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs
Zhiting Fan, Ruizhe Chen, Ruiling Xu et al.
2024 EMNLP
Commonsense Knowledge Editing Based on Free-Text in LLMs
Xiusheng Huang, Yequan Wang, Jun Zhao et al.
2024 EMNLP
2024 EMNLP
2024 EMNLP
Towards Measuring and Modeling “Culture” in LLMs: A Survey
Muhammad Farid Adilazuarda, Sagnik Mukherjee, Pradhyumna Lavania et al.
2024 EMNLP
Hate Personified: Investigating the role of LLMs in content moderation
Sarah Masud, Sahajpreet Singh, Viktor Hangya et al.
2024 EMNLP
InfiniPot: Infinite Context Processing on Memory-Constrained LLMs
Minsoo Kim, Kyuhong Shim, Jungwook Choi et al.
2024 EMNLP
2024 EMNLP
Finding Blind Spots in Evaluator LLMs with Interpretable Checklists
Sumanth Doddapaneni, Mohammed Safi Ur Rahman Khan, Sshubam Verma et al.
2024 EMNLP
Small LLMs Are Weak Tool Learners: A Multi-LLM Agent
Weizhou Shen, Chenliang Li, Hongzhan Chen et al.
2024 EMNLP
Do LLMs learn a true syntactic universal?
John T. Hale, Miloš Stanojević
2024 EMNLP
From LLMs to MLLMs: Exploring the Landscape of Multimodal Jailbreaking
Siyuan Wang, Zhuohan Long, Zhihao Fan et al.
2024 EMNLP
2024 EMNLP
On the Universal Truthfulness Hyperplane Inside LLMs
Junteng Liu, Shiqi Chen, Yu Cheng et al.
2024 EMNLP