Papers
When is Tree Search Useful for LLM Planning? It Depends on the Discriminator
Ziru Chen, Michael White, Ray Mooney et al.
Meta-Tuning LLMs to Leverage Lexical Knowledge for Generalizable Language Style Understanding
Ruohao Guo, Wei Xu, Alan Ritter
Evaluating Very Long-Term Conversational Memory of LLM Agents
Adyasha Maharana, Dong-Ho Lee, Sergey Tulyakov et al.
Skin-in-the-Game: Decision Making via Multi-Stakeholder Alignment in LLMs
Bilgehan Sel, Priya Shanmugasundaram, Mohammad Kachuee et al.
PsychoGAT: A Novel Psychological Measurement Paradigm through Interactive Fiction Games with LLM Agents
Qisen Yang, Zekun Wang, Honghui Chen et al.
Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View
Jintian Zhang, Xin Xu, Ningyu Zhang et al.
ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs
Fengqing Jiang, Zhangchen Xu, Luyao Niu et al.
Pride and Prejudice: LLM Amplifies Self-Bias in Self-Refinement
Wenda Xu, Guanglei Zhu, Xuandong Zhao et al.
DocMath-Eval: Evaluating Math Reasoning Capabilities of LLMs in Understanding Long and Specialized Documents
Yilun Zhao, Yitao Long, Hongjun Liu et al.
Unintended Impacts of LLM Alignment on Global Representation
Michael J Ryan, William Held, Diyi Yang
The Earth is Flat because...: Investigating LLMs’ Belief towards Misinformation via Persuasive Conversation
Rongwu Xu, Brian Lin, Shujian Yang et al.
Is the Pope Catholic? Yes, the Pope is Catholic. Generative Evaluation of Non-Literal Intent Resolution in LLMs
Akhila Yerukola, Saujas Vaduguru, Daniel Fried et al.
Naming, Describing, and Quantifying Visual Objects in Humans and LLMs
Alberto Testoni, Juell Sprott, Sandro Pezzelle
Are LLMs classical or nonmonotonic reasoners? Lessons from generics
Alina Leidinger, Robert Van Rooij, Ekaterina Shutova
Cross-Modal Projection in Multimodal LLMs Doesn’t Really Project Visual Attributes to Textual Space
Gaurav Verma, Minje Choi, Kartik Sharma et al.
OpenEval: Benchmarking Chinese LLMs across Capability, Alignment and Safety
Chuang Liu, Linhao Yu, Jiaxuan Li et al.
UltraEval: A Lightweight Platform for Flexible and Comprehensive Evaluation for LLMs
Chaoqun He, Renjie Luo, Shengding Hu et al.
CharPoet: A Chinese Classical Poetry Generation System Based on Token-free LLM
Chengyue Yu, Lei Zang, Jiaotuan Wang et al.
ITAKE: Interactive Unstructured Text Annotation and Knowledge Extraction System with LLMs and ModelOps
Jiahe Song, Hongxin Ding, Zhiyuan Wang et al.
ELLA: Empowering LLMs for Interpretable, Accurate and Informative Legal Advice
Yutong Hu, Kangcheng Luo, Yansong Feng
LLMBox: A Comprehensive Library for Large Language Models
Tianyi Tang, Hu Yiwen, Bingqian Li et al.
Pragmatic inference of scalar implicature by LLMs
Ye-eun Cho, Seong mook Kim
Rescue: Ranking LLM Responses with Partial Ordering to Improve Response Generation
Yikun Wang, Rui Zheng, Haoming Li et al.
Can LLMs Augment Low-Resource Reading Comprehension Datasets? Opportunities and Challenges
Vinay Samuel, Houda Aynaou, Arijit Chowdhury et al.
Bridging Distribution Gap via Semantic Rewriting with LLMs to Enhance OOD Robustness
Manas Madine, Rohan Pandey, Vara Prasad Gudi et al.