Papers
How Diversely Can Language Models Solve Problems? Exploring the Algorithmic Diversity of Model-Generated Code
Seonghyeon Lee, HeeJae Chon, Joonwon Jang et al.
How do autoregressive transformers solve full addition?
Wang Peixu, Chen Yu, Yu Ming et al.
How Does Cognitive Bias Affect Large Language Models? A Case Study on the Anchoring Effect in Price Negotiation Simulations
Yoshiki Takenami, Yin Jou Huang, Yugo Murawaki et al.
How Does DPO Reduce Toxicity? A Mechanistic Neuron-Level Analysis
Yushi Yang, Filip Sondej, Harry Mayne et al.
How Does Knowledge Selection Help Retrieval Augmented Generation?
Xiangci Li, Jessica Ouyang
How do Language Models Reshape Entity Alignment? A Survey of LM-Driven EA Methods: Advances, Benchmarks, and Future
Zerui Chen, Huiming Fan, Qianyu Wang et al.
How Do Large Language Models Evaluate Lexical Complexity?
Abdelhak Kelious, Mathieu Constant, Christophe Coeur
How Do Large Language Models Perform on PDE Discovery: A Coarse-to-fine Perspective
Xiao Luo, Changhu Wang, Yizhou Sun et al.
How Do Large Vision-Language Models See Text in Image? Unveiling the Distinctive Role of OCR Heads
Ingeol Baek, Hwan Chang, Sunghyun Ryu et al.
How Do Social Bots Participate in Misinformation Spread? A Comprehensive Dataset and Analysis
Herun Wan, Minnan Luo, Zihan Ma et al.
How Far Can LLMs Improve from Experience? Measuring Test-Time Learning Ability in LLMs with Human Comparison
Jiayin Wang, Zhiqiang Guo, Weizhi Ma et al.
How Good are LLM-based Rerankers? An Empirical Analysis of State-of-the-Art Reranking Models
Abdelrahman Abdallah, Bhawna Piryani, Jamshid Mozafari et al.
How Is LLM Reasoning Distracted by Irrelevant Context? An Analysis Using a Controlled Benchmark
Minglai Yang, Ethan Huang, Liang Zhang et al.
How Jailbreak Defenses Work and Ensemble? A Mechanistic Investigation
Zhuohan Long, Siyuan Wang, Shujun Liu et al.
How Much Do Large Language Models Know about Human Motion? A Case Study in 3D Avatar Control
Kunhang Li, Jason Naradowsky, Yansong Feng et al.
How Much Do LLMs Hallucinate across Languages? On Realistic Multilingual Estimation of LLM Hallucination
Saad Obaid Ul Islam, Anne Lauscher, Goran Glavaš
How Persuasive Is Your Context?
Tu Nguyen, Kevin Du, Alexander Miserlis Hoyle et al.
How Private are Language Models in Abstractive Summarization?
Anthony Hughes, Nikolaos Aletras, Ning Ma
How Real Are Synthetic Therapy Conversations? Evaluating Fidelity in Prolonged Exposure Dialogues
Suhas Bn, Dominik O. Mattioli, Andrew M. Sherrill et al.
How Reliable is Multilingual LLM-as-a-Judge?
Xiyan Fu, Wei Liu
How Sampling Affects the Detectability of Machine-written texts: A Comprehensive Study
Matthieu Dubois, François Yvon, Pablo Piantanida
How Sememic Components Can Benefit Link Prediction for Lexico-Semantic Knowledge Graphs?
Hansi Wang, Yue Wang, Qiliang Liang et al.
How to Fine-Tune Safely on a Budget: Model Adaptation Using Minimal Resources
Anh C. Pham, Mihir Thalanki, Michael Sun et al.
How to Generalize the Detection of AI-Generated Text: Confounding Neurons
Claudio Borile, Carlo Abrate