Papers
The Illusion of Randomness: How LLMs Fail to Emulate Stochastic Decision-Making in Rock-Paper-Scissors Games?
Zihao Guo, Hongtao Lv, Chaoli Zhang et al.
From Confidence to Collapse in LLM Factual Robustness
Alina Fastowski, Bardh Prenkaj, Gjergji Kasneci
Joint Enhancement of Relational Reasoning for Long-Context LLMs
Zhirui Chen, Wei Shen, Jiashui Huang et al.
Rethink Rumor Detection in the Era of LLMs: A Review
Chang Yang, Peng Zhang, Jing Zhang et al.
DRBO: Mitigating the Bottleneck Effect via Dynamic Reward Balancing in Multi-reward LLM Optimization
Nuo Chen, Yufei Gao, Yongnan Jin et al.
Enhancing LLM Knowledge Learning through Generalization
Mingkang Zhu, Xi Chen, Zhongdao Wang et al.
Chain of Ideas: Revolutionizing Research Via Novel Idea Development with LLM Agents
Long Li, Weiwen Xu, Jiayan Guo et al.
Unveiling Multimodal Processing: Exploring Activation Patterns in Multimodal LLMs for Interpretability and Efficiency
Chuan Wu, Meng Su, Youxuan Fang et al.
Hard Negatives, Hard Lessons: Revisiting Training Data Quality for Robust Information Retrieval with LLMs
Nandan Thakur, Crystina Zhang, Xueguang Ma et al.
S2LPP: Small-to-Large Prompt Prediction across LLMs
Liang Cheng, Tianyi Li, Zhaowei Wang et al.
Tool Zero: Training Tool-Augmented LLMs via Pure RL from Scratch
Yirong Zeng, Xiao Ding, Yutai Hou et al.
Extracting Conceptual Spaces from LLMs Using Prototype Embeddings
Nitesh Kumar, Usashi Chatterjee, Steven Schockaert
SAFE: A Sparse Autoencoder-Based Framework for Robust Query Enrichment and Hallucination Mitigation in LLMs
Samir Abdaljalil, Filippo Pallucchini, Andrea Seveso et al.
Understanding How Value Neurons Shape the Generation of Specified Values in LLMs
Yi Su, Jiayi Zhang, Shu Yang et al.
Comparing Apples to Oranges: A Dataset & Analysis of LLM Humour Understanding from Traditional Puns to Topical Jokes
Tyler Loakman, William Thorne, Chenghua Lin
Modeling, Evaluating, and Embodying Personality in LLMs: A Survey
Iago Alves Brito, Julia Soares Dollis, Fernanda Bufon Färber et al.
FIER: Fine-Grained and Efficient KV Cache Retrieval for Long-context LLM Inference
Dongwei Wang, Zijie Liu, Song Wang et al.
AraSafe: Benchmarking Safety in Arabic LLMs
Hamdy Mubarak, Abubakr Mohamed, Majd Hawasly
Catch Me If You Can? Not Yet: LLMs Still Struggle to Imitate the Implicit Writing Styles of Everyday Authors
Zhengxiang Wang, Nafis Irtiza Tripto, Solha Park et al.
AIRepr: An Analyst-Inspector Framework for Evaluating Reproducibility of LLMs in Data Science
Qiuhai Zeng, Claire Jin, Xinyue Wang et al.
MisinfoBench: A Multi-Dimensional Benchmark for Evaluating LLMs’ Resilience to Misinformation
Ye Yang, Donghe Li, Zuchen Li et al.
Accelerating LLM Reasoning via Early Rejection with Partial Reward Modeling
Seyyed Saeid Cheshmi, Azal Ahmad Khan, Xinran Wang et al.
R3-RAG: Learning Step-by-Step Reasoning and Retrieval for LLMs via Reinforcement Learning
Yuan Li, Qi Luo, Xiaonan Li et al.
‘Hello, World!’: Making GNNs Talk with LLMs
Sunwoo Kim, Soo Yong Lee, Jaemin Yoo et al.