Papers
2,781 papers found
Locate-then-Merge: Neuron-Level Parameter Fusion for Mitigating Catastrophic Forgetting in Multimodal LLMs
Zeping Yu, Sophia Ananiadou
MC2: A Minimum-Coverage and Dataset-Agnostic Framework for Compositional Generalization of LLMs on Semantic Parsing
Ziyao Xu, Zhe Yang, Houfeng Wang
Learning to Instruct: Fine-Tuning a Task-Aware Instruction Optimizer for Black-Box LLMs
Yunzhe Qi, Jinjin Tian, Tianci Liu et al.
DynamicKV: Task-Aware Adaptive KV Cache Compression for Long Context LLMs
Xiabin Zhou, Wenbin Wang, Minyan Zeng et al.
Representing LLMs in Prompt Semantic Task Space
Idan Kashani, Avi Mendelson, Yaniv Nemcovsky
The Illusion of Randomness: How LLMs Fail to Emulate Stochastic Decision-Making in Rock-Paper-Scissors Games?
Zihao Guo, Hongtao Lv, Chaoli Zhang et al.
Joint Enhancement of Relational Reasoning for Long-Context LLMs
Zhirui Chen, Wei Shen, Jiashui Huang et al.
Rethink Rumor Detection in the Era of LLMs: A Review
Chang Yang, Peng Zhang, Jing Zhang et al.
Unveiling Multimodal Processing: Exploring Activation Patterns in Multimodal LLMs for Interpretability and Efficiency
Chuan Wu, Meng Su, Youxuan Fang et al.
Hard Negatives, Hard Lessons: Revisiting Training Data Quality for Robust Information Retrieval with LLMs
Nandan Thakur, Crystina Zhang, Xueguang Ma et al.
S2LPP: Small-to-Large Prompt Prediction across LLMs
Liang Cheng, Tianyi Li, Zhaowei Wang et al.
Tool Zero: Training Tool-Augmented LLMs via Pure RL from Scratch
Yirong Zeng, Xiao Ding, Yutai Hou et al.
Extracting Conceptual Spaces from LLMs Using Prototype Embeddings
Nitesh Kumar, Usashi Chatterjee, Steven Schockaert
SAFE: A Sparse Autoencoder-Based Framework for Robust Query Enrichment and Hallucination Mitigation in LLMs
Samir Abdaljalil, Filippo Pallucchini, Andrea Seveso et al.
Understanding How Value Neurons Shape the Generation of Specified Values in LLMs
Yi Su, Jiayi Zhang, Shu Yang et al.
Modeling, Evaluating, and Embodying Personality in LLMs: A Survey
Iago Alves Brito, Julia Soares Dollis, Fernanda Bufon Färber et al.
AraSafe: Benchmarking Safety in Arabic LLMs
Hamdy Mubarak, Abubakr Mohamed, Majd Hawasly
Catch Me If You Can? Not Yet: LLMs Still Struggle to Imitate the Implicit Writing Styles of Everyday Authors
Zhengxiang Wang, Nafis Irtiza Tripto, Solha Park et al.
AIRepr: An Analyst-Inspector Framework for Evaluating Reproducibility of LLMs in Data Science
Qiuhai Zeng, Claire Jin, Xinyue Wang et al.
MisinfoBench: A Multi-Dimensional Benchmark for Evaluating LLMs’ Resilience to Misinformation
Ye Yang, Donghe Li, Zuchen Li et al.
R3-RAG: Learning Step-by-Step Reasoning and Retrieval for LLMs via Reinforcement Learning
Yuan Li, Qi Luo, Xiaonan Li et al.
‘Hello, World!’: Making GNNs Talk with LLMs
Sunwoo Kim, Soo Yong Lee, Jaemin Yoo et al.
Text or Pixels? Evaluating Efficiency and Understanding of LLMs with Visual Text Inputs
Yanhong Li, Zixuan Lan, Jiawei Zhou