Papers
Prune ’n Predict: Optimizing LLM Decision-making with Conformal Prediction
Harit Vishwakarma, Alan Mishler, Thomas Cook et al.
Think Twice, Act Once: A Co-Evolution Framework of LLM and RL for Large-Scale Decision Making
Xu Wan, Wenyue Xu, Chao Yang et al.
TruthFlow: Truthful LLM Generation via Representation Flow Correction
Hanyu Wang, Bochuan Cao, Yuanpu Cao et al.
Towards Understanding Fine-Tuning Mechanisms of LLMs via Circuit Analysis
Xu Wang, Yan Hu, Wenyu Du et al.
The Illusion of Role Separation: Hidden Shortcuts in LLM Role Learning (and How to Fix Them)
Zihao Wang, Yibo Jiang, Jiahao Yu et al.
Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM
Xiong Wang, Yangze Li, Chaoyou Fu et al.
NICE Data Selection for Instruction Tuning in LLMs with Non-differentiable Evaluation Metric
Jingtan Wang, Xiaoqiang Lin, Rui Qiao et al.
Teaching Physical Awareness to LLMs through Sounds
Weiguo Wang, Andy Nie, Wenrui Zhou et al.
Is Your Model Fairly Certain? Uncertainty-Aware Fairness Evaluation for LLMs
Yinong Oliver Wang, Nivedha Sivakumar, Falaah Arif Khan et al.
GRU: Mitigating the Trade-off between Unlearning and Retention for LLMs
Yue Wang, Qizhou Wang, Feng Liu et al.
Efficient and Privacy-Preserving Soft Prompt Transfer for LLMs
Xun Wang, Jing Xu, Franziska Boenisch et al.
Invariance Makes LLM Unlearning Resilient Even to Unanticipated Downstream Fine-Tuning
Changsheng Wang, Yihua Zhang, Jinghan Jia et al.
AdaDecode: Accelerating LLM Decoding with Adaptive Layer Parallelism
Zhepei Wei, Wei-Lin Chen, Xinyu Zhu et al.
Emoji Attack: Enhancing Jailbreak Attacks Against Judge LLM Detection
Zhipeng Wei, Yuqi Liu, N. Benjamin Erichson
Improving Parallel Program Performance with LLM Optimizers via Agent-System Interfaces
Anjiang Wei, Allen Nie, Thiago S. F. X. Teixeira et al.
AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders
Zhengxuan Wu, Aryaman Arora, Atticus Geiger et al.
Thinking LLMs: General Instruction Following with Thought Generation
Tianhao Wu, Janice Lan, Weizhe Yuan et al.
When Do LLMs Help With Node Classification? A Comprehensive Analysis
Xixi Wu, Yifei Shen, Fangzhou Ge et al.
Adaptive Localization of Knowledge Negation for Continual LLM Unlearning
Abudukelimu Wuerkaixi, Qizhou Wang, Sen Cui et al.
GuardAgent: Safeguard LLM Agents via Knowledge-Enabled Reasoning
Zhen Xiang, Linzhi Zheng, Yanjie Li et al.
Everything Everywhere All at Once: LLMs can In-Context Learn Multiple Tasks in Superposition
Zheyang Xiong, Ziyang Cai, John Cooper et al.
DipLLM: Fine-Tuning LLM for Strategic Decision-making in Diplomacy
Kaixuan Xu, Jiajun Chai, Sicheng Li et al.
RLTHF: Targeted Human Feedback for LLM Alignment
Yifei Xu, Tusher Chakraborty, Emre Kiciman et al.
ProSec: Fortifying Code LLMs with Proactive Security Alignment
Xiangzhe Xu, Zian Su, Jinyao Guo et al.
Let LLM Tell What to Prune and How Much to Prune
Mingzhe Yang, Sihao Lin, Changlin Li et al.