Papers
Optimization and Robustness-Informed Membership Inference Attacks for LLMs
Zichen Song, Qixin Zhang, Ming Li et al.
CP-Router: An Uncertainty-Aware Router Between LLM and LRM
Jiayuan Su, Fulin Lin, Zhaopeng Feng et al.
Bridging the Language Gap: Uncovering and Aligning Shared Circuits for Multi-Hop Reasoning in Multilingual LLMs
Chenghao Sun, Zhen Huang, Yonggang Zhang et al.
Enhancing Pre-training Data Detection in LLMs Through Discriminative and Symmetric Prefix Selection
Kai Sun, Yuxin Lin, Bo Dong et al.
Well Begun, Half Done: Reinforcement Learning with Prefix Optimization for LLM Reasoning
Yiliu Sun, Zicheng Zhao, Yang Wei et al.
RAG-R1:Incentivizing the Search and Reasoning Capabilities of LLMs Through Multi-Query Parallelism
Zhiwen Tan, Jiaming Huang, Qintong Wu et al.
Rectify Evaluation Preference: Improving LLMs’ Critique on Math Reasoning via Perplexity-aware Reinforcement Learning
Changyuan Tian, Zhicong Lu, Shuang Qian et al.
KeepKV: Achieving Periodic Lossless KV Cache Compression for Efficient LLM Inference
Yuxuan Tian, Zihan Wang, Yebo Peng et al.
PRAGWORLD: A Benchmark Evaluating LLMs’ Local World Model Under Minimal Linguistic Alterations and Conversational Dynamics
Sachin Vashistha, Aryan Bibhuti, Atharva Naik et al.
Deep Research Arena: The First Exam of LLMs’ Research Abilities via Seminar-Grounded Tasks
Haiyuan Wan, Chen Yang, Junchi Yu et al.
ICL-Router: In-Context Learned Model Representations for LLM Routing
Chenxu Wang, Hao Li, Yiqun Zhang et al.
Light-IF: Endowing LLMs with Generalizable Reasoning via Preview and Self-Checking for Complex Instruction Following
Chenyang Wang, Liang Wen, Shousheng Jia et al.
Improving Implicit Discourse Relation Recognition with Natural Language Explanations from LLMs
Heng Wang, Changxing Wu
Accommodate Knowledge Conflicts in Retrieval-augmented LLMs: Towards Robust Response Generation in the Wild
Jiatai Wang, Zhiwei Xu, Di Jin et al.
CP-Search: A Chain Progressive Search Training Framework Incentivizing the Cognitive Behaviors for Searching in LLMs
Zehua Wang, Shipeng Li, Buzhou Tang
MetaEval: Measuring the Discrimination of Benchmarks for Efficient LLM Evaluation
Zhuo Wang, Wen Wu, Guoqing Wang et al.
Eliciting Chain-of-Thought in Base LLMs via Gradient-Based Representation Optimization
Zijian Wang, Yanxiang Ma, Chang Xu
Beyond ReAct: A Planner-Centric Framework for Complex Tool-Augmented LLM Reasoning
Xiaolong Wei, Yuehu Dong, Xingliang Wang et al.
Mixture-of-Trees: Learning to Select and Weigh Reasoning Paths for Efficient LLM Inference
Yangbo Wei, Zhen Huang, Shaoqiang Lu et al.
LAMDAS: LLM as an Implicit Classifier for Domain-specific Data Selection
Jian Wu, Hang Yu, Bingchang Liu et al.
Finding the Translation Switch: Discovering and Exploiting the Task-Initiation Features in LLMs
Xinwei Wu, Heng Liu, Xiaohu Zhao et al.
StepFun-Formalizer: Unlocking the Autoformalization Potential of LLMs Through Knowledge-Reasoning Fusion
Yutong Wu, Di Huang, Ruosi Wan et al.
SDA: Steering-Driven Distribution Alignment for Open LLMs Without Fine-Tuning
Wei Xia, Zhi-Hong Deng
Enhancing Uncertainty Estimation in LLMs with Expectation of Aggregated Internal Belief
Zeguan Xiao, Diyang Dou, Boya Xiong et al.
Multi-Value Alignment for LLMs via Value Decorrelation and Extrapolation
Hefei Xu, Le Wu, Chen Cheng et al.