Papers
MemGuide: Intent-Driven Memory Selection for Goal-Oriented Multi-Session LLM Agents
Yiming Du, Bingbing Wang, Yang He et al.
Graph of Verification: Structured Verification of LLM Reasoning with Directed Acyclic Graphs
Jiwei Fang, Bin Zhang, Changwei Wang et al.
Toward Better EHR Reasoning in LLMs: Reinforcement Learning with Expert Attention Guidance
Yue Fang, Yuxin Guo, Jiaran Gao et al.
FinMathBench: A Formula-Driven Benchmark for Evaluating LLMs’ Math Reasoning Capabilities in Finance
Yi He, Ping Wang, Shiqiang Xiong et al.
Format Matters: The Robustness of Multimodal LLMs in Reviewing Evidence from Tables and Charts
Xanh Ho, Yun-Ang Wu, Sunisth Kumar et al.
Benchmarking LLMs’ Mathematical Reasoning with Unseen Random Variables Questions
Zijin Hong, Hao Wu, Su Dong et al.
SPA: Achieving Consensus in LLM Alignment via Self-Priority Optimization
Yue Huang, Xiangqi Wang, Xiangliang Zhang
LiteLong: Resource-Efficient Long-Context Data Synthesis for LLMs
Junlong Jia, Xing Wu, Chaochen Gao et al.
Importance-Aware Data Selection for Efficient LLM Instruction Tuning
Tingyu Jiang, Shen Li, Yiyao Song et al.
From Chaos to Clarity: A Knowledge Graph-Driven Audit Dataset Generation Framework for LLM Unlearning
Weipeng Jiang, Juan Zhai, Shiqing Ma et al.
EduGuardBench: A Holistic Benchmark for Evaluating the Pedagogical Fidelity and Adversarial Safety of LLMs as Simulated Teachers
Yilin Jiang, Mingzi Zhang, Xuanyu Yin et al.
Difficulty Is Not Enough: Curriculum Learning for LLMs Fine-tuning Must Consider Utility
Zishang Jiang, Jinyi Han, Tingyun Li et al.
Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective
Deyang Kong, Qi Guo, Xiangyu Xi et al.
Template-Theorems Graph Construction to Enhance Mathematical Reasoning Capabilities of LLM
Yarong Lan, Yajing Xu, Huajun Chen
Do Not Merge My Model! Safeguarding Open-Source LLMs Against Unauthorized Model Merging
Qinfeng Li, Miao Pan, Jintao Chen et al.
OSVBench: Benchmarking LLMs on Specification Generation Tasks for Operating System Verification
Shangyu Li, Juyong Jiang, Tiancheng Zhao et al.
CoFact: Dynamic Coordination of Attention Heads for Improving Factual Consistency in LLMs
Shike Li, Xiaokai Wang, Xiaofeng Liu et al.
Jupiter: Enhancing LLM Data Analysis Capabilities via Notebook and Inference-Time Value-Guided Search
Shuocheng Li, Yihao Liu, Silin Du et al.
Semantic Volume: Quantifying and Detecting Both External and Internal Uncertainty in LLMs
Xiaomin Li, Zhou Yu, Ziji Zhang et al.
Selection of LLM Fine-Tuning Data Based on Orthogonal Rules
Xiaomin Li, Mingye Gao, Zhiwei Zhang et al.
LoopLLM: Transferable Energy-Latency Attacks in LLMs via Repetitive Generation
Xingyu Li, Xiaolei Liu, Cheng Liu et al.
Do LLMs Feel? Teaching Emotion Recognition with Prompts, Retrieval, and Curriculum Learning
Xinran Li, Yu Liu, Jiaqi Qiao et al.
AgentSwift: Efficient LLM Agent Design via Value-Guided Hierarchical Search
Yu Li, Lehui Li, Zhihao Wu et al.
GrayKD: Distilling Better Knowledge from Black-box LLM via Multi-rationale Injection
Hyeongsoo Lim, Hyung Yong Kim, Jin Young Kim et al.