Papers
5,479 papers found
TokenPowerBench: Benchmarking the Power Consumption of LLM Inference
Chenxu Niu, Wei Zhang, Jie Li et al.
Bias Association Discovery Framework for Open-Ended LLM Generations
Jinhao Pan, Chahat Raj, Ziwei Zhu
WALKSAFE: Risk-aware Graph Random Walk with Bi-GRPO for LLM Safety
Shilong Pan, Zhiliang Tian, Wanlong Yu et al.
WaterMod: Modular Token-Rank Partitioning for Probability-Balanced LLM Watermarking
Shinwoo Park, Hyejin Park, Hyeseon Ahn et al.
PoeTone: A Framework for Constrained Generation of Structured Chinese Songci with LLMs
Zhan Qu, Shuzhou Yuan, Michael Färber
RMO: Towards Better LLM Alignment via Reshaping Reward Margin Distributions
Yanchi Ru, Yue Huang, Xiangliang Zhang
Assessing the Capabilities of LLMs in Humor: A Multi-dimensional Analysis of Oogiri Generation and Evaluation
Ritsu Sakabe, Hwichan Kim, Tosho Hirasawa et al.
Positional Cognitive Specialization: Where Do LLMs Learn to Comprehend and Speak Your Language?
Luis Frentzen Salim, Lun-Wei Ku, Hsing-Kuo Kenneth Pao
AntiDote: Bi-level Adversarial Training for Tamper-Resistant LLMs
Debdeep Sanyal, Manodeep Ray, Murari Mandal
LLMdoctor: Token-Level Flow-Guided Preference Optimization for Efficient Test-Time Alignment of Large Language Models
Tiesunlong Shen, Rui Mao, Jin Wang et al.
Scaling LLM Speculative Decoding: Non-Autoregressive Forecasting in Large-Batch Scenarios
Luohe Shi, Zuchao Li, Lefei Zhang et al.
From Solver to Tutor: Evaluating the Pedagogical Intelligence of LLMs with KMP-Bench
Weikang Shi, Houxing Ren, Junting Pan et al.
Fine-Tuned LLMs Know They Don’t Know: A Parameter-Efficient Approach to Recovering Honesty
Zeyu Shi, Ziming Wang, Tianyu Chen et al.
qa-FLoRA: Data-free query-adaptive Fusion of LoRAs for LLMs
Shreya Shukla, Aditya Sriram, Milinda Kuppur Narayanaswamy et al.
Sparse-dLLM: Accelerating Diffusion LLMs with Dynamic Cache Eviction
Yuerong Song, Xiaoran Liu, Ruixiao Li et al.
Optimization and Robustness-Informed Membership Inference Attacks for LLMs
Zichen Song, Qixin Zhang, Ming Li et al.
CP-Router: An Uncertainty-Aware Router Between LLM and LRM
Jiayuan Su, Fulin Lin, Zhaopeng Feng et al.
Bridging the Language Gap: Uncovering and Aligning Shared Circuits for Multi-Hop Reasoning in Multilingual LLMs
Chenghao Sun, Zhen Huang, Yonggang Zhang et al.
Enhancing Pre-training Data Detection in LLMs Through Discriminative and Symmetric Prefix Selection
Kai Sun, Yuxin Lin, Bo Dong et al.
Well Begun, Half Done: Reinforcement Learning with Prefix Optimization for LLM Reasoning
Yiliu Sun, Zicheng Zhao, Yang Wei et al.
RAG-R1:Incentivizing the Search and Reasoning Capabilities of LLMs Through Multi-Query Parallelism
Zhiwen Tan, Jiaming Huang, Qintong Wu et al.
Rectify Evaluation Preference: Improving LLMs’ Critique on Math Reasoning via Perplexity-aware Reinforcement Learning
Changyuan Tian, Zhicong Lu, Shuang Qian et al.
KeepKV: Achieving Periodic Lossless KV Cache Compression for Efficient LLM Inference
Yuxuan Tian, Zihan Wang, Yebo Peng et al.
PRAGWORLD: A Benchmark Evaluating LLMs’ Local World Model Under Minimal Linguistic Alterations and Conversational Dynamics
Sachin Vashistha, Aryan Bibhuti, Atharva Naik et al.
Deep Research Arena: The First Exam of LLMs’ Research Abilities via Seminar-Grounded Tasks
Haiyuan Wan, Chen Yang, Junchi Yu et al.