Papers
2,781 papers found
ShadeEdit: A Utility-Preserving and Defense-Evasive Knowledge Manipulation Attack in Federated LLMs
Xu Zhang, Hangcheng Liu, Shangwei Guo et al.
SCOPE: Intrinsic Semantic Space Control for Mitigating Copyright Infringement in LLMs
Zhenliang Zhang, Xinyu Hu, Xiaojun Wan
Don’t Start Over: A Cost-Effective Framework for Migrating Personalized Prompts Between LLMs
Ziyi Zhao, Chongming Gao, Yang Zhang et al.
M3UCD: A Multi-task Multimodal Metaphor Understanding Challenge Dataset for LLMs
Tianlong Zheng, Yating Yang, Rui Dong et al.
What to Ask Next? Probing the Imaginative Reasoning of LLMs with TurtleSoup Puzzles
Mengtao Zhou, Sifan Wu, Huan Zhang et al.
Note2Chat: Improving LLMs for Multi-Turn Clinical History Taking Using Medical Notes
Yang Zhou, Zhenting Sheng, Mingrui Tan et al.
Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study
Yuqi Zhu, Yi Zhong, Jintian Zhang et al.
ALTER: Asymmetric LoRA for Token-Entropy-Guided Unlearning of LLMs
Xunlei Chen, Jinyu Guo, Yuang Li et al.
MedOmni-45°: A Safety–Performance Benchmark for Reasoning-Oriented LLMs in Medicine
Kaiyuan Ji, Yijin Guo, Zicheng Zhang et al.
EchoBat: Echo-Vision Enhancement and Echo-Layered Sampling for Video LLMs Hallucination Mitigation
Shuai Liu, Da Chen, Yiheng Pan et al.
Dynamic Deep Prompt Optimization for Defending Against Jailbreak Attacks on LLMs
Doniyorkhon Obidov, Honggang Yu, Xiaolong Guo et al.
HalluClean: A Unified Framework to Combat Hallucinations in LLMs
Yaxin Zhao, Yu Zhang
EoH-S: Evolution of Heuristic Set Using LLMs for Automated Heuristic Design
Fei Liu, Yilu Liu, Qingfu Zhang et al.
DNR Bench: Benchmarking Over-Reasoning in Reasoning LLMs
Oluwanifemi Bamgbose, Masoud Hashemi, Sathwik Tejaswi Madhusudhan et al.
A Course Correction in Steerability Evaluation: Revealing Miscalibration and Side Effects in LLMs
Trenton Chang, Tobias Schnabel, Adith Swaminathan et al.
MetaCipher: A Time-Persistent and Universal Multi-Agent Framework for Cipher-Based Jailbreak Attacks for LLMs
Boyuan Chen, Minghao Shao, Abdul Basit et al.
Resilience in Ambient Multi-Agent LLMs via Decentralized Bio-Autonomic Control and Immune-Inspired Anomaly Detection
Nastaran Darabi, Devashri Naik, Sina Tayebati et al.
Silenced Biases: The Dark Side LLMs Learned to Refuse
Rom Himelstein, Amit LeVi, Brit Youngmann et al.
MRACL: Multi-Reward Space Guided Adaptive Curriculum Reinforcement Learning for LLMs
Wenxuan Liu, Liangyu Huo, Yi Jing et al.
Efficient Switchable Safety Control in LLMs via Magic-Token-Guided Co-Training
Jianfeng Si, Lin Sun, Zhewen Tan et al.
Benchmarking Trustworthiness in Multimodal LLMs for Video Understanding
Youze Wang, Zijun Chen, Ruoyu Chen et al.
STAR-1: Safer Alignment of Reasoning LLMs with 1K Data
Zijun Wang, Haoqin Tu, Yuhan Wang et al.
MedAtlas: Evaluating LLMs for Multi-Round, Multi-Task Medical Reasoning Across Diverse Imaging Modalities and Clinical Text
Ronghao Xu, Zhen Huang, Yangbo Wei et al.
GEM: Generative Entropy-Guided Preference Modeling for Few-Shot Alignment of LLMs
Yiyang Zhao, Huiyu Bai, Xuejiao Zhao
Can LLMs Detect Their Confabulations? Estimating Reliability in Uncertainty-Aware Language Models
Tianyi Zhou, Johanne Medina, Sanjay Chawla