Papers

2,781 papers found

ShadeEdit: A Utility-Preserving and Defense-Evasive Knowledge Manipulation Attack in Federated LLMs

Xu Zhang, Hangcheng Liu, Shangwei Guo et al.

2026 AAAI

SCOPE: Intrinsic Semantic Space Control for Mitigating Copyright Infringement in LLMs

Zhenliang Zhang, Xinyu Hu, Xiaojun Wan

2026 AAAI

Don’t Start Over: A Cost-Effective Framework for Migrating Personalized Prompts Between LLMs

Ziyi Zhao, Chongming Gao, Yang Zhang et al.

2026 AAAI

M3UCD: A Multi-task Multimodal Metaphor Understanding Challenge Dataset for LLMs

Tianlong Zheng, Yating Yang, Rui Dong et al.

2026 AAAI

What to Ask Next? Probing the Imaginative Reasoning of LLMs with TurtleSoup Puzzles

Mengtao Zhou, Sifan Wu, Huan Zhang et al.

2026 AAAI

Note2Chat: Improving LLMs for Multi-Turn Clinical History Taking Using Medical Notes

Yang Zhou, Zhenting Sheng, Mingrui Tan et al.

2026 AAAI

Why Do Open-Source LLMs Struggle with Data Analysis? A Systematic Empirical Study

Yuqi Zhu, Yi Zhong, Jintian Zhang et al.

2026 AAAI

ALTER: Asymmetric LoRA for Token-Entropy-Guided Unlearning of LLMs

Xunlei Chen, Jinyu Guo, Yuang Li et al.

2026 AAAI

MedOmni-45°: A Safety–Performance Benchmark for Reasoning-Oriented LLMs in Medicine

Kaiyuan Ji, Yijin Guo, Zicheng Zhang et al.

2026 AAAI

EchoBat: Echo-Vision Enhancement and Echo-Layered Sampling for Video LLMs Hallucination Mitigation

Shuai Liu, Da Chen, Yiheng Pan et al.

2026 AAAI

Dynamic Deep Prompt Optimization for Defending Against Jailbreak Attacks on LLMs

Doniyorkhon Obidov, Honggang Yu, Xiaolong Guo et al.

2026 AAAI

HalluClean: A Unified Framework to Combat Hallucinations in LLMs

Yaxin Zhao, Yu Zhang

2026 AAAI

EoH-S: Evolution of Heuristic Set Using LLMs for Automated Heuristic Design

Fei Liu, Yilu Liu, Qingfu Zhang et al.

2026 AAAI

DNR Bench: Benchmarking Over-Reasoning in Reasoning LLMs

Oluwanifemi Bamgbose, Masoud Hashemi, Sathwik Tejaswi Madhusudhan et al.

2026 AAAI

A Course Correction in Steerability Evaluation: Revealing Miscalibration and Side Effects in LLMs

Trenton Chang, Tobias Schnabel, Adith Swaminathan et al.

2026 AAAI

MetaCipher: A Time-Persistent and Universal Multi-Agent Framework for Cipher-Based Jailbreak Attacks for LLMs

Boyuan Chen, Minghao Shao, Abdul Basit et al.

2026 AAAI

Resilience in Ambient Multi-Agent LLMs via Decentralized Bio-Autonomic Control and Immune-Inspired Anomaly Detection

Nastaran Darabi, Devashri Naik, Sina Tayebati et al.

2026 AAAI

Silenced Biases: The Dark Side LLMs Learned to Refuse

Rom Himelstein, Amit LeVi, Brit Youngmann et al.

2026 AAAI

MRACL: Multi-Reward Space Guided Adaptive Curriculum Reinforcement Learning for LLMs

Wenxuan Liu, Liangyu Huo, Yi Jing et al.

2026 AAAI

Efficient Switchable Safety Control in LLMs via Magic-Token-Guided Co-Training

Jianfeng Si, Lin Sun, Zhewen Tan et al.

2026 AAAI

Benchmarking Trustworthiness in Multimodal LLMs for Video Understanding

Youze Wang, Zijun Chen, Ruoyu Chen et al.

2026 AAAI

STAR-1: Safer Alignment of Reasoning LLMs with 1K Data

Zijun Wang, Haoqin Tu, Yuhan Wang et al.

2026 AAAI

MedAtlas: Evaluating LLMs for Multi-Round, Multi-Task Medical Reasoning Across Diverse Imaging Modalities and Clinical Text

Ronghao Xu, Zhen Huang, Yangbo Wei et al.

2026 AAAI

GEM: Generative Entropy-Guided Preference Modeling for Few-Shot Alignment of LLMs

Yiyang Zhao, Huiyu Bai, Xuejiao Zhao

2026 AAAI

Can LLMs Detect Their Confabulations? Estimating Reliability in Uncertainty-Aware Language Models

Tianyi Zhou, Johanne Medina, Sanjay Chawla

2026 AAAI