Baixiang Huang
5 papers · 2024–2026 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
🌍
Conference Polyglot
(2)
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🐝
Cross-Pollinator
(15)
❓
The Questioner
(2)
Conferences
AAAI (2)
ACL (1)
EMNLP (1)
ICLR (1)
Top co-authors
Keywords
reinforcement learning
(1)
zero-shot learning
(1)
text classification
(1)
knowledge editing
(1)
authorship attribution
(1)
model editing
(1)
ai safety
(1)
safety alignment
(1)
information gain
(1)
agent behavior
(1)
shapley value
(1)
llm agent
(1)
linguistic feature
(1)
ethical behavior
(1)
large language model
(1)
harmful information
(1)
bias injection
(1)
misinformation injection
(1)
behavior steering
(1)
interactive medical questioning
(1)
Papers
Can Editing LLMs Inject Harm?
AAAI 2026
Model Editing as a Double-Edged Sword: Steering Agent Behavior Toward Beneficence or Harm
AAAI 2026
ProMed: Shapley Information Gain Guided Reinforcement Learning for Proactive Medical LLMs
ACL 2026
Can Knowledge Editing Really Correct Hallucinations?
ICLR 2025
Can Large Language Models Identify Authorship?
EMNLP 2024