Xudong Han
29 papers · 2019–2026 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
π Renaissance Researcher (5) π Interdisciplinary Bridge π Academic Marathon (6) π Conference Polyglot (9) πΊοΈ Taxonomy Completionist (34)
π
Academic Marathon
(6)
πΊοΈ
Taxonomy Completionist
(34)
π
Cross-Pollinator
(14)
π¬
Deep Specialist
(11)
π₯
Mega-Team
(35)
π€
Dynamic Duo
(22)
π§¬
Topic Evolution
π
Keyword Champion
β
The Questioner
(3)
π
Century Club
(26)
ποΈ
Keyword Collector
(91)
β‘
Prolific Year
(5)
π₯
Unstoppable
(5)
Conferences
EACL (5)
EMNLP (5)
IJCNLP (4)
NAACL (4)
AACL (3)
ACL (3)
ICLR (2)
AAAI (1)
COLING (1)
CORL (1)
Top co-authors
Keywords
large language model
(7)
bias mitigation
(6)
debiasing method
(4)
text classification
(2)
group bia
(2)
risk assessment
(2)
predictive fairness
(2)
harmful content detection
(2)
model debiasing
(2)
equal opportunity
(2)
modifier dynamics
(2)
bias detection
(2)
adversarial training
(2)
class imbalance
(2)
model training
(2)
fairness evaluation
(2)
model safety
(1)
binary classification
(1)
model evaluation
(1)
model security
(1)
Papers
Control Illusion: The Failure of Instruction Hierarchies in Large Language Models
AAAI 2026
Nanda Family: Open-Weights Generative Large Language Models for Hindi
EACL 2026
SCALAR: Scientific Citation-based Live Assessment of Long-context Academic Reasoning
EACL 2026
NAT: Enhancing Agent Tuning with Negative Samples
NAACL 2025
ToolGen: Unified Tool Retrieval and Calling via Generation
ICLR 2025
Stealthy Jailbreak Attacks on Large Language Models via Benign Data Mirroring
NAACL 2025
Loki: An Open-Source Tool for Fact Verification
COLING 2025
Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability
NAACL 2025
Do-Not-Answer: Evaluating Safeguards in LLMs
EACL 2024
A Chinese Dataset for Evaluating the Safeguards in Large Language Models
ACL 2024
Demystifying Instruction Mixing for Fine-tuning Large Language Models
ACL 2024
Uncertainty Estimation for Debiased Models: Does Fairness Hurt Reliability?
AACL 2023
Uncertainty Estimation for Debiased Models: Does Fairness Hurt Reliability?
IJCNLP 2023
Fair Enough: Standardizing Evaluation and Model Selection for Fairness Research in NLP
EACL 2023
Everybody Needs Good Neighbours: An Unsupervised Locality-based Method for Bias Mitigation
ICLR 2023
FairLib: A Unified Framework for Assessing and Improving Fairness
EMNLP 2022
Systematic Evaluation of Predictive Fairness
AACL 2022
Does Representational Fairness Imply Empirical Fairness?
AACL 2022
Balancing out Bias: Achieving Fairness Through Balanced Training
EMNLP 2022
Towards Fair Dataset Distillation for Text Classification
EMNLP 2022
Systematic Evaluation of Predictive Fairness
IJCNLP 2022
Optimising Equal Opportunity Fairness in Model Training
NAACL 2022
Evaluating Debiasing Techniques for Intersectional Biases
EMNLP 2021
Diverse Adversaries for Mitigating Bias in Training
EACL 2021
Decoupling Adversarial Training for Fair NLP
IJCNLP 2021
Visual Learning Towards Soft Robot Force Control using a 3D Metamaterial with Differential Stiffness
CORL 2021
Decoupling Adversarial Training for Fair NLP
ACL 2021
Grounding learning of modifier dynamics: An application to color naming
IJCNLP 2019
Grounding learning of modifier dynamics: An application to color naming
EMNLP 2019