Yuxian Wang
4 papers · 2025–2026 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
🌍
Conference Polyglot
(2)
🌉
Interdisciplinary Bridge
🐝
Cross-Pollinator
(15)
👥
Mega-Team
(27)
Conferences
EMNLP (2)
ACL (1)
ICLR (1)
Top co-authors
Keywords
reinforcement learning
(3)
tool use
(2)
language model
(2)
language model evaluation
(1)
instruction following
(1)
monte carlo tree search
(1)
ensemble method
(1)
reward hacking
(1)
zero-shot generalization
(1)
supervised fine-tuning
(1)
large language model
(1)
knowledge distillation
(1)
preference optimization
(1)
Papers
TinyJudge: Unverifiable Constraint Alignment via Lightweight Specialist Ensembles
ACL 2026
iTool: Reinforced Fine-Tuning with Dynamic Deficiency Calibration for Advanced Tool Use
EMNLP 2025
Tool Zero: Training Tool-Augmented LLMs via Pure RL from Scratch
EMNLP 2025
ToolACE: Winning the Points of LLM Function Calling
ICLR 2025