Bosi Wen
11 papers · 2024–2026 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
π Cross-Pollinator (15) π Conference Polyglot (4) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (22) π§ Keyword Pioneer
π₯
Mega-Team
(20)
π
Keyword Champion
(2)
β‘
Prolific Year
(5)
Conferences
ACL (7)
AAAI (2)
EMNLP (1)
NIPS (1)
Top co-authors
Keywords
large language model
(8)
benchmark evaluation
(3)
critique generation
(3)
instruction following
(2)
preference optimization
(2)
dialogue system
(2)
reinforcement learning
(2)
language modeling
(1)
heuristic search
(1)
text generation
(1)
llm evaluation
(1)
model alignment
(1)
constraint satisfaction
(1)
pairwise comparison
(1)
theory of mind
(1)
model evaluation
(1)
prompt optimization
(1)
evaluation benchmark
(1)
language model
(1)
feedback loop
(1)
Papers
IF-RewardBench: Benchmarking Judge Models for Instruction-Following Evaluation
ACL 2026
RLMR: Reinforcement Learning with Mixed Rewards for Creative Writing
AAAI 2026
IF-CRITIC: Towards a Fine-Grained LLM Critic for Instruction-Following Evaluation
ACL 2026
HPSS: Heuristic Prompting Strategy Search for LLM Evaluators
ACL 2025
CharacterBench: Benchmarking Character Customization of Large Language Models
AAAI 2025
Training Language Model to Critique for Better Refinement
ACL 2025
AlignBench: Benchmarking Chinese Alignment of Large Language Models
ACL 2024
CharacterGLM: Customizing Social Characters with Large Language Models
EMNLP 2024
ToMBench: Benchmarking Theory of Mind in Large Language Models
ACL 2024
CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model Generation
ACL 2024
Benchmarking Complex Instruction-Following with Multiple Constraints Composition
NIPS 2024