Shenzhi Wang
12 papers · 2021–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
π Cross-Pollinator (9) π Interdisciplinary Bridge π Conference Polyglot (6) π§ Keyword Pioneer π Academic Marathon (5)
πΊοΈ
Taxonomy Completionist
(31)
π
Conference Polyglot
(6)
π₯
Mega-Team
(29)
π
Keyword Champion
(2)
π
Century Club
(10)
ποΈ
Keyword Collector
(72)
Conferences
ACL (5)
NIPS (2)
AAAI (1)
CVPR (1)
EACL (1)
ICML (1)
NAACL (1)
Top co-authors
Keywords
large language model
(3)
policy constraint
(2)
offline reinforcement learning
(2)
reward model
(2)
agent system
(2)
reward modeling
(1)
adversarial learning
(1)
contrastive learning
(1)
direct preference optimization
(1)
preference learning
(1)
preference alignment
(1)
data annotation
(1)
efficient inference
(1)
deep learning
(1)
benchmark evaluation
(1)
model alignment
(1)
reinforcement learning from human feedback
(1)
feature extraction
(1)
early exit
(1)
ai safety
(1)
Papers
Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models
ACL 2026
COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values
EACL 2026
PopAlign: Diversifying Contrasting Patterns for a More Comprehensive Alignment
ACL 2025
DiveR-CT: Diversity-enhanced Red Teaming Large Language Model Assistants with Relaxing Constraints
AAAI 2025
OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use
ACL 2025
Model Surgery: Modulating LLMβs Behavior Via Simple Parameter Editing
NAACL 2025
DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution
NIPS 2024
Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling
ACL 2024
PsychoGAT: A Novel Psychological Measurement Paradigm through Interactive Fiction Games with LLM Agents
ACL 2024
Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning
NIPS 2023
Boosting Offline Reinforcement Learning with Action Preference Query
ICML 2023
Glancing at the Patch: Anomaly Localization With Global and Local Feature Comparison
CVPR 2021