conftrace_

Shenzhi Wang

12 papers · 2021–2026 · 7 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+6 more ↓

🐝 Cross-Pollinator (9) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (6) 🧭 Keyword Pioneer 🏃 Academic Marathon (5)

🗺️ Taxonomy Completionist (31) 🌍 Conference Polyglot (6) 👥 Mega-Team (29) 🏆 Keyword Champion (2) 💎 Century Club (10) 🗃️ Keyword Collector (72)

Conferences

ACL (5) NIPS (2) AAAI (1) CVPR (1) EACL (1) ICML (1) NAACL (1)

Top co-authors

Gao Huang (7) Shiji Song (6) Qisen Yang (4) Wenhao Huang (3) Wangchunshu Zhou (3) Andrew Zhao (3) Liwei Wu (2) Yang Yue (2) Zilong Zheng (2) Jiaheng Liu (2)

Keywords

large language model (3) policy constraint (2) offline reinforcement learning (2) reward model (2) agent system (2) reward modeling (1) adversarial learning (1) contrastive learning (1) direct preference optimization (1) preference learning (1) preference alignment (1) data annotation (1) efficient inference (1) deep learning (1) benchmark evaluation (1) model alignment (1) reinforcement learning from human feedback (1) feature extraction (1) early exit (1) ai safety (1)

Papers

Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models ACL 2026 COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values EACL 2026 PopAlign: Diversifying Contrasting Patterns for a More Comprehensive Alignment ACL 2025 DiveR-CT: Diversity-enhanced Red Teaming Large Language Model Assistants with Relaxing Constraints AAAI 2025 OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use ACL 2025 Model Surgery: Modulating LLM’s Behavior Via Simple Parameter Editing NAACL 2025 DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution NIPS 2024 Boosting LLM Agents with Recursive Contemplation for Effective Deception Handling ACL 2024 PsychoGAT: A Novel Psychological Measurement Paradigm through Interactive Fiction Games with LLM Agents ACL 2024 Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning NIPS 2023 Boosting Offline Reinforcement Learning with Action Preference Query ICML 2023 Glancing at the Patch: Anomaly Localization With Global and Local Feature Comparison CVPR 2021