conftrace_

Rong Bao

7 papers · 2022–2026 · 6 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+3 more ↓

🐝 Cross-Pollinator (15) 🧭 Keyword Pioneer 🌍 Conference Polyglot (5) 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge

🗺️ Taxonomy Completionist (23) 🏆 Keyword Champion (2) 🔥 Unstoppable (5)

Conferences

ACL (2) AAAI (1) COLING (1) EMNLP (1) ICLR (1) NIPS (1)

Top co-authors

Rui Zheng (5) Qi Zhang (5) Tao Gui (3) Liang Ding (3) Dacheng Tao (3) Xuanjing Huang (3) Xiao Wang (2) Qin Liu (1) Leszek Rutkowski (1) Rui Xie (1)

Keywords

text classification (2) reward hacking (2) language model (2) continual learning (1) catastrophic forgetting (1) reward modeling (1) domain adaptation (1) natural language processing (1) information bottleneck (1) model robustness (1) language model reasoning (1) chain-of-thought reasoning (1) adversarial training (1) language model alignment (1) gradient estimation (1) reinforcement learning from human feedback (1) monte carlo sampling (1) distribution shift (1) adversarial defense (1) adversarial detection (1)

Papers

Time-Frequency Token Advantage Clipping for Training Efficient Large Reasoning Model AAAI 2026 Fixing Distribution Shifts of LLM Self-Critique via On-Policy Self-Play Training ACL 2025 RMB: Comprehensively benchmarking reward models in LLM alignment ICLR 2025 InfoRM: Mitigating Reward Hacking in RLHF via Information-Theoretic Reward Modeling NIPS 2024 CASN:Class-Aware Score Network for Textual Adversarial Detection ACL 2023 Orthogonal Subspace Learning for Language Model Continual Learning EMNLP 2023 PlugAT: A Plug and Play Module to Defend against Textual Adversarial Attack COLING 2022