Rong Bao
7 papers · 2022–2026 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
π Cross-Pollinator (15) π§ Keyword Pioneer π Conference Polyglot (5) π Renaissance Researcher (5) π Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(23)
π
Keyword Champion
(2)
π₯
Unstoppable
(5)
Conferences
ACL (2)
AAAI (1)
COLING (1)
EMNLP (1)
ICLR (1)
NIPS (1)
Top co-authors
Keywords
text classification
(2)
reward hacking
(2)
language model
(2)
continual learning
(1)
catastrophic forgetting
(1)
reward modeling
(1)
domain adaptation
(1)
natural language processing
(1)
information bottleneck
(1)
model robustness
(1)
language model reasoning
(1)
chain-of-thought reasoning
(1)
adversarial training
(1)
language model alignment
(1)
gradient estimation
(1)
reinforcement learning from human feedback
(1)
monte carlo sampling
(1)
distribution shift
(1)
adversarial defense
(1)
adversarial detection
(1)
Papers
Time-Frequency Token Advantage Clipping for Training Efficient Large Reasoning Model
AAAI 2026
Fixing Distribution Shifts of LLM Self-Critique via On-Policy Self-Play Training
ACL 2025
RMB: Comprehensively benchmarking reward models in LLM alignment
ICLR 2025
InfoRM: Mitigating Reward Hacking in RLHF via Information-Theoretic Reward Modeling
NIPS 2024
CASN:Class-Aware Score Network for Textual Adversarial Detection
ACL 2023
Orthogonal Subspace Learning for Language Model Continual Learning
EMNLP 2023
PlugAT: A Plug and Play Module to Defend against Textual Adversarial Attack
COLING 2022