Chenjia Bai

28 papers · 2021–2026 · 7 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🌍 Conference Polyglot (7) 🐝 Cross-Pollinator (9) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (5)

🏃 Academic Marathon (5) 🌈 Renaissance Researcher (7) 🗺️ Taxonomy Completionist (38) 🤝 Dynamic Duo (11) 👑 Triple Crown 🏆 Grand Slam ⚡ Prolific Year (10) 🔥 Unstoppable (5) 🗃️ Keyword Collector (106) ❓ The Questioner 💎 Century Club (27)

Conferences

ICML (9) NIPS (7) AAAI (4) ICLR (4) ACL (2) CORL (1) EMNLP (1)

Top co-authors

Xuelong Li (11) Haoran He (6) Kang Xu (6) Peng Liu (6) Zhen Wang (6) Zhaoran Wang (6) Lingxiao Wang (5) Bin Zhao (5) Yang Zhang (5) Xiu Li (5)

Keywords

reinforcement learning (11) upper confidence bound (3) offline reinforcement learning (3) diffusion model (3) preference optimization (2) contrastive learning (2) radiology report generation (2) multi-objective optimization (2) policy transfer (2) dynamics mismatch (2) preference learning (2) medical imaging (2) information bottleneck (1) domain adaptation (1) multi-task learning (1) domain generalization (1) locomotion control (1) benchmark evaluation (1) sequential decision-making (1) sequential decision making (1)

Papers

Towards Adaptive Humanoid Control via Multi-Behavior Distillation and Reinforced Fine-Tuning AAAI 2026 Online Iterative Self-Alignment for Radiology Report Generation ACL 2025 Towards Efficient LLM Grounding for Embodied Multi-Agent Collaboration ACL 2025 VLP: Vision-Language Preference Learning for Embodied Manipulation EMNLP 2025 Online Preference Alignment for Language Models via Count-based Exploration ICLR 2025 Exponential Topology-enabled Scalable Communication in Multi-agent Reinforcement Learning ICLR 2025 Discriminator-Guided Embodied Planning for LLM Agent ICLR 2025 Task-Agnostic Pre-training and Task-Guided Fine-tuning for Versatile Diffusion Planner ICML 2025 Forward KL Regularized Preference Optimization for Aligning Diffusion Policies AAAI 2025 Radiology Report Generation via Multi-objective Preference Optimization AAAI 2025 Cross-Domain Policy Adaptation by Capturing Representation Mismatch ICML 2024 SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation ICML 2024 How Does Goal Relabeling Improve Sample Efficiency? ICML 2024 Learning an Actionable Discrete Diffusion Policy via Large-Scale Actionless Video Pre-Training NIPS 2024 ODRL: A Benchmark for Off-Dynamics Reinforcement Learning NIPS 2024 Regularized Conditional Diffusion Model for Multi-Task Preference Alignment NIPS 2024 Bridging the Sim-to-Real Gap from the Information Bottleneck Perspective CORL 2024 OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments AAAI 2024 Constrained Ensemble Exploration for Unsupervised Skill Discovery ICML 2024 Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning ICML 2024 Behavior Contrastive Learning for Unsupervised Skill Discovery ICML 2023 Cross-Domain Policy Adaptation via Value-Guided Data Filtering NIPS 2023 Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning NIPS 2023 Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning ICLR 2022 RORL: Robust Offline Reinforcement Learning via Conservative Smoothing NIPS 2022 Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning ICML 2022 Principled Exploration via Optimistic Bootstrapping and Backward Induction ICML 2021 Dynamic Bottleneck for Robust Self-Supervised Exploration NIPS 2021