Chenjia Bai
28 papers · 2021–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
π Conference Polyglot (7) π Cross-Pollinator (9) π§ Keyword Pioneer π Interdisciplinary Bridge π Academic Marathon (5)
π
Academic Marathon
(5)
π
Renaissance Researcher
(7)
πΊοΈ
Taxonomy Completionist
(38)
π€
Dynamic Duo
(11)
π
Triple Crown
π
Grand Slam
β‘
Prolific Year
(10)
π₯
Unstoppable
(5)
ποΈ
Keyword Collector
(106)
β
The Questioner
π
Century Club
(27)
Conferences
ICML (9)
NIPS (7)
AAAI (4)
ICLR (4)
ACL (2)
CORL (1)
EMNLP (1)
Top co-authors
Keywords
reinforcement learning
(11)
upper confidence bound
(3)
offline reinforcement learning
(3)
diffusion model
(3)
preference optimization
(2)
contrastive learning
(2)
radiology report generation
(2)
multi-objective optimization
(2)
policy transfer
(2)
dynamics mismatch
(2)
preference learning
(2)
medical imaging
(2)
information bottleneck
(1)
domain adaptation
(1)
multi-task learning
(1)
domain generalization
(1)
locomotion control
(1)
benchmark evaluation
(1)
sequential decision-making
(1)
sequential decision making
(1)
Papers
Towards Adaptive Humanoid Control via Multi-Behavior Distillation and Reinforced Fine-Tuning
AAAI 2026
Online Iterative Self-Alignment for Radiology Report Generation
ACL 2025
Towards Efficient LLM Grounding for Embodied Multi-Agent Collaboration
ACL 2025
VLP: Vision-Language Preference Learning for Embodied Manipulation
EMNLP 2025
Online Preference Alignment for Language Models via Count-based Exploration
ICLR 2025
Exponential Topology-enabled Scalable Communication in Multi-agent Reinforcement Learning
ICLR 2025
Discriminator-Guided Embodied Planning for LLM Agent
ICLR 2025
Task-Agnostic Pre-training and Task-Guided Fine-tuning for Versatile Diffusion Planner
ICML 2025
Forward KL Regularized Preference Optimization for Aligning Diffusion Policies
AAAI 2025
Radiology Report Generation via Multi-objective Preference Optimization
AAAI 2025
Cross-Domain Policy Adaptation by Capturing Representation Mismatch
ICML 2024
SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation
ICML 2024
How Does Goal Relabeling Improve Sample Efficiency?
ICML 2024
Learning an Actionable Discrete Diffusion Policy via Large-Scale Actionless Video Pre-Training
NIPS 2024
ODRL: A Benchmark for Off-Dynamics Reinforcement Learning
NIPS 2024
Regularized Conditional Diffusion Model for Multi-Task Preference Alignment
NIPS 2024
Bridging the Sim-to-Real Gap from the Information Bottleneck Perspective
CORL 2024
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments
AAAI 2024
Constrained Ensemble Exploration for Unsupervised Skill Discovery
ICML 2024
Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning
ICML 2024
Behavior Contrastive Learning for Unsupervised Skill Discovery
ICML 2023
Cross-Domain Policy Adaptation via Value-Guided Data Filtering
NIPS 2023
Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning
NIPS 2023
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
ICLR 2022
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing
NIPS 2022
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
ICML 2022
Principled Exploration via Optimistic Bootstrapping and Backward Induction
ICML 2021
Dynamic Bottleneck for Robust Self-Supervised Exploration
NIPS 2021