Zhang-Wei Hong
19 papers · 2017–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
π Conference Polyglot (6) π Academic Marathon (8) π Interdisciplinary Bridge π£ Hot Topic Early Bird π Cross-Pollinator (14)
πΊοΈ
Taxonomy Completionist
(17)
π
Conference Polyglot
(6)
π
Academic Marathon
(8)
π€
Dynamic Duo
(11)
π
Triple Crown
π
Century Club
(18)
β‘
Prolific Year
(5)
ποΈ
Keyword Collector
(53)
Conferences
ICLR (6)
ICML (4)
NIPS (4)
IJCAI (2)
ACL (1)
CORL (1)
L4DC (1)
Top co-authors
Keywords
deep reinforcement learning
(4)
reinforcement learning
(4)
imitation learning
(2)
policy optimization
(2)
mathematical reasoning
(1)
policy learning
(1)
sequential decision making
(1)
domain adaptation
(1)
offline reinforcement learning
(1)
logical reasoning
(1)
robotic manipulation
(1)
constrained optimization
(1)
sim-to-real transfer
(1)
model predictive control
(1)
sample efficiency
(1)
behavior cloning
(1)
importance sampling
(1)
robot control
(1)
off-policy learning
(1)
semantic segmentation
(1)
Papers
Tailored Primitive Initialization is the Secret Key to Reinforcement Learning
ACL 2026
ORSO: Accelerating Reward Design via Online Reward Selection and Policy Optimization
ICLR 2025
ReGen: Generative Robot Simulation via Inverse Design
ICLR 2025
Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search
ICML 2025
Going Beyond Heuristics by Imposing Policy Improvement as a Constraint
NIPS 2024
Curiosity-driven Red-teaming for Large Language Models
ICLR 2024
Random Latent Exploration for Deep Reinforcement Learning
ICML 2024
Parallel $Q$-Learning: Scaling Off-policy Reinforcement Learning under Massively Parallel Simulation
ICML 2023
Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Weighting
ICLR 2023
Model Predictive Control via On-Policy Imitation Learning
L4DC 2023
TGRL: An Algorithm for Teacher Guided Reinforcement Learning
ICML 2023
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets
NIPS 2023
Topological Experience Replay
ICLR 2022
Redeeming intrinsic rewards via constrained optimization
NIPS 2022
Bi-linear Value Networks for Multi-goal Reinforcement Learning
ICLR 2022
Adversarial Active Exploration for Inverse Dynamics Model Learning
CORL 2019
Diversity-Driven Exploration Strategy for Deep Reinforcement Learning
NIPS 2018
Virtual-to-Real: Learning to Control in Visual Semantic Segmentation
IJCAI 2018
Tactics of Adversarial Attack on Deep Reinforcement Learning Agents
IJCAI 2017