Jianye Hao
136 papers · 2013–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (30) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (5) π£ Hot Topic Early Bird
π
Cross-Pollinator
(4)
π
Interdisciplinary Bridge
π
Academic Marathon
(12)
π
Conference Loyalist
(24)
π€
Dynamic Duo
(38)
π
Triple Crown
π¬
Deep Specialist
(11)
π§¬
Topic Evolution
π
Grand Slam
π₯
Mega-Team
(32)
π
Keyword Champion
(3)
π
Conference Pioneer
π₯
Unstoppable
(9)
π
Century Club
(135)
β‘
Prolific Year
(6)
ποΈ
Keyword Collector
(103)
Conferences
ICML (30)
ICLR (24)
NIPS (24)
AAAI (21)
IJCAI (20)
CVPR (6)
ACL (3)
CORL (2)
ICCV (2)
UAI (2)
AISTATS (1)
EMNLP (1)
Top co-authors
Keywords
reinforcement learning
(20)
deep reinforcement learning
(18)
multi-agent reinforcement learning
(14)
multi-agent system
(10)
graph neural network
(10)
offline reinforcement learning
(8)
policy learning
(7)
policy optimization
(7)
representation learning
(6)
transfer learning
(6)
model-based reinforcement learning
(6)
policy transfer
(4)
contrastive learning
(4)
large language model
(4)
attention mechanism
(3)
evolutionary algorithm
(3)
question answering
(3)
trajectory generation
(3)
diffusion model
(3)
variational autoencoder
(3)
Papers
AgentSwift: Efficient LLM Agent Design via Value-Guided Hierarchical Search
AAAI 2026
MODULI: Unlocking Preference Generalization via Diffusion Models for Offline Multi-Objective Reinforcement Learning
ICML 2025
Trajectory World Models for Heterogeneous Environments
ICML 2025
Accelerating Large Language Model Reasoning via Speculative Search
ICML 2025
Boosting Multi-Domain Fine-Tuning of Large Language Models through Evolving Interactions between Samples
ICML 2025
STAR: Learning Diverse Robot Skill Abstractions through Rotation-Augmented Vector Quantization
ICML 2025
R*: Efficient Reward Design via Reward Structure Evolution and Parameter Alignment Optimization with Large Language Models
ICML 2025
HyperTree Planning: Enhancing LLM Reasoning via Hierarchical Thinking
ICML 2025
Differentiable Integer Linear Programming
ICLR 2025
Lightweight Neural App Control
ICLR 2025
DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agent
ICLR 2025
The Graphβs Apprentice: Teaching an LLM Low-Level Knowledge for Circuit Quality Estimation
IJCAI 2025
Reinforced In-Context Black-Box Optimization
IJCAI 2025
SWAMamba: A Sliding Window Attention Mamba Framework for Predicting Translation Elongation Rates
AAAI 2025
Improving Generalization in Offline Reinforcement Learning via Latent Distribution Representation Learning
AAAI 2025
DualRAG: A Dual-Process Approach to Integrate Reasoning and Retrieval for Multi-Hop Question Answering
ACL 2025
War of Thoughts: Competition Stimulates Stronger Reasoning in Large Language Models
ACL 2025
3D-AffordanceLLM: Harnessing Large Language Models for Open-Vocabulary Affordance Detection in 3D Worlds
ICLR 2025
SPA-BENCH: A COMPREHENSIVE BENCHMARK FOR SMARTPHONE AGENT EVALUATION
ICLR 2025
EchoTraffic: Enhancing Traffic Anomaly Understanding with Audio-Visual Insights
CVPR 2025
Spatial-Temporal Graph Diffusion Policy with Kinematic Modeling for Bimanual Robotic Manipulation
CVPR 2025
A Graph Enhanced Symbolic Discovery Framework For Efficient Logic Optimization
ICLR 2025
RoboAnnotatorX: A Comprehensive and Universal Annotation Framework for Accurate Understanding of Long-horizon Robot Demonstration
ICCV 2025
Apollo-MILP: An Alternating Prediction-Correction Neural Solving Framework for Mixed-Integer Linear Programming
ICLR 2025
Computing Circuits Optimization via Model-Based Circuit Genetic Evolution
ICLR 2025
LaMPlace: Learning to Optimize Cross-Stage Metrics in Macro Placement
ICLR 2025
A Circuit Domain Generalization Framework for Efficient Logic Synthesis in Chip Design
ICML 2024
HarmonyDream: Task Harmonization Inside World Models
ICML 2024
Rethinking Decision Transformer via Hierarchical Reinforcement Learning
ICML 2024
Unlock the Cognitive Generalization of Deep Reinforcement Learning via Granular Ball Representation
ICML 2024
EvoRainbow: Combining Improvements in Evolutionary Reinforcement Learning for Policy Search
ICML 2024
Value-Evolutionary-Based Reinforcement Learning
ICML 2024
Towards General Algorithm Discovery for Combinatorial Optimization: Learning Symbolic Branching Policy from Bipartite Graph
ICML 2024
KISA: A Unified Keyframe Identifier and Skill Annotator for Long-Horizon Robotics Demonstrations
ICML 2024
Reinforcement Learning within Tree Search for Fast Macro Placement
ICML 2024
Rethinking Branching on Exact Combinatorial Optimization Solver: The First Deep Symbolic Discovery Framework
ICLR 2024
Hybrid CtrlFormer: Learning Adaptive Search Space Partition for Hybrid Action Control via Transformer-based Monte Carlo Tree Search
UAI 2024
ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles
IJCAI 2024
Improving Unsupervised Hierarchical Representation with Reinforcement Learning
CVPR 2024
Multiagent Gumbel MuZero: Efficient Planning in Combinatorial Action Spaces
AAAI 2024
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments
AAAI 2024
PORTAL: Automatic Curricula Generation for Multiagent Reinforcement Learning
AAAI 2024
A Transfer Approach Using Graph Neural Networks in Deep Reinforcement Learning
AAAI 2024
PreRoutGNN for Timing Prediction with Order Preserving Partition: Global Circuit Pre-training, Local Delay Learning and Attentional Cell Modeling
AAAI 2024
vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement
IJCAI 2024
AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model
ICLR 2024
Sample-Efficient Quality-Diversity by Cooperative Coevolution
ICLR 2024
Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback
ICLR 2024
Generate Subgoal Images before Act: Unlocking the Chain-of-Thought Reasoning in Diffusion Model for Robot Manipulation with Multimodal Prompts
CVPR 2024
EWEK-QA : Enhanced Web and Efficient Knowledge Graph Retrieval for Citation-based Question Answering Systems
ACL 2024
Sample-Efficient Multiagent Reinforcement Learning with Reset Replay
ICML 2024
PERIA: Perceive, Reason, Imagine, Act via Holistic Language and Vision Planning for Manipulation
NIPS 2024
FlexPlanner: Flexible 3D Floorplanning via Deep Reinforcement Learning in Hybrid Action Space with Multi-Modality Representation
NIPS 2024
Iteratively Refined Behavior Regularization for Offline Reinforcement Learning
NIPS 2024
iVideoGPT: Interactive VideoGPTs are Scalable World Models
NIPS 2024
The Ladder in Chaos: Improving Policy Learning by Harnessing the Parameter Evolving Path in A Low-dimensional Space
NIPS 2024
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making
NIPS 2024
Towards Next-Generation Logic Synthesis: A Scalable Neural Circuit Generation Framework
NIPS 2024
Unlock the Intermittent Control Ability of Model Free Reinforcement Learning
NIPS 2024
DiffuserLite: Towards Real-time Diffusion Planning
NIPS 2024
A Hierarchical Adaptive Multi-Task Reinforcement Learning Framework for Multiplier Circuit Design
ICML 2024
Improving Generalization in Offline Reinforcement Learning via Adversarial Data Splitting
ICML 2024
Co-Speech Gesture Synthesis by Reinforcement Learning With Contrastive Pre-Trained Rewards
CVPR 2023
Structure Aware Incremental Learning with Personalized Imitation Weights for Recommender Systems
AAAI 2023
SplitNet: A Reinforcement Learning Based Sequence Splitting Method for the MinMax Multiple Travelling Salesman Problem
AAAI 2023
Models as Agents: Optimizing Multi-Step Predictions of Interactive Local Models in Model-Based Multi-Agent Reinforcement Learning
AAAI 2023
Neighbor Auto-Grouping Graph Neural Networks for Handover Parameter Configuration in Cellular Network
AAAI 2023
Spectral Augmentations for Graph Contrastive Learning
AISTATS 2023
Traj-MAE: Masked Autoencoders for Trajectory Prediction
ICCV 2023
EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model
ICLR 2023
Boosting Multiagent Reinforcement Learning via Permutation Invariant and Permutation Equivariant Networks
ICLR 2023
ERL-Re$^2$: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation
ICLR 2023
CFlowNets: Continuous Control with Generative Flow Networks
ICLR 2023
Out-of-distribution Detection with Implicit Outlier Transformation
ICLR 2023
DAG Matters! GFlowNets Enhanced Explainer for Graph Neural Networks
ICLR 2023
Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection
ICLR 2023
ChiPFormer: Transferable Chip Placement via Offline Decision Transformer
ICML 2023
RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolution
ICML 2023
MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL
ICML 2023
Generative Flow Networks for Precise Reward-Oriented Active Learning on Graphs
IJCAI 2023
Online Ad Hoc Teamwork under Partial Observability
ICLR 2022
Neuro-Symbolic Hierarchical Rule Induction
ICML 2022
Learning State Representations via Retracing in Reinforcement Learning
ICLR 2022
Socially-Attentive Policy Optimization in Multi-Agent Self-Driving System
CORL 2022
Learning Pseudometric-based Action Representations for Offline Reinforcement Learning
ICML 2022
Transformer-based Working Memory for Multiagent Reinforcement Learning with Action Parsing
NIPS 2022
Multiagent Q-learning with Sub-Team Coordination
NIPS 2022
DOMINO: Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning
NIPS 2022
The Policy-gradient Placement and Generative Routing Neural Networks for Chip Design
NIPS 2022
Versatile Multi-stage Graph Neural Network for Circuit Representation
NIPS 2022
GALOIS: Boosting Deep Reinforcement Learning via Generalizable Logic Synthesis
NIPS 2022
Plan To Predict: Learning an Uncertainty-Foreseeing Model For Model-Based Reinforcement Learning
NIPS 2022
Cross-domain adaptive transfer reinforcement learning based on state-action correspondence
UAI 2022
PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations
IJCAI 2022
What about Inputting Policy in Value Function: Policy Representation and Policy-Extended Value Function Approximator
AAAI 2022
HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation
ICLR 2022
PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration
ICML 2022
Plan Your Target and Learn Your Skills: Transferable State-Only Imitation Learning via Decoupled Policy Optimization
ICML 2022
Offline-to-Online Co-Evolutional User Simulator and Dialogue System
EMNLP 2022
Individual Reward Assisted Multi-Agent Reinforcement Learning
ICML 2022
Towards Effective Context for Meta-Reinforcement Learning: an Approach based on Contrastive Learning
AAAI 2021
An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning
NIPS 2021
Flattening Sharpness for Dynamic Gradient Projection Memory Benefits Continual Learning
NIPS 2021
A Hierarchical Reinforcement Learning Based Optimization Framework for Large-scale Dynamic Pickup and Delivery Problems
NIPS 2021
Ordering-Based Causal Discovery with Reinforcement Learning
IJCAI 2021
Dynamic Bottleneck for Robust Self-Supervised Exploration
NIPS 2021
Model-Based Reinforcement Learning via Imagination with Derived Memory
NIPS 2021
Foresee then Evaluate: Decomposing Value Estimation with Latent Future Prediction
AAAI 2021
Adaptive Online Packing-guided Search for POMDPs
NIPS 2021
Addressing Action Oscillations through Learning Policy Inertia
AAAI 2021
CausalVAE: Disentangled Representation Learning via Neural Structural Causal Models
CVPR 2021
Principled Exploration via Optimistic Bootstrapping and Backward Induction
ICML 2021
Action Semantics Network: Considering the Effects of Actions in Multiagent Systems
ICLR 2020
Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising
ICML 2020
Q-value Path Decomposition for Deep Multiagent Reinforcement Learning
ICML 2020
From Few to More: Large-Scale Dynamic Multiagent Curriculum Learning
AAAI 2020
Neighborhood Cognition Consistent Multi-Agent Reinforcement Learning
AAAI 2020
Multi-Agent Game Abstraction via Graph Attention Neural Network
AAAI 2020
Continuous Multiagent Control Using Collective Behavior Entropy for Large-Scale Home Energy Management
AAAI 2020
SMARTS: An Open-Source Scalable Multi-Agent RL Training School for Autonomous Driving
CORL 2020
KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge
IJCAI 2020
Triple-GAIL: A Multi-Modal Imitation Learning Framework with Generative Adversarial Nets
IJCAI 2020
Efficient Deep Reinforcement Learning via Adaptive Policy Transfer
IJCAI 2020
Generating Behavior-Diverse Game AIs with Evolutionary Multi-Objective Deep Reinforcement Learning
IJCAI 2020
Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising
IJCAI 2020
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping
NIPS 2020
An Optimal Rewiring Strategy for Cooperative Multiagent Social Learning
AAAI 2019
Large-Scale Home Energy Management Using Entropy-Based Collective Multiagent Deep Reinforcement Learning Framework
IJCAI 2019
Towards Efficient Detection and Optimal Response against Sophisticated Opponents
IJCAI 2019
Building Personalized Simulator for Interactive Search
IJCAI 2019
Explicitly Coordinated Policy Iteration
IJCAI 2019
Deep Multi-Agent Reinforcement Learning with Discrete-Continuous Hybrid Action Spaces
IJCAI 2019
A Deep Bayesian Policy Reuse Approach Against Non-Stationary Agents
NIPS 2018
Recurrent Deep Multiagent Q-Learning for Autonomous Brokers in Smart Grid
IJCAI 2018
Defending Against Man-In-The-Middle Attack in Repeated Games
IJCAI 2017
The Dynamics of Reinforcement Social Learning in Cooperative Multiagent Systems
IJCAI 2013