Jiafei Lyu
15 papers · 2022–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
π Cross-Pollinator (5) π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (7) πΊοΈ Taxonomy Completionist (22)
πΊοΈ
Taxonomy Completionist
(22)
π
Triple Crown
π€
Dynamic Duo
(13)
π
Grand Slam
ποΈ
Keyword Collector
(61)
β‘
Prolific Year
(5)
π
Century Club
(14)
Conferences
AAAI (4)
ICML (3)
NIPS (3)
ICLR (2)
CVPR (1)
EMNLP (1)
NAACL (1)
Top co-authors
Keywords
reinforcement learning
(3)
model-based reinforcement learning
(2)
offline reinforcement learning
(2)
sample efficiency
(2)
domain randomization
(1)
domain adaptation
(1)
image generation
(1)
reward learning
(1)
direct preference optimization
(1)
chain-of-thought reasoning
(1)
preference learning
(1)
reinforcement learning from human feedback
(1)
value function
(1)
continuous control
(1)
novelty detection
(1)
mathematical reasoning
(1)
distribution shift
(1)
value estimation
(1)
benchmark evaluation
(1)
multi-agent reinforcement learning
(1)
Papers
GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning
AAAI 2026
VLP: Vision-Language Preference Learning for Embodied Manipulation
EMNLP 2025
Novelty-Guided Data Reuse for Efficient and Diversified Multi-Agent Reinforcement Learning
AAAI 2025
SUMO: Search-Based Uncertainty Estimation for Model-Based Offline Reinforcement Learning
AAAI 2025
Cross-Domain Offline Policy Adaptation with Optimal Transport and Dataset Constraint
ICLR 2025
World Models with Hints of Large Language Models for Goal Achieving
NAACL 2025
Exploration and Anti-Exploration with Distributional Random Network Distillation
ICML 2024
ODRL: A Benchmark for Off-Dynamics Reinforcement Learning
NIPS 2024
Cross-Domain Policy Adaptation by Capturing Representation Mismatch
ICML 2024
Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model
CVPR 2024
SEABO: A Simple Search-Based Method for Offline Imitation Learning
ICLR 2024
PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic Manipulation
ICML 2024
Mildly Conservative Q-Learning for Offline Reinforcement Learning
NIPS 2022
Efficient Continuous Control with Double Actors and Regularized Critics
AAAI 2022
Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination
NIPS 2022