conftrace_

Jiafei Lyu

15 papers · 2022–2026 · 7 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+7 more ↓

🐝 Cross-Pollinator (5) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (7) 🗺️ Taxonomy Completionist (22)

🗺️ Taxonomy Completionist (22) 👑 Triple Crown 🤝 Dynamic Duo (13) 🏆 Grand Slam 🗃️ Keyword Collector (61) ⚡ Prolific Year (5) 💎 Century Club (14)

Conferences

AAAI (4) ICML (3) NIPS (3) ICLR (2) CVPR (1) EMNLP (1) NAACL (1)

Top co-authors

Xiu Li (14) Zongqing Lu (6) Runze Liu (5) Xiaoteng Ma (4) Jian Tao (4) Kai Yang (3) Chenjia Bai (3) Yali Du (2) Mengbei Yan (2) Jing-Wen Yang (2)

Keywords

reinforcement learning (3) model-based reinforcement learning (2) offline reinforcement learning (2) sample efficiency (2) domain randomization (1) domain adaptation (1) image generation (1) reward learning (1) direct preference optimization (1) chain-of-thought reasoning (1) preference learning (1) reinforcement learning from human feedback (1) value function (1) continuous control (1) novelty detection (1) mathematical reasoning (1) distribution shift (1) value estimation (1) benchmark evaluation (1) multi-agent reinforcement learning (1)

Papers

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning AAAI 2026 VLP: Vision-Language Preference Learning for Embodied Manipulation EMNLP 2025 Novelty-Guided Data Reuse for Efficient and Diversified Multi-Agent Reinforcement Learning AAAI 2025 SUMO: Search-Based Uncertainty Estimation for Model-Based Offline Reinforcement Learning AAAI 2025 Cross-Domain Offline Policy Adaptation with Optimal Transport and Dataset Constraint ICLR 2025 World Models with Hints of Large Language Models for Goal Achieving NAACL 2025 Exploration and Anti-Exploration with Distributional Random Network Distillation ICML 2024 ODRL: A Benchmark for Off-Dynamics Reinforcement Learning NIPS 2024 Cross-Domain Policy Adaptation by Capturing Representation Mismatch ICML 2024 Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model CVPR 2024 SEABO: A Simple Search-Based Method for Offline Imitation Learning ICLR 2024 PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic Manipulation ICML 2024 Mildly Conservative Q-Learning for Offline Reinforcement Learning NIPS 2022 Efficient Continuous Control with Double Actors and Regularized Critics AAAI 2022 Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination NIPS 2022