Yuhang Jiang
15 papers · 2020–2025 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
🐝 Cross-Pollinator (4) 🧭 Keyword Pioneer 🏃 Academic Marathon (5) 🌍 Conference Polyglot (6) 🌈 Renaissance Researcher (7)
🧭
Keyword Pioneer
🌍
Conference Polyglot
(6)
🏃
Academic Marathon
(5)
🤝
Dynamic Duo
(12)
⚡
Prolific Year
(6)
💎
Century Club
(14)
🗃️
Keyword Collector
(77)
Conferences
NIPS (5)
AAAI (4)
ICML (2)
AACL (1)
CVPR (1)
IJCNLP (1)
OSDI (1)
Top co-authors
Keywords
multi-agent reinforcement learning
(4)
offline reinforcement learning
(4)
large language model
(3)
skill discovery
(2)
markov decision process
(2)
biomedical nlp
(2)
knowledge graph
(2)
policy learning
(2)
zero-shot learning
(2)
centralized training decentralized execution
(2)
relation extraction
(2)
partial observability
(2)
unsupervised reinforcement learning
(2)
weakly supervised learning
(1)
attention mechanism
(1)
embedding learning
(1)
similarity search
(1)
multi-task learning
(1)
reinforcement learning
(1)
computer vision
(1)
Papers
Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning
AAAI 2025
A benchmark for end-to-end zero-shot biomedical relation extraction with LLMs: experiments with OpenAI models
IJCNLP 2025
A benchmark for end-to-end zero-shot biomedical relation extraction with LLMs: experiments with OpenAI models
AACL 2025
LLM-Empowered State Representation for Reinforcement Learning
ICML 2024
Doubly Mild Generalization for Offline Reinforcement Learning
NIPS 2024
DARL: Distance-Aware Uncertainty Estimation for Offline Reinforcement Learning
AAAI 2023
Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning Benchmarks
NIPS 2023
Complementary Attention for Multi-Agent Reinforcement Learning
ICML 2023
Wasserstein Unsupervised Reinforcement Learning
AAAI 2022
Self-Organized Group for Cooperative Multi-agent Reinforcement Learning
NIPS 2022
SPD: Synergy Pattern Diversifying Oriented Unsupervised Multi-agent Reinforcement Learning
NIPS 2022
State Deviation Correction for Offline Reinforcement Learning
AAAI 2022
Near-Optimal Regret Bounds for Multi-batch Reinforcement Learning
NIPS 2022
FAERY: An FPGA-accelerated Embedding-based Retrieval System
OSDI 2022
PFRL: Pose-Free Reinforcement Learning for 6D Pose Estimation
CVPR 2020