Daiki E. Matsunaga
3 papers · 2023–2024 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
π
Conference Polyglot
(3)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(13)
π§
Keyword Pioneer
π
Cross-Pollinator
(15)
Conferences
AAAI (1)
EMNLP (1)
NIPS (1)
Top co-authors
Keywords
offline reinforcement learning
(3)
direct preference optimization
(1)
preference alignment
(1)
conditional generation
(1)
stationary distribution
(1)
diffusion model
(1)
language model
(1)
reward model
(1)
out-of-distribution action
(1)
generative flow network
(1)
value decomposition
(1)
goal-conditioned planning
(1)
large language model
(1)
multi-agent system
(1)
trajectory stitching
(1)
diversity seeking
(1)
reinforcement learning
(1)
nash policy
(1)
Papers
Stitching Sub-trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RL
AAAI 2024
GDPO: Learning to Directly Align Language Models with Diversity Using GFlowNets
EMNLP 2024
AlberDICE: Addressing Out-Of-Distribution Joint Actions in Offline Multi-Agent RL via Alternating Stationary Distribution Correction Estimation
NIPS 2023