Youngsoo Jang
18 papers · 2017–2026 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
π Academic Marathon (8) π§ Keyword Pioneer π Interdisciplinary Bridge π Conference Polyglot (9) π Cross-Pollinator (14)
π
Conference Polyglot
(9)
π
Academic Marathon
(8)
πΊοΈ
Taxonomy Completionist
(33)
π€
Dynamic Duo
(11)
π
Grand Slam
π
Keyword Champion
(2)
π₯
Unstoppable
(7)
π
Century Club
(16)
ποΈ
Keyword Collector
(75)
π
Trend Setter
Conferences
ICML (4)
ACL (3)
EMNLP (2)
ICLR (2)
NIPS (2)
AAAI (1)
ACML (1)
EACL (1)
IJCAI (1)
IJCNLP (1)
Top co-authors
Keywords
reinforcement learning
(3)
variational inference
(2)
dialogue system
(2)
dialogue state tracking
(2)
stationary distribution
(2)
policy learning
(2)
recurrent neural network
(2)
goal-oriented dialogue
(2)
probabilistic rule
(2)
spoken dialogue system
(2)
preference optimization
(1)
text generation
(1)
dialogue generation
(1)
policy optimization
(1)
offline reinforcement learning
(1)
in-context learning
(1)
variance reduction
(1)
uncertainty quantification
(1)
divergence minimization
(1)
model-based reinforcement learning
(1)
Papers
IRPO: Implicit Policy Regularized Preference Optimization
EACL 2026
Efficiently Learning To Reason or Not to Reason: Root-token Policy Optimization for Adaptive Thinking
ACL 2026
Online Pre-Training for Offline-to-Online Reinforcement Learning
ICML 2025
Prospector: Improving LLM Agents with Self-Asking and Trajectory Ranking
EMNLP 2024
Semantic Skill Grounding for Embodied Instruction-Following in Cross-Domain Environments
ACL 2024
Degeneration-free Policy Optimization: RL Fine-Tuning for Language Models without Degeneration
ICML 2024
Information-Theoretic State Space Model for Multi-View Reinforcement Learning
ICML 2023
SafeDICE: Offline Safe Imitation Learning with Non-Preferred Demonstrations
NIPS 2023
GPT-Critic: Offline Reinforcement Learning for End-to-End Task-Oriented Dialogue Systems
ICLR 2022
LobsDICE: Offline Learning from Observation via Stationary Distribution Correction Estimation
NIPS 2022
Monte-Carlo Planning and Learning with Language Action Value Estimates
ICLR 2021
Variational Inference for Sequential Data with Future Likelihood Estimates
ICML 2020
End-to-End Neural Pipeline for Goal-Oriented Dialogue Systems using GPT-2
ACL 2020
Bayes-Adaptive Monte-Carlo Planning and Learning for Goal-Oriented Dialogues
AAAI 2020
PyOpenDial: A Python-based Domain-Independent Toolkit for Developing Spoken Dialogue Systems with Probabilistic Rules
IJCNLP 2019
PyOpenDial: A Python-based Domain-Independent Toolkit for Developing Spoken Dialogue Systems with Probabilistic Rules
EMNLP 2019
Trust Region Sequential Variational Inference
ACML 2019
Constrained Bayesian Reinforcement Learning via Approximate Linear Programming
IJCAI 2017