Youngsoo Jang

18 papers · 2017–2026 · 10 conferences · across top CS/AI conferences

Achievements

+10 more ↓

🏃 Academic Marathon (8) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (9) 🐝 Cross-Pollinator (14)

🌍 Conference Polyglot (9) 🏃 Academic Marathon (8) 🗺️ Taxonomy Completionist (33) 🤝 Dynamic Duo (11) 🏆 Grand Slam 🏆 Keyword Champion (2) 🔥 Unstoppable (7) 💎 Century Club (16) 🗃️ Keyword Collector (75) 📈 Trend Setter

Conferences

ICML (4) ACL (3) EMNLP (2) ICLR (2) NIPS (2) AAAI (1) ACML (1) EACL (1) IJCAI (1) IJCNLP (1)

Top co-authors

Kee-eung Kim (11) Jongmin Lee (9) Geon-hyeong Kim (9) Moontae Lee (6) Honglak Lee (4) Yu Jin Kim (3) Byoungjip Kim (3) Hongseok Yang (3) Seokin Seo (2) Pierre Lison (2)

Keywords

reinforcement learning (3) variational inference (2) dialogue system (2) dialogue state tracking (2) stationary distribution (2) policy learning (2) recurrent neural network (2) goal-oriented dialogue (2) probabilistic rule (2) spoken dialogue system (2) preference optimization (1) text generation (1) dialogue generation (1) policy optimization (1) offline reinforcement learning (1) in-context learning (1) variance reduction (1) uncertainty quantification (1) divergence minimization (1) model-based reinforcement learning (1)

Papers

IRPO: Implicit Policy Regularized Preference Optimization EACL 2026 Efficiently Learning To Reason or Not to Reason: Root-token Policy Optimization for Adaptive Thinking ACL 2026 Online Pre-Training for Offline-to-Online Reinforcement Learning ICML 2025 Prospector: Improving LLM Agents with Self-Asking and Trajectory Ranking EMNLP 2024 Semantic Skill Grounding for Embodied Instruction-Following in Cross-Domain Environments ACL 2024 Degeneration-free Policy Optimization: RL Fine-Tuning for Language Models without Degeneration ICML 2024 Information-Theoretic State Space Model for Multi-View Reinforcement Learning ICML 2023 SafeDICE: Offline Safe Imitation Learning with Non-Preferred Demonstrations NIPS 2023 GPT-Critic: Offline Reinforcement Learning for End-to-End Task-Oriented Dialogue Systems ICLR 2022 LobsDICE: Offline Learning from Observation via Stationary Distribution Correction Estimation NIPS 2022 Monte-Carlo Planning and Learning with Language Action Value Estimates ICLR 2021 Variational Inference for Sequential Data with Future Likelihood Estimates ICML 2020 End-to-End Neural Pipeline for Goal-Oriented Dialogue Systems using GPT-2 ACL 2020 Bayes-Adaptive Monte-Carlo Planning and Learning for Goal-Oriented Dialogues AAAI 2020 PyOpenDial: A Python-based Domain-Independent Toolkit for Developing Spoken Dialogue Systems with Probabilistic Rules IJCNLP 2019 PyOpenDial: A Python-based Domain-Independent Toolkit for Developing Spoken Dialogue Systems with Probabilistic Rules EMNLP 2019 Trust Region Sequential Variational Inference ACML 2019 Constrained Bayesian Reinforcement Learning via Approximate Linear Programming IJCAI 2017