conftrace_

Youngsoo Jang

18 papers · 2017–2026 · 10 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓
+10 more ↓ πŸƒ Academic Marathon (8) 🧭 Keyword Pioneer πŸŒ‰ Interdisciplinary Bridge 🌍 Conference Polyglot (9) 🐝 Cross-Pollinator (14)
🌍 Conference Polyglot (9) πŸƒ Academic Marathon (8) πŸ—ΊοΈ Taxonomy Completionist (33) 🀝 Dynamic Duo (11) πŸ† Grand Slam πŸ† Keyword Champion (2) πŸ”₯ Unstoppable (7) πŸ’Ž Century Club (16) πŸ—ƒοΈ Keyword Collector (75) πŸ“ˆ Trend Setter

Conferences

ICML (4) ACL (3) EMNLP (2) ICLR (2) NIPS (2) AAAI (1) ACML (1) EACL (1) IJCAI (1) IJCNLP (1)

Papers

IRPO: Implicit Policy Regularized Preference Optimization EACL 2026 Efficiently Learning To Reason or Not to Reason: Root-token Policy Optimization for Adaptive Thinking ACL 2026 Online Pre-Training for Offline-to-Online Reinforcement Learning ICML 2025 Prospector: Improving LLM Agents with Self-Asking and Trajectory Ranking EMNLP 2024 Semantic Skill Grounding for Embodied Instruction-Following in Cross-Domain Environments ACL 2024 Degeneration-free Policy Optimization: RL Fine-Tuning for Language Models without Degeneration ICML 2024 Information-Theoretic State Space Model for Multi-View Reinforcement Learning ICML 2023 SafeDICE: Offline Safe Imitation Learning with Non-Preferred Demonstrations NIPS 2023 GPT-Critic: Offline Reinforcement Learning for End-to-End Task-Oriented Dialogue Systems ICLR 2022 LobsDICE: Offline Learning from Observation via Stationary Distribution Correction Estimation NIPS 2022 Monte-Carlo Planning and Learning with Language Action Value Estimates ICLR 2021 Variational Inference for Sequential Data with Future Likelihood Estimates ICML 2020 End-to-End Neural Pipeline for Goal-Oriented Dialogue Systems using GPT-2 ACL 2020 Bayes-Adaptive Monte-Carlo Planning and Learning for Goal-Oriented Dialogues AAAI 2020 PyOpenDial: A Python-based Domain-Independent Toolkit for Developing Spoken Dialogue Systems with Probabilistic Rules IJCNLP 2019 PyOpenDial: A Python-based Domain-Independent Toolkit for Developing Spoken Dialogue Systems with Probabilistic Rules EMNLP 2019 Trust Region Sequential Variational Inference ACML 2019 Constrained Bayesian Reinforcement Learning via Approximate Linear Programming IJCAI 2017