Kee-eung Kim
54 papers · 2011–2026 · 13 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
π§ Keyword Pioneer π£ Hot Topic Early Bird πΊοΈ Taxonomy Completionist (13) π Interdisciplinary Bridge π Conference Polyglot (13)
π
Academic Marathon
(14)
π
Renaissance Researcher
(8)
πΊοΈ
Taxonomy Completionist
(13)
π€
Dynamic Duo
(22)
π
Triple Crown
π§¬
Topic Evolution
π
Keyword Champion
π
Grand Slam
π¬
Deep Specialist
(11)
π
Trend Setter
π₯
Unstoppable
(11)
π
Conference Pioneer
β‘
Prolific Year
(9)
ποΈ
Keyword Collector
(216)
π
Century Club
(53)
Conferences
NIPS (15)
ICLR (9)
AAAI (7)
ICML (5)
IJCAI (5)
EMNLP (3)
ACL (2)
AISTATS (2)
NAACL (2)
ACML (1)
IJCNLP (1)
INTERSPEECH (1)
JMLR (1)
Top co-authors
Keywords
bayesian inference
(7)
offline reinforcement learning
(7)
reinforcement learning
(6)
variational inference
(5)
stationary distribution
(4)
reward function
(3)
inverse reinforcement learning
(3)
model-based reinforcement learning
(3)
imitation learning
(3)
language model
(3)
policy learning
(3)
monte-carlo tree search
(3)
dialogue state tracking
(2)
partially observable markov decision process
(2)
partial observability
(2)
direct preference optimization
(2)
adversarial learning
(2)
bayesian reinforcement learning
(2)
data augmentation
(2)
metric learning
(2)
Papers
Optimizing Preferential Rate in Retail Lending with Causal Inference and Domain Adaptation
AAAI 2026
Monet: Mixture of Monosemantic Experts for Transformers
ICLR 2025
Goal-Conditioned DPO: Prioritizing Safety in Misaligned Instructions
NAACL 2025
Diversification of Adaptive Policy for Effective Offline Reinforcement Learning
IJCAI 2024
Kernel Metric Learning for In-Sample Off-Policy Evaluation of Deterministic RL Policies
ICLR 2024
GDPO: Learning to Directly Align Language Models with Diversity Using GFlowNets
EMNLP 2024
Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL
ACL 2024
Stitching Sub-trajectories with Conditional Diffusion Model for Goal-Conditioned Offline RL
AAAI 2024
SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization
INTERSPEECH 2024
Data Augmentation with Diffusion for Open-Set Semi-Supervised Learning
NIPS 2024
Mitigating Covariate Shift in Behavioral Cloning via Robust Stationary Distribution Correction
NIPS 2024
A Submodular Optimization Approach to Accountable Loan Approval
AAAI 2024
AlberDICE: Addressing Out-Of-Distribution Joint Actions in Offline Multi-Agent RL via Alternating Stationary Distribution Correction Estimation
NIPS 2023
Bayesian Multi-Task Transfer Learning for Soft Prompt Tuning
EMNLP 2023
Information-Theoretic State Space Model for Multi-View Reinforcement Learning
ICML 2023
Trustworthy Residual Vehicle Value Prediction for Auto Finance
AAAI 2023
Regularized Behavior Cloning for Blocking the Leakage of Past Action Information
NIPS 2023
GPT-Critic: Offline Reinforcement Learning for End-to-End Task-Oriented Dialogue Systems
ICLR 2022
Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions
NIPS 2022
LobsDICE: Offline Learning from Observation via Stationary Distribution Correction Estimation
NIPS 2022
Structure-Aware Transformer Policy for Inhomogeneous Multi-Task Reinforcement Learning
ICLR 2022
DemoDICE: Offline Imitation Learning with Supplementary Imperfect Demonstrations
ICLR 2022
COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation
ICLR 2022
PAC-Net: A Model Pruning Approach to Inductive Transfer Learning
ICML 2022
Data Augmentation for Learning to Play in Text-Based Games
IJCAI 2022
Learning to Embed Multi-Modal Contexts for Situated Conversational Agents
NAACL 2022
Monte-Carlo Planning and Learning with Language Action Value Estimates
ICLR 2021
Representation Balancing Offline Model-based Reinforcement Learning
ICLR 2021
Winning the L2RPN Challenge: Power Grid Management via Semi-Markov Afterstate Actor-Critic
ICLR 2021
OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
ICML 2021
Multi-View Representation Learning via Total Correlation Objective
NIPS 2021
Variational Interaction Information Maximization for Cross-domain Disentanglement
NIPS 2020
Reinforcement Learning for Control with Multiple Frequencies
NIPS 2020
Batch Reinforcement Learning with Hyperparameter Gradients
ICML 2020
Variational Inference for Sequential Data with Future Likelihood Estimates
ICML 2020
Residual Neural Processes
AAAI 2020
Monte-Carlo Tree Search in Continuous Action Spaces with Value Gradients
AAAI 2020
End-to-End Neural Pipeline for Goal-Oriented Dialogue Systems using GPT-2
ACL 2020
Bayes-Adaptive Monte-Carlo Planning and Learning for Goal-Oriented Dialogues
AAAI 2020
PyOpenDial: A Python-based Domain-Independent Toolkit for Developing Spoken Dialogue Systems with Probabilistic Rules
EMNLP 2019
PyOpenDial: A Python-based Domain-Independent Toolkit for Developing Spoken Dialogue Systems with Probabilistic Rules
IJCNLP 2019
Trust Region Sequential Variational Inference
ACML 2019
A Bayesian Approach to Generative Adversarial Imitation Learning
NIPS 2018
Monte-Carlo Tree Search for Constrained POMDPs
NIPS 2018
Hierarchically-partitioned Gaussian Process Approximation
AISTATS 2017
Constrained Bayesian Reinforcement Learning via Approximate Linear Programming
IJCAI 2017
Generative Local Metric Learning for Kernel Regression
NIPS 2017
Bayesian Reinforcement Learning with Behavioral Feedback
IJCAI 2016
Reactive bandits with attitude
AISTATS 2015
Bayesian Nonparametric Feature Construction for Inverse Reinforcement Learning
IJCAI 2013
Nonparametric Bayesian Inverse Reinforcement Learning for Multiple Reward Functions
NIPS 2012
Cost-Sensitive Exploration in Bayesian Reinforcement Learning
NIPS 2012
MAP Inference for Bayesian Inverse Reinforcement Learning
NIPS 2011
Inverse Reinforcement Learning in Partially Observable Environments
JMLR 2011