Geon-hyeong Kim
14 papers · 2018–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
π Academic Marathon (7) π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (6) π Cross-Pollinator (14)
πΊοΈ
Taxonomy Completionist
(28)
π£
Hot Topic Early Bird
π
Conference Polyglot
(6)
π
Grand Slam
π
Century Club
(13)
π
Trend Setter
ποΈ
Keyword Collector
(63)
π₯
Unstoppable
(8)
Conferences
NIPS (5)
ICML (4)
AAAI (1)
ACML (1)
EACL (1)
EMNLP (1)
ICLR (1)
Top co-authors
Keywords
variational inference
(3)
monte-carlo tree search
(2)
reinforcement learning
(2)
policy learning
(2)
stationary distribution
(2)
latent representation
(2)
divergence minimization
(1)
imitation learning
(1)
representation learning
(1)
gradient-based optimization
(1)
in-context learning
(1)
preference optimization
(1)
bayesian inference
(1)
image-to-image translation
(1)
offline reinforcement learning
(1)
variance reduction
(1)
partially observable markov decision process
(1)
value function
(1)
gradient optimization
(1)
mutual information
(1)
Papers
IRPO: Implicit Policy Regularized Preference Optimization
EACL 2026
Online Pre-Training for Offline-to-Online Reinforcement Learning
ICML 2025
Prospector: Improving LLM Agents with Self-Asking and Trajectory Ranking
EMNLP 2024
Degeneration-free Policy Optimization: RL Fine-Tuning for Language Models without Degeneration
ICML 2024
Information-Theoretic State Space Model for Multi-View Reinforcement Learning
ICML 2023
SafeDICE: Offline Safe Imitation Learning with Non-Preferred Demonstrations
NIPS 2023
LobsDICE: Offline Learning from Observation via Stationary Distribution Correction Estimation
NIPS 2022
DemoDICE: Offline Imitation Learning with Supplementary Imperfect Demonstrations
ICLR 2022
Multi-View Representation Learning via Total Correlation Objective
NIPS 2021
Monte-Carlo Tree Search in Continuous Action Spaces with Value Gradients
AAAI 2020
Variational Inference for Sequential Data with Future Likelihood Estimates
ICML 2020
Variational Interaction Information Maximization for Cross-domain Disentanglement
NIPS 2020
Trust Region Sequential Variational Inference
ACML 2019
Monte-Carlo Tree Search for Constrained POMDPs
NIPS 2018