Yi Su
24 papers · 2009–2026 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
π Academic Marathon (16) π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (10) π Cross-Pollinator (14)
πΊοΈ
Taxonomy Completionist
(42)
π
Interdisciplinary Bridge
π§
Keyword Pioneer
π
Conference Pioneer
ποΈ
Keyword Collector
(84)
π₯
Unstoppable
(7)
π
Century Club
(22)
β‘
Prolific Year
(5)
Conferences
ACL (5)
ICML (4)
NIPS (4)
EMNLP (3)
ICLR (2)
WACV (2)
IJCNLP (1)
INTERSPEECH (1)
JMLR (1)
NAACL (1)
Top co-authors
Keywords
contextual bandit
(3)
domain adaptation
(3)
reinforcement learning
(2)
doubly robust estimator
(2)
hierarchical language modeling
(2)
off-policy learning
(2)
label shift
(2)
online learning
(2)
test-time adaptation
(2)
off-policy evaluation
(2)
recursive transformer
(2)
differentiable tree
(2)
offline reinforcement learning
(2)
distribution shift
(2)
large language model
(2)
unsupervised parsing
(2)
medical imaging
(1)
self-supervised learning
(1)
optimal transport
(1)
sequence labeling
(1)
Papers
OneRec-Think: In-Text Reasoning for Generative Recommendation
ACL 2026
Crossing the Reward Bridge: Expanding Reinforcement Learning with Verifiable Rewards Across Diverse Domains
ACL 2026
Understanding How Value Neurons Shape the Generation of Specified Values in LLMs
EMNLP 2025
CUNSB-RFIE: Context-Aware Unpaired Neural Schrodinger Bridge in Retinal Fundus Image Enhancement
WACV 2025
Accurate KV Cache Quantization with Outlier Tokens Tracing
ACL 2025
Training Language Models to Self-Correct via Reinforcement Learning
ICLR 2025
EVOLvE: Evaluating and Optimizing LLMs For In-Context Exploration
ICML 2025
Demonstration Augmentation for Zero-shot In-context Learning
ACL 2024
Online Feature Updates Improve Online (Generalized) Label Shift Adaptation
NIPS 2024
Ordinal Classification With Distance Regularization for Robust Brain Age Prediction
WACV 2024
Beware of Model Collapse! Fast and Stable Test-time Adaptation for Robust Question Answering
EMNLP 2023
Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective
NIPS 2023
Offline RL for Natural Language Generation with Implicit Language Q Learning
ICLR 2023
Data-Driven Offline Decision-Making via Invariant Representation Learning
NIPS 2022
Tianshou: A Highly Modularized Deep Reinforcement Learning Library
JMLR 2022
Context-Aware Language Modeling for Goal-Oriented Dialogue Systems
NAACL 2022
Online Adaptation to Label Distribution Shift
NIPS 2021
R2D2: Recursive Transformer based on Differentiable Tree for Interpretable Hierarchical Language Modeling
ACL 2021
R2D2: Recursive Transformer based on Differentiable Tree for Interpretable Hierarchical Language Modeling
IJCNLP 2021
Adaptive Estimator Selection for Off-Policy Evaluation
ICML 2020
Doubly robust off-policy evaluation with shrinkage
ICML 2020
CAB: Continuous Adaptive Blending for Policy Evaluation and Learning
ICML 2019
LSTM-Based NeuroCRFs for Named Entity Recognition
INTERSPEECH 2016
Model Adaptation via Model Interpolation and Boosting for Web Search Ranking
EMNLP 2009