Yi Su

24 papers · 2009–2026 · 10 conferences · across top CS/AI conferences

Achievements

+8 more ↓

🏃 Academic Marathon (16) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (10) 🐝 Cross-Pollinator (14)

🗺️ Taxonomy Completionist (42) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🚀 Conference Pioneer 🗃️ Keyword Collector (84) 🔥 Unstoppable (7) 💎 Century Club (22) ⚡ Prolific Year (5)

Conferences

ACL (5) ICML (4) NIPS (4) EMNLP (3) ICLR (2) WACV (2) IJCNLP (1) INTERSPEECH (1) JMLR (1) NAACL (1)

Top co-authors

Min Zhang (4) Juntao Li (4) Sergey Levine (3) Haitao Mi (3) Yafang Wang (2) Zujie Wen (2) Aviral Kumar (2) Akshay Krishnamurthy (2) Jing Zheng (2) Xiang Hu (2)

Keywords

contextual bandit (3) domain adaptation (3) reinforcement learning (2) doubly robust estimator (2) hierarchical language modeling (2) off-policy learning (2) label shift (2) online learning (2) test-time adaptation (2) off-policy evaluation (2) recursive transformer (2) differentiable tree (2) offline reinforcement learning (2) distribution shift (2) large language model (2) unsupervised parsing (2) medical imaging (1) self-supervised learning (1) optimal transport (1) sequence labeling (1)

Papers

OneRec-Think: In-Text Reasoning for Generative Recommendation ACL 2026 Crossing the Reward Bridge: Expanding Reinforcement Learning with Verifiable Rewards Across Diverse Domains ACL 2026 Understanding How Value Neurons Shape the Generation of Specified Values in LLMs EMNLP 2025 CUNSB-RFIE: Context-Aware Unpaired Neural Schrodinger Bridge in Retinal Fundus Image Enhancement WACV 2025 Accurate KV Cache Quantization with Outlier Tokens Tracing ACL 2025 Training Language Models to Self-Correct via Reinforcement Learning ICLR 2025 EVOLvE: Evaluating and Optimizing LLMs For In-Context Exploration ICML 2025 Demonstration Augmentation for Zero-shot In-context Learning ACL 2024 Online Feature Updates Improve Online (Generalized) Label Shift Adaptation NIPS 2024 Ordinal Classification With Distance Regularization for Robust Brain Age Prediction WACV 2024 Beware of Model Collapse! Fast and Stable Test-time Adaptation for Robust Question Answering EMNLP 2023 Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective NIPS 2023 Offline RL for Natural Language Generation with Implicit Language Q Learning ICLR 2023 Data-Driven Offline Decision-Making via Invariant Representation Learning NIPS 2022 Tianshou: A Highly Modularized Deep Reinforcement Learning Library JMLR 2022 Context-Aware Language Modeling for Goal-Oriented Dialogue Systems NAACL 2022 Online Adaptation to Label Distribution Shift NIPS 2021 R2D2: Recursive Transformer based on Differentiable Tree for Interpretable Hierarchical Language Modeling ACL 2021 R2D2: Recursive Transformer based on Differentiable Tree for Interpretable Hierarchical Language Modeling IJCNLP 2021 Adaptive Estimator Selection for Off-Policy Evaluation ICML 2020 Doubly robust off-policy evaluation with shrinkage ICML 2020 CAB: Continuous Adaptive Blending for Policy Evaluation and Learning ICML 2019 LSTM-Based NeuroCRFs for Named Entity Recognition INTERSPEECH 2016 Model Adaptation via Model Interpolation and Boosting for Web Search Ranking EMNLP 2009