Yong Yu
68 papers · 2008–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (18) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (5) π£ Hot Topic Early Bird
π
Renaissance Researcher
(5)
π
Interdisciplinary Bridge
π£
Hot Topic Early Bird
π€
Dynamic Duo
(52)
π
Triple Crown
π§¬
Topic Evolution
π
Keyword Champion
π
Grand Slam
π±
Topic Pioneer
ποΈ
Keyword Collector
(253)
π
Conference Pioneer
β‘
Prolific Year
(7)
π₯
Unstoppable
(10)
β
The Questioner
π
Trend Setter
π
Century Club
(63)
Conferences
ACL (14)
AAAI (11)
IJCAI (8)
NIPS (7)
EMNLP (6)
ICLR (6)
ICML (6)
IJCNLP (3)
JMLR (3)
COLING (2)
AISTATS (1)
NAACL (1)
Top co-authors
Research topics
Keywords
reinforcement learning
(9)
large language model
(5)
transfer learning
(5)
model-based reinforcement learning
(4)
multi-agent system
(4)
policy optimization
(4)
recommender system
(4)
neural machine translation
(4)
policy gradient
(4)
graph neural network
(4)
code generation
(4)
sample efficiency
(3)
process reward model
(3)
domain adaptation
(3)
unsupervised learning
(3)
multi-agent reinforcement learning
(2)
offline reinforcement learning
(2)
neural architecture search
(2)
text generation
(2)
named entity recognition
(2)
Papers
LoopTool: Closing the DataβTraining Loop for Robust LLM Tool Calls
ACL 2026
A Survey of Large Language Model-Based Search Agents
ACL 2026
CoreCodeBench: Decoupling Code Intelligence via Fine-Grained Repository-Level Tasks
ACL 2026
Offline Fictitious Self-Play for Competitive Games
AAAI 2026
A Comprehensive Survey of Process Reward Models: Data Generation, Model Construction, and Usage
ACL 2026
Beyond Graph Convolution: Multimodal Recommendation with Topology-aware MLPs
AAAI 2025
CodePRM: Execution Feedback-enhanced Process Reward Model for Code Generation
ACL 2025
DebateCoder: Towards Collective Intelligence of LLMs via Test Case Driven LLM Debate for Code Generation
ACL 2025
Retrieval-Augmented Process Reward Model for Generalizable Mathematical Reasoning
ACL 2025
NL-Debugging: Exploiting Natural Language as an Intermediate Representation for Code Debugging
EMNLP 2025
RethinkMCTS: Refining Erroneous Thoughts in Monte Carlo Tree Search for Code Generation
EMNLP 2025
Large Language Models are Demonstration Pre-Selectors for Themselves
ICML 2025
Boost, Disentangle, and Customize: A Robust System2-to-System1 Pipeline for Code Generation
ACL 2025
MADiff: Offline Multi-agent Learning with Diffusion Models
NIPS 2024
Lending Interaction Wings to Recommender Systems with Conversational Agents
NIPS 2023
MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning
JMLR 2023
Learning Decomposed Spatial Relations for Multi-Variate Time-Series Modeling
AAAI 2023
Adaptation Augmented Model-based Policy Optimization
JMLR 2023
Set-to-Sequence Ranking-Based Concept-Aware Learning Path Recommendation
AAAI 2023
Why Propagate Alone? Parallel Use of Labels and Features on Graphs
ICLR 2022
Inductive Relation Prediction Using Analogy Subgraph Embeddings
ICLR 2022
Honor of Kings Arena: an Environment for Generalization in Competitive Reinforcement Learning
NIPS 2022
PAEG: Phrase-level Adversarial Example Generation for Neural Machine Translation
COLING 2022
Plan Your Target and Learn Your Skills: Transferable State-Only Imitation Learning via Decoupled Policy Optimization
ICML 2022
Multi-View Graph Representation for Programming Language Processing: An Investigation into Algorithm Detection
AAAI 2022
Nested Named Entity Recognition with Span-level Graphs
ACL 2022
Learning Logic Rules for Document-Level Relation Extraction
EMNLP 2021
Universal Trading for Order Execution with Oracle Policy Distillation
AAAI 2021
Glancing Transformer for Non-Autoregressive Neural Machine Translation
ACL 2021
On Effective Scheduling of Model-based Reinforcement Learning
NIPS 2021
MARS: Markov Molecular Sampling for Multi-objective Drug Discovery
ICLR 2021
Glancing Transformer for Non-Autoregressive Neural Machine Translation
IJCNLP 2021
MapGo: Model-Assisted Policy Optimization for Goal-Oriented Tasks
IJCAI 2021
Aggregating Crowd Wisdom with Side Information via a Clustering-based Label-aware Autoencoder
IJCAI 2020
Model-based Policy Optimization with Unsupervised Model Adaptation
NIPS 2020
Efficient Projection-free Algorithms for Saddle Point Problems
NIPS 2020
Infomax Neural Joint Source-Channel Coding via Adversarial Bit Flip
AAAI 2020
Towards Making the Most of BERT in Neural Machine Translation
AAAI 2020
Efficient Spectrum-Revealing CUR Matrix Decomposition
AISTATS 2020
Active Sentence Learning by Adversarial Uncertainty Sampling in Discrete Space
EMNLP 2020
Multi-Agent Interactions Modeling with Correlated Policies
ICLR 2020
Bidirectional Model-based Policy Optimization
ICML 2020
Improving Knowledge Tracing via Pre-training Question Embeddings
IJCAI 2020
DropNAS: Grouped Operation Dropout for Differentiable Architecture Search
IJCAI 2020
Efficient and Robust High-Dimensional Linear Contextual Bandits
IJCAI 2020
Large-Scale Interactive Recommendation with Tree-Structured Policy Gradient
AAAI 2019
Exploring Diverse Expressions for Paraphrase Generation
IJCNLP 2019
Exploring Diverse Expressions for Paraphrase Generation
EMNLP 2019
Dynamically Fused Graph Network for Multi-hop Reasoning
ACL 2019
AdaShift: Decorrelation and Convergence of Adaptive Learning Rate Methods
ICLR 2019
Hybrid Actor-Critic Reinforcement Learning in Parameterized Action Space
IJCAI 2019
Lipschitz Generative Adversarial Nets
ICML 2019
Deep Recurrent Survival Analysis
AAAI 2019
Guiding the One-to-One Mapping in CycleGAN via Optimal Transport
AAAI 2019
Activation Maximization Generative Adversarial Nets
ICLR 2018
Learning to Design Games: Strategic Environments in Reinforcement Learning
IJCAI 2018
Path-Level Network Transformation for Efficient Architecture Search
ICML 2018
Label-Aware Double Transfer Learning for Cross-Specialty Medical Named Entity Recognition
NAACL 2018
Aggregating Crowd Wisdoms with Label-aware Autoencoders
IJCAI 2017
Context-Dependent Sense Embedding
EMNLP 2016
General Functional Matrix Factorization Using Gradient Boosting
ICML 2013
SVDFeature: A Toolkit for Feature-based Collaborative Filtering
JMLR 2012
Heterogeneous Transfer Learning for Image Clustering via the SocialWeb
IJCNLP 2009
Heterogeneous Transfer Learning for Image Clustering via the SocialWeb
ACL 2009
A Probabilistic Model for Fine-Grained Expert Search
ACL 2008
Understanding and Summarizing Answers in Community-Based Question Answering Services
COLING 2008
Searching Questions by Identifying Question Topic and Question Focus
ACL 2008
Translated Learning: Transfer Learning across Different Feature Spaces
NIPS 2008