Yong Yu

68 papers · 2008–2026 · 12 conferences · across top CS/AI conferences

Achievements

+16 more ↓

🗺️ Taxonomy Completionist (18) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (5) 🐣 Hot Topic Early Bird

🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird 🤝 Dynamic Duo (52) 👑 Triple Crown 🧬 Topic Evolution 🏆 Keyword Champion 🏆 Grand Slam 🌱 Topic Pioneer 🗃️ Keyword Collector (253) 🚀 Conference Pioneer ⚡ Prolific Year (7) 🔥 Unstoppable (10) ❓ The Questioner 📈 Trend Setter 💎 Century Club (63)

Conferences

ACL (14) AAAI (11) IJCAI (8) NIPS (7) EMNLP (6) ICLR (6) ICML (6) IJCNLP (3) JMLR (3) COLING (2) AISTATS (1) NAACL (1)

Top co-authors

Weinan Zhang (57) Lin Qiu (10) Ruiming Tang (9) Jun Wang (9) Hao Zhou (8) Jian Shen (8) Lei Li (7) Kounianhua Du (7) Weiwen Liu (6) Xinyi Dai (6)

Research topics

Education (2)

Keywords

reinforcement learning (9) large language model (5) transfer learning (5) model-based reinforcement learning (4) multi-agent system (4) policy optimization (4) recommender system (4) neural machine translation (4) policy gradient (4) graph neural network (4) code generation (4) sample efficiency (3) process reward model (3) domain adaptation (3) unsupervised learning (3) multi-agent reinforcement learning (2) offline reinforcement learning (2) neural architecture search (2) text generation (2) named entity recognition (2)

Papers

LoopTool: Closing the Data–Training Loop for Robust LLM Tool Calls ACL 2026 A Survey of Large Language Model-Based Search Agents ACL 2026 CoreCodeBench: Decoupling Code Intelligence via Fine-Grained Repository-Level Tasks ACL 2026 Offline Fictitious Self-Play for Competitive Games AAAI 2026 A Comprehensive Survey of Process Reward Models: Data Generation, Model Construction, and Usage ACL 2026 Beyond Graph Convolution: Multimodal Recommendation with Topology-aware MLPs AAAI 2025 CodePRM: Execution Feedback-enhanced Process Reward Model for Code Generation ACL 2025 DebateCoder: Towards Collective Intelligence of LLMs via Test Case Driven LLM Debate for Code Generation ACL 2025 Retrieval-Augmented Process Reward Model for Generalizable Mathematical Reasoning ACL 2025 NL-Debugging: Exploiting Natural Language as an Intermediate Representation for Code Debugging EMNLP 2025 RethinkMCTS: Refining Erroneous Thoughts in Monte Carlo Tree Search for Code Generation EMNLP 2025 Large Language Models are Demonstration Pre-Selectors for Themselves ICML 2025 Boost, Disentangle, and Customize: A Robust System2-to-System1 Pipeline for Code Generation ACL 2025 MADiff: Offline Multi-agent Learning with Diffusion Models NIPS 2024 Lending Interaction Wings to Recommender Systems with Conversational Agents NIPS 2023 MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning JMLR 2023 Learning Decomposed Spatial Relations for Multi-Variate Time-Series Modeling AAAI 2023 Adaptation Augmented Model-based Policy Optimization JMLR 2023 Set-to-Sequence Ranking-Based Concept-Aware Learning Path Recommendation AAAI 2023 Why Propagate Alone? Parallel Use of Labels and Features on Graphs ICLR 2022 Inductive Relation Prediction Using Analogy Subgraph Embeddings ICLR 2022 Honor of Kings Arena: an Environment for Generalization in Competitive Reinforcement Learning NIPS 2022 PAEG: Phrase-level Adversarial Example Generation for Neural Machine Translation COLING 2022 Plan Your Target and Learn Your Skills: Transferable State-Only Imitation Learning via Decoupled Policy Optimization ICML 2022 Multi-View Graph Representation for Programming Language Processing: An Investigation into Algorithm Detection AAAI 2022 Nested Named Entity Recognition with Span-level Graphs ACL 2022 Learning Logic Rules for Document-Level Relation Extraction EMNLP 2021 Universal Trading for Order Execution with Oracle Policy Distillation AAAI 2021 Glancing Transformer for Non-Autoregressive Neural Machine Translation ACL 2021 On Effective Scheduling of Model-based Reinforcement Learning NIPS 2021 MARS: Markov Molecular Sampling for Multi-objective Drug Discovery ICLR 2021 Glancing Transformer for Non-Autoregressive Neural Machine Translation IJCNLP 2021 MapGo: Model-Assisted Policy Optimization for Goal-Oriented Tasks IJCAI 2021 Aggregating Crowd Wisdom with Side Information via a Clustering-based Label-aware Autoencoder IJCAI 2020 Model-based Policy Optimization with Unsupervised Model Adaptation NIPS 2020 Efficient Projection-free Algorithms for Saddle Point Problems NIPS 2020 Infomax Neural Joint Source-Channel Coding via Adversarial Bit Flip AAAI 2020 Towards Making the Most of BERT in Neural Machine Translation AAAI 2020 Efficient Spectrum-Revealing CUR Matrix Decomposition AISTATS 2020 Active Sentence Learning by Adversarial Uncertainty Sampling in Discrete Space EMNLP 2020 Multi-Agent Interactions Modeling with Correlated Policies ICLR 2020 Bidirectional Model-based Policy Optimization ICML 2020 Improving Knowledge Tracing via Pre-training Question Embeddings IJCAI 2020 DropNAS: Grouped Operation Dropout for Differentiable Architecture Search IJCAI 2020 Efficient and Robust High-Dimensional Linear Contextual Bandits IJCAI 2020 Large-Scale Interactive Recommendation with Tree-Structured Policy Gradient AAAI 2019 Exploring Diverse Expressions for Paraphrase Generation IJCNLP 2019 Exploring Diverse Expressions for Paraphrase Generation EMNLP 2019 Dynamically Fused Graph Network for Multi-hop Reasoning ACL 2019 AdaShift: Decorrelation and Convergence of Adaptive Learning Rate Methods ICLR 2019 Hybrid Actor-Critic Reinforcement Learning in Parameterized Action Space IJCAI 2019 Lipschitz Generative Adversarial Nets ICML 2019 Deep Recurrent Survival Analysis AAAI 2019 Guiding the One-to-One Mapping in CycleGAN via Optimal Transport AAAI 2019 Activation Maximization Generative Adversarial Nets ICLR 2018 Learning to Design Games: Strategic Environments in Reinforcement Learning IJCAI 2018 Path-Level Network Transformation for Efficient Architecture Search ICML 2018 Label-Aware Double Transfer Learning for Cross-Specialty Medical Named Entity Recognition NAACL 2018 Aggregating Crowd Wisdoms with Label-aware Autoencoders IJCAI 2017 Context-Dependent Sense Embedding EMNLP 2016 General Functional Matrix Factorization Using Gradient Boosting ICML 2013 SVDFeature: A Toolkit for Feature-based Collaborative Filtering JMLR 2012 Heterogeneous Transfer Learning for Image Clustering via the SocialWeb IJCNLP 2009 Heterogeneous Transfer Learning for Image Clustering via the SocialWeb ACL 2009 A Probabilistic Model for Fine-Grained Expert Search ACL 2008 Understanding and Summarizing Answers in Community-Based Question Answering Services COLING 2008 Searching Questions by Identifying Question Topic and Question Focus ACL 2008 Translated Learning: Transfer Learning across Different Feature Spaces NIPS 2008