Yang Yu
142 papers · 2013–2026 · 15 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+17 more ↓ Show less ↑
π§ Keyword Pioneer π Conference Polyglot (15) πΊοΈ Taxonomy Completionist (25) π Interdisciplinary Bridge π Academic Marathon (12)
π
Cross-Pollinator
(14)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(25)
π
Keyword Trendsetter Combo
(3)
π
Conference Loyalist
(28)
π
Grand Slam
π
Triple Crown
π€
Dynamic Duo
(28)
π¬
Deep Specialist
(19)
π
Keyword Champion
(2)
ποΈ
Keyword Collector
(67)
π
Conference Pioneer
β‘
Prolific Year
(6)
β
The Questioner
π
Trend Setter
π
Century Club
(134)
π₯
Unstoppable
(11)
Conferences
AAAI (34)
NIPS (28)
IJCAI (25)
ICML (19)
ICLR (17)
EMNLP (5)
ACL (3)
CVPR (2)
IJCNLP (2)
UAI (2)
INTERSPEECH (1)
JMLR (1)
MICCAI (1)
NAACL (1)
OSDI (1)
Top co-authors
Research topics
Keywords
reinforcement learning
(12)
representation learning
(11)
offline reinforcement learning
(9)
multi-agent reinforcement learning
(8)
imitation learning
(7)
policy learning
(6)
subset selection
(6)
contrastive learning
(6)
model-based reinforcement learning
(5)
large language model
(5)
sample efficiency
(5)
domain adaptation
(5)
transfer learning
(5)
attention mechanism
(4)
hyperparameter optimization
(4)
news recommendation
(4)
self-supervised learning
(4)
adversarial learning
(4)
greedy algorithm
(4)
multi-agent system
(4)
Papers
An Interactive Simulation Framework by Ensemble Imitation Learning Agents for Training Robust Trading Policies
AAAI 2026
HEV Generative Sandbox: A Framework for Assessing Domain-Specific Social Risks Through Human-LLM Simulation
AAAI 2026
Reward Model Evaluation via Automatically-Ranked Policy Alignment
AAAI 2026
Multi-agent In-context Coordination via Decentralized Memory Retrieval
AAAI 2026
Exploring Reliable Spatiotemporal Dependencies for Efficient Visual Tracking
AAAI 2026
SurgPub-Video: A Comprehensive Surgical Video Framework for Enhanced Surgical Intelligence in Vision-Language Model
AAAI 2026
An LLM-based Quantitative Framework for Evaluating High-Stealthy Backdoor Risks in OSS Supply Chains
AAAI 2026
MPBoCo: Multimodal Prompt-based Boundary-enhanced Continual Framework for Joint Entity and Relation Extraction
ACL 2026
OS Rendering Service Made Parallel with Out-of-Order Execution and In-Order Commit
OSDI 2025
MRE-MI: A Multi-image Dataset for Multimodal Relation Extraction in Social Media Posts
NAACL 2025
Spatiotemporal-Sensitive Network for Microvascular Obstruction Segmentation from Cine Cardiac Magnetic Resonance
MICCAI 2025
Learning to Reuse Policies in State Evolvable Environments
ICML 2025
Improving Reward Model Generalization from Adversarial Process Enhanced Preferences
ICML 2025
LLM Data Selection and Utilization via Dynamic Bi-level Optimization
ICML 2025
LLM-Assisted Semantically Diverse Teammate Generation for Efficient Multi-agent Coordination
ICML 2025
Controlling Large Language Model with Latent Action
ICML 2025
Behavior-Regularized Diffusion Policy Optimization for Offline Reinforcement Learning
ICML 2025
Learning View-invariant World Models for Visual Robotic Manipulation
ICLR 2025
On the Optimization Landscape of Low Rank Adaptation Methods for Large Language Models
ICLR 2025
Any-step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement Learning
ICLR 2025
Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation
ICLR 2025
Efficient Multi-agent Offline Coordination via Diffusion-based Trajectory Stitching
ICLR 2025
LLMOPT: Learning to Define and Solve General Optimization Problems from Scratch
ICLR 2025
Semantic Temporal Abstraction via Vision-Language Model Guidance for Efficient Reinforcement Learning
ICLR 2025
SOO-Bench: Benchmarks for Evaluating the Stability of Offline Black-Box Optimization
ICLR 2025
POINTS-Reader: Distillation-Free Adaptation of Vision-Language Models for Document Conversion
EMNLP 2025
MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations
CVPR 2025
GuideNER: Annotation Guidelines Are Better than Examples for In-Context Named Entity Recognition
AAAI 2025
GRAIN: Multi-Granular and Implicit Information Aggregation Graph Neural Network for Heterophilous Graphs
AAAI 2025
VA-AR: Learning Velocity-Aware Action Representations with Mixture of Window Attention
AAAI 2025
Pre-training General User Representation with Multi-type APP Behaviors
IJCAI 2024
Policy Learning from Tutorial Books via Understanding, Rehearsing and Introspecting
NIPS 2024
Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning Rate
NIPS 2024
Provably and Practically Efficient Adversarial Imitation Learning with General Function Approximation
NIPS 2024
Multi-Agent Domain Calibration with a Handful of Offline Data
NIPS 2024
Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning
NIPS 2024
Bias and Volatility: A Statistical Framework for Evaluating Large Language Model's Stereotypes and the Associated Generation Inconsistency
NIPS 2024
KALM: Knowledgeable Agents by Offline Reinforcement Learning from Large Language Model Rollouts
NIPS 2024
Focus-Then-Decide: Segmentation-Assisted Reinforcement Learning
AAAI 2024
ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning
AAAI 2024
Episodic Return Decomposition by Difference of Implicitly Assigned Sub-trajectory Reward
AAAI 2024
ANEDL: Adaptive Negative Evidential Deep Learning for Open-Set Semi-supervised Learning
AAAI 2024
Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations
AAAI 2024
Rethinking the Development of Large Language Models from the Causal Perspective: A Legal Text Prediction Case Study
AAAI 2024
Unmixing Before Fusion: A Generalized Paradigm for Multi-Source-based Hyperspectral Image Synthesis
CVPR 2024
MGCL: Multi-Granularity Clue Learning for Emotion-Cause Pair Extraction via Cross-Grained Knowledge Distillation
EMNLP 2024
Policy Rehearsing: Training Generalizable Policies for Reinforcement Learning
ICLR 2024
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning
ICLR 2024
Flow to Better: Offline Preference-based Reinforcement Learning via Preferred Trajectory Generation
ICLR 2024
Language Model Self-improvement by Reinforcement Learning Contemplation
ICLR 2024
Limited Preference Aided Imitation Learning from Imperfect Demonstrations
ICML 2024
Policy-conditioned Environment Models are More Generalizable
ICML 2024
Offline Transition Modeling via Contrastive Energy Learning
ICML 2024
Deep Demonstration Tracing: Learning Generalizable Imitator Policy for Runtime Imitation from a Single Demonstration
ICML 2024
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models
ICML 2024
Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning
ICML 2024
Causality Based Front-door Defense Against Backdoor Attack on Language Models
ICML 2024
Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics
ICML 2024
ADMN: Agent-Driven Modular Network for Dynamic Parameter Sharing in Cooperative Multi-Agent Reinforcement Learning
IJCAI 2024
Continual Multi-Objective Reinforcement Learning via Reward Model Rehearsal
IJCAI 2024
Untargeted Attack against Federated Recommendation Systems via Poisonous Item Embeddings and the Defense
AAAI 2023
Uncertainty Estimation by Fisher Information-based Evidential Deep Learning
ICML 2023
Policy Regularization with Dataset Constraint for Offline Reinforcement Learning
ICML 2023
Model-Bellman Inconsistency for Model-based Offline Reinforcement Learning
ICML 2023
Provably Efficient Adversarial Imitation Learning with Unknown Transitions
UAI 2023
Fast Teammate Adaptation in the Presence of Sudden Policy Change
UAI 2023
A Unified View of Deep Learning for Reaction and Retrosynthesis Prediction: Current Status and Future Challenges
IJCAI 2023
Natural Language Instruction-following with Task-related Language Development and Translation
NIPS 2023
Imitation Learning from Imperfection: Theoretical Justifications and Algorithms
NIPS 2023
CMMA: Benchmarking Multi-Affection Detection in Chinese Multi-Modal Conversations
NIPS 2023
Policy-Independent Behavioral Metric-Based Representation for Deep Reinforcement Learning
AAAI 2023
Robust Multi-Agent Coordination via Evolutionary Generation of Auxiliary Adversarial Attackers
AAAI 2023
Deep Anomaly Detection and Search via Reinforcement Learning (Student Abstract)
AAAI 2023
Learning Generalizable Batch Active Learning Strategies via Deep Q-networks (Student Abstract)
AAAI 2023
Anti-drifting Feature Selection via Deep Reinforcement Learning (Student Abstract)
AAAI 2023
Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data
ICLR 2023
PronScribe: Highly Accurate Multimodal Phonemic Transcription From Speech and Text
INTERSPEECH 2023
Model-Based Offline Weighted Policy Optimization (Student Abstract)
AAAI 2023
Adversarial Counterfactual Environment Model Learning
NIPS 2023
Learning World Models with Identifiable Factorization
NIPS 2023
AdaptSSR: Pre-training User Model with Augmentation-Adaptive Self-Supervised Ranking
NIPS 2023
Doubly Stochastic Graph-based Non-autoregressive Reaction Prediction
IJCAI 2023
Active Hierarchical Exploration with Stable Subgoal Representation Learning
ICLR 2022
Learning Efficient Online 3D Bin Packing on Packing Configuration Trees
ICLR 2022
Efficient Multi-Agent Communication via Shapley Message Value
IJCAI 2022
Multi-Agent Concentrative Coordination with Decentralized Task Representation
IJCAI 2022
Multi-Agent Incentive Communication via Decentralized Teammate Modeling
AAAI 2022
Context-Aware Sparse Deep Coordination Graphs
ICLR 2022
Distributed Bootstrap for Simultaneous Inference Under High Dimensionality
JMLR 2022
Efficient Multi-agent Communication via Self-supervised Information Aggregation
NIPS 2022
Tiny-NewsRec: Effective and Efficient PLM-based News Recommendation
EMNLP 2022
The Teaching Dimension of Regularized Kernel Learners
ICML 2022
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning
NIPS 2022
Multi-agent Dynamic Algorithm Configuration
NIPS 2022
Bayesian Optimistic Optimization: Optimistic Exploration for Model-based Reinforcement Learning
NIPS 2022
Adapt to Environment Sudden Changes by Learning a Context Sensitive Policy
AAAI 2022
Invariant Action Effect Model for Reinforcement Learning
AAAI 2022
Offline Model-based Adaptable Policy Learning
NIPS 2021
Circles are like Ellipses, or Ellipses are like Circles? Measuring the Degree of Asymmetry of Static and Contextual Word Embeddings and the Implications to Representation Learning
AAAI 2021
Regret Minimization Experience Replay in Off-Policy Reinforcement Learning
NIPS 2021
NewsBERT: Distilling Pre-trained Language Model for Intelligent News Application
EMNLP 2021
Enhancing Context-Based Meta-Reinforcement Learning Algorithms via An Efficient Task Encoder (Student Abstract)
AAAI 2021
HieRec: Hierarchical User Interest Modeling for Personalized News Recommendation
IJCNLP 2021
HieRec: Hierarchical User Interest Modeling for Personalized News Recommendation
ACL 2021
Incorporating Bidirection-Interactive Information and Semantic Features for Relational Facts Extraction (Student Abstract)
AAAI 2021
Adaptive Online Packing-guided Search for POMDPs
NIPS 2021
Fast Pareto Optimization for Subset Selection with Dynamic Cost Constraints
IJCAI 2021
Cross-modal Domain Adaptation for Cost-Efficient Visual Reinforcement Learning
NIPS 2021
LB-DESPOT: Efficient Online POMDP Planning Considering Lower Bound in Action Selection (Student Abstract)
AAAI 2021
QPLEX: Duplex Dueling Multi-Agent Q-Learning
ICLR 2021
Offline Imitation Learning with a Misspecified Simulator
NIPS 2020
An Efficient Evolutionary Algorithm for Subset Selection with General Cost Constraints
AAAI 2020
RetroXpert: Decompose Retrosynthesis Prediction Like A Chemist
NIPS 2020
Simultaneous Inference for Massive Data: Distributed Bootstrap
ICML 2020
Error Bounds of Imitating Policies and Environments
NIPS 2020
Out-of-Domain Detection for Low-Resource Text Classification Tasks
EMNLP 2019
Virtual-Taobao: Virtualizing Real-World Online Retail Environment for Reinforcement Learning
AAAI 2019
On Reinforcement Learning for Full-Length Game of StarCraft
AAAI 2019
Multi-Fidelity Automatic Hyper-Parameter Tuning via Transfer Series Expansion
AAAI 2019
Bridging Machine Learning and Logical Reasoning by Abductive Learning
NIPS 2019
Cascaded Algorithm-Selection and Hyper-Parameter Optimization with Extreme-Region Upper Confidence Bound Bandit
IJCAI 2019
Reinforcement Learning Experience Reuse with Policy Residual Representation
IJCAI 2019
Out-of-Domain Detection for Low-Resource Text Classification Tasks
IJCNLP 2019
Learning Environmental Calibration Actions for Policy Self-Evolution
IJCAI 2018
Mixture of GANs for Clustering
IJCAI 2018
Experienced Optimization with Reusable Directional Model for Hyper-Parameter Search
IJCAI 2018
Approximation Guarantees of Stochastic Greedy Algorithms for Subset Selection
IJCAI 2018
Multi-Layered Gradient Boosting Decision Trees
NIPS 2018
Towards Sample Efficient Reinforcement Learning
IJCAI 2018
Binary Linear Compression for Multi-label Classification
IJCAI 2017
Subset Selection under Noise
NIPS 2017
Life-Stage Modeling by Customer-Manifold Embedding
IJCAI 2017
On Subset Selection with General Cost Constraints
IJCAI 2017
Optimizing Ratio of Monotone Set Functions
IJCAI 2017
AGRA: An Analysis-Generation-Ranking Framework for Automatic Abbreviation from Paper Titles
IJCAI 2017
Open Category Classification by Adversarial Sample Generation
IJCAI 2017
Parallel Pareto Optimization for Subset Selection
IJCAI 2016
User Embedding for Scholarly Microblog Recommendation
ACL 2016
Derivative-Free Optimization of High-Dimensional Non-Convex Functions by Sequential Random Embeddings
IJCAI 2016
On Constrained Boolean Pareto Optimization
IJCAI 2015
Subset Selection by Pareto Optimization
NIPS 2015
On the Approximation Ability of Evolutionary Optimization with Application to Minimum Set Cover: Extended Abstract
IJCAI 2013