Yi Wu
69 papers · 2012–2026 · 17 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
π§ Keyword Pioneer π Conference Polyglot (17) πΊοΈ Taxonomy Completionist (11) π Interdisciplinary Bridge π Academic Marathon (13)
π
Academic Marathon
(13)
π
Cross-Pollinator
(9)
π
Renaissance Researcher
(7)
π
Triple Crown
π
Grand Slam
π§¬
Topic Evolution
π€
Dynamic Duo
(12)
π¬
Deep Specialist
(10)
π
Keyword Champion
π
Century Club
(68)
π
Trend Setter
π₯
Unstoppable
(11)
ποΈ
Keyword Collector
(223)
β
The Questioner
β‘
Prolific Year
(10)
π
Conference Pioneer
Conferences
NIPS (13)
ICLR (10)
AAAI (7)
ICML (7)
ICCV (6)
EMNLP (5)
IJCAI (5)
ECCV (3)
CVPR (2)
JMLR (2)
MICCAI (2)
ACL (2)
CORL (1)
AISTATS (1)
ACML (1)
IJCNLP (1)
SEMEVAL (1)
Top co-authors
Keywords
multi-agent reinforcement learning
(7)
reinforcement learning
(6)
unsupervised learning
(4)
deep reinforcement learning
(4)
sparse reward
(3)
deep learning
(3)
text classification
(3)
neural network
(3)
sequence labeling
(3)
policy gradient
(3)
convolutional neural network
(3)
large language model
(3)
zero-shot learning
(3)
source-free domain adaptation
(3)
knowledge distillation
(2)
curriculum learning
(2)
unsupervised domain adaptation
(2)
few-shot learning
(2)
domain generalization
(2)
variational inference
(2)
Papers
Learning Knowledge from Textual Descriptions for 3D Human Pose Estimation
AAAI 2026
Learning Global Nash Equilibrium in Team Competitive Games with Generalized Fictitious Cross-Play
JMLR 2025
BitNet: 1-bit Pre-training for Large Language Models
JMLR 2025
Offline Reinforcement Learning for LLM Multi-step Reasoning
ACL 2025
ESGenius: Benchmarking LLMs on Environmental, Social, and Governance (ESG) and Sustainability Knowledge
EMNLP 2025
Estimating 2D Camera Motion with Hybrid Motion Basis
ICCV 2025
Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization
ICML 2025
Toward Real-World Cooperative and Competitive Soccer with Quadrupedal Robot Teams
CORL 2025
Cross-Modal Contrastive Learning for Emotion Recognition: Aligning ECG with EEG-Derived Features
MICCAI 2025
Feature Copy-Paste Network for Lung Cancer EGFR Mutation Status Prediction in CT images
MICCAI 2025
Infinite-ID: Identity-preserved Personalization via ID-semantics Decoupling Paradigm
ECCV 2024
Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game
ICML 2024
Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study
ICML 2024
Accelerate Multi-Agent Reinforcement Learning in Zero-Sum Games with Subgame Curriculum Learning
AAAI 2024
Adaptive-Gradient Policy Optimization: Enhancing Policy Learning in Non-Smooth Differentiable Simulations
ICML 2024
Stylized Offline Reinforcement Learning: Extracting Diverse High-Quality Behaviors from Heterogeneous Datasets
ICLR 2024
Efficient Backdoor Attacks for Deep Neural Networks in Real-world Scenarios
ICLR 2024
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores
ICLR 2024
AlphaSnake: Policy Iteration on a Nondeterministic NP-Hard Markov Decision Process (Student Abstract)
AAAI 2023
SpeedyZero: Mastering Atari with Limited Data and Time
ICLR 2023
Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased
ICLR 2023
KnowComp Submission for WMT23 Word-Level AutoCompletion Task
EMNLP 2023
Iteratively Learn Diverse Strategies with State Distance Information
NIPS 2023
Domain Re-Modulation for Few-Shot Generative Domain Adaptation
NIPS 2023
KDLGT: A Linear Graph Transformer Framework via Kernel Decomposition Approach
IJCAI 2023
SOAR: Scene-debiasing Open-set Action Recognition
ICCV 2023
Single Image Super-resolution Based On Non-subsampled Shearlet Transform
ACML 2023
Automatic Truss Design with Reinforcement Learning
IJCAI 2023
Maximum Entropy Population-Based Training for Zero-Shot Human-AI Coordination
AAAI 2023
Sequence Level Contrastive Learning for Text Summarization
AAAI 2022
Grounded Reinforcement Learning: Learning to Win the Game under Human Commands
NIPS 2022
Pre-Trained Image Encoder for Generalizable Visual Reinforcement Learning
NIPS 2022
The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games
NIPS 2022
Uncertainty-Based Spatial-Temporal Attention for Online Action Detection
ECCV 2022
Learning Efficient Multi-agent Cooperative Visual Exploration
ECCV 2022
PILE: Pairwise Iterative Logits Ensemble for Multi-Teacher Labeled Distillation
EMNLP 2022
Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization
ICLR 2022
Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning
ICML 2022
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
ICML 2022
Variational Automatic Curriculum Learning for Sparse-Reward Cooperative Multi-Agent Problems
NIPS 2021
BLCUFIGHT at SemEval-2021 Task 10: Novel Unsupervised Frameworks For Source-Free Domain Adaptation
IJCNLP 2021
Solving Compositional Reinforcement Learning Problems via Task Reduction
ICLR 2021
BLCUFIGHT at SemEval-2021 Task 10: Novel Unsupervised Frameworks For Source-Free Domain Adaptation
ACL 2021
BLCUFIGHT at SemEval-2021 Task 10: Novel Unsupervised Frameworks For Source-Free Domain Adaptation
SEMEVAL 2021
Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization
ICLR 2021
Temporal Induced Self-Play for Stochastic Bayesian Games
IJCAI 2021
NovelD: A Simple yet Effective Exploration Criterion
NIPS 2021
Multi-Task Reinforcement Learning with Soft Modularization
NIPS 2020
Unsupervised Extractive Summarization by Pre-training Hierarchical Transformers
EMNLP 2020
Influence-Based Multi-Agent Exploration
ICLR 2020
SaccadeNet: A Fast and Accurate Object Detector
CVPR 2020
Emergent Tool Use From Multi-Agent Autocurricula
ICLR 2020
Stochastic Runge-Kutta Accelerates Langevin Monte Carlo and Beyond
NIPS 2019
Robust Multi-Agent Reinforcement Learning via Minimax Deep Deterministic Policy Gradient
AAAI 2019
Deep Reinforcement Learning for Green Security Games with Real-Time Information
AAAI 2019
Bayesian Relational Memory for Semantic Visual Navigation
ICCV 2019
Discrete-Continuous Mixtures in Probabilistic Programming: Generalized Semantics and Inference Algorithms
ICML 2018
Meta-Learning MCMC Proposals
NIPS 2018
CoupleNet: Coupling Global Structure With Local Parts for Object Detection
ICCV 2017
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
NIPS 2017
Adversarial Training for Relation Extraction
EMNLP 2017
Egocentric Gesture Recognition Using Recurrent 3D Convolutional Neural Networks With Spatiotemporal Transformer Modules
ICCV 2017
Value Iteration Networks
IJCAI 2017
Swift: Compiled Inference for Probabilistic Programming Languages
IJCAI 2016
Value Iteration Networks
NIPS 2016
Understanding and Evaluating Sparse Linear Discriminant Analysis
AISTATS 2015
Multiple Non-rigid Surface Detection and Registration
ICCV 2013
Online Object Tracking: A Benchmark
CVPR 2013
Dual-Space Analysis of the Sparse Linear Model
NIPS 2012