Bo An
131 papers · 2013–2026 · 14 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+20 more ↓ Show less ↑
π§ Keyword Pioneer π£ Hot Topic Early Bird πΊοΈ Taxonomy Completionist (23) π Interdisciplinary Bridge π Conference Polyglot (14)
π
Cross-Pollinator
(11)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(23)
π
Conference Loyalist
(21)
π
Keyword Trendsetter Combo
(4)
π€
Dynamic Duo
(28)
π
Triple Crown
π§¬
Topic Evolution
π
Keyword Champion
(2)
π
Grand Slam
π₯
Mega-Team
(28)
π±
Topic Pioneer
π¬
Deep Specialist
(24)
β
The Questioner
π
Conference Pioneer
π
Trend Setter
β‘
Prolific Year
(20)
π
Century Club
(126)
ποΈ
Keyword Collector
(101)
π₯
Unstoppable
(11)
Conferences
IJCAI (35)
AAAI (23)
ICML (21)
NIPS (17)
ICLR (14)
ACL (5)
CVPR (4)
AISTATS (3)
COLING (3)
UAI (2)
EMNLP (1)
IJCNLP (1)
NAACL (1)
SEMEVAL (1)
Top co-authors
Research topics
Keywords
game theory
(15)
reinforcement learning
(12)
multi-agent system
(11)
weakly supervised learning
(9)
extensive-form game
(8)
security game
(7)
nash equilibrium
(7)
deep reinforcement learning
(6)
multiplayer game
(5)
out-of-distribution detection
(5)
representation learning
(5)
zero-sum game
(5)
team-maxmin equilibrium
(4)
label noise
(4)
deep learning
(4)
stackelberg game
(4)
semi-supervised learning
(3)
policy learning
(3)
resource allocation
(3)
semantic parsing
(3)
Papers
LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification
ACL 2026
AgentOCR: Reimagining Agent History via Optical Self-Compression
ACL 2026
SeDev: Structured Semantic Exploration for LLM-Driven Code Generation
ACL 2026
ArchetypeTrader: Reinforcement Learning for Selecting and Refining Learnable Strategic Archetypes in Quantitative Trading
AAAI 2026
GDBA Revisited: Unleashing the Power of Guided Local Search for Distributed Constraint Optimization
AAAI 2026
Removing Prompt-template Bias in Reinforcement Learning from Human Feedback
ACL 2025
AgentStudio: A Toolkit for Building General Virtual Agents
ICLR 2025
Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation
ICLR 2025
OpticalNet: An Optical Imaging Dataset and Benchmark Beyond the Diffraction Limit
CVPR 2025
Policy Regularization on Globally Accessible States in Cross-Dynamics Reinforcement Learning
ICML 2025
Representation Surgery in Model Merging with Probabilistic Modeling
ICML 2025
Cradle: Empowering Foundation Agents towards General Computer Control
ICML 2025
A Closer Look at Backdoor Attacks on CLIP
ICML 2025
Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning
ICML 2025
Influence-Based Fair Selection for Sample-Discriminative Backdoor Attack
AAAI 2025
Policy Optimization under Imperfect Human Interactions with Agent-Gated Shared Autonomy
ICLR 2025
MaNo: Exploiting Matrix Norm for Unsupervised Accuracy Estimation Under Distribution Shifts
NIPS 2024
Reinforcement Learning from Diverse Human Preferences
IJCAI 2024
PoRank: A Practical Framework for Learning to Rank Policies
IJCAI 2024
Reinforcement Nash Equilibrium Solver
IJCAI 2024
Self-adaptive PSRO: Towards an Automatic Population-based Game Solver
IJCAI 2024
Consistent Hierarchical Classification with A Generalized Metric
AISTATS 2024
EarnHFT: Efficient Hierarchical Reinforcement Learning for High Frequency Trading
AAAI 2024
Market-GAN: Adding Control to Financial Market Data Generation with Semantic Context
AAAI 2024
Transition-Informed Reinforcement Learning for Large-Scale Stackelberg Mean-Field Games
AAAI 2024
Mitigating Underfitting in Learning to Defer with Consistent Losses
AISTATS 2024
DAG-Based Column Generation for Adversarial Team Games
ICML 2024
Latent Logic Tree Extraction for Event Sequence Explanation from LLMs
ICML 2024
Configurable Mirror Descent: Towards a Unification of Decision Making
ICML 2024
Safe and Robust Subgame Exploitation in Imperfect Information Games
ICML 2024
Resisting Stochastic Risks in Diffusion Planners with the Trajectory Aggregation Tree
ICML 2024
True Knowledge Comes from Practice: Aligning Large Language Models with Embodied Environments via Reinforcement Learning
ICLR 2024
Improving Unsupervised Hierarchical Representation with Reinforcement Learning
CVPR 2024
Synapse: Trajectory-as-Exemplar Prompting with Memory for Computer Control
ICLR 2024
On the Vulnerability of Adversarially Trained Models Against Two-faced Attacks
ICLR 2024
S$2$AC: Energy-Based Reinforcement Learning with Stein Soft Actor Critic
ICLR 2024
Solving Homogeneous and Heterogeneous Cooperative Tasks with Greedy Sequential Execution
ICLR 2024
IMM: An Imitative Reinforcement Learning Approach with Predictive Representation Learning for Automatic Market Making
IJCAI 2024
vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement
IJCAI 2024
Consistent Multi-Class Classification from Multiple Unlabeled Datasets
ICLR 2024
Complex Contagion Influence Maximization: A Reinforcement Learning Approach
IJCAI 2023
Few-shot Generation via Recalling Brain-Inspired Episodic-Semantic Memory
NIPS 2023
Computing Optimal Nash Equilibria in Multiplayer Games
NIPS 2023
On the Importance of Feature Separability in Predicting Out-Of-Distribution Error
NIPS 2023
State Regularized Policy Optimization on Data with Dynamics Shift
NIPS 2023
In Defense of Softmax Parametrization for Calibrated and Consistent Learning to Defer
NIPS 2023
Regression with Cost-based Rejection
NIPS 2023
TradeMaster: A Holistic Quantitative Trading Platform Empowered by Reinforcement Learning
NIPS 2023
Offline RL with Discrete Proxy Representations for Generalizability in POMDPs
NIPS 2023
An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Games
AAAI 2023
Partial-Label Regression
AAAI 2023
Solving Large-Scale Pursuit-Evasion Games Using Pre-trained Strategies
AAAI 2023
Consistent Complementary-Label Learning via Order-Preserving Losses
AISTATS 2023
Population-size-Aware Policy Optimization for Mean-Field Games
ICLR 2023
RPM: Generalizable Multi-Agent Policies for Multi-Agent Reinforcement Learning
ICLR 2023
ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor
ICLR 2023
Weakly Supervised Regression with Interval Targets
ICML 2023
Mitigating Memorization of Noisy Labels by Clipping the Model Prediction
ICML 2023
Controlling Type Confounding in Ad Hoc Teamwork with Instance-wise Teammate Feedback Rectification
ICML 2023
Exploring Leximin Principle for Fair Core-Selecting Combinatorial Auctions: Payment Rule Design and Implementation
IJCAI 2023
DO-GAN: A Double Oracle Framework for Generative Adversarial Networks
CVPR 2022
Alleviating "Posterior Collapse'' in Deep Topic Models via Policy Gradient
NIPS 2022
Deep Attentive Belief Propagation: Integrating Reasoning and Learning for Solving Constraint Optimization Problems
NIPS 2022
Learning Pseudometric-based Action Representations for Offline Reinforcement Learning
ICML 2022
Correlation-Based Algorithm for Team-Maxmin Equilibrium in Multiplayer Extensive-Form Games
IJCAI 2022
Open-Sampling: Exploring Out-of-Distribution data for Re-balancing Long-tailed datasets
ICML 2022
Mitigating Neural Network Overconfidence with Logit Normalization
ICML 2022
Generalizing Consistent Multi-Class Classification with Rejection to be Compatible with Arbitrary Losses
NIPS 2022
Pretrained Cost Model for Distributed Constraint Optimization Problems
AAAI 2022
GearNet: Stepwise Dual Learning for Weakly Supervised Domain Adaptation
AAAI 2022
NSGZero: Efficiently Learning Non-exploitable Policy in Large-Scale Network Security Games with Neural Monte Carlo Tree Search
AAAI 2022
Online Ad Hoc Teamwork under Partial Observability
ICLR 2022
Out-of-Distribution Detection with An Adaptive Likelihood Ratio on Informative Hierarchical VAE
NIPS 2022
Computing Ex Ante Coordinated Team-Maxmin Equilibria in Zero-Sum Multiplayer Extensive-Form Games
AAAI 2021
Contingency-aware influence maximization: A reinforcement learning approach
UAI 2021
Learning from Similarity-Confidence Data
ICML 2021
RMIX: Learning Risk-Sensitive Policies for Cooperative Reinforcement Learning Agents
NIPS 2021
Open-set Label Noise Can Improve Robustness Against Inherent Label Noise
NIPS 2021
Pointwise Binary Classification with Pairwise Confidence Comparisons
ICML 2021
Complexity and Algorithms for Exploiting Quantal Opponents in Large Two-Player Games
AAAI 2021
Computing Quantal Stackelberg Equilibrium in Extensive-Form Games
AAAI 2021
Solving Large-Scale Extensive-Form Network Security Games via Neural Fictitious Self-Play
IJCAI 2021
CFR-MIX: Solving Imperfect Information Extensive-Form Games with Combinatorial Action Space
IJCAI 2021
Neural Regret-Matching for Distributed Constraint Optimization Problems
IJCAI 2021
Commission Fee is not Enough: A Hierarchical Reinforced Framework for Portfolio Management
AAAI 2021
Personalized Adaptive Meta Learning for Cold-start User Preference Prediction
AAAI 2021
Can Cross Entropy Loss Be Robust to Label Noise?
IJCAI 2020
Speeding Up Incomplete GDL-based Algorithms for Multi-agent Optimization with Dense Local Utilities
IJCAI 2020
Learning Expensive Coordination: An Event-Based Deep RL Approach
ICLR 2020
Learning with Multiple Complementary Labels
ICML 2020
Learning Efficient Multi-agent Communication: An Information Bottleneck Approach
ICML 2020
Converging to Team-Maxmin Equilibria in Zero-Sum Multiplayer Games
ICML 2020
Computing Team-Maxmin Equilibria in Zero-Sum Multiplayer Extensive-Form Games
AAAI 2020
Enhancing Neural Models with Vulnerability via Adversarial Attack
COLING 2020
Learning Behaviors with Uncertain Human Feedback
UAI 2020
Combating Noisy Labels by Agreement: A Joint Training Method with Co-Regularization
CVPR 2020
Provably Consistent Partial-Label Learning
NIPS 2020
IΒ²HRL: Interactive Influence-based Hierarchical Reinforcement Learning
IJCAI 2020
Dinkelbach-Type Algorithm for Computing Quantal Stackelberg Equilibrium
IJCAI 2020
Manipulating a Learning Defender and Ways to Counteract
NIPS 2019
Collaboration Based Multi-Label Learning
AAAI 2019
Partial Label Learning by Semantic Difference Maximization
IJCAI 2019
Dynamic Electronic Toll Collection via Multi-Agent Deep Reinforcement Learning with Edge-Based Graph Convolutional Networks
IJCAI 2019
Who Should Pay the Cost: A Game-theoretic Model for Government Subsidized Investments to Improve National Cybersecurity
IJCAI 2019
Partial Label Learning with Self-Guided Retraining
AAAI 2019
On the Inducibility of Stackelberg Equilibrium for Security Games
AAAI 2019
Optimal Interdiction of Urban Criminals with the Aid of Real-Time Information
AAAI 2019
A Memetic Approach for Sequential Security Games on a Plane with Moving Targets
AAAI 2019
EUSP: An Easy-to-Use Semantic Parsing PlatForm
IJCNLP 2019
EUSP: An Easy-to-Use Semantic Parsing PlatForm
EMNLP 2019
Stackelberg Security Games: Looking Beyond a Decade of Success
IJCAI 2018
Model-Free Context-Aware Word Composition
COLING 2018
Semi-Supervised Lexicon Learning for Wide-Coverage Semantic Parsing
COLING 2018
Leveraging Latent Label Distributions for Partial Label Learning
IJCAI 2018
Impression Allocation for Combating Fraud in E-commerce Via Deep Reinforcement Learning with Action Norm Penalty
IJCAI 2018
Accurate Text-Enhanced Knowledge Graph Representation Learning
NAACL 2018
Defending Against Man-In-The-Middle Attack in Repeated Games
IJCAI 2017
Comparing Strategic Secrecy and Stackelberg Commitment in Security Games
IJCAI 2017
Playing Repeated Network Interdiction Games with Semi-Bandit Feedback
IJCAI 2017
Game Theoretic Analysis of Security and Sustainability
IJCAI 2017
Efficient Label Contamination Attacks Against Black-Box Learning Models
IJCAI 2017
Optimal Escape Interdiction on Transportation Networks
IJCAI 2017
Sentence Rewriting for Semantic Parsing
ACL 2016
ISCAS_NLP at SemEval-2016 Task 1: Sentence Similarity Based on Support Vector Regression using Multiple Features
SEMEVAL 2016
Optimally Protecting Elections
IJCAI 2016
Optimal Interdiction of Illegal Network Flow
IJCAI 2016
Efficient Resource Allocation for Protecting Coral Reef Ecosystems
IJCAI 2016
Optimal Electric Vehicle Charging Station Placement
IJCAI 2015
Computing Optimal Mixed Strategies for Security Games with Dynamic Payoffs
IJCAI 2015
Optimal Pricing for Improving Efficiency of Taxi Systems
IJCAI 2013
A Reputation Management Approach for Resource Constrained Trustee Agents
IJCAI 2013