conftrace_

Bo An

131 papers · 2013–2026 · 14 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+20 more ↓

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🗺️ Taxonomy Completionist (23) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (14)

🐝 Cross-Pollinator (11) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (23) 🏠 Conference Loyalist (21) 🌟 Keyword Trendsetter Combo (4) 🤝 Dynamic Duo (28) 👑 Triple Crown 🧬 Topic Evolution 🏆 Keyword Champion (2) 🏆 Grand Slam 👥 Mega-Team (28) 🌱 Topic Pioneer 🔬 Deep Specialist (24) ❓ The Questioner 🚀 Conference Pioneer 📈 Trend Setter ⚡ Prolific Year (20) 💎 Century Club (126) 🗃️ Keyword Collector (101) 🔥 Unstoppable (11)

Conferences

IJCAI (35) AAAI (23) ICML (21) NIPS (17) ICLR (14) ACL (5) CVPR (4) AISTATS (3) COLING (3) UAI (2) EMNLP (1) IJCNLP (1) NAACL (1) SEMEVAL (1)

Top co-authors

Lei Feng (28) Xinrun Wang (23) Youzhi Zhang (12) Yuzhou Cao (11) Hongxin Wei (11) Pengjie Gu (9) Mengchen Zhao (8) Shuxin Li (8) Le Sun (7) Xianpei Han (7)

Research topics

Reinforcement Learning (1)

Keywords

game theory (15) reinforcement learning (12) multi-agent system (11) weakly supervised learning (9) extensive-form game (8) security game (7) nash equilibrium (7) deep reinforcement learning (6) multiplayer game (5) out-of-distribution detection (5) representation learning (5) zero-sum game (5) team-maxmin equilibrium (4) label noise (4) deep learning (4) stackelberg game (4) semi-supervised learning (3) policy learning (3) resource allocation (3) semantic parsing (3)

Papers

LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification ACL 2026 AgentOCR: Reimagining Agent History via Optical Self-Compression ACL 2026 SeDev: Structured Semantic Exploration for LLM-Driven Code Generation ACL 2026 ArchetypeTrader: Reinforcement Learning for Selecting and Refining Learnable Strategic Archetypes in Quantitative Trading AAAI 2026 GDBA Revisited: Unleashing the Power of Guided Local Search for Distributed Constraint Optimization AAAI 2026 Removing Prompt-template Bias in Reinforcement Learning from Human Feedback ACL 2025 AgentStudio: A Toolkit for Building General Virtual Agents ICLR 2025 Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation ICLR 2025 OpticalNet: An Optical Imaging Dataset and Benchmark Beyond the Diffraction Limit CVPR 2025 Policy Regularization on Globally Accessible States in Cross-Dynamics Reinforcement Learning ICML 2025 Representation Surgery in Model Merging with Probabilistic Modeling ICML 2025 Cradle: Empowering Foundation Agents towards General Computer Control ICML 2025 A Closer Look at Backdoor Attacks on CLIP ICML 2025 Towards Efficient Online Tuning of VLM Agents via Counterfactual Soft Reinforcement Learning ICML 2025 Influence-Based Fair Selection for Sample-Discriminative Backdoor Attack AAAI 2025 Policy Optimization under Imperfect Human Interactions with Agent-Gated Shared Autonomy ICLR 2025 MaNo: Exploiting Matrix Norm for Unsupervised Accuracy Estimation Under Distribution Shifts NIPS 2024 Reinforcement Learning from Diverse Human Preferences IJCAI 2024 PoRank: A Practical Framework for Learning to Rank Policies IJCAI 2024 Reinforcement Nash Equilibrium Solver IJCAI 2024 Self-adaptive PSRO: Towards an Automatic Population-based Game Solver IJCAI 2024 Consistent Hierarchical Classification with A Generalized Metric AISTATS 2024 EarnHFT: Efficient Hierarchical Reinforcement Learning for High Frequency Trading AAAI 2024 Market-GAN: Adding Control to Financial Market Data Generation with Semantic Context AAAI 2024 Transition-Informed Reinforcement Learning for Large-Scale Stackelberg Mean-Field Games AAAI 2024 Mitigating Underfitting in Learning to Defer with Consistent Losses AISTATS 2024 DAG-Based Column Generation for Adversarial Team Games ICML 2024 Latent Logic Tree Extraction for Event Sequence Explanation from LLMs ICML 2024 Configurable Mirror Descent: Towards a Unification of Decision Making ICML 2024 Safe and Robust Subgame Exploitation in Imperfect Information Games ICML 2024 Resisting Stochastic Risks in Diffusion Planners with the Trajectory Aggregation Tree ICML 2024 True Knowledge Comes from Practice: Aligning Large Language Models with Embodied Environments via Reinforcement Learning ICLR 2024 Improving Unsupervised Hierarchical Representation with Reinforcement Learning CVPR 2024 Synapse: Trajectory-as-Exemplar Prompting with Memory for Computer Control ICLR 2024 On the Vulnerability of Adversarially Trained Models Against Two-faced Attacks ICLR 2024 S$2$AC: Energy-Based Reinforcement Learning with Stein Soft Actor Critic ICLR 2024 Solving Homogeneous and Heterogeneous Cooperative Tasks with Greedy Sequential Execution ICLR 2024 IMM: An Imitative Reinforcement Learning Approach with Predictive Representation Learning for Automatic Market Making IJCAI 2024 vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement IJCAI 2024 Consistent Multi-Class Classification from Multiple Unlabeled Datasets ICLR 2024 Complex Contagion Influence Maximization: A Reinforcement Learning Approach IJCAI 2023 Few-shot Generation via Recalling Brain-Inspired Episodic-Semantic Memory NIPS 2023 Computing Optimal Nash Equilibria in Multiplayer Games NIPS 2023 On the Importance of Feature Separability in Predicting Out-Of-Distribution Error NIPS 2023 State Regularized Policy Optimization on Data with Dynamics Shift NIPS 2023 In Defense of Softmax Parametrization for Calibrated and Consistent Learning to Defer NIPS 2023 Regression with Cost-based Rejection NIPS 2023 TradeMaster: A Holistic Quantitative Trading Platform Empowered by Reinforcement Learning NIPS 2023 Offline RL with Discrete Proxy Representations for Generalizability in POMDPs NIPS 2023 An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Games AAAI 2023 Partial-Label Regression AAAI 2023 Solving Large-Scale Pursuit-Evasion Games Using Pre-trained Strategies AAAI 2023 Consistent Complementary-Label Learning via Order-Preserving Losses AISTATS 2023 Population-size-Aware Policy Optimization for Mean-Field Games ICLR 2023 RPM: Generalizable Multi-Agent Policies for Multi-Agent Reinforcement Learning ICLR 2023 ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor ICLR 2023 Weakly Supervised Regression with Interval Targets ICML 2023 Mitigating Memorization of Noisy Labels by Clipping the Model Prediction ICML 2023 Controlling Type Confounding in Ad Hoc Teamwork with Instance-wise Teammate Feedback Rectification ICML 2023 Exploring Leximin Principle for Fair Core-Selecting Combinatorial Auctions: Payment Rule Design and Implementation IJCAI 2023 DO-GAN: A Double Oracle Framework for Generative Adversarial Networks CVPR 2022 Alleviating "Posterior Collapse'' in Deep Topic Models via Policy Gradient NIPS 2022 Deep Attentive Belief Propagation: Integrating Reasoning and Learning for Solving Constraint Optimization Problems NIPS 2022 Learning Pseudometric-based Action Representations for Offline Reinforcement Learning ICML 2022 Correlation-Based Algorithm for Team-Maxmin Equilibrium in Multiplayer Extensive-Form Games IJCAI 2022 Open-Sampling: Exploring Out-of-Distribution data for Re-balancing Long-tailed datasets ICML 2022 Mitigating Neural Network Overconfidence with Logit Normalization ICML 2022 Generalizing Consistent Multi-Class Classification with Rejection to be Compatible with Arbitrary Losses NIPS 2022 Pretrained Cost Model for Distributed Constraint Optimization Problems AAAI 2022 GearNet: Stepwise Dual Learning for Weakly Supervised Domain Adaptation AAAI 2022 NSGZero: Efficiently Learning Non-exploitable Policy in Large-Scale Network Security Games with Neural Monte Carlo Tree Search AAAI 2022 Online Ad Hoc Teamwork under Partial Observability ICLR 2022 Out-of-Distribution Detection with An Adaptive Likelihood Ratio on Informative Hierarchical VAE NIPS 2022 Computing Ex Ante Coordinated Team-Maxmin Equilibria in Zero-Sum Multiplayer Extensive-Form Games AAAI 2021 Contingency-aware influence maximization: A reinforcement learning approach UAI 2021 Learning from Similarity-Confidence Data ICML 2021 RMIX: Learning Risk-Sensitive Policies for Cooperative Reinforcement Learning Agents NIPS 2021 Open-set Label Noise Can Improve Robustness Against Inherent Label Noise NIPS 2021 Pointwise Binary Classification with Pairwise Confidence Comparisons ICML 2021 Complexity and Algorithms for Exploiting Quantal Opponents in Large Two-Player Games AAAI 2021 Computing Quantal Stackelberg Equilibrium in Extensive-Form Games AAAI 2021 Solving Large-Scale Extensive-Form Network Security Games via Neural Fictitious Self-Play IJCAI 2021 CFR-MIX: Solving Imperfect Information Extensive-Form Games with Combinatorial Action Space IJCAI 2021 Neural Regret-Matching for Distributed Constraint Optimization Problems IJCAI 2021 Commission Fee is not Enough: A Hierarchical Reinforced Framework for Portfolio Management AAAI 2021 Personalized Adaptive Meta Learning for Cold-start User Preference Prediction AAAI 2021 Can Cross Entropy Loss Be Robust to Label Noise? IJCAI 2020 Speeding Up Incomplete GDL-based Algorithms for Multi-agent Optimization with Dense Local Utilities IJCAI 2020 Learning Expensive Coordination: An Event-Based Deep RL Approach ICLR 2020 Learning with Multiple Complementary Labels ICML 2020 Learning Efficient Multi-agent Communication: An Information Bottleneck Approach ICML 2020 Converging to Team-Maxmin Equilibria in Zero-Sum Multiplayer Games ICML 2020 Computing Team-Maxmin Equilibria in Zero-Sum Multiplayer Extensive-Form Games AAAI 2020 Enhancing Neural Models with Vulnerability via Adversarial Attack COLING 2020 Learning Behaviors with Uncertain Human Feedback UAI 2020 Combating Noisy Labels by Agreement: A Joint Training Method with Co-Regularization CVPR 2020 Provably Consistent Partial-Label Learning NIPS 2020 I²HRL: Interactive Influence-based Hierarchical Reinforcement Learning IJCAI 2020 Dinkelbach-Type Algorithm for Computing Quantal Stackelberg Equilibrium IJCAI 2020 Manipulating a Learning Defender and Ways to Counteract NIPS 2019 Collaboration Based Multi-Label Learning AAAI 2019 Partial Label Learning by Semantic Difference Maximization IJCAI 2019 Dynamic Electronic Toll Collection via Multi-Agent Deep Reinforcement Learning with Edge-Based Graph Convolutional Networks IJCAI 2019 Who Should Pay the Cost: A Game-theoretic Model for Government Subsidized Investments to Improve National Cybersecurity IJCAI 2019 Partial Label Learning with Self-Guided Retraining AAAI 2019 On the Inducibility of Stackelberg Equilibrium for Security Games AAAI 2019 Optimal Interdiction of Urban Criminals with the Aid of Real-Time Information AAAI 2019 A Memetic Approach for Sequential Security Games on a Plane with Moving Targets AAAI 2019 EUSP: An Easy-to-Use Semantic Parsing PlatForm IJCNLP 2019 EUSP: An Easy-to-Use Semantic Parsing PlatForm EMNLP 2019 Stackelberg Security Games: Looking Beyond a Decade of Success IJCAI 2018 Model-Free Context-Aware Word Composition COLING 2018 Semi-Supervised Lexicon Learning for Wide-Coverage Semantic Parsing COLING 2018 Leveraging Latent Label Distributions for Partial Label Learning IJCAI 2018 Impression Allocation for Combating Fraud in E-commerce Via Deep Reinforcement Learning with Action Norm Penalty IJCAI 2018 Accurate Text-Enhanced Knowledge Graph Representation Learning NAACL 2018 Defending Against Man-In-The-Middle Attack in Repeated Games IJCAI 2017 Comparing Strategic Secrecy and Stackelberg Commitment in Security Games IJCAI 2017 Playing Repeated Network Interdiction Games with Semi-Bandit Feedback IJCAI 2017 Game Theoretic Analysis of Security and Sustainability IJCAI 2017 Efficient Label Contamination Attacks Against Black-Box Learning Models IJCAI 2017 Optimal Escape Interdiction on Transportation Networks IJCAI 2017 Sentence Rewriting for Semantic Parsing ACL 2016 ISCAS_NLP at SemEval-2016 Task 1: Sentence Similarity Based on Support Vector Regression using Multiple Features SEMEVAL 2016 Optimally Protecting Elections IJCAI 2016 Optimal Interdiction of Illegal Network Flow IJCAI 2016 Efficient Resource Allocation for Protecting Coral Reef Ecosystems IJCAI 2016 Optimal Electric Vehicle Charging Station Placement IJCAI 2015 Computing Optimal Mixed Strategies for Security Games with Dynamic Payoffs IJCAI 2015 Optimal Pricing for Improving Efficiency of Taxi Systems IJCAI 2013 A Reputation Management Approach for Resource Constrained Trustee Agents IJCAI 2013