conftrace_

Hongning Wang

72 papers · 2010–2026 · 11 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+16 more ↓

🗺️ Taxonomy Completionist (22) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (7) 🐣 Hot Topic Early Bird

🌈 Renaissance Researcher (7) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (22) 🤝 Dynamic Duo (25) 👑 Triple Crown 🏆 Keyword Champion (4) 🏆 Grand Slam 👥 Mega-Team (20) 🔬 Deep Specialist (11) 🧬 Topic Evolution ⚡ Prolific Year (8) 🔥 Unstoppable (8) 🗃️ Keyword Collector (67) 💎 Century Club (64) ❓ The Questioner (4) 🚀 Conference Pioneer

Conferences

ACL (21) NIPS (12) ICLR (11) AAAI (8) EMNLP (7) ICML (5) AISTATS (3) IJCNLP (2) COLING (1) ICCV (1) IJCAI (1)

Top co-authors

Minlie Huang (33) Jie Tang (16) Pei Ke (15) Chuanhao Li (13) Jiale Cheng (11) Xiaotao Gu (10) Yuxiao Dong (9) Haifeng Xu (9) Xiao Liu (8) Bosi Wen (8)

Research topics

Keywords

large language model (19) benchmark evaluation (6) multi-armed bandit (6) contextual bandit (5) recommendation system (5) recommender system (5) instruction following (4) federated learning (4) domain adaptation (3) communication efficiency (3) adversarial attack (3) online learning (3) dialogue system (3) prompt optimization (3) game theory (2) off-policy learning (2) multi-objective optimization (2) reinforcement learning (2) safety alignment (2) reinforcement learning from human feedback (2)

Papers

Data Efficient RLVR via Off-Policy Influence Guidance ACL 2026 When Smiley Turns Hostile: Interpreting How Emojis Trigger LLMs’ Toxicity AAAI 2026 LASA: Language-Agnostic Semantic Alignment at the Semantic Bottleneck for LLM Safety ACL 2026 Glyph: Scaling Context Windows via Visual-Text Compression ACL 2026 IF-CRITIC: Towards a Fine-Grained LLM Critic for Instruction-Following Evaluation ACL 2026 IF-RewardBench: Benchmarking Judge Models for Instruction-Following Evaluation ACL 2026 How Should We Enhance the Safety of Large Reasoning Models: An Empirical Study ACL 2026 HoWToBench: Holistic Evaluation for LLM’s Capability in Human-level Writing using Tree of Writing ACL 2026 HPSS: Heuristic Prompting Strategy Search for LLM Evaluators ACL 2025 Crisp: Cognitive Restructuring of Negative Thoughts through Multi-turn Supportive Dialogues EMNLP 2025 SelfRACG: Enabling LLMs to Self-Express and Retrieve for Code Generation EMNLP 2025 SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models ICLR 2025 Data Selection via Optimal Control for Language Models ICLR 2025 CodePlan: Unlocking Reasoning Potential in Large Language Models by Scaling Code-form Planning ICLR 2025 RecFlow: An Industrial Full Flow Recommendation Dataset ICLR 2025 MAPS: Advancing Multi-Modal Reasoning in Expert-Level Physical Science ICLR 2025 VPO: Aligning Text-to-Video Generation Models with Prompt Optimization ICCV 2025 SocialSim: Towards Socialized Simulation of Emotional Support Conversation AAAI 2025 CharacterBench: Benchmarking Character Customization of Large Language Models AAAI 2025 Tree-KG: An Expandable Knowledge Graph Construction Framework for Knowledge-intensive Domains ACL 2025 Guiding not Forcing: Enhancing the Transferability of Jailbreaking Attacks on LLMs via Removing Superfluous Constraints ACL 2025 SocialEval: Evaluating Social Intelligence of Large Language Models ACL 2025 LongSafety: Evaluating Long-Context Safety of Large Language Models ACL 2025 LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models ACL 2025 Unveiling User Satisfaction and Creator Productivity Trade-Offs in Recommendation Platforms NIPS 2024 Mitigating Reward Overoptimization via Lightweight Uncertainty Estimation NIPS 2024 AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback NIPS 2024 Benchmarking Complex Instruction-Following with Multiple Constraints Composition NIPS 2024 Meta-Reinforcement Learning via Exploratory Task Clustering AAAI 2024 Stealthy Adversarial Attacks on Stochastic Multi-Armed Bandits AAAI 2024 Black-Box Prompt Optimization: Aligning Large Language Models without Model Training ACL 2024 Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization ACL 2024 AlignBench: Benchmarking Chinese Alignment of Large Language Models ACL 2024 Learning Task Decomposition to Assist Humans in Competitive Programming ACL 2024 CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model Generation ACL 2024 Federated Linear Contextual Bandits with Heterogeneous Clients AISTATS 2024 CharacterGLM: Customizing Social Characters with Large Language Models EMNLP 2024 AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models EMNLP 2024 ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors EMNLP 2024 Incentivized Truthful Communication for Federated Bandits ICLR 2024 Language Model Decoding as Direct Metrics Optimization ICLR 2024 Towards Efficient Exact Optimization of Language Model Alignment ICML 2024 Human vs. Generative AI in Content Creation Competition: Symbiosis or Conflict? ICML 2024 Rethinking Incentives in Recommender Systems: Are Monotone Rewards Always Beneficial? NIPS 2023 COFFEE: Counterfactual Fairness for Personalized Text Generation in Explainable Recommendation EMNLP 2023 Incentivized Communication for Federated Bandits NIPS 2023 How Bad is Top-$K$ Recommendation under Competing Content Creators? ICML 2023 Learning Kernelized Contextual Bandits in a Distributed and Asynchronous Environment ICLR 2023 Spectral Augmentation for Self-Supervised Learning on Graphs ICLR 2023 Multi-Objective Intrinsic Reward Learning for Conversational Recommender Systems NIPS 2023 Uncertainty-Aware Instance Reweighting for Off-Policy Learning NIPS 2023 Asynchronous Upper Confidence Bound Algorithms for Federated Linear Bandits AISTATS 2022 Learning Neural Contextual Bandits through Perturbed Rewards ICLR 2022 Communication Efficient Federated Learning for Generalized Linear Bandits NIPS 2022 Learning the Optimal Recommendation from Explorative Users AAAI 2022 When Are Linear Stochastic Bandits Attackable? ICML 2022 Learning from a Learning User for Optimal Recommendations ICML 2022 Communication Efficient Distributed Learning for Kernelized Contextual Bandits NIPS 2022 IMO^3: Interactive Multi-Objective Off-Policy Optimization IJCAI 2022 Learning from Crowds by Modeling Common Confusions AAAI 2021 Unifying Clustered and Non-stationary Bandits AISTATS 2021 Relation Inference among Sensor Time Series in Smart Buildings with Metric Learning AAAI 2020 Adversarial Domain Adaptation for Machine Reading Comprehension EMNLP 2019 A Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation NIPS 2019 Adversarial Domain Adaptation for Machine Reading Comprehension IJCNLP 2019 Multi-Task Learning for Document Ranking and Query Suggestion ICLR 2018 Bandit Learning with Implicit Feedback NIPS 2018 Modeling Social Norms Evolution for Personalized Sentiment Classification ACL 2016 Model Adaptation for Personalized Opinion Analysis ACL 2015 Model Adaptation for Personalized Opinion Analysis IJCNLP 2015 Structural Topic Model for Latent Topical Structure Analysis ACL 2011 Exploiting Structured Ontology to Organize Scattered Online Opinions COLING 2010