Zihao Wang

74 papers · 2017–2026 · 16 conferences · across top CS/AI conferences

Achievements

+14 more ↓

🌍 Conference Polyglot (16) 🏃 Academic Marathon (8) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (8)

🐝 Cross-Pollinator (8) 🌈 Renaissance Researcher (12) 🗺️ Taxonomy Completionist (120) 🏆 Grand Slam 🧬 Topic Evolution 👥 Mega-Team (40) 👑 Triple Crown 🤝 Dynamic Duo (11) 💎 Century Club (69) ⚡ Prolific Year (9) 📈 Trend Setter 🔥 Unstoppable (9) 🗃️ Keyword Collector (294) ❓ The Questioner (2)

Conferences

ACL (10) AAAI (9) EMNLP (9) ICLR (8) CVPR (6) ICML (6) IJCAI (5) NIPS (5) ICCV (4) COLING (3) ECCV (3) NAACL (2) ACML (1) COLT (1) IJCNLP (1) WACV (1)

Top co-authors

Yangqiu Song (11) Yitao Liang (11) Anji Liu (8) Shaofei Cai (7) Xiaojian Ma (6) Hang Yin (4) Wai Lam (4) Yong Zhang (4) Haowei Lin (4) Kejun Zhang (4)

Research topics

Architectures (1) Education (1)

Keywords

large language model (9) generative model (5) optimal transport (4) imitation learning (4) zero-shot learning (3) prompt engineering (3) knowledge graph completion (3) diffusion model (3) representation learning (3) knowledge graph (3) word embedding (3) transformer architecture (3) benchmark evaluation (2) unsupervised learning (2) multimodal learning (2) feature matching (2) attention mechanism (2) image generation (2) contrastive learning (2) in-context learning (2)

Papers

RPGen: Robust and Differentially Private Synthetic Image Generation AAAI 2026 Detecting AI-Generated Content on Social Media with Multi-modal Language Models ACL 2026 PRBench: Large-Scale Expert Rubrics for Evaluating High-Stakes Professional Reasoning ACL 2026 Activation-Guided Local Editing for Jailbreaking Attacks ACL 2026 Diff-V2M: A Hierarchical Conditional Diffusion Model with Explicit Rhythmic Modeling for Video-to-Music Generation AAAI 2026 JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse ACL 2025 MSV-PCT: Multi-Sparse-View Enhanced Transformer Framework for Salient Object Detection in Point Clouds AAAI 2025 Transtreaming: Adaptive Delay-aware Transformer for Real-time Streaming Perception AAAI 2025 ESEG: Event-Based Segmentation Boosted by Explicit Edge-Semantic Guidance AAAI 2025 Enhancing Transformers for Generalizable First-Order Logical Entailment ACL 2025 Extending Complex Logical Queries on Uncertain Knowledge Graphs ACL 2025 Generative Music Models’ Alignment with Professional and Amateur Users’ Expectations ACL 2025 Teaching-Inspired Integrated Prompting Framework: A Novel Approach for Enhancing Reasoning in Large Language Models COLING 2025 ACE: Anti-Editing Concept Erasure in Text-to-Image Models CVPR 2025 ROCKET-1: Mastering Open-World Interaction with Visual-Temporal Context Prompting CVPR 2025 From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery EMNLP 2025 LogiDynamics: Unraveling the Dynamics of Inductive, Abductive and Deductive Logical Inferences in LLM Reasoning EMNLP 2025 Where am I? Cross-View Geo-localization with Natural Language Descriptions ICCV 2025 Open-World Skill Discovery from Unsegmented Demonstration Videos ICCV 2025 Model Reveals What to Cache: Profiling-Based Feature Reuse for Video Diffusion Models ICCV 2025 SqueezeAttention: 2D Management of KV-Cache in LLM Inference via Layer-wise Optimal Budget ICLR 2025 Learning Hierarchical Polynomials of Multiple Nonlinear Features ICLR 2025 GROOT-2: Weakly Supervised Multimodal Instruction Following Agents ICLR 2025 LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models ICLR 2025 The Illusion of Role Separation: Hidden Shortcuts in LLM Role Learning (and How to Fix Them) ICML 2025 A Recipe for Causal Graph Regression: Confounding Effects Revisited ICML 2025 MCU: An Evaluation Framework for Open-Ended Game Agents ICML 2025 AI-Assisted Human-Pet Artistic Musical Co-Creation for Wellness Therapy IJCAI 2025 MuChin: A Chinese Colloquial Description Benchmark for Evaluating Language Models in the Field of Music IJCAI 2024 OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents NIPS 2024 Transforming and Combining Rewards for Aligning Large Language Models ICML 2024 Selecting Large Language Model to Fine-tune via Rectified Scaling Law ICML 2024 SSL-Cleanse: Trojan Detection and Mitigation in Self-Supervised Learning ECCV 2024 Rethinking the Bounds of LLM Reasoning: Are Multi-Agent Discussions the Key? ACL 2024 ProAgent: Building Proactive Cooperative Agents with Large Language Models AAAI 2024 NestE: Modeling Nested Relational Structures for Knowledge Graph Reasoning AAAI 2024 A User-Friendly Framework for Generating Model-Preferred Prompts in Text-to-Image Synthesis AAAI 2024 GROOT: Learning to Follow Instructions by Watching Gameplay Videos ICLR 2024 LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing EMNLP 2024 Generate-on-Graph: Treat LLM as both Agent and KG for Incomplete Knowledge Graph Question Answering EMNLP 2024 Rethinking Complex Queries on Knowledge Graphs with Neural Link Predictors ICLR 2024 SDformer: Transformer with Spectral Filter and Dynamic Attention for Multivariate Time Series Long-term Forecasting IJCAI 2024 Learning Hierarchical Polynomials with Three-Layer Neural Networks ICLR 2024 Concept Algebra for (Score-Based) Text-Controlled Generative Models NIPS 2023 Describe, Explain, Plan and Select: Interactive Planning with LLMs Enables Open-World Multi-Task Agents NIPS 2023 Logical Message Passing Networks with One-hop Inference on Atomic Formulas ICLR 2023 spred: Solving L1 Penalty with SGD ICML 2023 Wasserstein-Fisher-Rao Embedding: Logical Query Embeddings with Local Comparison and Global Transport ACL 2023 Theoretical Analysis of the Inductive Biases in Deep Convolutional Networks NIPS 2023 Information-Directed Selection for Top-Two Algorithms COLT 2023 Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction CVPR 2023 Learning Transformation-Predictive Representations for Detection and Description of Local Features CVPR 2023 MICO: A Multi-alternative Contrastive Learning Framework for Commonsense Knowledge Representation EMNLP 2022 Mending Neural Implicit Modeling for 3D Vehicle Reconstruction in the Wild WACV 2022 OnePose: One-Shot Object Pose Estimation Without CAD Models CVPR 2022 Quasi-Balanced Self-Training on Noise-Aware Synthesis of Object Point Clouds for Closing Domain Gap ECCV 2022 Posterior Collapse of a Linear Latent Variable Model NIPS 2022 Unsupervised Sentence Textual Similarity with Compositional Phrase Semantics COLING 2022 A Neural-Symbolic Approach to Natural Language Understanding EMNLP 2022 SeaD: End-to-end Text-to-SQL Generation with Schema-aware Denoising NAACL 2022 Query2Particles: Knowledge Graph Reasoning with Particle Embeddings NAACL 2022 IFDDS: An Anti-fraud Outbound Robot AAAI 2021 Local Representation is Not Enough: Soft Point-Wise Transformer for Descriptor and Detector of Local Features IJCAI 2021 A Relaxed Matching Procedure for Unsupervised BLI ACL 2020 Robust Document Distance with Wasserstein-Fisher-Rao metric ACML 2020 Semi-Supervised Bilingual Lexicon Induction with Two-way Interaction EMNLP 2020 Weakly-supervised 3D Shape Completion in the Wild ECCV 2020 Two-stage Behavior Cloning for Spoken Dialogue System in Debt Collection IJCAI 2020 Tackling Long-Tailed Relations and Uncommon Entities in Knowledge Graph Completion EMNLP 2019 CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval ICCV 2019 Improving Referring Expression Grounding With Cross-Modal Attention-Guided Erasing CVPR 2019 Tackling Long-Tailed Relations and Uncommon Entities in Knowledge Graph Completion IJCNLP 2019 Responding E-commerce Product Questions via Exploiting QA Collections and Reviews COLING 2018 Deep Recurrent Generative Decoder for Abstractive Text Summarization EMNLP 2017