Zihao Wang
74 papers · 2017–2026 · 16 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
π Conference Polyglot (16) π Academic Marathon (8) π§ Keyword Pioneer π Interdisciplinary Bridge π Cross-Pollinator (8)
π
Cross-Pollinator
(8)
π
Renaissance Researcher
(12)
πΊοΈ
Taxonomy Completionist
(120)
π
Grand Slam
π§¬
Topic Evolution
π₯
Mega-Team
(40)
π
Triple Crown
π€
Dynamic Duo
(11)
π
Century Club
(69)
β‘
Prolific Year
(9)
π
Trend Setter
π₯
Unstoppable
(9)
ποΈ
Keyword Collector
(294)
β
The Questioner
(2)
Conferences
ACL (10)
AAAI (9)
EMNLP (9)
ICLR (8)
CVPR (6)
ICML (6)
IJCAI (5)
NIPS (5)
ICCV (4)
COLING (3)
ECCV (3)
NAACL (2)
ACML (1)
COLT (1)
IJCNLP (1)
WACV (1)
Top co-authors
Research topics
Keywords
large language model
(9)
generative model
(5)
optimal transport
(4)
imitation learning
(4)
zero-shot learning
(3)
prompt engineering
(3)
knowledge graph completion
(3)
diffusion model
(3)
representation learning
(3)
knowledge graph
(3)
word embedding
(3)
transformer architecture
(3)
benchmark evaluation
(2)
unsupervised learning
(2)
multimodal learning
(2)
feature matching
(2)
attention mechanism
(2)
image generation
(2)
contrastive learning
(2)
in-context learning
(2)
Papers
RPGen: Robust and Differentially Private Synthetic Image Generation
AAAI 2026
Detecting AI-Generated Content on Social Media with Multi-modal Language Models
ACL 2026
PRBench: Large-Scale Expert Rubrics for Evaluating High-Stakes Professional Reasoning
ACL 2026
Activation-Guided Local Editing for Jailbreaking Attacks
ACL 2026
Diff-V2M: A Hierarchical Conditional Diffusion Model with Explicit Rhythmic Modeling for Video-to-Music Generation
AAAI 2026
JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse
ACL 2025
MSV-PCT: Multi-Sparse-View Enhanced Transformer Framework for Salient Object Detection in Point Clouds
AAAI 2025
Transtreaming: Adaptive Delay-aware Transformer for Real-time Streaming Perception
AAAI 2025
ESEG: Event-Based Segmentation Boosted by Explicit Edge-Semantic Guidance
AAAI 2025
Enhancing Transformers for Generalizable First-Order Logical Entailment
ACL 2025
Extending Complex Logical Queries on Uncertain Knowledge Graphs
ACL 2025
Generative Music Modelsβ Alignment with Professional and Amateur Usersβ Expectations
ACL 2025
Teaching-Inspired Integrated Prompting Framework: A Novel Approach for Enhancing Reasoning in Large Language Models
COLING 2025
ACE: Anti-Editing Concept Erasure in Text-to-Image Models
CVPR 2025
ROCKET-1: Mastering Open-World Interaction with Visual-Temporal Context Prompting
CVPR 2025
From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery
EMNLP 2025
LogiDynamics: Unraveling the Dynamics of Inductive, Abductive and Deductive Logical Inferences in LLM Reasoning
EMNLP 2025
Where am I? Cross-View Geo-localization with Natural Language Descriptions
ICCV 2025
Open-World Skill Discovery from Unsegmented Demonstration Videos
ICCV 2025
Model Reveals What to Cache: Profiling-Based Feature Reuse for Video Diffusion Models
ICCV 2025
SqueezeAttention: 2D Management of KV-Cache in LLM Inference via Layer-wise Optimal Budget
ICLR 2025
Learning Hierarchical Polynomials of Multiple Nonlinear Features
ICLR 2025
GROOT-2: Weakly Supervised Multimodal Instruction Following Agents
ICLR 2025
LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models
ICLR 2025
The Illusion of Role Separation: Hidden Shortcuts in LLM Role Learning (and How to Fix Them)
ICML 2025
A Recipe for Causal Graph Regression: Confounding Effects Revisited
ICML 2025
MCU: An Evaluation Framework for Open-Ended Game Agents
ICML 2025
AI-Assisted Human-Pet Artistic Musical Co-Creation for Wellness Therapy
IJCAI 2025
MuChin: A Chinese Colloquial Description Benchmark for Evaluating Language Models in the Field of Music
IJCAI 2024
OmniJARVIS: Unified Vision-Language-Action Tokenization Enables Open-World Instruction Following Agents
NIPS 2024
Transforming and Combining Rewards for Aligning Large Language Models
ICML 2024
Selecting Large Language Model to Fine-tune via Rectified Scaling Law
ICML 2024
SSL-Cleanse: Trojan Detection and Mitigation in Self-Supervised Learning
ECCV 2024
Rethinking the Bounds of LLM Reasoning: Are Multi-Agent Discussions the Key?
ACL 2024
ProAgent: Building Proactive Cooperative Agents with Large Language Models
AAAI 2024
NestE: Modeling Nested Relational Structures for Knowledge Graph Reasoning
AAAI 2024
A User-Friendly Framework for Generating Model-Preferred Prompts in Text-to-Image Synthesis
AAAI 2024
GROOT: Learning to Follow Instructions by Watching Gameplay Videos
ICLR 2024
LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing
EMNLP 2024
Generate-on-Graph: Treat LLM as both Agent and KG for Incomplete Knowledge Graph Question Answering
EMNLP 2024
Rethinking Complex Queries on Knowledge Graphs with Neural Link Predictors
ICLR 2024
SDformer: Transformer with Spectral Filter and Dynamic Attention for Multivariate Time Series Long-term Forecasting
IJCAI 2024
Learning Hierarchical Polynomials with Three-Layer Neural Networks
ICLR 2024
Concept Algebra for (Score-Based) Text-Controlled Generative Models
NIPS 2023
Describe, Explain, Plan and Select: Interactive Planning with LLMs Enables Open-World Multi-Task Agents
NIPS 2023
Logical Message Passing Networks with One-hop Inference on Atomic Formulas
ICLR 2023
spred: Solving L1 Penalty with SGD
ICML 2023
Wasserstein-Fisher-Rao Embedding: Logical Query Embeddings with Local Comparison and Global Transport
ACL 2023
Theoretical Analysis of the Inductive Biases in Deep Convolutional Networks
NIPS 2023
Information-Directed Selection for Top-Two Algorithms
COLT 2023
Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction
CVPR 2023
Learning Transformation-Predictive Representations for Detection and Description of Local Features
CVPR 2023
MICO: A Multi-alternative Contrastive Learning Framework for Commonsense Knowledge Representation
EMNLP 2022
Mending Neural Implicit Modeling for 3D Vehicle Reconstruction in the Wild
WACV 2022
OnePose: One-Shot Object Pose Estimation Without CAD Models
CVPR 2022
Quasi-Balanced Self-Training on Noise-Aware Synthesis of Object Point Clouds for Closing Domain Gap
ECCV 2022
Posterior Collapse of a Linear Latent Variable Model
NIPS 2022
Unsupervised Sentence Textual Similarity with Compositional Phrase Semantics
COLING 2022
A Neural-Symbolic Approach to Natural Language Understanding
EMNLP 2022
SeaD: End-to-end Text-to-SQL Generation with Schema-aware Denoising
NAACL 2022
Query2Particles: Knowledge Graph Reasoning with Particle Embeddings
NAACL 2022
IFDDS: An Anti-fraud Outbound Robot
AAAI 2021
Local Representation is Not Enough: Soft Point-Wise Transformer for Descriptor and Detector of Local Features
IJCAI 2021
A Relaxed Matching Procedure for Unsupervised BLI
ACL 2020
Robust Document Distance with Wasserstein-Fisher-Rao metric
ACML 2020
Semi-Supervised Bilingual Lexicon Induction with Two-way Interaction
EMNLP 2020
Weakly-supervised 3D Shape Completion in the Wild
ECCV 2020
Two-stage Behavior Cloning for Spoken Dialogue System in Debt Collection
IJCAI 2020
Tackling Long-Tailed Relations and Uncommon Entities in Knowledge Graph Completion
EMNLP 2019
CAMP: Cross-Modal Adaptive Message Passing for Text-Image Retrieval
ICCV 2019
Improving Referring Expression Grounding With Cross-Modal Attention-Guided Erasing
CVPR 2019
Tackling Long-Tailed Relations and Uncommon Entities in Knowledge Graph Completion
IJCNLP 2019
Responding E-commerce Product Questions via Exploiting QA Collections and Reviews
COLING 2018
Deep Recurrent Generative Decoder for Abstractive Text Summarization
EMNLP 2017