Handong Zhao

51 papers · 2015–2025 · 15 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🌍 Conference Polyglot (15) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (10) 🏃 Academic Marathon (10)

🏃 Academic Marathon (10) 🐝 Cross-Pollinator (7) 🌈 Renaissance Researcher (10) 🏆 Keyword Champion 🤝 Dynamic Duo (13) 🏆 Grand Slam 🧬 Topic Evolution 🗃️ Keyword Collector (205) ⚡ Prolific Year (9) 💎 Century Club (51) 🔥 Unstoppable (7)

Conferences

EMNLP (7) CVPR (6) IJCAI (6) ACL (5) ICCV (5) NIPS (5) ICLR (4) NAACL (3) AAAI (2) EACL (2) ECCV (2) ICML (1) IJCNLP (1) JMLR (1) L4DC (1)

Top co-authors

Tong Yu (13) Ruiyi Zhang (11) Jiuxiang Gu (10) Sungchul Kim (10) Sheng Li (9) Yun Fu (9) Ryan Rossi (7) Jason Kuen (6) Ani Nenkova (6) Ricardo Henao (6)

Keywords

representation learning (6) contrastive learning (6) self-supervised learning (5) multimodal learning (5) few-shot learning (4) multimodal large language model (4) prompt tuning (4) domain adaptation (4) document understanding (3) knowledge distillation (3) vision-language model (3) causal inference (3) cross-modal retrieval (2) diffusion model (2) named entity recognition (2) reinforcement learning (2) adversarial learning (2) subspace clustering (2) federated learning (2) spectral clustering (2)

Papers

The Photographer's Eye: Teaching Multimodal Large Language Models to See, and Critique Like Photographers CVPR 2025 MAGNET: Augmenting Generative Decoders with Representation Learning and Infilling Capabilities ACL 2025 Augment before You Try: Knowledge-Enhanced Table Question Answering via Table Expansion EMNLP 2025 GUI-Bee: Align GUI Action Grounding to Novel Environments via Autonomous Exploration EMNLP 2025 VSP: Diagnosing the Dual Challenges of Perception and Reasoning in Spatial Planning Tasks for MLLMs ICCV 2025 Advancing Vision-Language Models with Adapter Ensemble Strategies EMNLP 2024 Structured Dynamic Pricing: Optimal Regret in a Global Shrinkage Model JMLR 2024 Personalized Federated Learning for Text Classification with Gradient-Free Prompt Tuning NAACL 2024 Tag-grounded Visual Instruction Tuning with Retrieval Augmentation EMNLP 2024 Fine-tuning CLIP Text Encoders with Two-step Paraphrasing EACL 2024 Generalizing to Unseen Domains via Text-guided Augmentation ECCV 2024 Few-Shot Dialogue Summarization via Skeleton-Assisted Prompt Transfer in Prompt Tuning EACL 2024 Easy Regional Contrastive Learning of Expressive Fashion Representations NIPS 2024 Aligning as Debiasing: Causality-Aware Alignment via Reinforcement Learning with Interventional Feedback NAACL 2024 SOHES: Self-supervised Open-world Hierarchical Entity Segmentation ICLR 2024 InfoPrompt: Information-Theoretic Soft Prompt Tuning for Natural Language Understanding NIPS 2023 Few-Shot Composition Learning for Image Retrieval with Prompt Tuning AAAI 2023 Federated Domain Adaptation for Named Entity Recognition via Distilling with Heterogeneous Tag Sets ACL 2023 Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models CVPR 2023 A Critical Analysis of Document Out-of-Distribution Detection EMNLP 2023 Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synthesis ICCV 2023 Better Generative Replay for Continual Federated Learning ICLR 2023 Few-Shot Class-Incremental Learning for Named Entity Recognition ACL 2022 Neural Point Process for Learning Spatiotemporal Event Dynamics L4DC 2022 Context-aware Information-theoretic Causal De-biasing for Interactive Sequence Labeling EMNLP 2022 XDC: Adversarial Adaptive Cross Domain Face Clustering (Student Abstract) AAAI 2022 Neural Contextual Bandits with Deep Representation and Shallow Exploration ICLR 2022 EI-CLIP: Entity-Aware Interventional Contrastive Learning for E-Commerce Cross-Modal Retrieval CVPR 2022 Learning Adaptive Axis Attentions in Fine-tuning: Beyond Fixed Sparse Attention Patterns ACL 2022 Discovering Low-rank Subspaces for Language-agnostic Multilingual Representations EMNLP 2022 Slow Learning and Fast Inference: Efficient Graph Similarity Computation via Knowledge Distillation NIPS 2021 SelfDoc: Self-Supervised Document Representation Learning CVPR 2021 Edge: Enriching Knowledge Graph Embeddings with External Text NAACL 2021 UniDoc: Unified Pretraining Framework for Document Understanding NIPS 2021 Learning Contextualized Knowledge Structures for Commonsense Reasoning ACL 2021 Adaptive Adversarial Network for Source-Free Domain Adaptation ICCV 2021 ECACL: A Holistic Framework for Semi-Supervised Domain Adaptation ICCV 2021 Learning Contextualized Knowledge Structures for Commonsense Reasoning IJCNLP 2021 Learning to Deceive Knowledge Graph Augmented Models via Targeted Perturbation ICLR 2021 A Survey on Representation Learning for User Modeling IJCAI 2020 Cross-Domain Document Object Detection: Benchmark Suite and Method CVPR 2020 Open-Edit: Open-Domain Image Manipulation with Open-Vocabulary Instructions ECCV 2020 Structured Policy Iteration for Linear Quadratic Regulator ICML 2020 Self-Supervised Relationship Probing NIPS 2020 Unpaired Image Captioning via Scene Graph Alignments ICCV 2019 Scene Graph Generation With External Knowledge and Image Reconstruction CVPR 2019 Projective Low-rank Subspace Clustering via Learning Deep Encoder IJCAI 2017 Large-scale Subspace Clustering by Fast Regression Coding IJCAI 2017 Incomplete Multi-Modal Visual Data Grouping IJCAI 2016 Dual-Regularized Multi-View Outlier Detection IJCAI 2015 Semantic Single Video Segmentation with Robust Graph Representation IJCAI 2015