Handong Zhao
51 papers · 2015–2025 · 15 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
π Conference Polyglot (15) π§ Keyword Pioneer π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (10) π Academic Marathon (10)
π
Academic Marathon
(10)
π
Cross-Pollinator
(7)
π
Renaissance Researcher
(10)
π
Keyword Champion
π€
Dynamic Duo
(13)
π
Grand Slam
π§¬
Topic Evolution
ποΈ
Keyword Collector
(205)
β‘
Prolific Year
(9)
π
Century Club
(51)
π₯
Unstoppable
(7)
Conferences
EMNLP (7)
CVPR (6)
IJCAI (6)
ACL (5)
ICCV (5)
NIPS (5)
ICLR (4)
NAACL (3)
AAAI (2)
EACL (2)
ECCV (2)
ICML (1)
IJCNLP (1)
JMLR (1)
L4DC (1)
Top co-authors
Keywords
representation learning
(6)
contrastive learning
(6)
self-supervised learning
(5)
multimodal learning
(5)
few-shot learning
(4)
multimodal large language model
(4)
prompt tuning
(4)
domain adaptation
(4)
document understanding
(3)
knowledge distillation
(3)
vision-language model
(3)
causal inference
(3)
cross-modal retrieval
(2)
diffusion model
(2)
named entity recognition
(2)
reinforcement learning
(2)
adversarial learning
(2)
subspace clustering
(2)
federated learning
(2)
spectral clustering
(2)
Papers
The Photographer's Eye: Teaching Multimodal Large Language Models to See, and Critique Like Photographers
CVPR 2025
MAGNET: Augmenting Generative Decoders with Representation Learning and Infilling Capabilities
ACL 2025
Augment before You Try: Knowledge-Enhanced Table Question Answering via Table Expansion
EMNLP 2025
GUI-Bee: Align GUI Action Grounding to Novel Environments via Autonomous Exploration
EMNLP 2025
VSP: Diagnosing the Dual Challenges of Perception and Reasoning in Spatial Planning Tasks for MLLMs
ICCV 2025
Advancing Vision-Language Models with Adapter Ensemble Strategies
EMNLP 2024
Structured Dynamic Pricing: Optimal Regret in a Global Shrinkage Model
JMLR 2024
Personalized Federated Learning for Text Classification with Gradient-Free Prompt Tuning
NAACL 2024
Tag-grounded Visual Instruction Tuning with Retrieval Augmentation
EMNLP 2024
Fine-tuning CLIP Text Encoders with Two-step Paraphrasing
EACL 2024
Generalizing to Unseen Domains via Text-guided Augmentation
ECCV 2024
Few-Shot Dialogue Summarization via Skeleton-Assisted Prompt Transfer in Prompt Tuning
EACL 2024
Easy Regional Contrastive Learning of Expressive Fashion Representations
NIPS 2024
Aligning as Debiasing: Causality-Aware Alignment via Reinforcement Learning with Interventional Feedback
NAACL 2024
SOHES: Self-supervised Open-world Hierarchical Entity Segmentation
ICLR 2024
InfoPrompt: Information-Theoretic Soft Prompt Tuning for Natural Language Understanding
NIPS 2023
Few-Shot Composition Learning for Image Retrieval with Prompt Tuning
AAAI 2023
Federated Domain Adaptation for Named Entity Recognition via Distilling with Heterogeneous Tag Sets
ACL 2023
Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models
CVPR 2023
A Critical Analysis of Document Out-of-Distribution Detection
EMNLP 2023
Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synthesis
ICCV 2023
Better Generative Replay for Continual Federated Learning
ICLR 2023
Few-Shot Class-Incremental Learning for Named Entity Recognition
ACL 2022
Neural Point Process for Learning Spatiotemporal Event Dynamics
L4DC 2022
Context-aware Information-theoretic Causal De-biasing for Interactive Sequence Labeling
EMNLP 2022
XDC: Adversarial Adaptive Cross Domain Face Clustering (Student Abstract)
AAAI 2022
Neural Contextual Bandits with Deep Representation and Shallow Exploration
ICLR 2022
EI-CLIP: Entity-Aware Interventional Contrastive Learning for E-Commerce Cross-Modal Retrieval
CVPR 2022
Learning Adaptive Axis Attentions in Fine-tuning: Beyond Fixed Sparse Attention Patterns
ACL 2022
Discovering Low-rank Subspaces for Language-agnostic Multilingual Representations
EMNLP 2022
Slow Learning and Fast Inference: Efficient Graph Similarity Computation via Knowledge Distillation
NIPS 2021
SelfDoc: Self-Supervised Document Representation Learning
CVPR 2021
Edge: Enriching Knowledge Graph Embeddings with External Text
NAACL 2021
UniDoc: Unified Pretraining Framework for Document Understanding
NIPS 2021
Learning Contextualized Knowledge Structures for Commonsense Reasoning
ACL 2021
Adaptive Adversarial Network for Source-Free Domain Adaptation
ICCV 2021
ECACL: A Holistic Framework for Semi-Supervised Domain Adaptation
ICCV 2021
Learning Contextualized Knowledge Structures for Commonsense Reasoning
IJCNLP 2021
Learning to Deceive Knowledge Graph Augmented Models via Targeted Perturbation
ICLR 2021
A Survey on Representation Learning for User Modeling
IJCAI 2020
Cross-Domain Document Object Detection: Benchmark Suite and Method
CVPR 2020
Open-Edit: Open-Domain Image Manipulation with Open-Vocabulary Instructions
ECCV 2020
Structured Policy Iteration for Linear Quadratic Regulator
ICML 2020
Self-Supervised Relationship Probing
NIPS 2020
Unpaired Image Captioning via Scene Graph Alignments
ICCV 2019
Scene Graph Generation With External Knowledge and Image Reconstruction
CVPR 2019
Projective Low-rank Subspace Clustering via Learning Deep Encoder
IJCAI 2017
Large-scale Subspace Clustering by Fast Regression Coding
IJCAI 2017
Incomplete Multi-Modal Visual Data Grouping
IJCAI 2016
Dual-Regularized Multi-View Outlier Detection
IJCAI 2015
Semantic Single Video Segmentation with Robust Graph Representation
IJCAI 2015