Wenqiao Zhang
29 papers · 2021–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
🌍 Conference Polyglot (9) 🐝 Cross-Pollinator (12) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🏃 Academic Marathon (5)
🌈
Renaissance Researcher
(6)
🐣
Hot Topic Early Bird
🌍
Conference Polyglot
(9)
🤝
Dynamic Duo
(15)
🏆
Grand Slam
🔬
Deep Specialist
(10)
🧬
Topic Evolution
🗃️
Keyword Collector
(147)
❓
The Questioner
⚡
Prolific Year
(6)
🔥
Unstoppable
(5)
💎
Century Club
(25)
Conferences
ACL (7)
CVPR (6)
AAAI (5)
ICCV (4)
EMNLP (2)
ICML (2)
COLING (1)
ICLR (1)
NIPS (1)
Top co-authors
Keywords
multimodal large language model
(5)
video understanding
(4)
multimodal learning
(4)
large language model
(3)
active learning
(3)
object detection
(2)
representation learning
(2)
visual reasoning
(2)
video grounding
(2)
low-rank adaptation
(2)
instruction tuning
(2)
uncertainty quantification
(2)
image captioning
(2)
parameter-efficient fine-tuning
(2)
symbolic reasoning
(1)
adversarial learning
(1)
domain adaptation
(1)
reinforcement learning
(1)
domain generalization
(1)
knowledge distillation
(1)
Papers
MAU-GPT: Enhancing Multi-type Industrial Anomaly Understanding via Anomaly-aware and Generalist Experts Adaptation
AAAI 2026
Evolving Generalist Virtual Agents with Generative and Associative Memory
AAAI 2026
PILOT: Planning via Internalized Latent Optimization Trajectories for Large Language Models
ACL 2026
MoA: Heterogeneous Mixture of Adapters for Parameter-Efficient Fine-Tuning of Large Language Models
ACL 2026
Mastering Collaborative Multi-modal Data Selection: A Focus on Informativeness, Uniqueness, and Representativeness
ICCV 2025
Align2LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation
ACL 2025
ITERATE: Image-Text Enhancement, Retrieval, and Alignment for Transmodal Evolution with LLMs
COLING 2025
Boosting Virtual Agent Learning and Reasoning: A Step-Wise, Multi-Dimensional, and Generalist Reward Model with Benchmark
ICML 2025
HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation
ICML 2025
Iris: Breaking GUI Complexity with Adaptive Focus and Self-Refining
ICCV 2025
TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition
ACL 2025
VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM
CVPR 2025
Optimize Incompatible Parameters Through Compatibility-aware Knowledge Integration
AAAI 2025
Meta-Reflection: A Feedback-Free Reflection Learning Framework
ACL 2025
Revisiting the Domain Shift and Sample Uncertainty in Multi-source Active Domain Transfer
CVPR 2024
DIEM: Decomposition-Integration Enhancing Multimodal Insights
CVPR 2024
Bridging Local Details and Global Context in Text-Attributed Graphs
EMNLP 2024
Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions
ICLR 2024
Learning in Imperfect Environment: Multi-Label Classification with Long-Tailed Distribution and Partial Labels
ICCV 2023
Gradient-Regulated Meta-Prompt Learning for Generalizable Vision-Language Models
ICCV 2023
Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-Based Active Learning
CVPR 2023
WINNER: Weakly-Supervised hIerarchical decompositioN and aligNment for Spatio-tEmporal Video gRounding
CVPR 2023
Multi-modal Action Chain Abductive Reasoning
ACL 2023
ART: rule bAsed futuRe-inference deducTion
EMNLP 2023
BoostMIS: Boosting Medical Image Semi-Supervised Learning With Adaptive Pseudo Labeling and Informative Active Annotation
CVPR 2022
DeVRF: Fast Deformable Voxel Radiance Fields for Dynamic Scenes
NIPS 2022
MAGIC: Multimodal relAtional Graph adversarIal inferenCe for Diverse and Unpaired Text-Based Image Captioning
AAAI 2022
End-to-End Modeling via Information Tree for One-Shot Natural Language Spatial Video Grounding
ACL 2022
Consensus Graph Representation Learning for Better Grounded Image Captioning
AAAI 2021