Wenqiao Zhang

29 papers · 2021–2026 · 9 conferences · across top CS/AI conferences

Achievements

+12 more ↓

🌍 Conference Polyglot (9) 🐝 Cross-Pollinator (12) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🏃 Academic Marathon (5)

🌈 Renaissance Researcher (6) 🐣 Hot Topic Early Bird 🌍 Conference Polyglot (9) 🤝 Dynamic Duo (15) 🏆 Grand Slam 🔬 Deep Specialist (10) 🧬 Topic Evolution 🗃️ Keyword Collector (147) ❓ The Questioner ⚡ Prolific Year (6) 🔥 Unstoppable (5) 💎 Century Club (25)

Conferences

ACL (7) CVPR (6) AAAI (5) ICCV (4) EMNLP (2) ICML (2) COLING (1) ICLR (1) NIPS (1)

Top co-authors

Siliang Tang (18) Yueting Zhuang (15) Juncheng Li (14) Mengze Li (8) Shengyu Zhang (8) Wei Ji (6) Fei Wu (5) Tat-Seng Chua (5) Yuqian Yuan (4) Minghe Gao (4)

Keywords

multimodal large language model (5) video understanding (4) multimodal learning (4) large language model (3) active learning (3) object detection (2) representation learning (2) visual reasoning (2) video grounding (2) low-rank adaptation (2) instruction tuning (2) uncertainty quantification (2) image captioning (2) parameter-efficient fine-tuning (2) symbolic reasoning (1) adversarial learning (1) domain adaptation (1) reinforcement learning (1) domain generalization (1) knowledge distillation (1)

Papers

MAU-GPT: Enhancing Multi-type Industrial Anomaly Understanding via Anomaly-aware and Generalist Experts Adaptation AAAI 2026 Evolving Generalist Virtual Agents with Generative and Associative Memory AAAI 2026 PILOT: Planning via Internalized Latent Optimization Trajectories for Large Language Models ACL 2026 MoA: Heterogeneous Mixture of Adapters for Parameter-Efficient Fine-Tuning of Large Language Models ACL 2026 Mastering Collaborative Multi-modal Data Selection: A Focus on Informativeness, Uniqueness, and Representativeness ICCV 2025 Align2LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation ACL 2025 ITERATE: Image-Text Enhancement, Retrieval, and Alignment for Transmodal Evolution with LLMs COLING 2025 Boosting Virtual Agent Learning and Reasoning: A Step-Wise, Multi-Dimensional, and Generalist Reward Model with Benchmark ICML 2025 HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation ICML 2025 Iris: Breaking GUI Complexity with Adaptive Focus and Self-Refining ICCV 2025 TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition ACL 2025 VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM CVPR 2025 Optimize Incompatible Parameters Through Compatibility-aware Knowledge Integration AAAI 2025 Meta-Reflection: A Feedback-Free Reflection Learning Framework ACL 2025 Revisiting the Domain Shift and Sample Uncertainty in Multi-source Active Domain Transfer CVPR 2024 DIEM: Decomposition-Integration Enhancing Multimodal Insights CVPR 2024 Bridging Local Details and Global Context in Text-Attributed Graphs EMNLP 2024 Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions ICLR 2024 Learning in Imperfect Environment: Multi-Label Classification with Long-Tailed Distribution and Partial Labels ICCV 2023 Gradient-Regulated Meta-Prompt Learning for Generalizable Vision-Language Models ICCV 2023 Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-Based Active Learning CVPR 2023 WINNER: Weakly-Supervised hIerarchical decompositioN and aligNment for Spatio-tEmporal Video gRounding CVPR 2023 Multi-modal Action Chain Abductive Reasoning ACL 2023 ART: rule bAsed futuRe-inference deducTion EMNLP 2023 BoostMIS: Boosting Medical Image Semi-Supervised Learning With Adaptive Pseudo Labeling and Informative Active Annotation CVPR 2022 DeVRF: Fast Deformable Voxel Radiance Fields for Dynamic Scenes NIPS 2022 MAGIC: Multimodal relAtional Graph adversarIal inferenCe for Diverse and Unpaired Text-Based Image Captioning AAAI 2022 End-to-End Modeling via Information Tree for One-Shot Natural Language Spatial Video Grounding ACL 2022 Consensus Graph Representation Learning for Better Grounded Image Captioning AAAI 2021