Qing-Guo Chen

17 papers · 2019–2026 · 8 conferences · across top CS/AI conferences

Achievements

+7 more ↓

🏃 Academic Marathon (6) 🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌍 Conference Polyglot (7) 🐝 Cross-Pollinator (15)

🌍 Conference Polyglot (7) 🏃 Academic Marathon (6) 🐝 Cross-Pollinator (15) 🏆 Grand Slam 🗃️ Keyword Collector (58) 💎 Century Club (15) ⚡ Prolific Year (8)

Conferences

ICLR (3) IJCAI (3) AAAI (2) ACL (2) ICCV (2) ICML (2) NIPS (2) CVPR (1)

Top co-authors

Weihua Luo (9) Kaifu Zhang (9) Zhao Xu (8) Yao Hu (4) Shiyin Lu (4) Jianfeng Lu (3) Xiangyu Wu (3) De-Chuan Zhan (3) Han-Jia Ye (3) Yang Yang (3)

Keywords

multimodal large language model (3) multi-label classification (2) multi-view learning (2) representation learning (2) image generation (1) feature extraction (1) catastrophic forgetting (1) preference learning (1) adversarial learning (1) direct preference optimization (1) attention mechanism (1) weakly supervised learning (1) text-to-image synthesis (1) knowledge distillation (1) image synthesis (1) visual question answering (1) document understanding (1) text-to-image generation (1) label disambiguation (1) multi-label learning (1)

Papers

Scaling Beyond Context: A Survey of Multimodal Retrieval-Augmented Generation for Document Understanding ACL 2026 MirrorCAPTCHA: Wild CAPTCHA, Wild Distribution, Wild Web-based Platform Meet Multimodal LLM Agents ACL 2026 ZooProbe: A Data Engine for Evaluating, Exploring, and Evolving Large-scale Training Data for Multimodal LLMs ICLR 2025 UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation CVPR 2025 MDP3: A Training-free Approach for List-wise Frame Selection in Video-LLMs ICCV 2025 TeEFusion: Blending Text Embeddings to Distill Classifier-Free Guidance ICCV 2025 Multi-Label Test-Time Adaptation with Bound Entropy Minimization ICLR 2025 Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis ICLR 2025 CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation ICML 2025 Parrot: Multilingual Visual Instruction Tuning ICML 2025 Wings: Learning Multimodal LLMs without Text-only Forgetting NIPS 2024 TAI++: Text as Image for Multi-Label Image Classification by Co-Learning Transferable Prompt IJCAI 2024 Advancing Tool-Augmented Large Language Models: Integrating Insights from Errors in Inference Trees NIPS 2024 Deep Time-Stream Framework for Click-through Rate Prediction by Tracking Interest Evolution AAAI 2020 Multi-View Partial Multi-Label Learning with Graph-Based Disambiguation AAAI 2020 Multi-View Active Learning for Video Recommendation IJCAI 2019 Multi-View Multi-Label Learning with View-Specific Information Extraction IJCAI 2019