Qing-Guo Chen
17 papers · 2019–2026 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
🏃 Academic Marathon (6) 🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌍 Conference Polyglot (7) 🐝 Cross-Pollinator (15)
🌍
Conference Polyglot
(7)
🏃
Academic Marathon
(6)
🐝
Cross-Pollinator
(15)
🏆
Grand Slam
🗃️
Keyword Collector
(58)
💎
Century Club
(15)
⚡
Prolific Year
(8)
Conferences
ICLR (3)
IJCAI (3)
AAAI (2)
ACL (2)
ICCV (2)
ICML (2)
NIPS (2)
CVPR (1)
Top co-authors
Keywords
multimodal large language model
(3)
multi-label classification
(2)
multi-view learning
(2)
representation learning
(2)
image generation
(1)
feature extraction
(1)
catastrophic forgetting
(1)
preference learning
(1)
adversarial learning
(1)
direct preference optimization
(1)
attention mechanism
(1)
weakly supervised learning
(1)
text-to-image synthesis
(1)
knowledge distillation
(1)
image synthesis
(1)
visual question answering
(1)
document understanding
(1)
text-to-image generation
(1)
label disambiguation
(1)
multi-label learning
(1)
Papers
Scaling Beyond Context: A Survey of Multimodal Retrieval-Augmented Generation for Document Understanding
ACL 2026
MirrorCAPTCHA: Wild CAPTCHA, Wild Distribution, Wild Web-based Platform Meet Multimodal LLM Agents
ACL 2026
ZooProbe: A Data Engine for Evaluating, Exploring, and Evolving Large-scale Training Data for Multimodal LLMs
ICLR 2025
UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
CVPR 2025
MDP3: A Training-free Approach for List-wise Frame Selection in Video-LLMs
ICCV 2025
TeEFusion: Blending Text Embeddings to Distill Classifier-Free Guidance
ICCV 2025
Multi-Label Test-Time Adaptation with Bound Entropy Minimization
ICLR 2025
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
ICLR 2025
CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation
ICML 2025
Parrot: Multilingual Visual Instruction Tuning
ICML 2025
Wings: Learning Multimodal LLMs without Text-only Forgetting
NIPS 2024
TAI++: Text as Image for Multi-Label Image Classification by Co-Learning Transferable Prompt
IJCAI 2024
Advancing Tool-Augmented Large Language Models: Integrating Insights from Errors in Inference Trees
NIPS 2024
Deep Time-Stream Framework for Click-through Rate Prediction by Tracking Interest Evolution
AAAI 2020
Multi-View Partial Multi-Label Learning with Graph-Based Disambiguation
AAAI 2020
Multi-View Active Learning for Video Recommendation
IJCAI 2019
Multi-View Multi-Label Learning with View-Specific Information Extraction
IJCAI 2019