Gen Luo

21 papers · 2020–2026 · 7 conferences · across top CS/AI conferences

Achievements

+10 more ↓

🏃 Academic Marathon (5) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (7) 🐝 Cross-Pollinator (9)

🗺️ Taxonomy Completionist (46) 🌍 Conference Polyglot (7) 🤝 Dynamic Duo (15) 🏆 Grand Slam 🧬 Topic Evolution 🏆 Keyword Champion (2) 🗃️ Keyword Collector (88) ⚡ Prolific Year (7) 🔥 Unstoppable (6) 💎 Century Club (20)

Conferences

CVPR (8) AAAI (3) NIPS (3) ECCV (2) ICLR (2) ICML (2) ACL (1)

Top co-authors

Rongrong Ji (15) Xiaoshuai Sun (14) Yiyi Zhou (11) Jiayi Ji (7) Yuxin Zhang (4) Liujuan Cao (3) Yunhang Shen (3) Yaxin Luo (3) GUANNAN JIANG (3) Qi Chen (2)

Keywords

referring expression comprehension (5) semantic segmentation (4) weakly supervised learning (3) multimodal large language model (3) contrastive learning (2) knowledge distillation (2) parameter-efficient fine-tuning (2) teacher-student learning (2) 3d referring expression segmentation (2) object grounding (2) pseudo labeling (2) attention mechanism (2) multi-task learning (2) multimodal learning (2) referring expression (2) semi-supervised learning (1) object detection (1) image segmentation (1) domain adaptation (1) 3d vision (1)

Papers

Earth-Adapter: Bridge the Geospatial Domain Gaps with a Frequency-Guided Mixture of Adapters AAAI 2026 DViN: Dynamic Visual Routing Network for Weakly Supervised Referring Expression Comprehension CVPR 2025 Training Long-Context LLMs Efficiently via Chunk-wise Optimization ACL 2025 WeakMCN: Multi-task Collaborative Network for Weakly Supervised Referring Expression Comprehension and Segmentation CVPR 2025 Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training CVPR 2025 FlashSloth : Lightning Multimodal Large Language Models via Embedded Visual Compression CVPR 2025 Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models ICLR 2025 $\gamma-$MoD: Exploring Mixture-of-Depth Adaptation for Multimodal Large Language Models ICLR 2025 3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation AAAI 2024 APL: Anchor-based Prompt Learning for One-stage Weakly Supervised Referring Expression Comprehension ECCV 2024 ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models NIPS 2024 CaM: Cache Merging for Memory-efficient LLMs Inference ICML 2024 RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End 3D Referring Expression Segmentation NIPS 2024 Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization ICML 2024 Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models NIPS 2023 RefCLIP: A Universal Teacher for Weakly Supervised Referring Expression Comprehension CVPR 2023 RefTeacher: A Strong Baseline for Semi-Supervised Referring Expression Comprehension CVPR 2023 Active Teacher for Semi-Supervised Object Detection CVPR 2022 SeqTR: A Simple Yet Universal Network for Visual Grounding ECCV 2022 Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network AAAI 2021 Multi-Task Collaborative Network for Joint Referring Expression Comprehension and Segmentation CVPR 2020