Gen Luo
21 papers · 2020–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
π Academic Marathon (5) π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (7) π Cross-Pollinator (9)
πΊοΈ
Taxonomy Completionist
(46)
π
Conference Polyglot
(7)
π€
Dynamic Duo
(15)
π
Grand Slam
π§¬
Topic Evolution
π
Keyword Champion
(2)
ποΈ
Keyword Collector
(88)
β‘
Prolific Year
(7)
π₯
Unstoppable
(6)
π
Century Club
(20)
Conferences
CVPR (8)
AAAI (3)
NIPS (3)
ECCV (2)
ICLR (2)
ICML (2)
ACL (1)
Top co-authors
Keywords
referring expression comprehension
(5)
semantic segmentation
(4)
weakly supervised learning
(3)
multimodal large language model
(3)
contrastive learning
(2)
knowledge distillation
(2)
parameter-efficient fine-tuning
(2)
teacher-student learning
(2)
3d referring expression segmentation
(2)
object grounding
(2)
pseudo labeling
(2)
attention mechanism
(2)
multi-task learning
(2)
multimodal learning
(2)
referring expression
(2)
semi-supervised learning
(1)
object detection
(1)
image segmentation
(1)
domain adaptation
(1)
3d vision
(1)
Papers
Earth-Adapter: Bridge the Geospatial Domain Gaps with a Frequency-Guided Mixture of Adapters
AAAI 2026
DViN: Dynamic Visual Routing Network for Weakly Supervised Referring Expression Comprehension
CVPR 2025
Training Long-Context LLMs Efficiently via Chunk-wise Optimization
ACL 2025
WeakMCN: Multi-task Collaborative Network for Weakly Supervised Referring Expression Comprehension and Segmentation
CVPR 2025
Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training
CVPR 2025
FlashSloth : Lightning Multimodal Large Language Models via Embedded Visual Compression
CVPR 2025
Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models
ICLR 2025
$\gamma-$MoD: Exploring Mixture-of-Depth Adaptation for Multimodal Large Language Models
ICLR 2025
3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation
AAAI 2024
APL: Anchor-based Prompt Learning for One-stage Weakly Supervised Referring Expression Comprehension
ECCV 2024
ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models
NIPS 2024
CaM: Cache Merging for Memory-efficient LLMs Inference
ICML 2024
RG-SAN: Rule-Guided Spatial Awareness Network for End-to-End 3D Referring Expression Segmentation
NIPS 2024
Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization
ICML 2024
Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models
NIPS 2023
RefCLIP: A Universal Teacher for Weakly Supervised Referring Expression Comprehension
CVPR 2023
RefTeacher: A Strong Baseline for Semi-Supervised Referring Expression Comprehension
CVPR 2023
Active Teacher for Semi-Supervised Object Detection
CVPR 2022
SeqTR: A Simple Yet Universal Network for Visual Grounding
ECCV 2022
Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network
AAAI 2021
Multi-Task Collaborative Network for Joint Referring Expression Comprehension and Segmentation
CVPR 2020