Guanglu Song

34 papers · 2018–2025 · 7 conferences · across top CS/AI conferences

Achievements

+13 more ↓

🌍 Conference Polyglot (7) 🏃 Academic Marathon (7) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (5)

🌈 Renaissance Researcher (5) 🐣 Hot Topic Early Bird 🌍 Conference Polyglot (7) 🤝 Dynamic Duo (34) 🏆 Grand Slam 🧬 Topic Evolution 🏆 Keyword Champion (2) 📈 Trend Setter 🗃️ Keyword Collector (96) 🚀 Conference Pioneer ⚡ Prolific Year (12) 🔥 Unstoppable (6) 💎 Century Club (34)

Conferences

ECCV (12) NIPS (7) ICCV (6) CVPR (5) ICLR (2) AAAI (1) ICML (1)

Top co-authors

Yu Liu (34) hongsheng Li (18) Zhuofan Zong (10) Boxiao Liu (6) Manyuan Zhang (6) Dazhong Shen (6) Dongzhi Jiang (5) Zeyue Xue (4) Fu-Yun Wang (4) Bingqi Ma (4)

Keywords

object detection (8) diffusion model (6) text-to-image generation (5) knowledge distillation (3) image generation (3) semantic segmentation (2) spatial disentanglement (2) large language model (2) consistency model (2) mixture of expert (2) classifier-free guidance (2) autonomous driving (2) face detection (2) neural network optimization (2) convolutional neural network (2) feature pyramid network (2) text-to-image synthesis (1) self-supervised learning (1) visual question answering (1) temporal modeling (1)

Papers

MMSearch: Unveiling the Potential of Large Models as Multi-modal Search Engines ICLR 2025 See Further When Clear: Curriculum Consistency Model CVPR 2025 EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM ICML 2025 Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models ECCV 2024 Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models NIPS 2024 FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis ECCV 2024 Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation ECCV 2024 ZoLA: Zero-Shot Creative Long Animation Generation with Short Video Model ECCV 2024 Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance CVPR 2024 Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning NIPS 2024 CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching NIPS 2024 Phased Consistency Models NIPS 2024 MoVA: Adapting Mixture of Vision Experts to Multimodal Context NIPS 2024 Three Things We Need to Know About Transferring Stable Diffusion to Visual Dense Prediciton Tasks ECCV 2024 LMDrive: Closed-Loop End-to-End Driving with Large Language Models CVPR 2024 DETRs with Collaborative Hybrid Assignments Training ICCV 2023 Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction ICCV 2023 Masked Autoencoders Are Stronger Knowledge Distillers ICCV 2023 RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths NIPS 2023 Decoupled DETR: Spatially Disentangling Localization and Classification for Improved End-to-End Object Detection ICCV 2023 UniKD: Universal Knowledge Distillation for Mimicking Homogeneous or Heterogeneous Object Detectors ICCV 2023 Rethinking Robust Representation Learning under Fine-Grained Noisy Faces ECCV 2022 Unifying Visual Perception by Dispersible Points Learning ECCV 2022 Self-Slimmed Vision Transformer ECCV 2022 Large-batch Optimization for Dense Visual Predictions: Training Faster R-CNN in 4.2 Minutes NIPS 2022 Towards Robust Face Recognition with Comprehensive Search ECCV 2022 "UniNet: Unified Architecture Search with Convolution, Transformer, and MLP" ECCV 2022 UniFormer: Unified Transformer for Efficient Spatial-Temporal Representation Learning ICLR 2022 Switchable K-Class Hyperplanes for Noise-Robust Representation Learning ICCV 2021 Discriminability Distillation in Group Representation Learning ECCV 2020 Revisiting the Sibling Head in Object Detector CVPR 2020 KPNet: Towards Minimal Face Detector AAAI 2020 Transductive Centroid Projection for Semi-supervised Large-scale Recognition ECCV 2018 Beyond Trade-Off: Accelerate FCN-Based Face Detector With Higher Accuracy CVPR 2018