Guanglu Song
34 papers · 2018–2025 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
🌍 Conference Polyglot (7) 🏃 Academic Marathon (7) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (5)
🌈
Renaissance Researcher
(5)
🐣
Hot Topic Early Bird
🌍
Conference Polyglot
(7)
🤝
Dynamic Duo
(34)
🏆
Grand Slam
🧬
Topic Evolution
🏆
Keyword Champion
(2)
📈
Trend Setter
🗃️
Keyword Collector
(96)
🚀
Conference Pioneer
⚡
Prolific Year
(12)
🔥
Unstoppable
(6)
💎
Century Club
(34)
Conferences
ECCV (12)
NIPS (7)
ICCV (6)
CVPR (5)
ICLR (2)
AAAI (1)
ICML (1)
Top co-authors
Keywords
object detection
(8)
diffusion model
(6)
text-to-image generation
(5)
knowledge distillation
(3)
image generation
(3)
semantic segmentation
(2)
spatial disentanglement
(2)
large language model
(2)
consistency model
(2)
mixture of expert
(2)
classifier-free guidance
(2)
autonomous driving
(2)
face detection
(2)
neural network optimization
(2)
convolutional neural network
(2)
feature pyramid network
(2)
text-to-image synthesis
(1)
self-supervised learning
(1)
visual question answering
(1)
temporal modeling
(1)
Papers
MMSearch: Unveiling the Potential of Large Models as Multi-modal Search Engines
ICLR 2025
See Further When Clear: Curriculum Consistency Model
CVPR 2025
EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM
ICML 2025
Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models
ECCV 2024
Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models
NIPS 2024
FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis
ECCV 2024
Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation
ECCV 2024
ZoLA: Zero-Shot Creative Long Animation Generation with Short Video Model
ECCV 2024
Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance
CVPR 2024
Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning
NIPS 2024
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
NIPS 2024
Phased Consistency Models
NIPS 2024
MoVA: Adapting Mixture of Vision Experts to Multimodal Context
NIPS 2024
Three Things We Need to Know About Transferring Stable Diffusion to Visual Dense Prediciton Tasks
ECCV 2024
LMDrive: Closed-Loop End-to-End Driving with Large Language Models
CVPR 2024
DETRs with Collaborative Hybrid Assignments Training
ICCV 2023
Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
ICCV 2023
Masked Autoencoders Are Stronger Knowledge Distillers
ICCV 2023
RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths
NIPS 2023
Decoupled DETR: Spatially Disentangling Localization and Classification for Improved End-to-End Object Detection
ICCV 2023
UniKD: Universal Knowledge Distillation for Mimicking Homogeneous or Heterogeneous Object Detectors
ICCV 2023
Rethinking Robust Representation Learning under Fine-Grained Noisy Faces
ECCV 2022
Unifying Visual Perception by Dispersible Points Learning
ECCV 2022
Self-Slimmed Vision Transformer
ECCV 2022
Large-batch Optimization for Dense Visual Predictions: Training Faster R-CNN in 4.2 Minutes
NIPS 2022
Towards Robust Face Recognition with Comprehensive Search
ECCV 2022
"UniNet: Unified Architecture Search with Convolution, Transformer, and MLP"
ECCV 2022
UniFormer: Unified Transformer for Efficient Spatial-Temporal Representation Learning
ICLR 2022
Switchable K-Class Hyperplanes for Noise-Robust Representation Learning
ICCV 2021
Discriminability Distillation in Group Representation Learning
ECCV 2020
Revisiting the Sibling Head in Object Detector
CVPR 2020
KPNet: Towards Minimal Face Detector
AAAI 2020
Transductive Centroid Projection for Semi-supervised Large-scale Recognition
ECCV 2018
Beyond Trade-Off: Accelerate FCN-Based Face Detector With Higher Accuracy
CVPR 2018