conftrace_

Guanglu Song

34 papers · 2018–2025 · 7 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓
+13 more ↓ 🌍 Conference Polyglot (7) 🏃 Academic Marathon (7) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (5)
🌈 Renaissance Researcher (5) 🐣 Hot Topic Early Bird 🌍 Conference Polyglot (7) 🤝 Dynamic Duo (34) 🏆 Grand Slam 🧬 Topic Evolution 🏆 Keyword Champion (2) 📈 Trend Setter 🗃️ Keyword Collector (96) 🚀 Conference Pioneer Prolific Year (12) 🔥 Unstoppable (6) 💎 Century Club (34)

Conferences

ECCV (12) NIPS (7) ICCV (6) CVPR (5) ICLR (2) AAAI (1) ICML (1)

Papers

MMSearch: Unveiling the Potential of Large Models as Multi-modal Search Engines ICLR 2025 See Further When Clear: Curriculum Consistency Model CVPR 2025 EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM ICML 2025 Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models ECCV 2024 Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models NIPS 2024 FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis ECCV 2024 Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation ECCV 2024 ZoLA: Zero-Shot Creative Long Animation Generation with Short Video Model ECCV 2024 Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance CVPR 2024 Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning NIPS 2024 CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching NIPS 2024 Phased Consistency Models NIPS 2024 MoVA: Adapting Mixture of Vision Experts to Multimodal Context NIPS 2024 Three Things We Need to Know About Transferring Stable Diffusion to Visual Dense Prediciton Tasks ECCV 2024 LMDrive: Closed-Loop End-to-End Driving with Large Language Models CVPR 2024 DETRs with Collaborative Hybrid Assignments Training ICCV 2023 Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction ICCV 2023 Masked Autoencoders Are Stronger Knowledge Distillers ICCV 2023 RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths NIPS 2023 Decoupled DETR: Spatially Disentangling Localization and Classification for Improved End-to-End Object Detection ICCV 2023 UniKD: Universal Knowledge Distillation for Mimicking Homogeneous or Heterogeneous Object Detectors ICCV 2023 Rethinking Robust Representation Learning under Fine-Grained Noisy Faces ECCV 2022 Unifying Visual Perception by Dispersible Points Learning ECCV 2022 Self-Slimmed Vision Transformer ECCV 2022 Large-batch Optimization for Dense Visual Predictions: Training Faster R-CNN in 4.2 Minutes NIPS 2022 Towards Robust Face Recognition with Comprehensive Search ECCV 2022 "UniNet: Unified Architecture Search with Convolution, Transformer, and MLP" ECCV 2022 UniFormer: Unified Transformer for Efficient Spatial-Temporal Representation Learning ICLR 2022 Switchable K-Class Hyperplanes for Noise-Robust Representation Learning ICCV 2021 Discriminability Distillation in Group Representation Learning ECCV 2020 Revisiting the Sibling Head in Object Detector CVPR 2020 KPNet: Towards Minimal Face Detector AAAI 2020 Transductive Centroid Projection for Semi-supervised Large-scale Recognition ECCV 2018 Beyond Trade-Off: Accelerate FCN-Based Face Detector With Higher Accuracy CVPR 2018