Song Wang

102 papers · 2013–2026 · 14 conferences · across top CS/AI conferences

Achievements

+16 more ↓

🌍 Conference Polyglot (14) 🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (12)

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏃 Academic Marathon (12) 🏠 Conference Loyalist (22) 🤝 Dynamic Duo (23) 🏆 Grand Slam 🔬 Deep Specialist (13) 🧬 Topic Evolution 🏆 Keyword Champion (2) ⚡ Prolific Year (5) ❓ The Questioner (3) 📈 Trend Setter 🗃️ Keyword Collector (421) 💎 Century Club (101) 🔥 Unstoppable (11) 🚀 Conference Pioneer

Conferences

CVPR (22) AAAI (17) ICCV (13) EMNLP (12) ECCV (10) ACL (6) IJCAI (6) ICLR (5) NAACL (4) NIPS (3) AACL (1) ICML (1) IJCNLP (1) WACV (1)

Top co-authors

Jundong Li (23) Zhenyao Wu (14) Xinyi Wu (14) Wei Feng (14) Jianke Zhu (12) Zhen Tan (11) Wentong Li (10) Zihan Chen (9) Lili Ju (8) Qing Guo (7)

Keywords

large language model (11) autonomous driving (9) graph neural network (7) knowledge distillation (7) convolutional neural network (6) few-shot learning (6) semantic segmentation (6) unsupervised learning (5) image restoration (5) in-context learning (4) generative model (4) language model (4) point cloud (4) zero-shot learning (4) domain adaptation (4) instance segmentation (4) knowledge graph (4) attention mechanism (3) 3d reconstruction (3) feature extraction (3)

Papers

GUIDE: Gaussian Unified Instance Detection for Enhanced Obstacle Perception in Autonomous Driving AAAI 2026 Separate the Wheat from the Chaff: Winnowing Down Divergent Views in Retrieval Augmented Generation EMNLP 2025 AnyMAC: Cascading Flexible Multi-Agent Collaboration via Next-Agent Prediction EMNLP 2025 Learning from Diverse Reasoning Paths with Routing and Collaboration EMNLP 2025 CoRAG: Enhancing Hybrid Retrieval-Augmented Generation through a Cooperative Retriever Architecture EMNLP 2025 Reasoning of Large Language Models over Knowledge Graphs with Super-Relations ICLR 2025 The Source Image is the Best Attention for Infrared and Visible Image Fusion ICCV 2025 Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations ICCV 2025 SAM4D: Segment Anything in Camera and LiDAR Streams ICCV 2025 Monocular Semantic Scene Completion via Masked Recurrent Networks ICCV 2025 From Cross-Task Examples to In-Task Prompts: A Graph-Based Pseudo-Labeling Framework for In-context Learning EMNLP 2025 FIER: Fine-Grained and Efficient KV Cache Retrieval for Long-context LLM Inference EMNLP 2025 Interpreting Pretrained Language Models via Concept Bottlenecks (Extended Abstract) IJCAI 2025 Uncertainty-Instructed Structure Injection for Generalizable HD Map Construction CVPR 2025 DPSeg: Dual-Prompt Cost Volume Learning for Open-Vocabulary Semantic Segmentation CVPR 2025 PointLoRA: Low-Rank Adaptation with Token Selection for Point Cloud Learning CVPR 2025 Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuning CVPR 2025 Acquire and then Adapt: Squeezing out Text-to-Image Model for Image Restoration CVPR 2025 From Implicit Exploration to Structured Reasoning: Guideline and Refinement for LLMs EMNLP 2025 Reliable and Calibrated Semantic Occupancy Prediction by Hybrid Uncertainty Learning IJCAI 2025 DIIN: Diffusion Iterative Implicit Networks for Arbitrary-scale Super-resolution IJCAI 2025 Developing a Reliable, Fast, General-Purpose Hallucination Detection and Mitigation Service NAACL 2025 Revisiting Graph Contrastive Learning on Anomaly Detection: A Structural Imbalance Perspective AAAI 2025 BrainMAP: Learning Multiple Activation Pathways in Brain Networks AAAI 2025 Virtual Nodes Can Help: Tackling Distribution Shifts in Federated Graph Learning AAAI 2025 Tuning-Free Accountable Intervention for LLM Deployment – a Metacognitive Approach AAAI 2025 Bias Unveiled: Investigating Social Bias in LLM-Generated Code AAAI 2025 The Visual Counter Turing Test (VCT²): A Benchmark for Evaluating AI-Generated Image Detection and the Visual AI Index (V_AI) AACL 2025 The Visual Counter Turing Test (VCT²): A Benchmark for Evaluating AI-Generated Image Detection and the Visual AI Index (V_AI) IJCNLP 2025 Question-Aware Knowledge Graph Prompting for Enhancing Large Language Models ACL 2025 MAPLE: Many-Shot Adaptive Pseudo-Labeling for In-Context Learning ICML 2025 Integrative Decoding: Improving Factuality via Implicit Self-consistency ICLR 2025 Graph Neural Networks Are More Than Filters: Revisiting and Benchmarking from A Spectral Perspective ICLR 2025 PianoMotion10M: Dataset and Benchmark for Hand Motion Generation in Piano Performance ICLR 2025 CEB: Compositional Evaluation Benchmark for Fairness in Large Language Models ICLR 2025 Large Language Models for Data Annotation and Synthesis: A Survey EMNLP 2024 EINet: Point Cloud Completion via Extrapolation and Interpolation ECCV 2024 SAIR: Learning Semantic-aware Implicit Representation ECCV 2024 From a Bird's Eye View to See: Joint Camera and Subject Registration without the Camera Calibration CVPR 2024 Label-efficient Semantic Scene Completion with Scribble Annotations IJCAI 2024 Mixture of Demonstrations for In-Context Learning NIPS 2024 Glue pizza and eat rocks - Exploiting Vulnerabilities in Retrieval-Augmented Generative Models EMNLP 2024 Few-shot Knowledge Graph Relational Reasoning via Subgraph Adaptation NAACL 2024 Orthogonal Dictionary Guided Shape Completion Network for Point Cloud AAAI 2024 MGMap: Mask-Guided Learning for Online Vectorized HD Map Construction CVPR 2024 Knowledge Graph-Enhanced Large Language Models via Path Selection ACL 2024 FastGAS: Fast Graph-based Annotation Selection for In-Context Learning ACL 2024 Bidirectional Autoregessive Diffusion Model for Dance Generation CVPR 2024 Not All Voxels Are Equal: Hardness-Aware Semantic Scene Completion with Self-Distillation CVPR 2024 CDUL: CLIP-Driven Unsupervised Learning for Multi-Label Image Classification ICCV 2023 Label-efficient Segmentation via Affinity Propagation NIPS 2023 Parametric Surface Constrained Upsampler Network for Point Cloud AAAI 2023 Few-Shot 3D Point Cloud Semantic Segmentation via Stratified Class-Specific Attention Based Transformer Network AAAI 2023 Interpreting Unfairness in Graph Neural Networks via Training Node Attribution AAAI 2023 Z-Code++: A Pre-trained Language Model Optimized for Abstractive Summarization ACL 2023 Joint Generator-Ranker Learning for Natural Language Generation ACL 2023 LiDAR2Map: In Defense of LiDAR-Based Semantic Map Construction Using Online Camera Distillation CVPR 2023 Noise-Robust Fine-Tuning of Pretrained Language Models via External Guidance EMNLP 2023 LMGQS: A Large-scale Dataset for Query-focused Summarization EMNLP 2023 Point2Mask: Point-supervised Panoptic Segmentation via Optimal Transport ICCV 2023 Leveraging Inpainting for Single-Image Shadow Removal ICCV 2023 Self-Supervised Social Relation Representation for Human Group Detection ECCV 2022 Graph Few-shot Learning with Task-specific Structures NIPS 2022 Background-Insensitive Scene Text Recognition with Text Semantic Segmentation ECCV 2022 Rethinking Video Rain Streak Removal: A New Synthesis Model and a Deraining Network with Video Rain Prior ECCV 2022 Style-Guided Shadow Removal ECCV 2022 Panoramic Human Activity Recognition ECCV 2022 SiamDoGe: Domain Generalizable Semantic Segmentation Using Siamese Network ECCV 2022 MISF: Multi-Level Interactive Siamese Filtering for High-Fidelity Image Inpainting CVPR 2022 Can You Spot the Chameleon? Adversarially Camouflaging Images From Co-Salient Object Detection CVPR 2022 An End-to-End Dialogue Summarization System for Sales Calls NAACL 2022 Style Mixing and Patchwise Prototypical Matching for One-Shot Unsupervised Domain Adaptive Semantic Segmentation AAAI 2022 DialogVED: A Pre-trained Latent Variable Encoder-Decoder Model for Dialog Response Generation ACL 2022 FAITH: Few-Shot Graph Classification with Hierarchical Task Graphs IJCAI 2022 Connecting the Complementary-View Videos: Joint Camera Identification and Subject Association CVPR 2022 Is It Necessary to Transfer Temporal Knowledge for Domain Adaptive Video Semantic Segmentation? ECCV 2022 Deep Poisoning: Towards Robust Image Data Sharing Against Visual Disclosure WACV 2021 VIL-100: A New Dataset and a Baseline Model for Video Instance Lane Detection ICCV 2021 From Shadow Generation To Shadow Removal CVPR 2021 DANNet: A One-Stage Domain Adaptation Network for Unsupervised Nighttime Semantic Segmentation CVPR 2021 Long-Tailed Multi-Label Visual Recognition by Collaborative Training on Uniform and Re-Balanced Samplings CVPR 2021 Auto-Exposure Fusion for Single-Image Shadow Removal CVPR 2021 Multi-Domain Multi-Task Rehearsal for Lifelong Learning AAAI 2021 Binaural Audio-Visual Localization AAAI 2021 Hierarchical Heterogeneous Graph Representation Learning for Short Text Classification EMNLP 2021 A Multi-Task Mean Teacher for Semi-Supervised Shadow Detection CVPR 2020 Multi-Spectral Salient Object Detection by Adversarial Domain Adaptation AAAI 2020 SalSAC: A Video Saliency Prediction Model with Shuffled Attentions and Correlation-Based ConvLSTM AAAI 2020 Complementary-View Multiple Human Tracking AAAI 2020 Multi-Type Self-Attention Guided Degraded Saliency Detection AAAI 2020 Semantic Stereo Matching With Pyramid Cost Volumes ICCV 2019 Goal-Oriented End-to-End Conversational Models with Profile Features in a Real-World Setting NAACL 2019 Spatial Correspondence With Generative Adversarial Network: Learning Depth From Monocular Videos ICCV 2019 Visual Attention Consistency Under Image Transforms for Multi-Label Image Classification CVPR 2019 Does Haze Removal Help CNN-based Image Classification? ECCV 2018 Learning View-Invariant Features for Person Identification in Temporally Synchronized Videos Taken by Wearable Cameras ICCV 2017 Learning Dynamic Siamese Network for Visual Object Tracking ICCV 2017 Groupwise Tracking of Crowded Similar-Appearance Targets From Low-Continuity Image Sequences CVPR 2016 Combining Local Appearance and Holistic View: Dual-Source Deep Neural Networks for Human Pose Estimation CVPR 2015 Simple Atom Selection Strategy for Greedy Matrix Completion IJCAI 2015 Co-Interest Person Detection From Multiple Wearable Camera Videos ICCV 2015 Recognize Human Activities from Partially Observed Videos CVPR 2013