Song Wang
102 papers · 2013–2026 · 14 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
🌍 Conference Polyglot (14) 🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (12)
🧭
Keyword Pioneer
🐣
Hot Topic Early Bird
🏃
Academic Marathon
(12)
🏠
Conference Loyalist
(22)
🤝
Dynamic Duo
(23)
🏆
Grand Slam
🔬
Deep Specialist
(13)
🧬
Topic Evolution
🏆
Keyword Champion
(2)
⚡
Prolific Year
(5)
❓
The Questioner
(3)
📈
Trend Setter
🗃️
Keyword Collector
(421)
💎
Century Club
(101)
🔥
Unstoppable
(11)
🚀
Conference Pioneer
Conferences
CVPR (22)
AAAI (17)
ICCV (13)
EMNLP (12)
ECCV (10)
ACL (6)
IJCAI (6)
ICLR (5)
NAACL (4)
NIPS (3)
AACL (1)
ICML (1)
IJCNLP (1)
WACV (1)
Top co-authors
Keywords
large language model
(11)
autonomous driving
(9)
graph neural network
(7)
knowledge distillation
(7)
convolutional neural network
(6)
few-shot learning
(6)
semantic segmentation
(6)
unsupervised learning
(5)
image restoration
(5)
in-context learning
(4)
generative model
(4)
language model
(4)
point cloud
(4)
zero-shot learning
(4)
domain adaptation
(4)
instance segmentation
(4)
knowledge graph
(4)
attention mechanism
(3)
3d reconstruction
(3)
feature extraction
(3)
Papers
GUIDE: Gaussian Unified Instance Detection for Enhanced Obstacle Perception in Autonomous Driving
AAAI 2026
Separate the Wheat from the Chaff: Winnowing Down Divergent Views in Retrieval Augmented Generation
EMNLP 2025
AnyMAC: Cascading Flexible Multi-Agent Collaboration via Next-Agent Prediction
EMNLP 2025
Learning from Diverse Reasoning Paths with Routing and Collaboration
EMNLP 2025
CoRAG: Enhancing Hybrid Retrieval-Augmented Generation through a Cooperative Retriever Architecture
EMNLP 2025
Reasoning of Large Language Models over Knowledge Graphs with Super-Relations
ICLR 2025
The Source Image is the Best Attention for Infrared and Visible Image Fusion
ICCV 2025
Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations
ICCV 2025
SAM4D: Segment Anything in Camera and LiDAR Streams
ICCV 2025
Monocular Semantic Scene Completion via Masked Recurrent Networks
ICCV 2025
From Cross-Task Examples to In-Task Prompts: A Graph-Based Pseudo-Labeling Framework for In-context Learning
EMNLP 2025
FIER: Fine-Grained and Efficient KV Cache Retrieval for Long-context LLM Inference
EMNLP 2025
Interpreting Pretrained Language Models via Concept Bottlenecks (Extended Abstract)
IJCAI 2025
Uncertainty-Instructed Structure Injection for Generalizable HD Map Construction
CVPR 2025
DPSeg: Dual-Prompt Cost Volume Learning for Open-Vocabulary Semantic Segmentation
CVPR 2025
PointLoRA: Low-Rank Adaptation with Token Selection for Point Cloud Learning
CVPR 2025
Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuning
CVPR 2025
Acquire and then Adapt: Squeezing out Text-to-Image Model for Image Restoration
CVPR 2025
From Implicit Exploration to Structured Reasoning: Guideline and Refinement for LLMs
EMNLP 2025
Reliable and Calibrated Semantic Occupancy Prediction by Hybrid Uncertainty Learning
IJCAI 2025
DIIN: Diffusion Iterative Implicit Networks for Arbitrary-scale Super-resolution
IJCAI 2025
Developing a Reliable, Fast, General-Purpose Hallucination Detection and Mitigation Service
NAACL 2025
Revisiting Graph Contrastive Learning on Anomaly Detection: A Structural Imbalance Perspective
AAAI 2025
BrainMAP: Learning Multiple Activation Pathways in Brain Networks
AAAI 2025
Virtual Nodes Can Help: Tackling Distribution Shifts in Federated Graph Learning
AAAI 2025
Tuning-Free Accountable Intervention for LLM Deployment – a Metacognitive Approach
AAAI 2025
Bias Unveiled: Investigating Social Bias in LLM-Generated Code
AAAI 2025
The Visual Counter Turing Test (VCT²): A Benchmark for Evaluating AI-Generated Image Detection and the Visual AI Index (V_AI)
AACL 2025
The Visual Counter Turing Test (VCT²): A Benchmark for Evaluating AI-Generated Image Detection and the Visual AI Index (V_AI)
IJCNLP 2025
Question-Aware Knowledge Graph Prompting for Enhancing Large Language Models
ACL 2025
MAPLE: Many-Shot Adaptive Pseudo-Labeling for In-Context Learning
ICML 2025
Integrative Decoding: Improving Factuality via Implicit Self-consistency
ICLR 2025
Graph Neural Networks Are More Than Filters: Revisiting and Benchmarking from A Spectral Perspective
ICLR 2025
PianoMotion10M: Dataset and Benchmark for Hand Motion Generation in Piano Performance
ICLR 2025
CEB: Compositional Evaluation Benchmark for Fairness in Large Language Models
ICLR 2025
Large Language Models for Data Annotation and Synthesis: A Survey
EMNLP 2024
EINet: Point Cloud Completion via Extrapolation and Interpolation
ECCV 2024
SAIR: Learning Semantic-aware Implicit Representation
ECCV 2024
From a Bird's Eye View to See: Joint Camera and Subject Registration without the Camera Calibration
CVPR 2024
Label-efficient Semantic Scene Completion with Scribble Annotations
IJCAI 2024
Mixture of Demonstrations for In-Context Learning
NIPS 2024
Glue pizza and eat rocks - Exploiting Vulnerabilities in Retrieval-Augmented Generative Models
EMNLP 2024
Few-shot Knowledge Graph Relational Reasoning via Subgraph Adaptation
NAACL 2024
Orthogonal Dictionary Guided Shape Completion Network for Point Cloud
AAAI 2024
MGMap: Mask-Guided Learning for Online Vectorized HD Map Construction
CVPR 2024
Knowledge Graph-Enhanced Large Language Models via Path Selection
ACL 2024
FastGAS: Fast Graph-based Annotation Selection for In-Context Learning
ACL 2024
Bidirectional Autoregessive Diffusion Model for Dance Generation
CVPR 2024
Not All Voxels Are Equal: Hardness-Aware Semantic Scene Completion with Self-Distillation
CVPR 2024
CDUL: CLIP-Driven Unsupervised Learning for Multi-Label Image Classification
ICCV 2023
Label-efficient Segmentation via Affinity Propagation
NIPS 2023
Parametric Surface Constrained Upsampler Network for Point Cloud
AAAI 2023
Few-Shot 3D Point Cloud Semantic Segmentation via Stratified Class-Specific Attention Based Transformer Network
AAAI 2023
Interpreting Unfairness in Graph Neural Networks via Training Node Attribution
AAAI 2023
Z-Code++: A Pre-trained Language Model Optimized for Abstractive Summarization
ACL 2023
Joint Generator-Ranker Learning for Natural Language Generation
ACL 2023
LiDAR2Map: In Defense of LiDAR-Based Semantic Map Construction Using Online Camera Distillation
CVPR 2023
Noise-Robust Fine-Tuning of Pretrained Language Models via External Guidance
EMNLP 2023
LMGQS: A Large-scale Dataset for Query-focused Summarization
EMNLP 2023
Point2Mask: Point-supervised Panoptic Segmentation via Optimal Transport
ICCV 2023
Leveraging Inpainting for Single-Image Shadow Removal
ICCV 2023
Self-Supervised Social Relation Representation for Human Group Detection
ECCV 2022
Graph Few-shot Learning with Task-specific Structures
NIPS 2022
Background-Insensitive Scene Text Recognition with Text Semantic Segmentation
ECCV 2022
Rethinking Video Rain Streak Removal: A New Synthesis Model and a Deraining Network with Video Rain Prior
ECCV 2022
Style-Guided Shadow Removal
ECCV 2022
Panoramic Human Activity Recognition
ECCV 2022
SiamDoGe: Domain Generalizable Semantic Segmentation Using Siamese Network
ECCV 2022
MISF: Multi-Level Interactive Siamese Filtering for High-Fidelity Image Inpainting
CVPR 2022
Can You Spot the Chameleon? Adversarially Camouflaging Images From Co-Salient Object Detection
CVPR 2022
An End-to-End Dialogue Summarization System for Sales Calls
NAACL 2022
Style Mixing and Patchwise Prototypical Matching for One-Shot Unsupervised Domain Adaptive Semantic Segmentation
AAAI 2022
DialogVED: A Pre-trained Latent Variable Encoder-Decoder Model for Dialog Response Generation
ACL 2022
FAITH: Few-Shot Graph Classification with Hierarchical Task Graphs
IJCAI 2022
Connecting the Complementary-View Videos: Joint Camera Identification and Subject Association
CVPR 2022
Is It Necessary to Transfer Temporal Knowledge for Domain Adaptive Video Semantic Segmentation?
ECCV 2022
Deep Poisoning: Towards Robust Image Data Sharing Against Visual Disclosure
WACV 2021
VIL-100: A New Dataset and a Baseline Model for Video Instance Lane Detection
ICCV 2021
From Shadow Generation To Shadow Removal
CVPR 2021
DANNet: A One-Stage Domain Adaptation Network for Unsupervised Nighttime Semantic Segmentation
CVPR 2021
Long-Tailed Multi-Label Visual Recognition by Collaborative Training on Uniform and Re-Balanced Samplings
CVPR 2021
Auto-Exposure Fusion for Single-Image Shadow Removal
CVPR 2021
Multi-Domain Multi-Task Rehearsal for Lifelong Learning
AAAI 2021
Binaural Audio-Visual Localization
AAAI 2021
Hierarchical Heterogeneous Graph Representation Learning for Short Text Classification
EMNLP 2021
A Multi-Task Mean Teacher for Semi-Supervised Shadow Detection
CVPR 2020
Multi-Spectral Salient Object Detection by Adversarial Domain Adaptation
AAAI 2020
SalSAC: A Video Saliency Prediction Model with Shuffled Attentions and Correlation-Based ConvLSTM
AAAI 2020
Complementary-View Multiple Human Tracking
AAAI 2020
Multi-Type Self-Attention Guided Degraded Saliency Detection
AAAI 2020
Semantic Stereo Matching With Pyramid Cost Volumes
ICCV 2019
Goal-Oriented End-to-End Conversational Models with Profile Features in a Real-World Setting
NAACL 2019
Spatial Correspondence With Generative Adversarial Network: Learning Depth From Monocular Videos
ICCV 2019
Visual Attention Consistency Under Image Transforms for Multi-Label Image Classification
CVPR 2019
Does Haze Removal Help CNN-based Image Classification?
ECCV 2018
Learning View-Invariant Features for Person Identification in Temporally Synchronized Videos Taken by Wearable Cameras
ICCV 2017
Learning Dynamic Siamese Network for Visual Object Tracking
ICCV 2017
Groupwise Tracking of Crowded Similar-Appearance Targets From Low-Continuity Image Sequences
CVPR 2016
Combining Local Appearance and Holistic View: Dual-Source Deep Neural Networks for Human Pose Estimation
CVPR 2015
Simple Atom Selection Strategy for Greedy Matrix Completion
IJCAI 2015
Co-Interest Person Detection From Multiple Wearable Camera Videos
ICCV 2015
Recognize Human Activities from Partially Observed Videos
CVPR 2013