Yunhong Wang
61 papers · 2017–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
π§ Keyword Pioneer π Conference Polyglot (12) πΊοΈ Taxonomy Completionist (11) π Interdisciplinary Bridge π Academic Marathon (8)
π
Cross-Pollinator
(13)
πΊοΈ
Taxonomy Completionist
(11)
π§
Keyword Pioneer
π
Conference Loyalist
(22)
π
Grand Slam
π
Triple Crown
π€
Dynamic Duo
(16)
π¬
Deep Specialist
(10)
π§¬
Topic Evolution
π
Trend Setter
π
Conference Pioneer
ποΈ
Keyword Collector
(281)
β‘
Prolific Year
(12)
π
Century Club
(60)
π₯
Unstoppable
(9)
Conferences
CVPR (22)
AAAI (12)
ECCV (6)
ACL (4)
ICCV (4)
EMNLP (3)
IJCAI (3)
ICML (2)
NIPS (2)
ICLR (1)
NAACL (1)
UAI (1)
Top co-authors
Keywords
graph neural network
(5)
model compression
(5)
multimodal learning
(4)
video understanding
(4)
visual tracking
(4)
object detection
(4)
large language model
(4)
vision transformer
(4)
attention mechanism
(4)
contrastive learning
(3)
autonomous driving
(3)
self-supervised learning
(3)
post-training quantization
(3)
3d object detection
(3)
semantic segmentation
(2)
point cloud
(2)
depth estimation
(2)
video classification
(2)
multi-task learning
(2)
anomaly detection
(2)
Papers
Mem2Evolve: Towards Self-Evolving Agents via Co-Evolutionary Capability Expansion and Experience Distillation
ACL 2026
SeriesBench: A Benchmark for Narrative-Driven Drama Series Understanding
CVPR 2025
APHQ-ViT: Post-Training Quantization with Average Perturbation Hessian Based Reconstruction for Vision Transformers
CVPR 2025
KwaiChat: A Large-Scale Video-Driven Multilingual Mixed-Type Dialogue Corpus
NAACL 2025
OpenRSD: Towards Open-prompts for Object Detection in Remote Sensing Images
ICCV 2025
RepoDebug: Repository-Level Multi-Task and Multi-Language Debugging Evaluation of Large Language Models
EMNLP 2025
Weak2Wise: An Automated, Lightweight Framework for Weak-LLM-Friendly Reasoning Synthesis
EMNLP 2025
RETAIL: Towards Real-world Travel Planning for Large Language Models
EMNLP 2025
SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual Tracking
CVPR 2025
Multi-modal Deepfake Detection via Multi-task Audio-Visual Prompt Learning
AAAI 2025
Unified Knowledge Maintenance Pruning and Progressive Recovery with Weight Recalling for Large Vision-Language Models
AAAI 2025
GeoBEV: Learning Geometric BEV Representation for Multi-view 3D Object Detection
AAAI 2025
TCAQ-DM: Timestep-Channel Adaptive Quantization for Diffusion Models
AAAI 2025
GODBench: A Benchmark for Multimodal Large Language Models in Video Comment Art
ACL 2025
TransBench: Breaking Barriers for Transferable Graphical User Interface Agents in Dynamic Digital Environments
ACL 2025
FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation
CVPR 2025
ToolSpectrum: Towards Personalized Tool Utilization for Large Language Models
ACL 2025
4Diffusion: Multi-view Video Diffusion Model for 4D Generation
NIPS 2024
Transforming Vision Transformer: Towards Efficient Multi-Task Asynchronous Learner
NIPS 2024
AGS: Affordable and Generalizable Substitute Training for Transferable Adversarial Attack
AAAI 2024
ActiveDC: Distribution Calibration for Active Finetuning
CVPR 2024
Leveraging Predicate and Triplet Learning for Scene Graph Generation
CVPR 2024
HIPTrack: Visual Tracking with Historical Prompts
CVPR 2024
FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection
ECCV 2024
MutDet: Mutually Optimizing Pre-training for Remote Sensing Object Detection
ECCV 2024
AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer
ECCV 2024
Rotation Has Two Sides: Evaluating Data Augmentation for Deep One-class Classification
ICLR 2024
DSD-DA: Distillation-based Source Debiasing for Domain Adaptive Object Detection
ICML 2024
Understanding Heterophily for Graph Neural Networks
ICML 2024
Global-Local Characteristic Excited Cross-Modal Attacks from Images to Videos
AAAI 2023
Deepfake Video Detection via Facial Action Dependencies Estimation
AAAI 2023
Learning Discriminative Representations for Skeleton Based Action Recognition
CVPR 2023
SA-BEV: Generating Semantic-Aware Bird's-Eye-View Feature for Multi-view 3D Object Detection
ICCV 2023
Denoising Diffusion Autoencoders are Unified Self-supervised Learners
ICCV 2023
Unilaterally Aggregated Contrastive Learning with Hierarchical Augmentation for Anomaly Detection
ICCV 2023
Weakly Supervised Semantic Segmentation by Pixel-to-Prototype Contrast
CVPR 2022
Lagrange Motion Analysis and View Embeddings for Improved Gait Recognition
CVPR 2022
Video Anomaly Detection by Solving Decoupled Spatio-Temporal Jigsaw Puzzles
ECCV 2022
SparseTT: Visual Tracking with Sparse Transformers
IJCAI 2022
PACE: Predictive and Contrastive Embedding for Unsupervised Action Segmentation
IJCAI 2022
PC-RGNN: Point Cloud Completion and Graph Neural Network for 3D Object Detection
AAAI 2021
MIEHDR CNN: Main Image Enhancement based Ghost-Free High Dynamic Range Imaging using Dual-Lens Systems
AAAI 2021
STMTrack: Template-Free Visual Tracking With Space-Time Memory Networks
CVPR 2021
Bi-GCN: Binary Graph Convolutional Network
CVPR 2021
Cross-View Gait Recognition With Deep Universal Linear Embeddings
CVPR 2021
Path-BN: Towards effective batch normalization in the Path Space for ReLU networks
UAI 2021
Multi-Scale Positive Sample Refinement for Few-Shot Object Detection
ECCV 2020
I4R: Promoting Deep Reinforcement Learning by the Indicator for Expressive Representations
IJCAI 2020
Cycle-CNN for Colorization towards Real Monochrome-Color Camera Systems
AAAI 2020
Distraction-Aware Feature Learning for Human Attribute Recognition via Coarse-to-Fine Attention Mechanism
AAAI 2020
Cross-domain Object Detection through Coarse-to-Fine Feature Adaptation
CVPR 2020
KE-GAN: Knowledge Embedded Generative Adversarial Networks for Semi-Supervised Scene Parsing
CVPR 2019
Led3D: A Lightweight and Efficient Deep Approach to Recognizing Low-Quality 3D Faces
CVPR 2019
Adaptive NMS: Refining Pedestrian Detection in a Crowd
CVPR 2019
Attentive Relational Networks for Mapping Images to Scene Graphs
CVPR 2019
Learning a Deep Convolutional Network for Colorization in Monochrome-Color Dual-Lens System
AAAI 2019
Learning Face Age Progression: A Pyramid Architecture of GANs
CVPR 2018
stagNet: An Attentive Semantic RNN for Group Activity Recognition
ECCV 2018
Binary Coding for Partial Action Analysis With Limited Observation Ratios
CVPR 2017
Fast Person Re-Identification via Cross-Camera Semantic Binary Transformation
CVPR 2017
Zero-Shot Action Recognition With Error-Correcting Output Codes
CVPR 2017