Wengang Zhou
98 papers · 2014–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
π Academic Marathon (11) π Conference Polyglot (12) π§ Keyword Pioneer π Interdisciplinary Bridge π Cross-Pollinator (9)
π
Cross-Pollinator
(9)
π
Renaissance Researcher
(9)
πΊοΈ
Taxonomy Completionist
(132)
π
Conference Loyalist
(25)
π
Triple Crown
π
Grand Slam
π
Keyword Champion
(2)
π€
Dynamic Duo
(86)
π¬
Deep Specialist
(15)
ποΈ
Keyword Collector
(393)
π
Century Club
(95)
π
Trend Setter
π₯
Unstoppable
(8)
β
The Questioner
β‘
Prolific Year
(16)
π
Conference Pioneer
Conferences
CVPR (25)
AAAI (18)
ICCV (14)
NIPS (9)
ECCV (8)
ICLR (8)
ACL (5)
IJCAI (4)
COLING (2)
EMNLP (2)
ICML (2)
UAI (1)
Top co-authors
Keywords
representation learning
(9)
semantic segmentation
(6)
sign language recognition
(6)
transformer architecture
(5)
weakly supervised learning
(5)
multi-agent reinforcement learning
(5)
video understanding
(5)
semi-supervised learning
(4)
image generation
(4)
contrastive learning
(4)
image retrieval
(4)
metric learning
(4)
feature representation
(4)
diffusion model
(4)
pose estimation
(3)
reinforcement learning
(3)
unsupervised learning
(3)
feature embedding
(3)
self-supervised learning
(3)
model compression
(3)
Papers
DocR1: Evidence Page-Guided GRPO for Multi-Page Document Understanding
AAAI 2026
Bias Fitting to Mitigate Length Bias of Reward Model in RLHF
ACL 2026
Rethinking Long-tailed Dataset Distillation: A Uni-Level Framework with Unbiased Recovery and Relabeling
AAAI 2026
Controllable Style Arithmetic with Language Models
ACL 2025
Incremental Transformer: Efficient Encoder for Incremented Text Over MRC and Conversation Tasks
COLING 2025
Active Perception Meets Rule-Guided RL: A Two-Phase Approach for Precise Object Navigation in Complex Environments
ICCV 2025
OPTICAL: Leveraging Optimal Transport for Contribution Allocation in Dataset Distillation
CVPR 2025
Self-Classification Enhancement and Correction for Weakly Supervised Object Detection
IJCAI 2025
Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters
CVPR 2025
Robust Multimodal Large Language Models Against Modality Conflict
ICML 2025
EG4D: Explicit Generation of 4D Object without Score Distillation
ICLR 2025
Uni-Sign: Toward Unified Sign Language Understanding at Scale
ICLR 2025
DesignDiffusion: High-Quality Text-to-Design Image Generation with Diffusion Models
CVPR 2025
SmartEraser: Remove Anything from Images using Masked-Region Guidance
CVPR 2025
Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models
AAAI 2025
I2VGuard: Safeguarding Images against Misuse in Diffusion-based Image-to-Video Models
CVPR 2025
Aligning Global Semantics and Local Textures in Generative Video Enhancement
ICCV 2025
TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy
NIPS 2024
Semi-Supervised Spoken Language Glossification
ACL 2024
Image2Sentence based Asymmetrical Zero-shot Composed Image Retrieval
ICLR 2024
Sinkhorn Distance Minimization for Knowledge Distillation
COLING 2024
BoolQuestions: Does Dense Retrieval Understand Boolean Logic in Language?
EMNLP 2024
Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis
ECCV 2024
Revisiting Open-Set Panoptic Segmentation
AAAI 2024
Learning Spatial Adaptation and Temporal Coherence in Diffusion Models for Video Super-Resolution
CVPR 2024
Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning
ICML 2024
Instance-aware Exploration-Verification-Exploitation for Instance ImageGoal Navigation
CVPR 2024
SUF: Stabilized Unconstrained Fine-Tuning for Offline-to-Online Reinforcement Learning
AAAI 2024
State Sequences Prediction via Fourier Transform for Representation Learning
NIPS 2023
DIFFER:Decomposing Individual Reward for Fair Experience Replay in Multi-Agent Reinforcement Learning
NIPS 2023
Learning robust representation for reinforcement learning with distractions by reward sequence prediction
UAI 2023
MA2CL:Masked Attentive Contrastive Learning for Multi-Agent Reinforcement Learning
IJCAI 2023
Multi-Agent First Order Constrained Optimization in Policy Space
NIPS 2023
Low-Light Video Enhancement with Synthetic Event Guidance
AAAI 2023
BEST: BERT Pre-training for Sign Language Recognition with Coupling Tokenization
AAAI 2023
Hybrid and Collaborative Passage Reranking
ACL 2023
A General Rank Preserving Framework for Asymmetric Image Retrieval
ICLR 2023
$\mathcal{O}$-GNN: incorporating ring priors into molecular modeling
ICLR 2023
Making Better Decision by Directly Planning in Continuous Control
ICLR 2023
Cyclic-Bootstrap Labeling for Weakly Supervised Object Detection
ICCV 2023
Masked Motion Predictors are Strong 3D Action Representation Learners
ICCV 2023
Focus on Your Target: A Dual Teacher-Student Framework for Domain-Adaptive Semantic Segmentation
ICCV 2023
SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning
ICCV 2023
DIRE for Diffusion-Generated Image Detection
ICCV 2023
Sign Language Translation with Iterative Prototype
ICCV 2023
CLIP4HOI: Towards Adapting CLIP for Practical Zero-Shot HOI Detection
NIPS 2023
Hierarchical Multi-Agent Skill Discovery
NIPS 2023
AnchorFormer: Point Cloud Completion From Discriminative Nodes
CVPR 2023
Asymmetric Feature Fusion for Image Retrieval
CVPR 2023
AltFreezing for More General Video Face Forgery Detection
CVPR 2023
HandNeRF: Neural Radiance Fields for Animatable Interacting Hands
CVPR 2023
MVP: Multimodality-Guided Visual Pre-training
ECCV 2022
LDSA: Learning Dynamic Subtask Assignment in Cooperative Multi-Agent Reinforcement Learning
NIPS 2022
Hand-Object Interaction Image Generation
NIPS 2022
Learning Token-Based Representation for Image Retrieval
AAAI 2022
Uformer: A General U-Shaped Transformer for Image Restoration
CVPR 2022
Contextual Similarity Distillation for Asymmetric Image Retrieval
CVPR 2022
Domain-Agnostic Prior for Transfer Semantic Segmentation
CVPR 2022
CMD: Self-Supervised 3D Action Representation Learning with Cross-Modal Mutual Distillation
ECCV 2022
TAPE: Task-Agnostic Prior Embedding for Image Restoration
ECCV 2022
CMT: Context-Matching-Guided Transformer for 3D Tracking in Point Clouds
ECCV 2022
Geometric Representation Learning for Document Image Rectification
ECCV 2022
Instance Mining with Class Feature Banks for Weakly Supervised Object Detection
AAAI 2021
Contrastive Transformation for Self-supervised Correspondence Learning
AAAI 2021
IOT: Instance-wise Layer Reordering for Transformer Structures
ICLR 2021
Contextual Similarity Aggregation with Self-attention for Visual Re-ranking
NIPS 2021
Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection
AAAI 2021
Joint Inductive and Transductive Learning for Video Object Segmentation
ICCV 2021
SignBERT: Pre-Training of Hand-Model-Aware Representation for Sign Language Recognition
ICCV 2021
Instance-Wise Hard Negative Example Generation for Contrastive Learning in Unpaired Image-to-Image Translation
ICCV 2021
Learning Deep Local Features With Multiple Dynamic Attentions for Large-Scale Image Retrieval
ICCV 2021
TransVG: End-to-End Visual Grounding With Transformers
ICCV 2021
Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking
CVPR 2021
ATSO: Asynchronous Teacher-Student Optimization for Semi-Supervised Image Segmentation
CVPR 2021
Model-Aware Gesture-to-Gesture Translation
CVPR 2021
Improving Sign Language Translation With Monolingual Data by Sign Back-Translation
CVPR 2021
Fine-grained Semantic Alignment Network for Weakly Supervised Temporal Language Grounding
EMNLP 2021
Hand-Model-Aware Sign Language Recognition
AAAI 2021
Incorporating BERT into Neural Machine Translation
ICLR 2020
Wavelet-Based Dual-Branch Network for Image DemoirΓ©ing
ECCV 2020
Transformation GAN for Unsupervised Image Synthesis and Representation Learning
CVPR 2020
Spatial-Temporal Multi-Cue Network for Continuous Sign Language Recognition
AAAI 2020
POST: POlicy-Based Switch Tracking
AAAI 2020
Relation-Guided Spatial Attention and Temporal Refinement for Video-Based Person Re-Identification
AAAI 2020
Attentive Experience Replay
AAAI 2020
Soft Contextual Data Augmentation for Neural Machine Translation
ACL 2019
Spatial and Temporal Mutual Promotion for Video-Based Person Re-Identification
AAAI 2019
Iterative Alignment Network for Continuous Sign Language Recognition
CVPR 2019
Re2EMA: Regularized and Reinitialized Exponential Moving Average for Target Model Update in Object Tracking
AAAI 2019
Relation Distillation Networks for Video Object Detection
ICCV 2019
Unsupervised Deep Tracking
CVPR 2019
Improving Deep Neural Network Sparsity through Decorrelation Regularization
IJCAI 2018
Multi-Cue Correlation Filters for Robust Visual Tracking
CVPR 2018
Dilated Convolutional Network with Iterative Optimization for Continuous Sign Language Recognition
IJCAI 2018
Affinity Derivation and Graph Merge for Instance Segmentation
ECCV 2018
Picking Deep Filter Responses for Fine-Grained Image Recognition
CVPR 2016
SOM: Semantic Obviousness Metric for Image Quality Assessment
CVPR 2015
Bayes Merging of Multiple Vocabularies for Scalable Image Retrieval
CVPR 2014