Wengang Zhou

98 papers · 2014–2026 · 12 conferences · across top CS/AI conferences

Achievements

+16 more ↓

🏃 Academic Marathon (11) 🌍 Conference Polyglot (12) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (9)

🐝 Cross-Pollinator (9) 🌈 Renaissance Researcher (9) 🗺️ Taxonomy Completionist (132) 🏠 Conference Loyalist (25) 👑 Triple Crown 🏆 Grand Slam 🏆 Keyword Champion (2) 🤝 Dynamic Duo (86) 🔬 Deep Specialist (15) 🗃️ Keyword Collector (393) 💎 Century Club (95) 📈 Trend Setter 🔥 Unstoppable (8) ❓ The Questioner ⚡ Prolific Year (16) 🚀 Conference Pioneer

Conferences

CVPR (25) AAAI (18) ICCV (14) NIPS (9) ECCV (8) ICLR (8) ACL (5) IJCAI (4) COLING (2) EMNLP (2) ICML (2) UAI (1)

Top co-authors

Houqiang Li (89) Qi Tian (13) Jiajun Deng (12) Min Wang (11) Hezhen Hu (8) Jinhua Zhu (8) Li Li (7) Weichao Zhao (6) Hui Wu (6) Jianmin Bao (6)

Keywords

representation learning (9) semantic segmentation (6) sign language recognition (6) transformer architecture (5) weakly supervised learning (5) multi-agent reinforcement learning (5) video understanding (5) semi-supervised learning (4) image generation (4) contrastive learning (4) image retrieval (4) metric learning (4) feature representation (4) diffusion model (4) pose estimation (3) reinforcement learning (3) unsupervised learning (3) feature embedding (3) self-supervised learning (3) model compression (3)

Papers

DocR1: Evidence Page-Guided GRPO for Multi-Page Document Understanding AAAI 2026 Bias Fitting to Mitigate Length Bias of Reward Model in RLHF ACL 2026 Rethinking Long-tailed Dataset Distillation: A Uni-Level Framework with Unbiased Recovery and Relabeling AAAI 2026 Controllable Style Arithmetic with Language Models ACL 2025 Incremental Transformer: Efficient Encoder for Incremented Text Over MRC and Conversation Tasks COLING 2025 Active Perception Meets Rule-Guided RL: A Two-Phase Approach for Precise Object Navigation in Complex Environments ICCV 2025 OPTICAL: Leveraging Optimal Transport for Contribution Allocation in Dataset Distillation CVPR 2025 Self-Classification Enhancement and Correction for Weakly Supervised Object Detection IJCAI 2025 Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters CVPR 2025 Robust Multimodal Large Language Models Against Modality Conflict ICML 2025 EG4D: Explicit Generation of 4D Object without Score Distillation ICLR 2025 Uni-Sign: Toward Unified Sign Language Understanding at Scale ICLR 2025 DesignDiffusion: High-Quality Text-to-Design Image Generation with Diffusion Models CVPR 2025 SmartEraser: Remove Anything from Images using Masked-Region Guidance CVPR 2025 Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models AAAI 2025 I2VGuard: Safeguarding Images against Misuse in Diffusion-based Image-to-Video Models CVPR 2025 Aligning Global Semantics and Local Textures in Generative Video Enhancement ICCV 2025 TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy NIPS 2024 Semi-Supervised Spoken Language Glossification ACL 2024 Image2Sentence based Asymmetrical Zero-shot Composed Image Retrieval ICLR 2024 Sinkhorn Distance Minimization for Knowledge Distillation COLING 2024 BoolQuestions: Does Dense Retrieval Understand Boolean Logic in Language? EMNLP 2024 Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis ECCV 2024 Revisiting Open-Set Panoptic Segmentation AAAI 2024 Learning Spatial Adaptation and Temporal Coherence in Diffusion Models for Video Super-Resolution CVPR 2024 Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning ICML 2024 Instance-aware Exploration-Verification-Exploitation for Instance ImageGoal Navigation CVPR 2024 SUF: Stabilized Unconstrained Fine-Tuning for Offline-to-Online Reinforcement Learning AAAI 2024 State Sequences Prediction via Fourier Transform for Representation Learning NIPS 2023 DIFFER:Decomposing Individual Reward for Fair Experience Replay in Multi-Agent Reinforcement Learning NIPS 2023 Learning robust representation for reinforcement learning with distractions by reward sequence prediction UAI 2023 MA2CL:Masked Attentive Contrastive Learning for Multi-Agent Reinforcement Learning IJCAI 2023 Multi-Agent First Order Constrained Optimization in Policy Space NIPS 2023 Low-Light Video Enhancement with Synthetic Event Guidance AAAI 2023 BEST: BERT Pre-training for Sign Language Recognition with Coupling Tokenization AAAI 2023 Hybrid and Collaborative Passage Reranking ACL 2023 A General Rank Preserving Framework for Asymmetric Image Retrieval ICLR 2023 $\mathcal{O}$-GNN: incorporating ring priors into molecular modeling ICLR 2023 Making Better Decision by Directly Planning in Continuous Control ICLR 2023 Cyclic-Bootstrap Labeling for Weakly Supervised Object Detection ICCV 2023 Masked Motion Predictors are Strong 3D Action Representation Learners ICCV 2023 Focus on Your Target: A Dual Teacher-Student Framework for Domain-Adaptive Semantic Segmentation ICCV 2023 SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning ICCV 2023 DIRE for Diffusion-Generated Image Detection ICCV 2023 Sign Language Translation with Iterative Prototype ICCV 2023 CLIP4HOI: Towards Adapting CLIP for Practical Zero-Shot HOI Detection NIPS 2023 Hierarchical Multi-Agent Skill Discovery NIPS 2023 AnchorFormer: Point Cloud Completion From Discriminative Nodes CVPR 2023 Asymmetric Feature Fusion for Image Retrieval CVPR 2023 AltFreezing for More General Video Face Forgery Detection CVPR 2023 HandNeRF: Neural Radiance Fields for Animatable Interacting Hands CVPR 2023 MVP: Multimodality-Guided Visual Pre-training ECCV 2022 LDSA: Learning Dynamic Subtask Assignment in Cooperative Multi-Agent Reinforcement Learning NIPS 2022 Hand-Object Interaction Image Generation NIPS 2022 Learning Token-Based Representation for Image Retrieval AAAI 2022 Uformer: A General U-Shaped Transformer for Image Restoration CVPR 2022 Contextual Similarity Distillation for Asymmetric Image Retrieval CVPR 2022 Domain-Agnostic Prior for Transfer Semantic Segmentation CVPR 2022 CMD: Self-Supervised 3D Action Representation Learning with Cross-Modal Mutual Distillation ECCV 2022 TAPE: Task-Agnostic Prior Embedding for Image Restoration ECCV 2022 CMT: Context-Matching-Guided Transformer for 3D Tracking in Point Clouds ECCV 2022 Geometric Representation Learning for Document Image Rectification ECCV 2022 Instance Mining with Class Feature Banks for Weakly Supervised Object Detection AAAI 2021 Contrastive Transformation for Self-supervised Correspondence Learning AAAI 2021 IOT: Instance-wise Layer Reordering for Transformer Structures ICLR 2021 Contextual Similarity Aggregation with Self-attention for Visual Re-ranking NIPS 2021 Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection AAAI 2021 Joint Inductive and Transductive Learning for Video Object Segmentation ICCV 2021 SignBERT: Pre-Training of Hand-Model-Aware Representation for Sign Language Recognition ICCV 2021 Instance-Wise Hard Negative Example Generation for Contrastive Learning in Unpaired Image-to-Image Translation ICCV 2021 Learning Deep Local Features With Multiple Dynamic Attentions for Large-Scale Image Retrieval ICCV 2021 TransVG: End-to-End Visual Grounding With Transformers ICCV 2021 Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking CVPR 2021 ATSO: Asynchronous Teacher-Student Optimization for Semi-Supervised Image Segmentation CVPR 2021 Model-Aware Gesture-to-Gesture Translation CVPR 2021 Improving Sign Language Translation With Monolingual Data by Sign Back-Translation CVPR 2021 Fine-grained Semantic Alignment Network for Weakly Supervised Temporal Language Grounding EMNLP 2021 Hand-Model-Aware Sign Language Recognition AAAI 2021 Incorporating BERT into Neural Machine Translation ICLR 2020 Wavelet-Based Dual-Branch Network for Image Demoiréing ECCV 2020 Transformation GAN for Unsupervised Image Synthesis and Representation Learning CVPR 2020 Spatial-Temporal Multi-Cue Network for Continuous Sign Language Recognition AAAI 2020 POST: POlicy-Based Switch Tracking AAAI 2020 Relation-Guided Spatial Attention and Temporal Refinement for Video-Based Person Re-Identification AAAI 2020 Attentive Experience Replay AAAI 2020 Soft Contextual Data Augmentation for Neural Machine Translation ACL 2019 Spatial and Temporal Mutual Promotion for Video-Based Person Re-Identification AAAI 2019 Iterative Alignment Network for Continuous Sign Language Recognition CVPR 2019 Re2EMA: Regularized and Reinitialized Exponential Moving Average for Target Model Update in Object Tracking AAAI 2019 Relation Distillation Networks for Video Object Detection ICCV 2019 Unsupervised Deep Tracking CVPR 2019 Improving Deep Neural Network Sparsity through Decorrelation Regularization IJCAI 2018 Multi-Cue Correlation Filters for Robust Visual Tracking CVPR 2018 Dilated Convolutional Network with Iterative Optimization for Continuous Sign Language Recognition IJCAI 2018 Affinity Derivation and Graph Merge for Instance Segmentation ECCV 2018 Picking Deep Filter Responses for Fine-Grained Image Recognition CVPR 2016 SOM: Semantic Obviousness Metric for Image Quality Assessment CVPR 2015 Bayes Merging of Multiple Vocabularies for Scalable Image Retrieval CVPR 2014