Errui Ding

102 papers · 2017–2025 · 9 conferences · across top CS/AI conferences

Achievements

+14 more ↓

🌍 Conference Polyglot (9) 🏃 Academic Marathon (8) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (8)

🐝 Cross-Pollinator (8) 🌈 Renaissance Researcher (6) 🗺️ Taxonomy Completionist (109) 🏠 Conference Loyalist (34) 🔬 Deep Specialist (18) 🤝 Dynamic Duo (51) 🏆 Grand Slam 🏆 Keyword Champion (2) 🚀 Conference Pioneer 🔥 Unstoppable (9) 📈 Trend Setter 🗃️ Keyword Collector (408) 💎 Century Club (102) ⚡ Prolific Year (22)

Conferences

CVPR (34) ICCV (20) ECCV (17) AAAI (11) NIPS (10) ICLR (4) ICML (2) IJCAI (2) WACV (2)

Top co-authors

Jingdong Wang (51) Junyu Han (31) Xiao Tan (25) Haocheng Feng (22) Jingtuo Liu (18) Jian Wang (15) Dongliang He (14) Xiaoqing Ye (12) Hang Zhou (11) Shilei Wen (11)

Research topics

Computer Vision (1)

Keywords

object detection (9) image generation (7) 3d object detection (6) convolutional neural network (6) semantic segmentation (5) self-supervised learning (4) diffusion model (4) pseudo label (4) point cloud (4) knowledge distillation (4) contrastive learning (4) vision transformer (4) depth estimation (4) few-shot learning (4) attention mechanism (4) temporal modeling (4) video generation (4) domain adaptation (4) feature representation (4) representation learning (3)

Papers

Re-HOLD: Video Hand Object Interaction Reenactment via adaptive Layout-instructed Diffusion Model CVPR 2025 TexGarment: Consistent Garment UV Texture Generation via Efficient 3D Structure-Guided Diffusion Transformer CVPR 2025 Uni$^2$Det: Unified and Universal Framework for Prompt-Guided Multi-dataset 3D Detection ICLR 2025 Splatter-360: Generalizable 360 Gaussian Splatting for Wide-baseline Panoramic Images CVPR 2025 AudCast: Audio-Driven Human Video Generation by Cascaded Diffusion Transformers CVPR 2025 TexGaussian: Generating High-quality PBR Material via Octree-based 3D Gaussian Splatting CVPR 2025 MGMapNet: Multi-Granularity Representation Learning for End-to-End Vectorized HD Map Construction ICLR 2025 Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization ICML 2025 Interpretable Face Anti-Spoofing: Enhancing Generalization with Multimodal Large Language Models AAAI 2025 KD-DETR: Knowledge Distillation for Detection Transformer with Consistent Distillation Points Sampling CVPR 2024 TexOct: Generating Textures of 3D Models with Octree-based Diffusion CVPR 2024 Decoupled Pseudo-labeling for Semi-Supervised Monocular 3D Object Detection CVPR 2024 VRP-SAM: SAM with Visual Reference Prompt CVPR 2024 MS-DETR: Efficient DETR Training with Mixed Supervision CVPR 2024 ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer ECCV 2024 OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection ECCV 2024 LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction ECCV 2024 Interactive 3D Object Detection with Prompts ECCV 2024 Multi-Domain Incremental Learning for Face Presentation Attack Detection AAAI 2024 GGRt: Towards Generalizable 3D Gaussians without Pose Priors in Real-Time ECCV 2024 HD-Fusion: Detailed Text-to-3D Generation Leveraging Multiple Noise Estimation WACV 2024 OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding NIPS 2024 ShowMaker: Creating High-Fidelity 2D Human Video via Fine-Grained Diffusion Modeling NIPS 2024 Octopus: A Multi-modal LLM with Parallel Recognition and Sequential Understanding NIPS 2024 Towards Unified Multi-granularity Text Detection with Interactive Attention ICML 2024 CFCG: Semi-Supervised Semantic Segmentation via Cross-Fusion and Contour Guidance Supervision ICCV 2023 Gradient-based Sampling for Class Imbalanced Semi-supervised Object Detection ICCV 2023 Group Pose: A Simple Baseline for End-to-End Multi-Person Pose Estimation ICCV 2023 Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment ICCV 2023 Semi-DETR: Semi-Supervised Object Detection With Detection Transformers CVPR 2023 Graph Contrastive Learning for Skeleton-based Action Recognition ICLR 2023 HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception NIPS 2023 Ambiguity-Resistant Semi-Supervised Learning for Dense Object Detection CVPR 2023 StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-Based Generator CVPR 2023 CAPE: Camera View Position Embedding for Multi-View 3D Object Detection CVPR 2023 StereoDistill: Pick the Cream from LiDAR for Distilling Stereo-Based 3D Object Detection AAAI 2023 PSVT: End-to-End Multi-Person 3D Pose and Shape Estimation With Progressive Video Transformers CVPR 2023 Cyclically Disentangled Feature Translation for Face Anti-spoofing AAAI 2023 Effective Invertible Arbitrary Image Rescaling WACV 2023 Robust Video Portrait Reenactment via Personalized Representation Quantization AAAI 2023 StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training ICLR 2023 Delicate Textured Mesh Recovery from NeRF via Adaptive Surface Refinement ICCV 2023 Forward Flow for Novel View Synthesis of Dynamic Scenes ICCV 2023 LMR: A Large-Scale Multi-Reference Dataset for Reference-Based Super-Resolution ICCV 2023 Neural Color Operators for Sequential Image Retouching ECCV 2022 Delving into Sequential Patches for Deepfake Detection NIPS 2022 RTFormer: Efficient Design for Real-Time Semantic Segmentation with Transformer NIPS 2022 Singular Value Fine-tuning: Few-shot Segmentation requires Few-parameters Fine-tuning NIPS 2022 MobileFaceSwap: A Lightweight Framework for Video Face Swapping AAAI 2022 Human-Object Interaction Detection via Disentangled Transformer CVPR 2022 Few-Shot Head Swapping in the Wild CVPR 2022 Few-Shot Font Generation by Learning Fine-Grained Local Styles CVPR 2022 MixFormer: Mixing Features Across Windows and Dimensions CVPR 2022 Towards Bidirectional Arbitrary Image Rescaling: Joint Optimization and Cycle Idempotence CVPR 2022 Expressive Talking Head Generation With Granular Audio-Visual Control CVPR 2022 Implicit Sample Extension for Unsupervised Person Re-Identification CVPR 2022 ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval CVPR 2022 Rope3D: The Roadside Perception Dataset for Autonomous Driving and Monocular 3D Object Detection Task CVPR 2022 Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model CVPR 2022 GitNet: Geometric Prior-Based Transformation for Birds-Eye-View Segmentation ECCV 2022 Action Quality Assessment with Temporal Parsing Transformer ECCV 2022 StyleSwap: Style-Based Generator Empowers Robust Face Swapping ECCV 2022 UFO: Unified Feature Optimization ECCV 2022 Diverse Learner: Exploring Diverse Supervision for Semi-Supervised Object Detection ECCV 2022 CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval ECCV 2022 Self-Guided Hard Negative Generation for Unsupervised Person Re-Identification IJCAI 2022 Dual-stream Network for Visual Recognition NIPS 2021 Drafting and Revision: Laplacian Pyramid Network for Fast High-Quality Artistic Style Transfer CVPR 2021 DOLG: Single-Stage Image Retrieval With Deep Orthogonal Fusion of Local and Global Features ICCV 2021 Paint Transformer: Feed Forward Neural Painting With Stroke Prediction ICCV 2021 EC-DARTS: Inducing Equalized and Consistent Optimization Into DARTS ICCV 2021 The Devil Is in the Task: Exploiting Reciprocal Appearance-Localization Features for Monocular 3D Object Detection ICCV 2021 AdaAttN: Revisit Attention Mechanism in Arbitrary Neural Style Transfer ICCV 2021 ASCNet: Self-Supervised Video Representation Learning With Appearance-Speed Consistency ICCV 2021 Revealing the Reciprocal Relations Between Self-Supervised Stereo and Monocular Depth Estimation ICCV 2021 Dynamic Class Queue for Large Scale Face Recognition in the Wild CVPR 2021 Unsupervised Multi-Source Domain Adaptation for Person Re-Identification CVPR 2021 FaceController: Controllable Attribute Editing for Face in the Wild AAAI 2021 MVFNet: Multi-View Fusion Network for Efficient Video Recognition AAAI 2021 PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering Network AAAI 2021 Weakly-Supervised Spatio-Temporal Anomaly Detection in Surveillance Video IJCAI 2021 Segment as Points for Efficient Online Multi-Object Tracking and Segmentation ECCV 2020 Dynamic Instance Normalization for Arbitrary Style Transfer AAAI 2020 Associate-3Ddet: Perceptual-to-Conceptual Association for 3D Point Cloud Object Detection CVPR 2020 Graph-PCNN: Two Stage Human Pose Estimation with Graph Pose Refinement ECCV 2020 Discriminative Sounding Objects Localization via Self-supervised Audiovisual Matching NIPS 2020 ZoomNet: Part-Aware Adaptive Zooming Neural Network for 3D Object Detection AAAI 2020 Towards Accurate Scene Text Recognition With Semantic Reasoning Networks CVPR 2020 Monocular 3D Object Detection via Feature Domain Adaptation ECCV 2020 Attentive Feedback Network for Boundary-Aware Salient Object Detection CVPR 2019 Chinese Street View Text: Large-Scale Chinese Text Reading With Partially Supervised Learning ICCV 2019 Look More Than Once: An Accurate Detector for Text of Arbitrary Shapes CVPR 2019 A Mutual Learning Method for Salient Object Detection With Intertwined Multi-Supervision CVPR 2019 STGAN: A Unified Selective Transfer Network for Arbitrary Image Attribute Editing CVPR 2019 Perspective-Guided Convolution Networks for Crowd Counting ICCV 2019 BMN: Boundary-Matching Network for Temporal Action Proposal Generation ICCV 2019 ACFNet: Attentional Class Feature Network for Semantic Segmentation ICCV 2019 Image Inpainting With Learnable Bidirectional Attention Maps ICCV 2019 Multi-Attention Multi-Class Constraint for Fine-grained Image Recognition ECCV 2018 Fine-grained Video Categorization with Redundancy Reduction Attention ECCV 2018 Compact Generalized Non-local Network NIPS 2018 WordSup: Exploiting Word Annotations for Character Based Text Detection ICCV 2017