Errui Ding
102 papers · 2017–2025 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
π Conference Polyglot (9) π Academic Marathon (8) π Interdisciplinary Bridge π§ Keyword Pioneer π Cross-Pollinator (8)
π
Cross-Pollinator
(8)
π
Renaissance Researcher
(6)
πΊοΈ
Taxonomy Completionist
(109)
π
Conference Loyalist
(34)
π¬
Deep Specialist
(18)
π€
Dynamic Duo
(51)
π
Grand Slam
π
Keyword Champion
(2)
π
Conference Pioneer
π₯
Unstoppable
(9)
π
Trend Setter
ποΈ
Keyword Collector
(408)
π
Century Club
(102)
β‘
Prolific Year
(22)
Conferences
CVPR (34)
ICCV (20)
ECCV (17)
AAAI (11)
NIPS (10)
ICLR (4)
ICML (2)
IJCAI (2)
WACV (2)
Top co-authors
Research topics
Keywords
object detection
(9)
image generation
(7)
3d object detection
(6)
convolutional neural network
(6)
semantic segmentation
(5)
self-supervised learning
(4)
diffusion model
(4)
pseudo label
(4)
point cloud
(4)
knowledge distillation
(4)
contrastive learning
(4)
vision transformer
(4)
depth estimation
(4)
few-shot learning
(4)
attention mechanism
(4)
temporal modeling
(4)
video generation
(4)
domain adaptation
(4)
feature representation
(4)
representation learning
(3)
Papers
Re-HOLD: Video Hand Object Interaction Reenactment via adaptive Layout-instructed Diffusion Model
CVPR 2025
TexGarment: Consistent Garment UV Texture Generation via Efficient 3D Structure-Guided Diffusion Transformer
CVPR 2025
Uni$^2$Det: Unified and Universal Framework for Prompt-Guided Multi-dataset 3D Detection
ICLR 2025
Splatter-360: Generalizable 360 Gaussian Splatting for Wide-baseline Panoramic Images
CVPR 2025
AudCast: Audio-Driven Human Video Generation by Cascaded Diffusion Transformers
CVPR 2025
TexGaussian: Generating High-quality PBR Material via Octree-based 3D Gaussian Splatting
CVPR 2025
MGMapNet: Multi-Granularity Representation Learning for End-to-End Vectorized HD Map Construction
ICLR 2025
Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization
ICML 2025
Interpretable Face Anti-Spoofing: Enhancing Generalization with Multimodal Large Language Models
AAAI 2025
KD-DETR: Knowledge Distillation for Detection Transformer with Consistent Distillation Points Sampling
CVPR 2024
TexOct: Generating Textures of 3D Models with Octree-based Diffusion
CVPR 2024
Decoupled Pseudo-labeling for Semi-Supervised Monocular 3D Object Detection
CVPR 2024
VRP-SAM: SAM with Visual Reference Prompt
CVPR 2024
MS-DETR: Efficient DETR Training with Mixed Supervision
CVPR 2024
ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer
ECCV 2024
OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection
ECCV 2024
LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction
ECCV 2024
Interactive 3D Object Detection with Prompts
ECCV 2024
Multi-Domain Incremental Learning for Face Presentation Attack Detection
AAAI 2024
GGRt: Towards Generalizable 3D Gaussians without Pose Priors in Real-Time
ECCV 2024
HD-Fusion: Detailed Text-to-3D Generation Leveraging Multiple Noise Estimation
WACV 2024
OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding
NIPS 2024
ShowMaker: Creating High-Fidelity 2D Human Video via Fine-Grained Diffusion Modeling
NIPS 2024
Octopus: A Multi-modal LLM with Parallel Recognition and Sequential Understanding
NIPS 2024
Towards Unified Multi-granularity Text Detection with Interactive Attention
ICML 2024
CFCG: Semi-Supervised Semantic Segmentation via Cross-Fusion and Contour Guidance Supervision
ICCV 2023
Gradient-based Sampling for Class Imbalanced Semi-supervised Object Detection
ICCV 2023
Group Pose: A Simple Baseline for End-to-End Multi-Person Pose Estimation
ICCV 2023
Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment
ICCV 2023
Semi-DETR: Semi-Supervised Object Detection With Detection Transformers
CVPR 2023
Graph Contrastive Learning for Skeleton-based Action Recognition
ICLR 2023
HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception
NIPS 2023
Ambiguity-Resistant Semi-Supervised Learning for Dense Object Detection
CVPR 2023
StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-Based Generator
CVPR 2023
CAPE: Camera View Position Embedding for Multi-View 3D Object Detection
CVPR 2023
StereoDistill: Pick the Cream from LiDAR for Distilling Stereo-Based 3D Object Detection
AAAI 2023
PSVT: End-to-End Multi-Person 3D Pose and Shape Estimation With Progressive Video Transformers
CVPR 2023
Cyclically Disentangled Feature Translation for Face Anti-spoofing
AAAI 2023
Effective Invertible Arbitrary Image Rescaling
WACV 2023
Robust Video Portrait Reenactment via Personalized Representation Quantization
AAAI 2023
StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training
ICLR 2023
Delicate Textured Mesh Recovery from NeRF via Adaptive Surface Refinement
ICCV 2023
Forward Flow for Novel View Synthesis of Dynamic Scenes
ICCV 2023
LMR: A Large-Scale Multi-Reference Dataset for Reference-Based Super-Resolution
ICCV 2023
Neural Color Operators for Sequential Image Retouching
ECCV 2022
Delving into Sequential Patches for Deepfake Detection
NIPS 2022
RTFormer: Efficient Design for Real-Time Semantic Segmentation with Transformer
NIPS 2022
Singular Value Fine-tuning: Few-shot Segmentation requires Few-parameters Fine-tuning
NIPS 2022
MobileFaceSwap: A Lightweight Framework for Video Face Swapping
AAAI 2022
Human-Object Interaction Detection via Disentangled Transformer
CVPR 2022
Few-Shot Head Swapping in the Wild
CVPR 2022
Few-Shot Font Generation by Learning Fine-Grained Local Styles
CVPR 2022
MixFormer: Mixing Features Across Windows and Dimensions
CVPR 2022
Towards Bidirectional Arbitrary Image Rescaling: Joint Optimization and Cycle Idempotence
CVPR 2022
Expressive Talking Head Generation With Granular Audio-Visual Control
CVPR 2022
Implicit Sample Extension for Unsupervised Person Re-Identification
CVPR 2022
ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval
CVPR 2022
Rope3D: The Roadside Perception Dataset for Autonomous Driving and Monocular 3D Object Detection Task
CVPR 2022
Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model
CVPR 2022
GitNet: Geometric Prior-Based Transformation for Birds-Eye-View Segmentation
ECCV 2022
Action Quality Assessment with Temporal Parsing Transformer
ECCV 2022
StyleSwap: Style-Based Generator Empowers Robust Face Swapping
ECCV 2022
UFO: Unified Feature Optimization
ECCV 2022
Diverse Learner: Exploring Diverse Supervision for Semi-Supervised Object Detection
ECCV 2022
CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval
ECCV 2022
Self-Guided Hard Negative Generation for Unsupervised Person Re-Identification
IJCAI 2022
Dual-stream Network for Visual Recognition
NIPS 2021
Drafting and Revision: Laplacian Pyramid Network for Fast High-Quality Artistic Style Transfer
CVPR 2021
DOLG: Single-Stage Image Retrieval With Deep Orthogonal Fusion of Local and Global Features
ICCV 2021
Paint Transformer: Feed Forward Neural Painting With Stroke Prediction
ICCV 2021
EC-DARTS: Inducing Equalized and Consistent Optimization Into DARTS
ICCV 2021
The Devil Is in the Task: Exploiting Reciprocal Appearance-Localization Features for Monocular 3D Object Detection
ICCV 2021
AdaAttN: Revisit Attention Mechanism in Arbitrary Neural Style Transfer
ICCV 2021
ASCNet: Self-Supervised Video Representation Learning With Appearance-Speed Consistency
ICCV 2021
Revealing the Reciprocal Relations Between Self-Supervised Stereo and Monocular Depth Estimation
ICCV 2021
Dynamic Class Queue for Large Scale Face Recognition in the Wild
CVPR 2021
Unsupervised Multi-Source Domain Adaptation for Person Re-Identification
CVPR 2021
FaceController: Controllable Attribute Editing for Face in the Wild
AAAI 2021
MVFNet: Multi-View Fusion Network for Efficient Video Recognition
AAAI 2021
PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering Network
AAAI 2021
Weakly-Supervised Spatio-Temporal Anomaly Detection in Surveillance Video
IJCAI 2021
Segment as Points for Efficient Online Multi-Object Tracking and Segmentation
ECCV 2020
Dynamic Instance Normalization for Arbitrary Style Transfer
AAAI 2020
Associate-3Ddet: Perceptual-to-Conceptual Association for 3D Point Cloud Object Detection
CVPR 2020
Graph-PCNN: Two Stage Human Pose Estimation with Graph Pose Refinement
ECCV 2020
Discriminative Sounding Objects Localization via Self-supervised Audiovisual Matching
NIPS 2020
ZoomNet: Part-Aware Adaptive Zooming Neural Network for 3D Object Detection
AAAI 2020
Towards Accurate Scene Text Recognition With Semantic Reasoning Networks
CVPR 2020
Monocular 3D Object Detection via Feature Domain Adaptation
ECCV 2020
Attentive Feedback Network for Boundary-Aware Salient Object Detection
CVPR 2019
Chinese Street View Text: Large-Scale Chinese Text Reading With Partially Supervised Learning
ICCV 2019
Look More Than Once: An Accurate Detector for Text of Arbitrary Shapes
CVPR 2019
A Mutual Learning Method for Salient Object Detection With Intertwined Multi-Supervision
CVPR 2019
STGAN: A Unified Selective Transfer Network for Arbitrary Image Attribute Editing
CVPR 2019
Perspective-Guided Convolution Networks for Crowd Counting
ICCV 2019
BMN: Boundary-Matching Network for Temporal Action Proposal Generation
ICCV 2019
ACFNet: Attentional Class Feature Network for Semantic Segmentation
ICCV 2019
Image Inpainting With Learnable Bidirectional Attention Maps
ICCV 2019
Multi-Attention Multi-Class Constraint for Fine-grained Image Recognition
ECCV 2018
Fine-grained Video Categorization with Redundancy Reduction Attention
ECCV 2018
Compact Generalized Non-local Network
NIPS 2018
WordSup: Exploiting Word Annotations for Character Based Text Detection
ICCV 2017