Xiaodong Gu
30 papers · 2014–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
π Interdisciplinary Bridge π Renaissance Researcher (5) π Conference Polyglot (9) π Academic Marathon (11) πΊοΈ Taxonomy Completionist (50)
π£
Hot Topic Early Bird
π
Conference Polyglot
(9)
π
Academic Marathon
(11)
π€
Dynamic Duo
(13)
β‘
Prolific Year
(7)
π
Century Club
(29)
ποΈ
Keyword Collector
(127)
π₯
Unstoppable
(7)
Conferences
CVPR (10)
ICLR (4)
COLING (3)
ECCV (3)
ICCV (3)
AAAI (2)
EMNLP (2)
NIPS (2)
IJCAI (1)
Top co-authors
Keywords
depth estimation
(2)
animatable avatar
(2)
large language model
(2)
vision-language model
(2)
transformer architecture
(2)
dialogue system
(2)
3d gaussian splatting
(2)
cost volume
(2)
vision transformer
(1)
3d shape generation
(1)
adversarial learning
(1)
motion estimation
(1)
attention mechanism
(1)
data augmentation
(1)
contrastive learning
(1)
point cloud registration
(1)
text generation
(1)
pose estimation
(1)
metric learning
(1)
response generation
(1)
Papers
Anti-adversarial Learning: Desensitizing Prompts for Large Language Model
AAAI 2026
MMRL: Multi-Modal Representation Learning for Vision-Language Models
CVPR 2025
AniGS: Animatable Gaussian Avatar from a Single Image with Inconsistent Gaussian Reconstruction
CVPR 2025
Motions as Queries: One-Stage Multi-Person Holistic Human Motion Capture
CVPR 2025
Transplant Then Regenerate: A New Paradigm for Text Data Augmentation
EMNLP 2025
LastingBench: Defend Benchmarks Against Knowledge Leakage
EMNLP 2025
LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds
ICCV 2025
LaMP: Language-Motion Pretraining for Motion Generation, Retrieval, and Captioning
ICLR 2025
Surface-Centric Modeling for High-Fidelity Generalizable Neural Surface Reconstruction
ECCV 2024
MoGenTS: Motion Generation based on Spatial-Temporal Joint Modeling
NIPS 2024
High-Fidelity 3D Textured Shapes Generation by Sparse Encoding and Adversarial Decoding
ECCV 2024
An Optimization Framework to Enforce Multi-View Consistency for Texturing 3D Meshes
ECCV 2024
RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D
CVPR 2024
GPLD3D: Latent Diffusion of 3D Shape Generative Models by Enforcing Geometric and Physical Priors
CVPR 2024
JoAPR: Cleaning the Lens of Prompt Learning for Vision-Language Models
CVPR 2024
Monocular Scene Reconstruction with 3D SDF Transformers
ICLR 2023
DENSE RGB SLAM WITH NEURAL IMPLICIT MAPS
ICLR 2023
GenS: Generalizable Neural Surface Reconstruction from Multi-View Images
NIPS 2023
Neural Window Fully-Connected CRFs for Monocular Depth Estimation
CVPR 2022
RCP: Recurrent Closest Point for Point Cloud
CVPR 2022
Building Joint Relationship Attention Network for Image-Text Generation
COLING 2022
Continuous Decomposition of Granularity for Neural Paraphrase Generation
COLING 2022
UTC: A Unified Transformer With Inter-Task Contrastive Learning for Visual Dialog
CVPR 2022
DialogBERT: Discourse-Aware Response Generation via Learning to Recover and Rank Utterances
AAAI 2021
Cascade Cost Volume for High-Resolution Multi-View Stereo and Stereo Matching
CVPR 2020
DialogWAE: Multimodal Response Generation with Conditional Wasserstein Auto-Encoder
ICLR 2019
Batch DropBlock Network for Person Re-Identification and Beyond
ICCV 2019
Attribute-Driven Spontaneous Motion in Unpaired Image Translation
ICCV 2019
DeepAM: Migrate APIs with Multi-modal Sequence to Sequence Learning
IJCAI 2017
Reducing Over-Weighting in Supervised Term Weighting for Sentiment Analysis
COLING 2014