Dong Xu

63 papers · 2013–2026 · 12 conferences · across top CS/AI conferences

Achievements

+15 more ↓

🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (12) 🏃 Academic Marathon (12) 🌈 Renaissance Researcher (9) 🗺️ Taxonomy Completionist (112)

🗺️ Taxonomy Completionist (112) 🌍 Conference Polyglot (12) 🏃 Academic Marathon (12) 🌟 Keyword Trendsetter Combo (5) 🏠 Conference Loyalist (31) 🌱 Topic Pioneer 🏆 Grand Slam 🏆 Keyword Champion (5) 🤝 Dynamic Duo (10) ⚡ Prolific Year (9) 💎 Century Club (62) 🗃️ Keyword Collector (265) 🔥 Unstoppable (13) 📈 Trend Setter 🚀 Conference Pioneer

Conferences

CVPR (31) ECCV (9) ICCV (7) AAAI (6) NIPS (3) ACL (1) COLING (1) ICLR (1) ICML (1) IJCAI (1) MIDL (1) WACV (1)

Top co-authors

Wanli Ouyang (10) Wen Li (10) Guo Lu (10) Lu Sheng (10) Jing Zhang (8) Qian Yu (7) Zhihao Hu (7) Zhenghao Chen (5) Luping Zhou (4) Stephen Lin (3)

Keywords

video compression (6) point cloud (5) motion compensation (5) model compression (4) neural network optimization (4) neural network (4) weakly supervised learning (4) low-rank representation (3) 3d vision (3) image classification (3) channel pruning (3) multimodal learning (3) support vector machine (3) domain adaptation (3) convolutional neural network (3) visual grounding (2) feature alignment (2) entropy coding (2) unsupervised domain adaptation (2) object localization (2)

Papers

Learning Diffusion Policy from Primitive Skills for Robot Manipulation AAAI 2026 Improving Long-Text Alignment for Text-to-Image Diffusion Models ICLR 2025 On-Device Diffusion Transformer Policy for Efficient Robot Manipulation ICCV 2025 AutoAlign: Get Your LLM Aligned with Minimal Annotations ACL 2025 Empowering LLMs to Understand and Generate Complex Vector Graphics CVPR 2025 Data-Free Generalized Zero-Shot Learning AAAI 2024 A Video is Worth 256 Bases: Spatial-Temporal Expectation-Maximization Inversion for Zero-Shot Video Editing CVPR 2024 RaFE: Generative Radiance Fields Restoration ECCV 2024 Progressive Classifier and Feature Extractor Adaptation for Unsupervised Domain Adaptation on Point Clouds ECCV 2024 UFDA: Universal Federated Domain Adaptation with Practical Assumptions AAAI 2024 SVGDreamer: Text Guided SVG Generation with Diffusion Model CVPR 2024 An In-depth Investigation of Sparse Rate Reduction in Transformer-like Models NIPS 2024 Adaptive Conformal Inference by Betting ICML 2024 Multi-Modality Affinity Inference for Weakly Supervised 3D Semantic Segmentation AAAI 2024 DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models NIPS 2023 CS-Isolate: Extracting Hard Confident Examples by Content and Style Isolation NIPS 2023 Complexity-Guided Slimmable Decoder for Efficient Deep Video Compression CVPR 2023 VL-SAT: Visual-Linguistic Semantics Assisted Training for 3D Semantic Scene Graph Prediction in Point Cloud CVPR 2023 Conflict-Based Cross-View Consistency for Semi-Supervised Semantic Segmentation CVPR 2023 Content Adaptive Latents and Decoder for Neural Image Compression ECCV 2022 SketchSampler: Sketch-Based 3D Reconstruction via View-Dependent Depth Sampling ECCV 2022 Improving RGB-D Point Cloud Registration by Learning Multi-Scale Local Linear Transformation ECCV 2022 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds CVPR 2022 Learning Based Multi-Modality Image and Video Compression CVPR 2022 Coarse-To-Fine Deep Video Coding With Hyperprior-Guided Mode Prediction CVPR 2022 LSVC: A Learning-Based Stereo Video Compression Framework CVPR 2022 Region Aware Transformer for Automatic Breast Ultrasound Tumor Segmentation MIDL 2022 Unsupervised Multi-scale Expressive Speaking Style Modeling with Hierarchical Context Information for Audiobook Speech Synthesis COLING 2022 Enhance Curvature Information by Structured Stochastic Quasi-Newton Methods CVPR 2021 Back-Tracing Representative Points for Voting-Based 3D Object Detection in Point Clouds CVPR 2021 VoxelContext-Net: An Octree Based Framework for Point Cloud Compression CVPR 2021 StyleFormer: Real-Time Arbitrary Style Transfer via Parametric Style Composition ICCV 2021 STVGBert: A Visual-Linguistic Transformer Based Framework for Spatio-Temporal Video Grounding ICCV 2021 3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds ICCV 2021 IncreACO: Incrementally Learned Automatic Check-Out With Photorealistic Exemplar Augmentation WACV 2021 SRDAN: Scale-Aware and Range-Aware Domain Adaptation Network for Cross-Dataset 3D Object Detection CVPR 2021 Inception Convolution With Efficient Dilation Search CVPR 2021 FVC: A New Framework Towards Deep Video Compression in Feature Space CVPR 2021 Multi-Dimensional Pruning: A Unified Framework for Model Compression CVPR 2020 Hashing Based Answer Selection AAAI 2020 Improving Deep Video Compression by Resolution-adaptive Flow Coding ECCV 2020 Content Adaptive and Error Propagation Aware Deep Video Compression ECCV 2020 Channel Pruning Guided by Classification Loss and Feature Importance AAAI 2020 Improving Action Localization by Progressive Cross-Stream Cooperation CVPR 2019 DVC: An End-To-End Deep Video Compression Framework CVPR 2019 Dividing and Aggregating Network for Multi-view Action Recognition ECCV 2018 Collaborative and Adversarial Network for Unsupervised Domain Adaptation CVPR 2018 Deep Kalman Filtering Network for Video Compression Artifact Reduction ECCV 2018 Dependency Exploitation: A Unified CNN-RNN Approach for Visual Emotion Recognition IJCAI 2017 SPFTN: A Self-Paced Fine-Tuning Network for Segmenting Objects in Weakly Labelled Videos CVPR 2017 Complex Event Detection by Identifying Reliable Shots From Untrimmed Videos ICCV 2017 Fast Algorithms for Linear and Kernel SVM+ CVPR 2016 Proximal Riemannian Pursuit for Large-Scale Trace-Norm Minimization CVPR 2016 Object-Based RGBD Image Co-Segmentation With Mutex Constraint CVPR 2015 Multi-View Domain Generalization for Visual Recognition ICCV 2015 FaLRR: A Fast Low Rank Representation Solver CVPR 2015 Visual Recognition by Learning From Web Data: A Weakly Supervised Domain Generalization Approach CVPR 2015 Object-based Multiple Foreground Video Co-segmentation CVPR 2014 Recognizing RGB Images by Learning from RGB-D Data CVPR 2014 Fusing Robust Face Region Descriptors via Multiple Metric Learning for Face Recognition in the Wild CVPR 2013 Semantically-Based Human Scanpath Estimation with HMMs ICCV 2013 Event Recognition in Videos by Learning from Heterogeneous Web Sources CVPR 2013 Learning by Associating Ambiguously Labeled Images CVPR 2013