Dong Xu
63 papers · 2013–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
π Interdisciplinary Bridge π Conference Polyglot (12) π Academic Marathon (12) π Renaissance Researcher (9) πΊοΈ Taxonomy Completionist (112)
πΊοΈ
Taxonomy Completionist
(112)
π
Conference Polyglot
(12)
π
Academic Marathon
(12)
π
Keyword Trendsetter Combo
(5)
π
Conference Loyalist
(31)
π±
Topic Pioneer
π
Grand Slam
π
Keyword Champion
(5)
π€
Dynamic Duo
(10)
β‘
Prolific Year
(9)
π
Century Club
(62)
ποΈ
Keyword Collector
(265)
π₯
Unstoppable
(13)
π
Trend Setter
π
Conference Pioneer
Conferences
CVPR (31)
ECCV (9)
ICCV (7)
AAAI (6)
NIPS (3)
ACL (1)
COLING (1)
ICLR (1)
ICML (1)
IJCAI (1)
MIDL (1)
WACV (1)
Top co-authors
Keywords
video compression
(6)
point cloud
(5)
motion compensation
(5)
model compression
(4)
neural network optimization
(4)
neural network
(4)
weakly supervised learning
(4)
low-rank representation
(3)
3d vision
(3)
image classification
(3)
channel pruning
(3)
multimodal learning
(3)
support vector machine
(3)
domain adaptation
(3)
convolutional neural network
(3)
visual grounding
(2)
feature alignment
(2)
entropy coding
(2)
unsupervised domain adaptation
(2)
object localization
(2)
Papers
Learning Diffusion Policy from Primitive Skills for Robot Manipulation
AAAI 2026
Improving Long-Text Alignment for Text-to-Image Diffusion Models
ICLR 2025
On-Device Diffusion Transformer Policy for Efficient Robot Manipulation
ICCV 2025
AutoAlign: Get Your LLM Aligned with Minimal Annotations
ACL 2025
Empowering LLMs to Understand and Generate Complex Vector Graphics
CVPR 2025
Data-Free Generalized Zero-Shot Learning
AAAI 2024
A Video is Worth 256 Bases: Spatial-Temporal Expectation-Maximization Inversion for Zero-Shot Video Editing
CVPR 2024
RaFE: Generative Radiance Fields Restoration
ECCV 2024
Progressive Classifier and Feature Extractor Adaptation for Unsupervised Domain Adaptation on Point Clouds
ECCV 2024
UFDA: Universal Federated Domain Adaptation with Practical Assumptions
AAAI 2024
SVGDreamer: Text Guided SVG Generation with Diffusion Model
CVPR 2024
An In-depth Investigation of Sparse Rate Reduction in Transformer-like Models
NIPS 2024
Adaptive Conformal Inference by Betting
ICML 2024
Multi-Modality Affinity Inference for Weakly Supervised 3D Semantic Segmentation
AAAI 2024
DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models
NIPS 2023
CS-Isolate: Extracting Hard Confident Examples by Content and Style Isolation
NIPS 2023
Complexity-Guided Slimmable Decoder for Efficient Deep Video Compression
CVPR 2023
VL-SAT: Visual-Linguistic Semantics Assisted Training for 3D Semantic Scene Graph Prediction in Point Cloud
CVPR 2023
Conflict-Based Cross-View Consistency for Semi-Supervised Semantic Segmentation
CVPR 2023
Content Adaptive Latents and Decoder for Neural Image Compression
ECCV 2022
SketchSampler: Sketch-Based 3D Reconstruction via View-Dependent Depth Sampling
ECCV 2022
Improving RGB-D Point Cloud Registration by Learning Multi-Scale Local Linear Transformation
ECCV 2022
3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds
CVPR 2022
Learning Based Multi-Modality Image and Video Compression
CVPR 2022
Coarse-To-Fine Deep Video Coding With Hyperprior-Guided Mode Prediction
CVPR 2022
LSVC: A Learning-Based Stereo Video Compression Framework
CVPR 2022
Region Aware Transformer for Automatic Breast Ultrasound Tumor Segmentation
MIDL 2022
Unsupervised Multi-scale Expressive Speaking Style Modeling with Hierarchical Context Information for Audiobook Speech Synthesis
COLING 2022
Enhance Curvature Information by Structured Stochastic Quasi-Newton Methods
CVPR 2021
Back-Tracing Representative Points for Voting-Based 3D Object Detection in Point Clouds
CVPR 2021
VoxelContext-Net: An Octree Based Framework for Point Cloud Compression
CVPR 2021
StyleFormer: Real-Time Arbitrary Style Transfer via Parametric Style Composition
ICCV 2021
STVGBert: A Visual-Linguistic Transformer Based Framework for Spatio-Temporal Video Grounding
ICCV 2021
3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds
ICCV 2021
IncreACO: Incrementally Learned Automatic Check-Out With Photorealistic Exemplar Augmentation
WACV 2021
SRDAN: Scale-Aware and Range-Aware Domain Adaptation Network for Cross-Dataset 3D Object Detection
CVPR 2021
Inception Convolution With Efficient Dilation Search
CVPR 2021
FVC: A New Framework Towards Deep Video Compression in Feature Space
CVPR 2021
Multi-Dimensional Pruning: A Unified Framework for Model Compression
CVPR 2020
Hashing Based Answer Selection
AAAI 2020
Improving Deep Video Compression by Resolution-adaptive Flow Coding
ECCV 2020
Content Adaptive and Error Propagation Aware Deep Video Compression
ECCV 2020
Channel Pruning Guided by Classification Loss and Feature Importance
AAAI 2020
Improving Action Localization by Progressive Cross-Stream Cooperation
CVPR 2019
DVC: An End-To-End Deep Video Compression Framework
CVPR 2019
Dividing and Aggregating Network for Multi-view Action Recognition
ECCV 2018
Collaborative and Adversarial Network for Unsupervised Domain Adaptation
CVPR 2018
Deep Kalman Filtering Network for Video Compression Artifact Reduction
ECCV 2018
Dependency Exploitation: A Unified CNN-RNN Approach for Visual Emotion Recognition
IJCAI 2017
SPFTN: A Self-Paced Fine-Tuning Network for Segmenting Objects in Weakly Labelled Videos
CVPR 2017
Complex Event Detection by Identifying Reliable Shots From Untrimmed Videos
ICCV 2017
Fast Algorithms for Linear and Kernel SVM+
CVPR 2016
Proximal Riemannian Pursuit for Large-Scale Trace-Norm Minimization
CVPR 2016
Object-Based RGBD Image Co-Segmentation With Mutex Constraint
CVPR 2015
Multi-View Domain Generalization for Visual Recognition
ICCV 2015
FaLRR: A Fast Low Rank Representation Solver
CVPR 2015
Visual Recognition by Learning From Web Data: A Weakly Supervised Domain Generalization Approach
CVPR 2015
Object-based Multiple Foreground Video Co-segmentation
CVPR 2014
Recognizing RGB Images by Learning from RGB-D Data
CVPR 2014
Fusing Robust Face Region Descriptors via Multiple Metric Learning for Face Recognition in the Wild
CVPR 2013
Semantically-Based Human Scanpath Estimation with HMMs
ICCV 2013
Event Recognition in Videos by Learning from Heterogeneous Web Sources
CVPR 2013
Learning by Associating Ambiguously Labeled Images
CVPR 2013