Qiang Xu
48 papers · 2020–2026 · 13 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
π Conference Polyglot (13) π Academic Marathon (6) π§ Keyword Pioneer π Interdisciplinary Bridge π Cross-Pollinator (10)
π
Cross-Pollinator
(10)
π
Renaissance Researcher
(8)
πΊοΈ
Taxonomy Completionist
(78)
π
Triple Crown
π¬
Deep Specialist
(10)
π€
Dynamic Duo
(15)
π
Grand Slam
π
Conference Pioneer
π₯
Unstoppable
(7)
π
Century Club
(43)
ποΈ
Keyword Collector
(165)
β
The Questioner
β‘
Prolific Year
(12)
Conferences
AAAI (7)
ICLR (7)
NIPS (6)
CVPR (5)
ECCV (5)
ICCV (5)
ICML (3)
WACV (3)
ACL (2)
INTERSPEECH (2)
EMNLP (1)
IJCAI (1)
NAACL (1)
Top co-authors
Keywords
diffusion model
(9)
large language model
(4)
graph neural network
(3)
video generation
(3)
multimodal learning
(3)
vision-language model
(3)
autonomous driving
(3)
human pose estimation
(3)
image generation
(3)
human image generation
(2)
reasoning verification
(2)
diffusion transformer
(2)
adversarial attack
(2)
object detection
(2)
3d pose estimation
(2)
image editing
(2)
time series forecasting
(2)
automatic speech recognition
(2)
text-to-image model
(2)
process verification
(2)
Papers
MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes
WACV 2026
Solve-Detect-Verify: Inference-Time Scaling with Flexible Generative Verifier
ACL 2026
Multi-Faceted Attack: Exposing Cross-Model Vulnerabilities in Defense-Equipped Vision-Language Models
AAAI 2026
Activations as Features: Probing LLMs for Generalizable Essay Scoring Representations
AAAI 2026
FIXME: Towards End-to-End Benchmarking of LLM-Aided Design Verification
AAAI 2026
DynamicRTL: RTL Representation Learning for Dynamic Circuit Behavior
AAAI 2026
Non-Cross Diffusion for Semantic Consistency
WACV 2025
MotionCraft: Crafting Whole-Body Motion with Plug-and-Play Multimodal Controls
AAAI 2025
DeepRTL2: A Versatile Model for RTL-Related Tasks
ACL 2025
Dyve: Thinking Fast and Slow for Dynamic Process Verification
EMNLP 2025
MagicDrive-V2: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control
ICCV 2025
FullDiT: Video Generative Foundation Models with Multimodal Control via Full Attention
ICCV 2025
DeepRTL: Bridging Verilog Understanding and Generation with a Unified Representation Model
ICLR 2025
HiBug2: Efficient and Interpretable Error Slice Discovery for Comprehensive Model Debugging
ICLR 2025
DeepGate4: Efficient and Effective Representation Learning for Circuit Design at Scale
ICLR 2025
DeepLayout: Learning Neural Representations of Circuit Placement Layout
ICML 2025
Guideline Compliance in Task-Oriented Dialogue: The Chained Prior Approach
NAACL 2025
Be Your Own Neighborhood: Detecting Adversarial Examples by the Neighborhood Relations Built on Self-Supervised Learning
ICML 2024
MagicDrive: Street View Generation with Diverse 3D Geometry Control
ICLR 2024
GuardT2I: Defending Text-to-Image Models from Adversarial Prompts
NIPS 2024
MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions
NIPS 2024
Vector Quantization Prompting for Continual Learning
NIPS 2024
Multi-Patch Prediction: Adapting Language Models for Time Series Representation Learning
ICML 2024
Text Image Inpainting via Global Structure-Guided Diffusion Models
AAAI 2024
MMA-Diffusion: MultiModal Attack on Diffusion Models
CVPR 2024
DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception
CVPR 2024
BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion
ECCV 2024
FITS: Modeling Time Series with $10k$ Parameters
ICLR 2024
PnP Inversion: Boosting Diffusion-based Editing with 3 Lines of Code
ICLR 2024
HiBug: On Human-Interpretable Model Debug
NIPS 2023
HumanSD: A Native Skeleton-Guided Diffusion Model for Human Image Generation
ICCV 2023
Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes
CVPR 2023
Are Transformers Effective for Time Series Forecasting?
AAAI 2023
DIFFGUARD: Semantic Mismatch-Guided Out-of-Distribution Detection Using Pre-Trained Diffusion Models
ICCV 2023
DeciWatch: A Simple Baseline for 10Γ Efficient 2D and 3D Pose Estimation
ECCV 2022
Out-of-Distribution Detection with Semantic Mismatch under Masking
ECCV 2022
T-WaveNet: A Tree-Structured Wavelet Neural Network for Time Series Signal Analysis
ICLR 2022
Self-Distillation Based on High-level Information Supervision for Compressing End-to-End ASR Model
INTERSPEECH 2022
Language-specific Characteristic Assistance for Code-switching Speech Recognition
INTERSPEECH 2022
SCINet: Time Series Modeling and Forecasting with Sample Convolution and Interaction
NIPS 2022
Active Teacher for Semi-Supervised Object Detection
CVPR 2022
SmoothNet: A Plug-and-Play Network for Refining Human Poses in Videos
ECCV 2022
TestRank: Bringing Order into Unlabeled Test Instances for Deep Learning Tasks
NIPS 2021
Information Bottleneck Approach to Spatial Attention Learning
IJCAI 2021
Learning Skeletal Graph Neural Networks for Hard 3D Pose Estimation
ICCV 2021
SRNet: Improving Generalization in 3D Human Pose Estimation with a Split-and-Recombine Approach
ECCV 2020
DeepFuse: An IMU-Aware Network for Real-Time 3D Human Pose Estimation from Multi-View Image
WACV 2020
nuScenes: A Multimodal Dataset for Autonomous Driving
CVPR 2020