Qiang Xu

48 papers · 2020–2026 · 13 conferences · across top CS/AI conferences

Achievements

+13 more ↓

🌍 Conference Polyglot (13) 🏃 Academic Marathon (6) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (10)

🐝 Cross-Pollinator (10) 🌈 Renaissance Researcher (8) 🗺️ Taxonomy Completionist (78) 👑 Triple Crown 🔬 Deep Specialist (10) 🤝 Dynamic Duo (15) 🏆 Grand Slam 🚀 Conference Pioneer 🔥 Unstoppable (7) 💎 Century Club (43) 🗃️ Keyword Collector (165) ❓ The Questioner ⚡ Prolific Year (12)

Conferences

AAAI (7) ICLR (7) NIPS (6) CVPR (5) ECCV (5) ICCV (5) ICML (3) WACV (3) ACL (2) INTERSPEECH (2) EMNLP (1) IJCAI (1) NAACL (1)

Top co-authors

Ailing Zeng (15) Ruiyuan Gao (11) Xuan Ju (10) Qiuxia LAI (6) Lanqing Hong (6) Minhao LIU (6) Zhijian Xu (6) Jianyuan Zhong (5) Yijun Yang (5) Kai Chen (4)

Keywords

diffusion model (9) large language model (4) graph neural network (3) video generation (3) multimodal learning (3) vision-language model (3) autonomous driving (3) human pose estimation (3) image generation (3) human image generation (2) reasoning verification (2) diffusion transformer (2) adversarial attack (2) object detection (2) 3d pose estimation (2) image editing (2) time series forecasting (2) automatic speech recognition (2) text-to-image model (2) process verification (2)

Papers

MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes WACV 2026 Solve-Detect-Verify: Inference-Time Scaling with Flexible Generative Verifier ACL 2026 Multi-Faceted Attack: Exposing Cross-Model Vulnerabilities in Defense-Equipped Vision-Language Models AAAI 2026 Activations as Features: Probing LLMs for Generalizable Essay Scoring Representations AAAI 2026 FIXME: Towards End-to-End Benchmarking of LLM-Aided Design Verification AAAI 2026 DynamicRTL: RTL Representation Learning for Dynamic Circuit Behavior AAAI 2026 Non-Cross Diffusion for Semantic Consistency WACV 2025 MotionCraft: Crafting Whole-Body Motion with Plug-and-Play Multimodal Controls AAAI 2025 DeepRTL2: A Versatile Model for RTL-Related Tasks ACL 2025 Dyve: Thinking Fast and Slow for Dynamic Process Verification EMNLP 2025 MagicDrive-V2: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control ICCV 2025 FullDiT: Video Generative Foundation Models with Multimodal Control via Full Attention ICCV 2025 DeepRTL: Bridging Verilog Understanding and Generation with a Unified Representation Model ICLR 2025 HiBug2: Efficient and Interpretable Error Slice Discovery for Comprehensive Model Debugging ICLR 2025 DeepGate4: Efficient and Effective Representation Learning for Circuit Design at Scale ICLR 2025 DeepLayout: Learning Neural Representations of Circuit Placement Layout ICML 2025 Guideline Compliance in Task-Oriented Dialogue: The Chained Prior Approach NAACL 2025 Be Your Own Neighborhood: Detecting Adversarial Examples by the Neighborhood Relations Built on Self-Supervised Learning ICML 2024 MagicDrive: Street View Generation with Diverse 3D Geometry Control ICLR 2024 GuardT2I: Defending Text-to-Image Models from Adversarial Prompts NIPS 2024 MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions NIPS 2024 Vector Quantization Prompting for Continual Learning NIPS 2024 Multi-Patch Prediction: Adapting Language Models for Time Series Representation Learning ICML 2024 Text Image Inpainting via Global Structure-Guided Diffusion Models AAAI 2024 MMA-Diffusion: MultiModal Attack on Diffusion Models CVPR 2024 DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception CVPR 2024 BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion ECCV 2024 FITS: Modeling Time Series with $10k$ Parameters ICLR 2024 PnP Inversion: Boosting Diffusion-based Editing with 3 Lines of Code ICLR 2024 HiBug: On Human-Interpretable Model Debug NIPS 2023 HumanSD: A Native Skeleton-Guided Diffusion Model for Human Image Generation ICCV 2023 Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes CVPR 2023 Are Transformers Effective for Time Series Forecasting? AAAI 2023 DIFFGUARD: Semantic Mismatch-Guided Out-of-Distribution Detection Using Pre-Trained Diffusion Models ICCV 2023 DeciWatch: A Simple Baseline for 10× Efficient 2D and 3D Pose Estimation ECCV 2022 Out-of-Distribution Detection with Semantic Mismatch under Masking ECCV 2022 T-WaveNet: A Tree-Structured Wavelet Neural Network for Time Series Signal Analysis ICLR 2022 Self-Distillation Based on High-level Information Supervision for Compressing End-to-End ASR Model INTERSPEECH 2022 Language-specific Characteristic Assistance for Code-switching Speech Recognition INTERSPEECH 2022 SCINet: Time Series Modeling and Forecasting with Sample Convolution and Interaction NIPS 2022 Active Teacher for Semi-Supervised Object Detection CVPR 2022 SmoothNet: A Plug-and-Play Network for Refining Human Poses in Videos ECCV 2022 TestRank: Bringing Order into Unlabeled Test Instances for Deep Learning Tasks NIPS 2021 Information Bottleneck Approach to Spatial Attention Learning IJCAI 2021 Learning Skeletal Graph Neural Networks for Hard 3D Pose Estimation ICCV 2021 SRNet: Improving Generalization in 3D Human Pose Estimation with a Split-and-Recombine Approach ECCV 2020 DeepFuse: An IMU-Aware Network for Real-Time 3D Human Pose Estimation from Multi-View Image WACV 2020 nuScenes: A Multimodal Dataset for Autonomous Driving CVPR 2020