Shu-Tao Xia

109 papers · 2016–2026 · 11 conferences · across top CS/AI conferences

Achievements

+13 more ↓

🧭 Keyword Pioneer 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (13) 🌍 Conference Polyglot (11)

🌍 Conference Polyglot (11) 🏃 Academic Marathon (9) 🐝 Cross-Pollinator (10) 🏠 Conference Loyalist (20) 🏆 Keyword Champion 🏆 Grand Slam 🤝 Dynamic Duo (45) 🔬 Deep Specialist (13) 💎 Century Club (103) 📈 Trend Setter 🗃️ Keyword Collector (390) ⚡ Prolific Year (22) 🔥 Unstoppable (10)

Conferences

AAAI (20) CVPR (17) ICLR (14) NIPS (13) ECCV (11) ICCV (11) IJCAI (10) ACL (6) ICML (4) EMNLP (2) UAI (1)

Top co-authors

Tao Dai (47) Bin Chen (39) Yong Jiang (17) Jinpeng Wang (15) Yaohua Zha (12) Hang Guo (12) Naiqi Li (12) Hao Fang (10) Yiming Li (10) Kuofeng Gao (10)

Keywords

contrastive learning (9) adversarial learning (7) transfer learning (7) backdoor attack (7) adversarial attack (6) vision-language model (6) image restoration (6) self-supervised learning (6) point cloud (6) multimodal learning (5) large language model (5) representation learning (4) diffusion model (4) video retrieval (4) uncertainty quantification (3) video hashing (3) domain adaptation (3) video understanding (3) model security (3) image generation (3)

Papers

Retrievals Can Be Detrimental: Unveiling the Backdoor Vulnerability of Retrieval-Augmented Diffusion Models ACL 2026 Heterogeneous Uncertainty-Guided Composed Image Retrieval with Fine-Grained Probabilistic Learning AAAI 2026 From Verbatim to Gist: Distilling Pyramidal Multimodal Memory via Semantic Information Bottleneck for Long-Horizon Video Agents ACL 2026 Improving Deepfake Detection with Reinforcement Learning-Based Adaptive Data Augmentation AAAI 2026 When Efficiency Meets Safety: A Benchmark Security Analysis of KV Cache Compression in Large Language Models ACL 2026 CASL: Curvature-Augmented Self-supervised Learning for 3D Anomaly Detection AAAI 2026 One Perturbation is Enough: On Generating Universal Adversarial Perturbations against Vision-Language Pre-training Models ICCV 2025 AutoSSVH: Exploring Automated Frame Sampling for Efficient Self-Supervised Video Hashing CVPR 2025 MoSEs: Uncertainty-Aware AI-Generated Text Detection via Mixture of Stylistics Experts with Conditional Thresholds EMNLP 2025 Your Language Model Can Secretly Write Like Humans: Contrastive Paraphrase Attacks on LLM-Generated Text Detectors EMNLP 2025 Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning ICCV 2025 FastVAR: Linear Visual Autoregressive Modeling via Cached Token Pruning ICCV 2025 Cassic: Towards Content-Adaptive State-Space Models for Learned Image Compression ICCV 2025 GCD-Sampling: A General Cross-scale Decoupled Sampling for Point Cloud AAAI 2025 Efficient Self-Supervised Video Hashing with Selective State Spaces AAAI 2025 Diffusion Prior Interpolation for Flexibility Real-World Face Super-Resolution AAAI 2025 CALF: Aligning LLMs for Time Series Forecasting via Cross-modal Fine-Tuning AAAI 2025 Pre-Trained Vision-Language Models as Noisy Partial Annotators AAAI 2025 Modeling Uncertainty in Composed Image Retrieval via Probabilistic Embeddings ACL 2025 Benchmarking Open-ended Audio Dialogue Understanding for Large Audio-Language Models ACL 2025 TimeBridge: Non-Stationarity Matters for Long-term Time Series Forecasting ICML 2025 3D-LMVIC: Learning-based Multi-View Image Compression with 3D Gaussian Geometric Priors ICML 2025 TimeFilter: Patch-Specific Spatial-Temporal Graph Filtration for Time Series Forecasting ICML 2025 IntLoRA: Integral Low-rank Adaptation of Quantized Diffusion Models ICML 2025 PMA: Towards Parameter-Efficient Point Cloud Understanding via Point Mamba Adapter CVPR 2025 Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning CVPR 2025 Adapting Pre-trained 3D Models for Point Cloud Video Understanding via Cross-frame Spatio-temporal Perception CVPR 2025 Hierarchical Features Matter: A Deep Exploration of Progressive Parameterization Method for Dataset Distillation CVPR 2025 MambaIRv2: Attentive State Space Restoration CVPR 2025 Protecting Your Video Content: Disrupting Automated Video-based LLM Annotations CVPR 2025 Going Beyond Feature Similarity: Effective Dataset distillation based on Class-aware Conditional Mutual Information ICLR 2025 Error-quantified Conformal Inference for Time Series ICLR 2025 DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation ICLR 2025 An Exploration with Entropy Constrained 3D Gaussians for 2D Video Compression ICLR 2025 Stealthy Shield Defense: A Conditional Mutual Information-Based Approach against Black-Box Model Inversion Attacks ICLR 2025 Efficient Differentiable Approximation of Generalized Low-rank Regularization IJCAI 2025 Point Cloud Mixture-of-Domain-Experts Model for 3D Self-supervised Learning IJCAI 2025 DDN: Dual-domain Dynamic Normalization for Non-stationary Time Series Forecasting NIPS 2024 A Closer Look at GAN Priors: Exploiting Intermediate Features for Enhanced Model Inversion Attacks ECCV 2024 MambaIR: A Simple Baseline for Image Restoration with State-Space Model ECCV 2024 LCM: Locally Constrained Compact Point Cloud Model for Masked Point Modeling NIPS 2024 BoostAdapter: Improving Vision-Language Test-Time Adaptation via Regional Bootstrapping NIPS 2024 Everyday Object Meets Vision-and-Language Navigation Agent via Backdoor NIPS 2024 Towards Faithful XAI Evaluation via Generalization-Limited Backdoor Watermark ICLR 2024 Parameter-Efficient and Memory-Efficient Tuning for Vision Transformer: A Disentangled Approach ECCV 2024 GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval AAAI 2024 Towards Compact 3D Representations via Point Feature Enhancement Masked Autoencoders AAAI 2024 Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Understanding AAAI 2024 Controller-Guided Partial Label Consistency Regularization with Unlabeled Data AAAI 2024 CLIP-Guided Generative Networks for Transferable Targeted Adversarial Attacks ECCV 2024 Inducing High Energy-Latency of Large Vision-Language Models with Verbose Images ICLR 2024 Parameter Efficient Adaptation for Image Restoration with Heterogeneous Mixture-of-Experts NIPS 2024 ReFIR: Grounding Large Restoration Models with Retrieval Augmentation NIPS 2024 Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision Transfomers CVPR 2024 GladCoder: Stylized QR Code Generation with Grayscale-Aware Denoising Process IJCAI 2024 BadCLIP: Trigger-Aware Prompt Learning for Backdoor Attacks on CLIP CVPR 2024 Periodicity Decoupling Framework for Long-term Series Forecasting ICLR 2024 Boundary-aware Decoupled Flow Networks for Realistic Extreme Rescaling IJCAI 2024 Invertible Residual Rescaling Models IJCAI 2024 Instance-aware Dynamic Prompt Tuning for Pre-trained Point Cloud Models ICCV 2023 Domain Watermark: Effective and Harmless Dataset Copyright Protection is Closed at Hand NIPS 2023 Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge Transfer AAAI 2023 FSR: A General Frequency-Oriented Framework to Accelerate Image Super-resolution Networks AAAI 2023 Contrastive Masked Autoencoders for Self-Supervised Video Hashing AAAI 2023 Combating Unknown Bias with Effective Bias-Conflicting Scoring and Gradient Alignment AAAI 2023 Learned Distributed Image Compression with Multi-Scale Patch Matching in Feature Domain AAAI 2023 Learning Transferable Spatiotemporal Representations From Natural Script Knowledge CVPR 2023 Backdoor Defense via Adaptively Splitting Poisoned Dataset CVPR 2023 Towards Robust Model Watermark via Reducing Parametric Vulnerability ICCV 2023 Unsupervised Surface Anomaly Detection with Diffusion Probabilistic Model ICCV 2023 GIFD: A Generative Gradient Inversion Method with Feature Domain Optimization ICCV 2023 One-bit Flip is All You Need: When Bit-flip Attack Meets Model Training ICCV 2023 DELTA: DEGRADATION-FREE FULLY TEST-TIME ADAPTATION ICLR 2023 Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection ICLR 2023 Towards Robust Scene Text Image Super-resolution via Explicit Location Enhancement IJCAI 2023 PILC: Practical Image Lossless Compression With an End-to-End GPU Oriented Neural Framework CVPR 2022 Improving Vision Transformers by Revisiting High-Frequency Components ECCV 2022 Hardly Perceptible Trojan Attack against Neural Networks with Bit Flips ECCV 2022 Few-Shot Backdoor Attacks on Visual Object Tracking ICLR 2022 Boosting Black-Box Attack With Partially Transferred Conditional Adversarial Distribution CVPR 2022 NeXT: Towards High Quality Neural Radiance Fields via Multi-Skip Transformer ECCV 2022 Untargeted Backdoor Watermark: Towards Harmless and Stealthy Dataset Copyright Protection NIPS 2022 Deep Dirichlet process mixture models UAI 2022 Defending against Model Stealing via Verifying Embedded External Features AAAI 2022 Contrastive Quantization with Code Memory for Unsupervised Image Retrieval AAAI 2022 SimCC: A Simple Coordinate Classification Perspective for Human Pose Estimation ECCV 2022 TokenPose: Learning Keypoint Tokens for Human Pose Estimation ICCV 2021 Clustering Effect of Adversarial Robust Models NIPS 2021 Improving Adversarial Robustness via Channel-wise Activation Suppressing ICLR 2021 Targeted Attack against Deep Neural Networks via Flipping Limited Weight Bits ICLR 2021 Skip Connections Matter: On the Transferability of Adversarial Examples Generated with ResNets ICLR 2020 Stochastic Deep Gaussian Processes over Graphs NIPS 2020 Adversarial Weight Perturbation Helps Robust Generalization NIPS 2020 Improving Query Efficiency of Black-box Adversarial Attack ECCV 2020 Training Interpretable Convolutional Neural Networks by Differentiating Class-specific Filters ECCV 2020 Targeted Attack for Deep Hashing based Retrieval ECCV 2020 One-Shot Adversarial Attacks on Visual Tracking With Dual Attention CVPR 2020 Maintaining Discrimination and Fairness in Class Incremental Learning CVPR 2020 Adversarial Attack on Deep Product Quantization Network for Image Retrieval AAAI 2020 Automatic Grassland Degradation Estimation Using Deep Learning IJCAI 2019 Hilbert-Based Generative Defense for Adversarial Examples ICCV 2019 Second-Order Attention Network for Single Image Super-Resolution CVPR 2019 Exploiting Common Characters in Chinese and Japanese to Learn Cross-Lingual Word Embeddings via Matrix Factorization ACL 2018 BML: A High-performance, Low-cost Gradient Synchronization Algorithm for DML Training NIPS 2018 Iterative Learning With Open-Set Noisy Labels CVPR 2018 Student-t Process Regression with Student-t Likelihood IJCAI 2017 Accelerated Stochastic Greedy Coordinate Descent by Soft Thresholding Projection onto Simplex NIPS 2017 Robust Survey Aggregation with Student-t Distribution and Sparse Representation IJCAI 2017 Bernoulli Random Forests: Closing the Gap between Theoretical Consistency and Empirical Soundness IJCAI 2016