Shu-Tao Xia
109 papers · 2016–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
π§ Keyword Pioneer π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (13) π Conference Polyglot (11)
π
Conference Polyglot
(11)
π
Academic Marathon
(9)
π
Cross-Pollinator
(10)
π
Conference Loyalist
(20)
π
Keyword Champion
π
Grand Slam
π€
Dynamic Duo
(45)
π¬
Deep Specialist
(13)
π
Century Club
(103)
π
Trend Setter
ποΈ
Keyword Collector
(390)
β‘
Prolific Year
(22)
π₯
Unstoppable
(10)
Conferences
AAAI (20)
CVPR (17)
ICLR (14)
NIPS (13)
ECCV (11)
ICCV (11)
IJCAI (10)
ACL (6)
ICML (4)
EMNLP (2)
UAI (1)
Top co-authors
Keywords
contrastive learning
(9)
adversarial learning
(7)
transfer learning
(7)
backdoor attack
(7)
adversarial attack
(6)
vision-language model
(6)
image restoration
(6)
self-supervised learning
(6)
point cloud
(6)
multimodal learning
(5)
large language model
(5)
representation learning
(4)
diffusion model
(4)
video retrieval
(4)
uncertainty quantification
(3)
video hashing
(3)
domain adaptation
(3)
video understanding
(3)
model security
(3)
image generation
(3)
Papers
Retrievals Can Be Detrimental: Unveiling the Backdoor Vulnerability of Retrieval-Augmented Diffusion Models
ACL 2026
Heterogeneous Uncertainty-Guided Composed Image Retrieval with Fine-Grained Probabilistic Learning
AAAI 2026
From Verbatim to Gist: Distilling Pyramidal Multimodal Memory via Semantic Information Bottleneck for Long-Horizon Video Agents
ACL 2026
Improving Deepfake Detection with Reinforcement Learning-Based Adaptive Data Augmentation
AAAI 2026
When Efficiency Meets Safety: A Benchmark Security Analysis of KV Cache Compression in Large Language Models
ACL 2026
CASL: Curvature-Augmented Self-supervised Learning for 3D Anomaly Detection
AAAI 2026
One Perturbation is Enough: On Generating Universal Adversarial Perturbations against Vision-Language Pre-training Models
ICCV 2025
AutoSSVH: Exploring Automated Frame Sampling for Efficient Self-Supervised Video Hashing
CVPR 2025
MoSEs: Uncertainty-Aware AI-Generated Text Detection via Mixture of Stylistics Experts with Conditional Thresholds
EMNLP 2025
Your Language Model Can Secretly Write Like Humans: Contrastive Paraphrase Attacks on LLM-Generated Text Detectors
EMNLP 2025
Enhancing Partially Relevant Video Retrieval with Hyperbolic Learning
ICCV 2025
FastVAR: Linear Visual Autoregressive Modeling via Cached Token Pruning
ICCV 2025
Cassic: Towards Content-Adaptive State-Space Models for Learned Image Compression
ICCV 2025
GCD-Sampling: A General Cross-scale Decoupled Sampling for Point Cloud
AAAI 2025
Efficient Self-Supervised Video Hashing with Selective State Spaces
AAAI 2025
Diffusion Prior Interpolation for Flexibility Real-World Face Super-Resolution
AAAI 2025
CALF: Aligning LLMs for Time Series Forecasting via Cross-modal Fine-Tuning
AAAI 2025
Pre-Trained Vision-Language Models as Noisy Partial Annotators
AAAI 2025
Modeling Uncertainty in Composed Image Retrieval via Probabilistic Embeddings
ACL 2025
Benchmarking Open-ended Audio Dialogue Understanding for Large Audio-Language Models
ACL 2025
TimeBridge: Non-Stationarity Matters for Long-term Time Series Forecasting
ICML 2025
3D-LMVIC: Learning-based Multi-View Image Compression with 3D Gaussian Geometric Priors
ICML 2025
TimeFilter: Patch-Specific Spatial-Temporal Graph Filtration for Time Series Forecasting
ICML 2025
IntLoRA: Integral Low-rank Adaptation of Quantized Diffusion Models
ICML 2025
PMA: Towards Parameter-Efficient Point Cloud Understanding via Point Mamba Adapter
CVPR 2025
Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning
CVPR 2025
Adapting Pre-trained 3D Models for Point Cloud Video Understanding via Cross-frame Spatio-temporal Perception
CVPR 2025
Hierarchical Features Matter: A Deep Exploration of Progressive Parameterization Method for Dataset Distillation
CVPR 2025
MambaIRv2: Attentive State Space Restoration
CVPR 2025
Protecting Your Video Content: Disrupting Automated Video-based LLM Annotations
CVPR 2025
Going Beyond Feature Similarity: Effective Dataset distillation based on Class-aware Conditional Mutual Information
ICLR 2025
Error-quantified Conformal Inference for Time Series
ICLR 2025
DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation
ICLR 2025
An Exploration with Entropy Constrained 3D Gaussians for 2D Video Compression
ICLR 2025
Stealthy Shield Defense: A Conditional Mutual Information-Based Approach against Black-Box Model Inversion Attacks
ICLR 2025
Efficient Differentiable Approximation of Generalized Low-rank Regularization
IJCAI 2025
Point Cloud Mixture-of-Domain-Experts Model for 3D Self-supervised Learning
IJCAI 2025
DDN: Dual-domain Dynamic Normalization for Non-stationary Time Series Forecasting
NIPS 2024
A Closer Look at GAN Priors: Exploiting Intermediate Features for Enhanced Model Inversion Attacks
ECCV 2024
MambaIR: A Simple Baseline for Image Restoration with State-Space Model
ECCV 2024
LCM: Locally Constrained Compact Point Cloud Model for Masked Point Modeling
NIPS 2024
BoostAdapter: Improving Vision-Language Test-Time Adaptation via Regional Bootstrapping
NIPS 2024
Everyday Object Meets Vision-and-Language Navigation Agent via Backdoor
NIPS 2024
Towards Faithful XAI Evaluation via Generalization-Limited Backdoor Watermark
ICLR 2024
Parameter-Efficient and Memory-Efficient Tuning for Vision Transformer: A Disentangled Approach
ECCV 2024
GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval
AAAI 2024
Towards Compact 3D Representations via Point Feature Enhancement Masked Autoencoders
AAAI 2024
Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Understanding
AAAI 2024
Controller-Guided Partial Label Consistency Regularization with Unlabeled Data
AAAI 2024
CLIP-Guided Generative Networks for Transferable Targeted Adversarial Attacks
ECCV 2024
Inducing High Energy-Latency of Large Vision-Language Models with Verbose Images
ICLR 2024
Parameter Efficient Adaptation for Image Restoration with Heterogeneous Mixture-of-Experts
NIPS 2024
ReFIR: Grounding Large Restoration Models with Retrieval Augmentation
NIPS 2024
Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision Transfomers
CVPR 2024
GladCoder: Stylized QR Code Generation with Grayscale-Aware Denoising Process
IJCAI 2024
BadCLIP: Trigger-Aware Prompt Learning for Backdoor Attacks on CLIP
CVPR 2024
Periodicity Decoupling Framework for Long-term Series Forecasting
ICLR 2024
Boundary-aware Decoupled Flow Networks for Realistic Extreme Rescaling
IJCAI 2024
Invertible Residual Rescaling Models
IJCAI 2024
Instance-aware Dynamic Prompt Tuning for Pre-trained Point Cloud Models
ICCV 2023
Domain Watermark: Effective and Harmless Dataset Copyright Protection is Closed at Hand
NIPS 2023
Open-Vocabulary Multi-Label Classification via Multi-Modal Knowledge Transfer
AAAI 2023
FSR: A General Frequency-Oriented Framework to Accelerate Image Super-resolution Networks
AAAI 2023
Contrastive Masked Autoencoders for Self-Supervised Video Hashing
AAAI 2023
Combating Unknown Bias with Effective Bias-Conflicting Scoring and Gradient Alignment
AAAI 2023
Learned Distributed Image Compression with Multi-Scale Patch Matching in Feature Domain
AAAI 2023
Learning Transferable Spatiotemporal Representations From Natural Script Knowledge
CVPR 2023
Backdoor Defense via Adaptively Splitting Poisoned Dataset
CVPR 2023
Towards Robust Model Watermark via Reducing Parametric Vulnerability
ICCV 2023
Unsupervised Surface Anomaly Detection with Diffusion Probabilistic Model
ICCV 2023
GIFD: A Generative Gradient Inversion Method with Feature Domain Optimization
ICCV 2023
One-bit Flip is All You Need: When Bit-flip Attack Meets Model Training
ICCV 2023
DELTA: DEGRADATION-FREE FULLY TEST-TIME ADAPTATION
ICLR 2023
Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection
ICLR 2023
Towards Robust Scene Text Image Super-resolution via Explicit Location Enhancement
IJCAI 2023
PILC: Practical Image Lossless Compression With an End-to-End GPU Oriented Neural Framework
CVPR 2022
Improving Vision Transformers by Revisiting High-Frequency Components
ECCV 2022
Hardly Perceptible Trojan Attack against Neural Networks with Bit Flips
ECCV 2022
Few-Shot Backdoor Attacks on Visual Object Tracking
ICLR 2022
Boosting Black-Box Attack With Partially Transferred Conditional Adversarial Distribution
CVPR 2022
NeXT: Towards High Quality Neural Radiance Fields via Multi-Skip Transformer
ECCV 2022
Untargeted Backdoor Watermark: Towards Harmless and Stealthy Dataset Copyright Protection
NIPS 2022
Deep Dirichlet process mixture models
UAI 2022
Defending against Model Stealing via Verifying Embedded External Features
AAAI 2022
Contrastive Quantization with Code Memory for Unsupervised Image Retrieval
AAAI 2022
SimCC: A Simple Coordinate Classification Perspective for Human Pose Estimation
ECCV 2022
TokenPose: Learning Keypoint Tokens for Human Pose Estimation
ICCV 2021
Clustering Effect of Adversarial Robust Models
NIPS 2021
Improving Adversarial Robustness via Channel-wise Activation Suppressing
ICLR 2021
Targeted Attack against Deep Neural Networks via Flipping Limited Weight Bits
ICLR 2021
Skip Connections Matter: On the Transferability of Adversarial Examples Generated with ResNets
ICLR 2020
Stochastic Deep Gaussian Processes over Graphs
NIPS 2020
Adversarial Weight Perturbation Helps Robust Generalization
NIPS 2020
Improving Query Efficiency of Black-box Adversarial Attack
ECCV 2020
Training Interpretable Convolutional Neural Networks by Differentiating Class-specific Filters
ECCV 2020
Targeted Attack for Deep Hashing based Retrieval
ECCV 2020
One-Shot Adversarial Attacks on Visual Tracking With Dual Attention
CVPR 2020
Maintaining Discrimination and Fairness in Class Incremental Learning
CVPR 2020
Adversarial Attack on Deep Product Quantization Network for Image Retrieval
AAAI 2020
Automatic Grassland Degradation Estimation Using Deep Learning
IJCAI 2019
Hilbert-Based Generative Defense for Adversarial Examples
ICCV 2019
Second-Order Attention Network for Single Image Super-Resolution
CVPR 2019
Exploiting Common Characters in Chinese and Japanese to Learn Cross-Lingual Word Embeddings via Matrix Factorization
ACL 2018
BML: A High-performance, Low-cost Gradient Synchronization Algorithm for DML Training
NIPS 2018
Iterative Learning With Open-Set Noisy Labels
CVPR 2018
Student-t Process Regression with Student-t Likelihood
IJCAI 2017
Accelerated Stochastic Greedy Coordinate Descent by Soft Thresholding Projection onto Simplex
NIPS 2017
Robust Survey Aggregation with Student-t Distribution and Sparse Representation
IJCAI 2017
Bernoulli Random Forests: Closing the Gap between Theoretical Consistency and Empirical Soundness
IJCAI 2016