Bin Xiao
54 papers · 2017–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
π Interdisciplinary Bridge π Academic Marathon (8) π Conference Polyglot (11) π Renaissance Researcher (9) πΊοΈ Taxonomy Completionist (80)
π
Cross-Pollinator
(15)
π
Conference Polyglot
(11)
π
Academic Marathon
(8)
π₯
Mega-Team
(20)
π
Grand Slam
π€
Dynamic Duo
(16)
π§¬
Topic Evolution
π
Trend Setter
β
The Questioner
(2)
π
Century Club
(51)
π
Conference Pioneer
ποΈ
Keyword Collector
(196)
β‘
Prolific Year
(8)
π₯
Unstoppable
(9)
Conferences
CVPR (16)
AAAI (9)
ECCV (9)
ICCV (9)
ACML (3)
ICLR (3)
ICML (1)
IJCAI (1)
MICCAI (1)
NAACL (1)
NIPS (1)
Top co-authors
Research topics
Keywords
semantic segmentation
(5)
adversarial attack
(5)
image classification
(4)
vision transformer
(4)
human pose estimation
(4)
metric learning
(3)
contrastive learning
(3)
neural network
(3)
adversarial learning
(3)
knowledge distillation
(3)
convolutional neural network
(3)
object detection
(3)
zero-shot learning
(3)
self-supervised learning
(3)
multimodal learning
(3)
black-box attack
(3)
feature fusion
(3)
image forgery detection
(2)
representation learning
(2)
medical image segmentation
(2)
Papers
TGDD: Trajectory Guided Dataset Distillation with Balanced Distribution
AAAI 2026
SSR: Semantic and Spatial Rectification for CLIP-based Weakly Supervised Segmentation
AAAI 2026
Clear Nights Ahead: Towards Multi-Weather Nighttime Image Restoration
AAAI 2026
Improving Transferable Targeted Attacks with Feature Tuning Mixup
CVPR 2025
PLA: Prompt Learning Attack against Text-to-Image Generative Models
ICCV 2025
Towards Universal AI-Generated Image Detection by Variational Information Bottleneck Network
CVPR 2025
Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion
CVPR 2025
Efficient Dynamic Ensembling for Multiple LLM Experts
IJCAI 2025
Test-Time Learning for Large Language Models
ICML 2025
UV-Attack: Physical-World Adversarial Attacks on Person Detection via Dynamic-NeRF-based UV Mapping
ICLR 2025
Breaking Grid Constraints: Dynamic Graph Reconstruction Network for Multi-organ Segmentation
ICCV 2025
Who Controls the Authorization? Invertible Networks for Copyright Protection in Text-to-Image Synthesis
ICCV 2025
CustomTTT: Motion and Appearance Customized Video Generation via Test-Time Training
AAAI 2025
Power of Diversity: Enhancing Data-Free Black-Box Attack with Domain-Augmented Learning
AAAI 2025
DGMIR: Dual-Guided Multimodal Medical Image Registration based on Multi-view Augmentation and On-site Modality Removal
MICCAI 2025
Transferable 3D Adversarial Shape Completion using Diffusion Models
ECCV 2024
AdvDiff: Generating Unrestricted Adversarial Examples using Diffusion Models
ECCV 2024
i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data
NAACL 2024
Efficient Modulation for Vision Networks
ICLR 2024
Focus Stacking with High Fidelity and Superior Visual Effects
AAAI 2024
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
CVPR 2024
Using My Artistic Style? You Must Obtain My Authorization
ECCV 2024
DLBD: A Self-Supervised Direct-Learned Binary Descriptor
CVPR 2023
Self-Supervised Image Local Forgery Detection by JPEG Compression Trace
AAAI 2023
i-Code: An Integrative and Composable Multimodal Learning Framework
AAAI 2023
Physical-World Optical Adversarial Attacks on 3D Face Recognition
CVPR 2023
MCF: Mutual Correction Framework for Semi-Supervised Medical Image Segmentation
CVPR 2023
StyLess: Boosting the Transferability of Adversarial Examples
CVPR 2023
DAA: A Delta Age AdaIN Operation for Age Estimation via Binary Code Transformer
CVPR 2023
TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance
ICCV 2023
TinyViT: Fast Pretraining Distillation for Small Vision Transformers
ECCV 2022
Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training
ECCV 2022
Detecting Generated Images by Real Images
ECCV 2022
MiniViT: Compressing Vision Transformers With Weight Multiplexing
CVPR 2022
Unified Contrastive Learning in Image-Text-Label Space
CVPR 2022
Efficient Self-supervised Vision Transformers for Representation Learning
ICLR 2022
Semantic Cross Attention for Few-shot Learning
ACML 2022
FF-Net: An End-to-end Feature-Fusion Network for
Double JPEG Detection and Localization
ACML 2022
DaViT: Dual Attention Vision Transformers
ECCV 2022
DTMNet: A Discrete Tchebichef Moments-Based Deep Neural Network for Multi-Focus Image Fusion
ICCV 2021
Focal Attention for Long-Range Interactions in Vision Transformers
NIPS 2021
Lite-HRNet: A Lightweight High-Resolution Network
CVPR 2021
Dynamic Head: Unifying Object Detection Heads With Attentions
CVPR 2021
Bottom-Up Human Pose Estimation via Disentangled Keypoint Regression
CVPR 2021
Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding
ICCV 2021
Reality Transform Adversarial Generators for Image Splicing Forgery Detection and Localization
ICCV 2021
CvT: Introducing Convolutions to Vision Transformers
ICCV 2021
3D Human Pose Estimation via Explicit Compositional Depth Maps
AAAI 2020
Proxy Network for Few Shot Learning
ACML 2020
HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation
CVPR 2020
Deep High-Resolution Representation Learning for Human Pose Estimation
CVPR 2019
Simple Baselines for Human Pose Estimation and Tracking
ECCV 2018
Integral Human Pose Regression
ECCV 2018
Interleaved Group Convolutions
ICCV 2017