Bin Xiao

54 papers · 2017–2026 · 11 conferences · across top CS/AI conferences

Achievements

+14 more ↓

🌉 Interdisciplinary Bridge 🏃 Academic Marathon (8) 🌍 Conference Polyglot (11) 🌈 Renaissance Researcher (9) 🗺️ Taxonomy Completionist (80)

🐝 Cross-Pollinator (15) 🌍 Conference Polyglot (11) 🏃 Academic Marathon (8) 👥 Mega-Team (20) 🏆 Grand Slam 🤝 Dynamic Duo (16) 🧬 Topic Evolution 📈 Trend Setter ❓ The Questioner (2) 💎 Century Club (51) 🚀 Conference Pioneer 🗃️ Keyword Collector (196) ⚡ Prolific Year (8) 🔥 Unstoppable (9)

Conferences

CVPR (16) AAAI (9) ECCV (9) ICCV (9) ACML (3) ICLR (3) ICML (1) IJCAI (1) MICCAI (1) NAACL (1) NIPS (1)

Top co-authors

Xiuli Bi (16) Lu Yuan (16) Weisheng Li (9) Bo Liu (9) Xiyang Dai (7) Jianwei Yang (6) Jingdong Wang (6) Jianfeng Gao (5) Mengchen Liu (5) Xinbo Gao (5)

Research topics

Privacy (2)

Keywords

semantic segmentation (5) adversarial attack (5) image classification (4) vision transformer (4) human pose estimation (4) metric learning (3) contrastive learning (3) neural network (3) adversarial learning (3) knowledge distillation (3) convolutional neural network (3) object detection (3) zero-shot learning (3) self-supervised learning (3) multimodal learning (3) black-box attack (3) feature fusion (3) image forgery detection (2) representation learning (2) medical image segmentation (2)

Papers

TGDD: Trajectory Guided Dataset Distillation with Balanced Distribution AAAI 2026 SSR: Semantic and Spatial Rectification for CLIP-based Weakly Supervised Segmentation AAAI 2026 Clear Nights Ahead: Towards Multi-Weather Nighttime Image Restoration AAAI 2026 Improving Transferable Targeted Attacks with Feature Tuning Mixup CVPR 2025 PLA: Prompt Learning Attack against Text-to-Image Generative Models ICCV 2025 Towards Universal AI-Generated Image Detection by Variational Information Bottleneck Network CVPR 2025 Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion CVPR 2025 Efficient Dynamic Ensembling for Multiple LLM Experts IJCAI 2025 Test-Time Learning for Large Language Models ICML 2025 UV-Attack: Physical-World Adversarial Attacks on Person Detection via Dynamic-NeRF-based UV Mapping ICLR 2025 Breaking Grid Constraints: Dynamic Graph Reconstruction Network for Multi-organ Segmentation ICCV 2025 Who Controls the Authorization? Invertible Networks for Copyright Protection in Text-to-Image Synthesis ICCV 2025 CustomTTT: Motion and Appearance Customized Video Generation via Test-Time Training AAAI 2025 Power of Diversity: Enhancing Data-Free Black-Box Attack with Domain-Augmented Learning AAAI 2025 DGMIR: Dual-Guided Multimodal Medical Image Registration based on Multi-view Augmentation and On-site Modality Removal MICCAI 2025 Transferable 3D Adversarial Shape Completion using Diffusion Models ECCV 2024 AdvDiff: Generating Unrestricted Adversarial Examples using Diffusion Models ECCV 2024 i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data NAACL 2024 Efficient Modulation for Vision Networks ICLR 2024 Focus Stacking with High Fidelity and Superior Visual Effects AAAI 2024 Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks CVPR 2024 Using My Artistic Style? You Must Obtain My Authorization ECCV 2024 DLBD: A Self-Supervised Direct-Learned Binary Descriptor CVPR 2023 Self-Supervised Image Local Forgery Detection by JPEG Compression Trace AAAI 2023 i-Code: An Integrative and Composable Multimodal Learning Framework AAAI 2023 Physical-World Optical Adversarial Attacks on 3D Face Recognition CVPR 2023 MCF: Mutual Correction Framework for Semi-Supervised Medical Image Segmentation CVPR 2023 StyLess: Boosting the Transferability of Adversarial Examples CVPR 2023 DAA: A Delta Age AdaIN Operation for Age Estimation via Binary Code Transformer CVPR 2023 TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance ICCV 2023 TinyViT: Fast Pretraining Distillation for Small Vision Transformers ECCV 2022 Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training ECCV 2022 Detecting Generated Images by Real Images ECCV 2022 MiniViT: Compressing Vision Transformers With Weight Multiplexing CVPR 2022 Unified Contrastive Learning in Image-Text-Label Space CVPR 2022 Efficient Self-supervised Vision Transformers for Representation Learning ICLR 2022 Semantic Cross Attention for Few-shot Learning ACML 2022 FF-Net: An End-to-end Feature-Fusion Network for Double JPEG Detection and Localization ACML 2022 DaViT: Dual Attention Vision Transformers ECCV 2022 DTMNet: A Discrete Tchebichef Moments-Based Deep Neural Network for Multi-Focus Image Fusion ICCV 2021 Focal Attention for Long-Range Interactions in Vision Transformers NIPS 2021 Lite-HRNet: A Lightweight High-Resolution Network CVPR 2021 Dynamic Head: Unifying Object Detection Heads With Attentions CVPR 2021 Bottom-Up Human Pose Estimation via Disentangled Keypoint Regression CVPR 2021 Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding ICCV 2021 Reality Transform Adversarial Generators for Image Splicing Forgery Detection and Localization ICCV 2021 CvT: Introducing Convolutions to Vision Transformers ICCV 2021 3D Human Pose Estimation via Explicit Compositional Depth Maps AAAI 2020 Proxy Network for Few Shot Learning ACML 2020 HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation CVPR 2020 Deep High-Resolution Representation Learning for Human Pose Estimation CVPR 2019 Simple Baselines for Human Pose Estimation and Tracking ECCV 2018 Integral Human Pose Regression ECCV 2018 Interleaved Group Convolutions ICCV 2017