Yao Zhao

90 papers · 2017–2026 · 12 conferences · across top CS/AI conferences

Achievements

+16 more ↓

🐣 Hot Topic Early Bird 🌍 Conference Polyglot (12) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (8)

🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏠 Conference Loyalist (24) 🤝 Dynamic Duo (39) 👑 Triple Crown 🏆 Grand Slam 🔬 Deep Specialist (10) 🧬 Topic Evolution 🏆 Keyword Champion (2) ⚡ Prolific Year (23) ❓ The Questioner 🗃️ Keyword Collector (400) 💎 Century Club (89) 🔥 Unstoppable (9) 🚀 Conference Pioneer

Conferences

CVPR (24) AAAI (15) ICCV (14) ECCV (8) NIPS (8) ICLR (7) EMNLP (5) ACL (3) ICML (3) IJCAI (1) NAACL (1) WACV (1)

Top co-authors

Yunchao Wei (39) Chunyu Lin (14) Kang Liao (12) Lang Nie (8) Chuangchuang Tan (7) Huihui Bai (7) Chunjie Zhang (7) Shikui Wei (7) Mohammad Saleh (7) Humphrey Shi (6)

Research topics

Computer Vision (1)

Keywords

semantic segmentation (12) vision-language model (7) object detection (5) convolutional neural network (5) attention mechanism (5) image segmentation (5) diffusion model (5) transfer learning (5) unsupervised learning (4) deepfake detection (4) neural network (3) video generation (3) image stitching (3) instance segmentation (3) transformer architecture (3) video super-resolution (3) representation learning (3) abstractive summarization (3) knowledge transfer (2) self-supervised learning (2)

Papers

RAIN: Redundancy-Aware Latent Injection for Quality-Preserving Image Watermarking AAAI 2026 NTClick: Achieving Precise Interactive Segmentation With Noise-tolerant Clicks CVPR 2025 VideoWorld: Exploring Knowledge Learning from Unlabeled Videos CVPR 2025 EvEnhancer: Empowering Effectiveness, Efficiency and Generalizability for Continuous Space-Time Video Super-Resolution with Events CVPR 2025 Dual-view X-ray Detection: Can AI Detect Prohibited Items from Dual-view X-ray Images like Humans? CVPR 2025 Collapsed Language Models Promote Fairness ICLR 2025 ODDN: Addressing Unpaired Data Challenges in Open-World Deepfake Detection on Online Social Networks AAAI 2025 Memory Efficient Matting with Adaptive Token Routing AAAI 2025 Attend and Enrich: Enhanced Visual Prompt for Zero-Shot Learning AAAI 2025 C2P-CLIP: Injecting Category Common Prompt in CLIP to Enhance Generalization in Deepfake Detection AAAI 2025 Unsupervised Region-Based Image Editing of Denoising Diffusion Models AAAI 2025 CSR:Achieving 1 Bit Key-Value Cache via Sparse Representation AAAI 2025 ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance ICLR 2025 Once-for-All: Controllable Generative Image Compression with Dynamic Granularity Adaptation ICLR 2025 Making RALM Robust to Irrelevant Contexts via Layer Knowledge Guided Attention ACL 2025 Visual Relation Diffusion for Human-Object Interaction Detection ICCV 2025 ReCoT: Reflective Self-Correction Training for Mitigating Confirmation Bias in Large Vision-Language Models ICCV 2025 CLIP-GS: Unifying Vision-Language Representation with 3D Gaussian Splatting ICCV 2025 CharaConsist: Fine-Grained Consistent Character Generation ICCV 2025 PixelStitch: Structure-Preserving Pixel-Wise Bidirectional Warps for Unsupervised Image Stitching ICCV 2025 DIDS: Domain Impact-aware Data Sampling for Large Language Model Training EMNLP 2025 LiPO: Listwise Preference Optimization through Learning-to-Rank NAACL 2025 Fixing the Loose Brake: Exponential-Tailed Stopping Time in Best Arm Identification ICML 2025 Diffusion4D: Fast Spatial-temporal Consistent 4D generation via Video Diffusion Models NIPS 2024 Adaptive Experimentation When You Can't Experiment NIPS 2024 Statistical Rejection Sampling Improves Preference Optimization ICLR 2024 Diffusion for Natural Image Matting ECCV 2024 Region-Adaptive Transform with Segmentation Prior for Image Compression ECCV 2024 Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation ECCV 2024 Eliminating Warping Shakes for Unsupervised Online Video Stitching ECCV 2024 PixelLM: Pixel Reasoning with Large Multimodal Model CVPR 2024 Transferable and Principled Efficiency for Open-Vocabulary Segmentation CVPR 2024 Frequency-Aware Deepfake Detection: Improving Generalizability through Frequency Space Domain Learning AAAI 2024 Semantic Lens: Instance-Centric Semantic Alignment for Video Super-resolution AAAI 2024 On the Unstable Convergence Regime of Gradient Descent AAAI 2024 Lyapunov-Stable Deep Equilibrium Models AAAI 2024 SeeClear: Semantic Distillation Enhances Pixel Condensation for Video Super-Resolution NIPS 2024 Towards the Uncharted: Density-Descending Feature Perturbation for Semi-supervised Semantic Segmentation CVPR 2024 Rethinking the Up-Sampling Operations in CNN-based Generative Network for Generalizable Deepfake Detection CVPR 2024 Endow SAM with Keen Eyes: Temporal-spatial Prompt Learning for Video Camouflaged Object Detection CVPR 2024 Forgery-aware Adaptive Transformer for Generalizable Synthetic Image Detection CVPR 2024 Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation CVPR 2024 Region-Native Visual Tokenization ECCV 2024 Out-of-Distribution Detection and Selective Generation for Conditional Language Models ICLR 2023 Learning Mask-aware CLIP Representations for Zero-Shot Segmentation NIPS 2023 RIO: A Benchmark for Reasoning Intention-Oriented Objects in Open Environments NIPS 2023 SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process NIPS 2023 Spatiotemporal Deformation Perception for Fisheye Video Rectification AAAI 2023 Learning To Segment Every Referring Object Point by Point CVPR 2023 Disentangling Orthogonal Planes for Indoor Panoramic Room Layout Estimation With Cross-Scale Distortion Awareness CVPR 2023 An In-Depth Exploration of Person Re-Identification and Gait Recognition in Cloth-Changing Conditions CVPR 2023 Progressive Semantic-Visual Mutual Adaption for Generalized Zero-Shot Learning CVPR 2023 Learning on Gradients: Generalized Artifacts Representation for GAN-Generated Images Detection CVPR 2023 Investigating Efficiently Extending Transformers for Long Input Summarization EMNLP 2023 Improving the Robustness of Summarization Models by Detecting and Removing Input Noise EMNLP 2023 Parallax-Tolerant Unsupervised Deep Image Stitching ICCV 2023 Locating Noise is Halfway Denoising for Semi-Supervised Segmentation ICCV 2023 Global Knowledge Calibration for Fast Open-Vocabulary Segmentation ICCV 2023 Group Pose: A Simple Baseline for End-to-End Multi-Person Pose Estimation ICCV 2023 CTP:Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation ICCV 2023 RecRecNet: Rectangling Rectified Wide-Angle Images by Thin-Plate Spline Model and DoF-based Curriculum Learning ICCV 2023 Innovating Real Fisheye Image Correction with Dual Diffusion Architecture ICCV 2023 SMART: Sentences as Basic Units for Text Evaluation ICLR 2023 Calibrating Sequence likelihood Improves Conditional Language Generation ICLR 2023 Revisiting Simple Regret: Fast Rates for Returning a Good Arm ICML 2023 Complementary Bi-Directional Feature Compression for Indoor 360deg Semantic Segmentation With Self-Distillation WACV 2023 Implicit Relation Linking for Question Answering over Knowledge Graph ACL 2022 Mask Matching Transformer for Few-Shot Segmentation NIPS 2022 SiRi: A Simple Selective Retraining Mechanism for Transformer-Based Visual Grounding ECCV 2022 Slim Scissors: Segmenting Thin Object from Synthetic Background ECCV 2022 PanoFormer: Panorama Transformer for Indoor 360° Depth Estimation ECCV 2022 Deep Rectangling for Image Stitching: A Learning Baseline CVPR 2022 A Well-Composed Text is Half Done! Composition Sampling for Diverse Conditional Generation ACL 2022 Double Low-Rank Representation With Projection Distance Penalty for Clustering CVPR 2021 GradingNet: Towards Providing Reliable Supervisions for Weakly Supervised Object Detection by Grading the Box Candidates AAAI 2021 ForumSum: A Multi-Speaker Conversation Summarization Dataset EMNLP 2021 Towards Fast and Accurate Real-World Depth Super-Resolution: Benchmark Dataset and Baseline CVPR 2021 Progressively Complementary Network for Fisheye Image Rectification Using Appearance Flow CVPR 2021 Towards Complete Scene and Regular Shape for Distortion Rectification by Curve-Aware Extrapolation ICCV 2021 Multi-Level Curriculum for Training a Distortion-Aware Barrel Distortion Rectification Model ICCV 2021 Fast Template Matching and Update for Video Object Tracking and Segmentation CVPR 2020 Interactive Object Segmentation With Inside-Outside Guidance CVPR 2020 Distribution-Induced Bidirectional Generative Adversarial Network for Graph Representation Learning CVPR 2020 PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization ICML 2020 CoADNet: Collaborative Aggregation-and-Distribution Networks for Co-Salient Object Detection NIPS 2020 Devil in the Details: Towards Accurate Single and Multiple Human Parsing AAAI 2019 Learning Heterogeneous Spatial-Temporal Representation for Bike-Sharing Demand Prediction AAAI 2019 Self-Supervised Deep Low-Rank Assignment Model for Prototype Selection IJCAI 2018 Paragraph-level Neural Question Generation with Maxout Pointer and Gated Self-attention Networks EMNLP 2018 Object Region Mining With Adversarial Erasing: A Simple Classification to Semantic Segmentation Approach CVPR 2017