Yao Zhao
90 papers · 2017–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
🐣 Hot Topic Early Bird 🌍 Conference Polyglot (12) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (8)
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🐣
Hot Topic Early Bird
🏠
Conference Loyalist
(24)
🤝
Dynamic Duo
(39)
👑
Triple Crown
🏆
Grand Slam
🔬
Deep Specialist
(10)
🧬
Topic Evolution
🏆
Keyword Champion
(2)
⚡
Prolific Year
(23)
❓
The Questioner
🗃️
Keyword Collector
(400)
💎
Century Club
(89)
🔥
Unstoppable
(9)
🚀
Conference Pioneer
Conferences
CVPR (24)
AAAI (15)
ICCV (14)
ECCV (8)
NIPS (8)
ICLR (7)
EMNLP (5)
ACL (3)
ICML (3)
IJCAI (1)
NAACL (1)
WACV (1)
Top co-authors
Research topics
Keywords
semantic segmentation
(12)
vision-language model
(7)
object detection
(5)
convolutional neural network
(5)
attention mechanism
(5)
image segmentation
(5)
diffusion model
(5)
transfer learning
(5)
unsupervised learning
(4)
deepfake detection
(4)
neural network
(3)
video generation
(3)
image stitching
(3)
instance segmentation
(3)
transformer architecture
(3)
video super-resolution
(3)
representation learning
(3)
abstractive summarization
(3)
knowledge transfer
(2)
self-supervised learning
(2)
Papers
RAIN: Redundancy-Aware Latent Injection for Quality-Preserving Image Watermarking
AAAI 2026
NTClick: Achieving Precise Interactive Segmentation With Noise-tolerant Clicks
CVPR 2025
VideoWorld: Exploring Knowledge Learning from Unlabeled Videos
CVPR 2025
EvEnhancer: Empowering Effectiveness, Efficiency and Generalizability for Continuous Space-Time Video Super-Resolution with Events
CVPR 2025
Dual-view X-ray Detection: Can AI Detect Prohibited Items from Dual-view X-ray Images like Humans?
CVPR 2025
Collapsed Language Models Promote Fairness
ICLR 2025
ODDN: Addressing Unpaired Data Challenges in Open-World Deepfake Detection on Online Social Networks
AAAI 2025
Memory Efficient Matting with Adaptive Token Routing
AAAI 2025
Attend and Enrich: Enhanced Visual Prompt for Zero-Shot Learning
AAAI 2025
C2P-CLIP: Injecting Category Common Prompt in CLIP to Enhance Generalization in Deepfake Detection
AAAI 2025
Unsupervised Region-Based Image Editing of Denoising Diffusion Models
AAAI 2025
CSR:Achieving 1 Bit Key-Value Cache via Sparse Representation
AAAI 2025
ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance
ICLR 2025
Once-for-All: Controllable Generative Image Compression with Dynamic Granularity Adaptation
ICLR 2025
Making RALM Robust to Irrelevant Contexts via Layer Knowledge Guided Attention
ACL 2025
Visual Relation Diffusion for Human-Object Interaction Detection
ICCV 2025
ReCoT: Reflective Self-Correction Training for Mitigating Confirmation Bias in Large Vision-Language Models
ICCV 2025
CLIP-GS: Unifying Vision-Language Representation with 3D Gaussian Splatting
ICCV 2025
CharaConsist: Fine-Grained Consistent Character Generation
ICCV 2025
PixelStitch: Structure-Preserving Pixel-Wise Bidirectional Warps for Unsupervised Image Stitching
ICCV 2025
DIDS: Domain Impact-aware Data Sampling for Large Language Model Training
EMNLP 2025
LiPO: Listwise Preference Optimization through Learning-to-Rank
NAACL 2025
Fixing the Loose Brake: Exponential-Tailed Stopping Time in Best Arm Identification
ICML 2025
Diffusion4D: Fast Spatial-temporal Consistent 4D generation via Video Diffusion Models
NIPS 2024
Adaptive Experimentation When You Can't Experiment
NIPS 2024
Statistical Rejection Sampling Improves Preference Optimization
ICLR 2024
Diffusion for Natural Image Matting
ECCV 2024
Region-Adaptive Transform with Segmentation Prior for Image Compression
ECCV 2024
Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation
ECCV 2024
Eliminating Warping Shakes for Unsupervised Online Video Stitching
ECCV 2024
PixelLM: Pixel Reasoning with Large Multimodal Model
CVPR 2024
Transferable and Principled Efficiency for Open-Vocabulary Segmentation
CVPR 2024
Frequency-Aware Deepfake Detection: Improving Generalizability through Frequency Space Domain Learning
AAAI 2024
Semantic Lens: Instance-Centric Semantic Alignment for Video Super-resolution
AAAI 2024
On the Unstable Convergence Regime of Gradient Descent
AAAI 2024
Lyapunov-Stable Deep Equilibrium Models
AAAI 2024
SeeClear: Semantic Distillation Enhances Pixel Condensation for Video Super-Resolution
NIPS 2024
Towards the Uncharted: Density-Descending Feature Perturbation for Semi-supervised Semantic Segmentation
CVPR 2024
Rethinking the Up-Sampling Operations in CNN-based Generative Network for Generalizable Deepfake Detection
CVPR 2024
Endow SAM with Keen Eyes: Temporal-spatial Prompt Learning for Video Camouflaged Object Detection
CVPR 2024
Forgery-aware Adaptive Transformer for Generalizable Synthetic Image Detection
CVPR 2024
Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation
CVPR 2024
Region-Native Visual Tokenization
ECCV 2024
Out-of-Distribution Detection and Selective Generation for Conditional Language Models
ICLR 2023
Learning Mask-aware CLIP Representations for Zero-Shot Segmentation
NIPS 2023
RIO: A Benchmark for Reasoning Intention-Oriented Objects in Open Environments
NIPS 2023
SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process
NIPS 2023
Spatiotemporal Deformation Perception for Fisheye Video Rectification
AAAI 2023
Learning To Segment Every Referring Object Point by Point
CVPR 2023
Disentangling Orthogonal Planes for Indoor Panoramic Room Layout Estimation With Cross-Scale Distortion Awareness
CVPR 2023
An In-Depth Exploration of Person Re-Identification and Gait Recognition in Cloth-Changing Conditions
CVPR 2023
Progressive Semantic-Visual Mutual Adaption for Generalized Zero-Shot Learning
CVPR 2023
Learning on Gradients: Generalized Artifacts Representation for GAN-Generated Images Detection
CVPR 2023
Investigating Efficiently Extending Transformers for Long Input Summarization
EMNLP 2023
Improving the Robustness of Summarization Models by Detecting and Removing Input Noise
EMNLP 2023
Parallax-Tolerant Unsupervised Deep Image Stitching
ICCV 2023
Locating Noise is Halfway Denoising for Semi-Supervised Segmentation
ICCV 2023
Global Knowledge Calibration for Fast Open-Vocabulary Segmentation
ICCV 2023
Group Pose: A Simple Baseline for End-to-End Multi-Person Pose Estimation
ICCV 2023
CTP:Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation
ICCV 2023
RecRecNet: Rectangling Rectified Wide-Angle Images by Thin-Plate Spline Model and DoF-based Curriculum Learning
ICCV 2023
Innovating Real Fisheye Image Correction with Dual Diffusion Architecture
ICCV 2023
SMART: Sentences as Basic Units for Text Evaluation
ICLR 2023
Calibrating Sequence likelihood Improves Conditional Language Generation
ICLR 2023
Revisiting Simple Regret: Fast Rates for Returning a Good Arm
ICML 2023
Complementary Bi-Directional Feature Compression for Indoor 360deg Semantic Segmentation With Self-Distillation
WACV 2023
Implicit Relation Linking for Question Answering over Knowledge Graph
ACL 2022
Mask Matching Transformer for Few-Shot Segmentation
NIPS 2022
SiRi: A Simple Selective Retraining Mechanism for Transformer-Based Visual Grounding
ECCV 2022
Slim Scissors: Segmenting Thin Object from Synthetic Background
ECCV 2022
PanoFormer: Panorama Transformer for Indoor 360° Depth Estimation
ECCV 2022
Deep Rectangling for Image Stitching: A Learning Baseline
CVPR 2022
A Well-Composed Text is Half Done! Composition Sampling for Diverse Conditional Generation
ACL 2022
Double Low-Rank Representation With Projection Distance Penalty for Clustering
CVPR 2021
GradingNet: Towards Providing Reliable Supervisions for Weakly Supervised Object Detection by Grading the Box Candidates
AAAI 2021
ForumSum: A Multi-Speaker Conversation Summarization Dataset
EMNLP 2021
Towards Fast and Accurate Real-World Depth Super-Resolution: Benchmark Dataset and Baseline
CVPR 2021
Progressively Complementary Network for Fisheye Image Rectification Using Appearance Flow
CVPR 2021
Towards Complete Scene and Regular Shape for Distortion Rectification by Curve-Aware Extrapolation
ICCV 2021
Multi-Level Curriculum for Training a Distortion-Aware Barrel Distortion Rectification Model
ICCV 2021
Fast Template Matching and Update for Video Object Tracking and Segmentation
CVPR 2020
Interactive Object Segmentation With Inside-Outside Guidance
CVPR 2020
Distribution-Induced Bidirectional Generative Adversarial Network for Graph Representation Learning
CVPR 2020
PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization
ICML 2020
CoADNet: Collaborative Aggregation-and-Distribution Networks for Co-Salient Object Detection
NIPS 2020
Devil in the Details: Towards Accurate Single and Multiple Human Parsing
AAAI 2019
Learning Heterogeneous Spatial-Temporal Representation for Bike-Sharing Demand Prediction
AAAI 2019
Self-Supervised Deep Low-Rank Assignment Model for Prototype Selection
IJCAI 2018
Paragraph-level Neural Question Generation with Maxout Pointer and Gated Self-attention Networks
EMNLP 2018
Object Region Mining With Adversarial Erasing: A Simple Classification to Semantic Segmentation Approach
CVPR 2017