Feng Zhao

86 papers · 2020–2026 · 11 conferences · across top CS/AI conferences

Achievements

+14 more ↓

🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (15) 🌈 Renaissance Researcher (6) 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird

🌍 Conference Polyglot (11) 🐝 Cross-Pollinator (6) 🗺️ Taxonomy Completionist (15) 🌱 Topic Pioneer 🔬 Deep Specialist (14) 🧬 Topic Evolution 🤝 Dynamic Duo (26) 🏆 Grand Slam 🗃️ Keyword Collector (348) ❓ The Questioner (2) ⚡ Prolific Year (18) 📈 Trend Setter 💎 Century Club (78) 🔥 Unstoppable (5)

Conferences

CVPR (19) ECCV (11) EMNLP (10) AAAI (9) NIPS (9) ACL (8) ICCV (7) IJCAI (5) ICLR (4) COLING (3) ICML (1)

Top co-authors

Jie Huang (27) Zehui Chen (21) man zhou (19) Lin Chen (9) Hu Yu (9) Naishan Zheng (9) Dahua Lin (8) Chongyi Li (6) Ruilin Zhao (6) Cheng Yan (5)

Keywords

large language model (10) image restoration (9) knowledge graph (7) object detection (5) semantic segmentation (5) image fusion (4) remote sensing (4) domain generalization (4) vision-language model (4) diffusion model (3) self-supervised learning (3) fourier transform (3) image enhancement (3) neural network optimization (3) contrastive learning (3) multi-modal learning (3) question answering (3) attention mechanism (3) multimodal learning (3) image super-resolution (3)

Papers

Breaking Block Boundaries: Anchor-based History-stable Decoding for Diffusion Large Language Models ACL 2026 MACoT: Synthesizing Chains of Thought for Small Models via Multi-Agent Collaboration AAAI 2026 Towards Efficient and Robust Manipulation via Multi-Frame Vision-Language-Action Modeling AAAI 2026 Frequency-Aware Vision-Language Multimodality Generalization Network for Remote Sensing Image Classification AAAI 2026 Token Painter: Training-Free Text-Guided Image Inpainting via Mask Autoregressive Models AAAI 2026 Tracing the Roots: A Multi-Agent Framework for Uncovering Data Lineage in Post-Training LLMs ACL 2026 Beyond Accuracy: Unveiling Inefficiency Patterns in Tool-Integrated Reasoning ACL 2026 UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision ACL 2026 FreePCA: Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Principal Component Analysis CVPR 2025 SkySense-O: Towards Open-World Remote Sensing Interpretation with Vision-Centric Visual-Language Modeling CVPR 2025 ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents EMNLP 2025 Priority on High-Quality: Selecting Instruction Data via Consistency Verification of Noise Injection EMNLP 2025 Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis ICCV 2025 PseDet: Revisiting the Power of Pseudo Label in Incremental Object Detection ICLR 2025 FreeDNA: Endowing Domain Adaptation of Diffusion-Based Dense Prediction with Training-Free Domain Noise Alignment ICCV 2025 Correcting on Graph: Faithful Semantic Parsing over Knowledge Graphs with Large Language Models ACL 2025 LGA: LLM-GNN Aggregation for Temporal Evolution Attribute Graph Prediction EMNLP 2025 Commonsense Subgraph for Inductive Relation Reasoning with Meta-learning COLING 2025 MindSearch: Mimicking Human Minds Elicits Deep AI Searcher ICLR 2025 CRITICTOOL: Evaluating Self-Critique Capabilities of Large Language Models in Tool-Calling Error Scenarios EMNLP 2025 Enhancing Large Vision-Language Models with Ultra-Detailed Image Caption Generation EMNLP 2025 Inductive Reasoning on Few-Shot Knowledge Graphs with Task-Aware Language Models EMNLP 2025 Data Center Cooling System Optimization Using Offline Reinforcement Learning ICLR 2025 VFM-Adapter: Adapting Visual Foundation Models for Dense Prediction with Dynamic Hybrid Operation Mapping AAAI 2025 SEA: Supervised Embedding Alignment for Token-Level Visual-Textual Integration in MLLMs EMNLP 2025 Adaptive Dropout: Unleashing Dropout across Layers for Generalizable Image Super-Resolution CVPR 2025 Navigating Image Restoration with VAR's Distribution Alignment Prior CVPR 2025 Horizon-GS: Unified 3D Gaussian Splatting for Large-Scale Aerial-to-Ground Scenes CVPR 2025 SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Models CVPR 2025 Unmasking Bias in Diffusion Model Training ECCV 2024 Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection AAAI 2024 Graph Reasoning Transformers for Knowledge-Aware Question Answering AAAI 2024 T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step ACL 2024 PsySafe: A Comprehensive Framework for Psychological-based Attack, Defense, and Evaluation of Multi-agent System Safety ACL 2024 Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models ACL 2024 Correcting Language Model Bias for Text Classification in True Zero-Shot Learning COLING 2024 ShareGPT4Video: Improving Video Understanding and Generation with Better Captions NIPS 2024 Are We on the Right Way for Evaluating Large Vision-Language Models? NIPS 2024 GaussianCube: A Structured and Explicit Radiance Representation for 3D Generative Modeling NIPS 2024 Revisiting Spatial-Frequency Information Integration from a Hierarchical Perspective for Panchromatic and Multi-Spectral Image Fusion CVPR 2024 Probing Synergistic High-Order Interaction in Infrared and Visible Image Fusion CVPR 2024 Empowering Resampling Operation for Ultra-High-Definition Image Enhancement with Model-Aware Guidance CVPR 2024 KG-CoT: Chain-of-Thought Prompting of Large Language Models over Knowledge Graphs for Knowledge-Aware Question Answering IJCAI 2024 PPTFormer: Pseudo Multi-Perspective Transformer for UAV Segmentation IJCAI 2024 Discrete Latent Perspective Learning for Segmentation and Detection ICML 2024 RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models ECCV 2024 ShareGPT4V: Improving Large Multi-Modal Models with Better Captions ECCV 2024 Stream Query Denoising for Vectorized HD-Map Construction ECCV 2024 Stable Preference: Redefining training paradigm of human preference model for Text-to-Image Synthesis ECCV 2024 Unleashing the Potential of the Semantic Latent Space in Diffusion Models for Image Dehazing ECCV 2024 "Idling Neurons, Appropriately Lenient Workload During Fine-tuning Leads to Better Generalization" ECCV 2024 Guided Patch-Grouping Wavelet Transformer with Spatial Congruence for Ultra-High Resolution Segmentation IJCAI 2023 FouriDown: Factoring Down-Sampling into Shuffling and Superposing NIPS 2023 Transition-constant Normalization for Image Enhancement NIPS 2023 Deep Fractional Fourier Transform NIPS 2023 Intensity-Aware Loss for Dynamic Facial Expression Recognition in the Wild AAAI 2023 Learning Semantic Degradation-Aware Guidance for Recognition-Driven Unsupervised Low-Light Image Enhancement AAAI 2023 Ultra-High Resolution Segmentation With Ultra-Rich Context: A Novel Benchmark CVPR 2023 Towards Domain Generalization for Multi-View 3D Object Detection in Bird-Eye-View CVPR 2023 Learning Sample Relationship for Exposure Correction CVPR 2023 Visual Recognition-Driven Image Restoration for Multiple Degradation With Intrinsic Semantics Recovery CVPR 2023 Ingredient-Oriented Multi-Degradation Learning for Image Restoration CVPR 2023 Structure-aware Knowledge Graph-to-text Generation with Planning Selection and Similarity Distinction EMNLP 2023 Exploring Temporal Frequency Spectrum in Deep Video Deblurring ICCV 2023 Learning from Noisy Data for Semi-Supervised 3D Object Detection ICCV 2023 Empowering Low-Light Image Enhancer through Customized Learnable Priors ICCV 2023 FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Models ICCV 2023 DETRDistill: A Universal Knowledge Distillation Framework for DETR-families ICCV 2023 BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection ICLR 2023 Deep Fourier Up-Sampling NIPS 2022 RelCLIP: Adapting Language-Image Pretraining for Visual Relationship Detection via Relational Contrastive Learning EMNLP 2022 Frequency and Spatial Dual Guidance for Image Dehazing ECCV 2022 Deep Fourier-Based Exposure Correction Network with Spatial-Frequency Interaction ECCV 2022 AutoAlign: Pixel-Instance Feature Aggregation for Multi-Modal 3D Object Detection IJCAI 2022 MMNet: Muscle Motion-Guided Network for Micro-Expression Recognition IJCAI 2022 Panchromatic and Multispectral Image Fusion via Alternating Reverse Filtering Network NIPS 2022 Spatial-Frequency Domain Information Integration for Pan-Sharpening ECCV 2022 Deformable Feature Aggregation for Dynamic Multi-modal 3D Object Detection ECCV 2022 Exposure Normalization and Compensation for Multiple-Exposure Correction CVPR 2022 Bijective Mapping Network for Shadow Removal CVPR 2022 Unleashing Potential of Unsupervised Pre-Training With Intra-Identity Regularization for Person Re-Identification CVPR 2022 Mutual Information-Driven Pan-Sharpening CVPR 2022 Can Language Models Serve as Temporal Knowledge Bases? EMNLP 2022 OpticE: A Coherence Theory-Based Model for Link Prediction COLING 2022 Roadblocks for Temporarily Disabling Shortcuts and Learning New Knowledge NIPS 2022 P2B: Point-to-Box Network for 3D Object Tracking in Point Clouds CVPR 2020