Feng Zhao
86 papers · 2020–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (15) π Renaissance Researcher (6) π Interdisciplinary Bridge π£ Hot Topic Early Bird
π
Conference Polyglot
(11)
π
Cross-Pollinator
(6)
πΊοΈ
Taxonomy Completionist
(15)
π±
Topic Pioneer
π¬
Deep Specialist
(14)
π§¬
Topic Evolution
π€
Dynamic Duo
(26)
π
Grand Slam
ποΈ
Keyword Collector
(348)
β
The Questioner
(2)
β‘
Prolific Year
(18)
π
Trend Setter
π
Century Club
(78)
π₯
Unstoppable
(5)
Conferences
CVPR (19)
ECCV (11)
EMNLP (10)
AAAI (9)
NIPS (9)
ACL (8)
ICCV (7)
IJCAI (5)
ICLR (4)
COLING (3)
ICML (1)
Top co-authors
Keywords
large language model
(10)
image restoration
(9)
knowledge graph
(7)
object detection
(5)
semantic segmentation
(5)
image fusion
(4)
remote sensing
(4)
domain generalization
(4)
vision-language model
(4)
diffusion model
(3)
self-supervised learning
(3)
fourier transform
(3)
image enhancement
(3)
neural network optimization
(3)
contrastive learning
(3)
multi-modal learning
(3)
question answering
(3)
attention mechanism
(3)
multimodal learning
(3)
image super-resolution
(3)
Papers
Breaking Block Boundaries: Anchor-based History-stable Decoding for Diffusion Large Language Models
ACL 2026
MACoT: Synthesizing Chains of Thought for Small Models via Multi-Agent Collaboration
AAAI 2026
Towards Efficient and Robust Manipulation via Multi-Frame Vision-Language-Action Modeling
AAAI 2026
Frequency-Aware Vision-Language Multimodality Generalization Network for Remote Sensing Image Classification
AAAI 2026
Token Painter: Training-Free Text-Guided Image Inpainting via Mask Autoregressive Models
AAAI 2026
Tracing the Roots: A Multi-Agent Framework for Uncovering Data Lineage in Post-Training LLMs
ACL 2026
Beyond Accuracy: Unveiling Inefficiency Patterns in Tool-Integrated Reasoning
ACL 2026
UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision
ACL 2026
FreePCA: Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Principal Component Analysis
CVPR 2025
SkySense-O: Towards Open-World Remote Sensing Interpretation with Vision-Centric Visual-Language Modeling
CVPR 2025
ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents
EMNLP 2025
Priority on High-Quality: Selecting Instruction Data via Consistency Verification of Noise Injection
EMNLP 2025
Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis
ICCV 2025
PseDet: Revisiting the Power of Pseudo Label in Incremental Object Detection
ICLR 2025
FreeDNA: Endowing Domain Adaptation of Diffusion-Based Dense Prediction with Training-Free Domain Noise Alignment
ICCV 2025
Correcting on Graph: Faithful Semantic Parsing over Knowledge Graphs with Large Language Models
ACL 2025
LGA: LLM-GNN Aggregation for Temporal Evolution Attribute Graph Prediction
EMNLP 2025
Commonsense Subgraph for Inductive Relation Reasoning with Meta-learning
COLING 2025
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher
ICLR 2025
CRITICTOOL: Evaluating Self-Critique Capabilities of Large Language Models in Tool-Calling Error Scenarios
EMNLP 2025
Enhancing Large Vision-Language Models with Ultra-Detailed Image Caption Generation
EMNLP 2025
Inductive Reasoning on Few-Shot Knowledge Graphs with Task-Aware Language Models
EMNLP 2025
Data Center Cooling System Optimization Using Offline Reinforcement Learning
ICLR 2025
VFM-Adapter: Adapting Visual Foundation Models for Dense Prediction with Dynamic Hybrid Operation Mapping
AAAI 2025
SEA: Supervised Embedding Alignment for Token-Level Visual-Textual Integration in MLLMs
EMNLP 2025
Adaptive Dropout: Unleashing Dropout across Layers for Generalizable Image Super-Resolution
CVPR 2025
Navigating Image Restoration with VAR's Distribution Alignment Prior
CVPR 2025
Horizon-GS: Unified 3D Gaussian Splatting for Large-Scale Aerial-to-Ground Scenes
CVPR 2025
SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Models
CVPR 2025
Unmasking Bias in Diffusion Model Training
ECCV 2024
Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection
AAAI 2024
Graph Reasoning Transformers for Knowledge-Aware Question Answering
AAAI 2024
T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step
ACL 2024
PsySafe: A Comprehensive Framework for Psychological-based Attack, Defense, and Evaluation of Multi-agent System Safety
ACL 2024
Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models
ACL 2024
Correcting Language Model Bias for Text Classification in True Zero-Shot Learning
COLING 2024
ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
NIPS 2024
Are We on the Right Way for Evaluating Large Vision-Language Models?
NIPS 2024
GaussianCube: A Structured and Explicit Radiance Representation for 3D Generative Modeling
NIPS 2024
Revisiting Spatial-Frequency Information Integration from a Hierarchical Perspective for Panchromatic and Multi-Spectral Image Fusion
CVPR 2024
Probing Synergistic High-Order Interaction in Infrared and Visible Image Fusion
CVPR 2024
Empowering Resampling Operation for Ultra-High-Definition Image Enhancement with Model-Aware Guidance
CVPR 2024
KG-CoT: Chain-of-Thought Prompting of Large Language Models over Knowledge Graphs for Knowledge-Aware Question Answering
IJCAI 2024
PPTFormer: Pseudo Multi-Perspective Transformer for UAV Segmentation
IJCAI 2024
Discrete Latent Perspective Learning for Segmentation and Detection
ICML 2024
RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models
ECCV 2024
ShareGPT4V: Improving Large Multi-Modal Models with Better Captions
ECCV 2024
Stream Query Denoising for Vectorized HD-Map Construction
ECCV 2024
Stable Preference: Redefining training paradigm of human preference model for Text-to-Image Synthesis
ECCV 2024
Unleashing the Potential of the Semantic Latent Space in Diffusion Models for Image Dehazing
ECCV 2024
"Idling Neurons, Appropriately Lenient Workload During Fine-tuning Leads to Better Generalization"
ECCV 2024
Guided Patch-Grouping Wavelet Transformer with Spatial Congruence for Ultra-High Resolution Segmentation
IJCAI 2023
FouriDown: Factoring Down-Sampling into Shuffling and Superposing
NIPS 2023
Transition-constant Normalization for Image Enhancement
NIPS 2023
Deep Fractional Fourier Transform
NIPS 2023
Intensity-Aware Loss for Dynamic Facial Expression Recognition in the Wild
AAAI 2023
Learning Semantic Degradation-Aware Guidance for Recognition-Driven Unsupervised Low-Light Image Enhancement
AAAI 2023
Ultra-High Resolution Segmentation With Ultra-Rich Context: A Novel Benchmark
CVPR 2023
Towards Domain Generalization for Multi-View 3D Object Detection in Bird-Eye-View
CVPR 2023
Learning Sample Relationship for Exposure Correction
CVPR 2023
Visual Recognition-Driven Image Restoration for Multiple Degradation With Intrinsic Semantics Recovery
CVPR 2023
Ingredient-Oriented Multi-Degradation Learning for Image Restoration
CVPR 2023
Structure-aware Knowledge Graph-to-text Generation with Planning Selection and Similarity Distinction
EMNLP 2023
Exploring Temporal Frequency Spectrum in Deep Video Deblurring
ICCV 2023
Learning from Noisy Data for Semi-Supervised 3D Object Detection
ICCV 2023
Empowering Low-Light Image Enhancer through Customized Learnable Priors
ICCV 2023
FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Models
ICCV 2023
DETRDistill: A Universal Knowledge Distillation Framework for DETR-families
ICCV 2023
BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection
ICLR 2023
Deep Fourier Up-Sampling
NIPS 2022
RelCLIP: Adapting Language-Image Pretraining for Visual Relationship Detection via Relational Contrastive Learning
EMNLP 2022
Frequency and Spatial Dual Guidance for Image Dehazing
ECCV 2022
Deep Fourier-Based Exposure Correction Network with Spatial-Frequency Interaction
ECCV 2022
AutoAlign: Pixel-Instance Feature Aggregation for Multi-Modal 3D Object Detection
IJCAI 2022
MMNet: Muscle Motion-Guided Network for Micro-Expression Recognition
IJCAI 2022
Panchromatic and Multispectral Image Fusion via Alternating Reverse Filtering Network
NIPS 2022
Spatial-Frequency Domain Information Integration for Pan-Sharpening
ECCV 2022
Deformable Feature Aggregation for Dynamic Multi-modal 3D Object Detection
ECCV 2022
Exposure Normalization and Compensation for Multiple-Exposure Correction
CVPR 2022
Bijective Mapping Network for Shadow Removal
CVPR 2022
Unleashing Potential of Unsupervised Pre-Training With Intra-Identity Regularization for Person Re-Identification
CVPR 2022
Mutual Information-Driven Pan-Sharpening
CVPR 2022
Can Language Models Serve as Temporal Knowledge Bases?
EMNLP 2022
OpticE: A Coherence Theory-Based Model for Link Prediction
COLING 2022
Roadblocks for Temporarily Disabling Shortcuts and Learning New Knowledge
NIPS 2022
P2B: Point-to-Box Network for 3D Object Tracking in Point Clouds
CVPR 2020