Yu Liu
196 papers · 2016–2026 · 15 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+17 more ↓ Show less ↑
π Academic Marathon (9) π Conference Polyglot (14) π§ Keyword Pioneer π Interdisciplinary Bridge π Cross-Pollinator (12)
π
Cross-Pollinator
(12)
π
Renaissance Researcher
(9)
πΊοΈ
Taxonomy Completionist
(185)
π
Conference Loyalist
(20)
π
Triple Crown
π
Grand Slam
π€
Dynamic Duo
(34)
π¬
Deep Specialist
(20)
π§¬
Topic Evolution
π
Keyword Champion
π
Century Club
(180)
π₯
Unstoppable
(10)
ποΈ
Keyword Collector
(722)
β
The Questioner
(3)
β‘
Prolific Year
(9)
π
Conference Pioneer
π
Trend Setter
Conferences
CVPR (45)
AAAI (35)
ICCV (27)
ECCV (24)
NIPS (16)
ICLR (13)
IJCAI (12)
ICML (10)
EMNLP (7)
COLING (2)
ACL (1)
AISTATS (1)
CORL (1)
JMLR (1)
RSS (1)
Top co-authors
Research topics
Keywords
diffusion model
(21)
image generation
(13)
object detection
(12)
knowledge distillation
(10)
convolutional neural network
(8)
generative model
(7)
text-to-image generation
(7)
semantic segmentation
(7)
large language model
(7)
autonomous driving
(7)
domain adaptation
(6)
representation learning
(6)
neural network
(6)
video generation
(6)
image synthesis
(5)
image classification
(4)
vision-language model
(4)
zero-shot learning
(4)
self-supervised learning
(4)
pose estimation
(4)
Papers
Two Streams, One Sarcasm: Orthogonal Expert Tuning for Holistic Multimodal Sarcasm Understanding
ACL 2026
MPMA: Preference Manipulation Attack Against Model Context Protocol
AAAI 2026
Cross-modal Proxy Evolving for OOD Detection with Vision-Language Models
AAAI 2026
OPERA: A Reinforcement Learning--Enhanced Orchestrated Planner-Executor Architecture for Reasoning-Oriented Multi-Hop Retrieval
AAAI 2026
Adaptive Dynamic Dehazing via Instruction-Driven and Task-Feedback Closed-Loop Optimization for Diverse Downstream Task Adaptation
AAAI 2026
FDP: A Frequency-Decomposition Preprocessing Pipeline for Unsupervised Anomaly Detection in Brain MRI
AAAI 2026
RSOD: Reliability-Guided Sonar Image Object Detection with Extremely Limited Labels
AAAI 2026
Learning 3D Occupancy from Beam Overlap in 2D Rotating mmWave Radar
AAAI 2026
EAGLE: Episodic Appearance- and Geometry-aware Memory for Unified 2D-3D Visual Query Localization in Egocentric Vision
AAAI 2026
PathMind: A Retrieve-Prioritize-Reason Framework for Knowledge Graph Reasoning with Large Language Models
AAAI 2026
Do LLMs Feel? Teaching Emotion Recognition with Prompts, Retrieval, and Curriculum Learning
AAAI 2026
Time Series Class-Incremental Learning via Confidence-guided Mask Distillation and Prototype-guided Contrastive Learning
AAAI 2026
IndoorUAV: Benchmarking Vision-Language UAV Navigation in Continuous Indoor Environments
AAAI 2026
Causality-Aligned Semantic Recovery for Incomplete Cross-Modal Retrieval
AAAI 2026
Intention-Aware Diffusion Model for Pedestrian Trajectory Prediction
AAAI 2026
Gracefully Air-Written: Enhancing the Legibility and Style Consistency of In-Air Handwriting
AAAI 2026
BACON: Improving Clarity of Image Captions via Bag-of-Concept Graphs
CVPR 2025
NTR-Gaussian: Nighttime Dynamic Thermal Reconstruction with 4D Gaussian Splatting Based on Thermodynamics
CVPR 2025
See Further When Clear: Curriculum Consistency Model
CVPR 2025
MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes
CVPR 2025
Improved Video VAE for Latent Video Diffusion Model
CVPR 2025
As Pseudo-Label Free as Possible: Leveraging Adaptive Feature Generation for Sparsely Annotated Object Detection
AAAI 2025
MMET: A Multi-Input and Multi-Scale Transformer for Efficient PDEs Solving
IJCAI 2025
Aspect-Based Sentiment Analysis with Syntax-Opinion-Sentiment Reasoning Chain
COLING 2025
Enhancing Semantic Clarity: Discriminative and Fine-grained Information Mining for Remote Sensing Image-Text Retrieval
IJCAI 2025
EfficientPIE: Real-Time Prediction on Pedestrian Crossing Intention with Sole Observation
IJCAI 2025
OT-DETECTOR: Delving into Optimal Transport for Zero-shot Out-of-Distribution Detection
IJCAI 2025
EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM
ICML 2025
How Distributed Collaboration Influences the Diffusion Model Training? A Theoretical Perspective
ICML 2025
PDUDT: Provable Decentralized Unlearning under Dynamic Topologies
ICML 2025
MMSearch: Unveiling the Potential of Large Models as Multi-modal Search Engines
ICLR 2025
Building Interactable Replicas of Complex Articulated Objects via Gaussian Splatting
ICLR 2025
SmartPretrain: Model-Agnostic and Dataset-Agnostic Representation Learning for Motion Prediction
ICLR 2025
ACE: All-round Creator and Editor Following Instructions via Diffusion Transformer
ICLR 2025
TACO: Taming Diffusion for in-the-wild Video Amodal Completion
ICCV 2025
MPBR: Multimodal Progressive Bidirectional Reasoning for Open-Set Fine-Grained Recognition
ICCV 2025
UniFuse: A Unified All-in-One Framework for Multi-Modal Medical Image Fusion Under Diverse Degradations and Misalignments
ICCV 2025
ICE-Bench: A Unified and Comprehensive Benchmark for Image Creating and Editing
ICCV 2025
LoD-Loc v2: Aerial Visual Localization over Low Level-of-Detail City Models using Explicit Silhouette Alignment
ICCV 2025
VACE: All-in-One Video Creation and Editing
ICCV 2025
DiffDoctor: Diagnosing Image Diffusion Models Before Treating
ICCV 2025
Pretrained Reversible Generation as Unsupervised Visual Representation Learning
ICCV 2025
Is Meta-Learning Out? Rethinking Unsupervised Few-Shot Classification with Limited Entropy
ICCV 2025
ThinkAnswer Loss: Balancing Semantic Similarity and Exact Matching for LLM Reasoning Enhancement
EMNLP 2025
Agent-in-the-Loop: A Data Flywheel for Continuous Improvement in LLM-based Customer Support
EMNLP 2025
MADS: Multi-Agent Dialogue Simulation for Diverse Persuasion Data Generation
EMNLP 2025
Enhancing Large Language Model for Knowledge Graph Completion via Structure-Aware Alignment-Tuning
EMNLP 2025
OpenCarbon: A Contrastive Learning-based Cross-Modality Neural Approach for High-Resolution Carbon Emission Prediction Using Open Data
IJCAI 2025
IDEA-Bench: How Far are Generative Models from Professional Designing?
CVPR 2025
MangaNinja: Line Art Colorization with Precise Reference Following
CVPR 2025
Universal Actions for Enhanced Embodied Foundation Models
CVPR 2025
Decompositional Neural Scene Reconstruction with Generative Diffusion Prior
CVPR 2025
Long-term Detection and Monitory of Chinese Urban Village Using Satellite Imagery
IJCAI 2024
Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning
NIPS 2024
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
NIPS 2024
Phased Consistency Models
NIPS 2024
Zero-shot Image Editing with Reference Imitation
NIPS 2024
MoVA: Adapting Mixture of Vision Experts to Multimodal Context
NIPS 2024
Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models
NIPS 2024
LoD-Loc: Aerial Visual Localization using LoD 3D Map with Neural Wireframe Alignment
NIPS 2024
Not Just Object, But State: Compositional Incremental Learning without Forgetting
NIPS 2024
Instruction-Guided Visual Masking
NIPS 2024
CI-STHPAN: Pre-trained Attention Network for Stock Selection with Channel-Independent Spatio-Temporal Hypergraph
AAAI 2024
GMP-AR: Granularity Message Passing and Adaptive Reconciliation for Temporal Hierarchy Forecasting
AAAI 2024
Effect Size Estimation for Duration Recommendation in Online Experiments: Leveraging Hierarchical Models and Objective Utility Approaches
AAAI 2024
Causality-Inspired Invariant Representation Learning for Text-Based Person Retrieval
AAAI 2024
Critic-Guided Decision Transformer for Offline Reinforcement Learning
AAAI 2024
AUC Optimization from Multiple Unlabeled Datasets
AAAI 2024
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
AAAI 2024
Estimating On-Road Transportation Carbon Emissions from Open Data of Road Network and Origin-Destination Flow Data
AAAI 2024
UV-SAM: Adapting Segment Anything Model for Urban Village Identification
AAAI 2024
ESCP: Enhancing Emotion Recognition in Conversation with Speech and Contextual Prefixes
COLING 2024
SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction
CVPR 2024
Multi-agent Collaborative Perception via Motion-aware Robust Communication Network
CVPR 2024
Check Locate Rectify: A Training-Free Layout Calibration System for Text-to-Image Generation
CVPR 2024
GLID: Pre-training a Generalist Encoder-Decoder Vision Model
CVPR 2024
Novel Class Discovery for Ultra-Fine-Grained Visual Categorization
CVPR 2024
CPGA: Coding Priors-Guided Aggregation Network for Compressed Video Quality Enhancement
CVPR 2024
Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation
CVPR 2024
EasyDrag: Efficient Point-based Manipulation on Diffusion Models
CVPR 2024
DreamVideo: Composing Your Dream Videos with Customized Subject and Motion
CVPR 2024
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following
CVPR 2024
LMDrive: Closed-Loop End-to-End Driving with Large Language Models
CVPR 2024
Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance
CVPR 2024
AnyDoor: Zero-shot Object-level Image Customization
CVPR 2024
MultiGen: Zero-shot Image Generation from Multi-modal Prompts
ECCV 2024
Elegantly Written: Disentangling Writer and Character Styles for Enhancing Online Chinese Handwriting
ECCV 2024
SlotLifter: Slot-guided Feature Lifting for Learning Object-Centric Radiance Fields
ECCV 2024
FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis
ECCV 2024
LivePhoto: Real Image Animation with Text-guided Motion Control
ECCV 2024
Exploring Guided Sampling of Conditional GANs
ECCV 2024
Three Things We Need to Know About Transferring Stable Diffusion to Visual Dense Prediciton Tasks
ECCV 2024
Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation
ECCV 2024
ZoLA: Zero-Shot Creative Long Animation Generation with Short Video Model
ECCV 2024
Chains of Diffusion Models
ECCV 2024
Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models
ECCV 2024
Fast Context-Based Low-Light Image Enhancement via Neural Implicit Representations
ECCV 2024
How Grammatical Features Impact Machine Translation: A New Test Suite for Chinese-English MT Evaluation
EMNLP 2024
The Devil is in the Neurons: Interpreting and Mitigating Social Biases in Language Models
ICLR 2024
Space Group Constrained Crystal Generation
ICLR 2024
Continuous Invariance Learning
ICLR 2024
Lipschitz Singularities in Diffusion Models
ICLR 2024
DreamClean: Restoring Clean Image Using Deep Diffusion Prior
ICLR 2024
ReDiffuser: Reliable Decision-Making Using a Diffuser with Confidence Estimation
ICML 2024
DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning
ICML 2024
StrokeNUWAβTokenizing Strokes for Vector Graphic Synthesis
ICML 2024
CCM: Real-Time Controllable Visual Content Creation Using Text-to-Image Consistency Models
ICML 2024
From Pixels to Progress: Generating Road Network from Satellite Imagery for Socioeconomic Insights in Impoverished Areas
IJCAI 2024
SLOTH: Structured Learning and Task-Based Optimization for Time Series Forecasting on Hierarchies
AAAI 2023
Video Diffusion Models with Local-Global Context Guidance
IJCAI 2023
Masked Autoencoders Are Stronger Knowledge Distillers
ICCV 2023
Generating Dynamic Kernels via Transformers for Lane Detection
ICCV 2023
GeoMIM: Towards Better 3D Knowledge Transfer via Masked Image Modeling for Multi-view 3D Understanding
ICCV 2023
UniKD: Universal Knowledge Distillation for Mimicking Homogeneous or Heterogeneous Object Detectors
ICCV 2023
Efficient-VQGAN: Towards High-Resolution Image Generation with Efficient Vision Transformers
ICCV 2023
3D Semantic Subspace Traverser: Empowering 3D Generative Model with Shape Editing Capability
ICCV 2023
Deep Active Contours for Real-time 6-DoF Object Tracking
ICCV 2023
Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
ICCV 2023
ReasonNet: End-to-End Driving With Temporal and Global Reasoning
CVPR 2023
Dimensionality-Varying Diffusion Process
CVPR 2023
Long-Term Visual Localization With Mobile Sensors
CVPR 2023
MixMAE: Mixed and Masked Autoencoder for Efficient Pretraining of Hierarchical Vision Transformers
CVPR 2023
Arbitrary Virtual Try-on Network: Characteristics Representation and Trade-off between Body and Clothing
ICLR 2023
Improving Object-centric Learning with Query Optimization
ICLR 2023
GoBigger: A Scalable Platform for Cooperative-Competitive Multi-Agent Interactive Simulation
ICLR 2023
LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
NIPS 2023
RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths
NIPS 2023
Composer: Creative and Controllable Image Synthesis with Composable Conditions
ICML 2023
Cones: Concept Neurons in Diffusion Models for Customized Generation
ICML 2023
Style-Content Metric Learning for Multidomain Remote Sensing Object Recognition
AAAI 2023
ACE: Cooperative Multi-Agent Q-learning with Bidirectional Action-Dependency
AAAI 2023
Customizable Image Synthesis with Multiple Subjects
NIPS 2023
Efficient Reinforcement Learning for Autonomous Driving with Parameterized Skills and Priors
RSS 2023
COCA: COllaborative CAusal Regularization for Audio-Visual Question Answering
AAAI 2023
DETRs with Collaborative Hybrid Assignments Training
ICCV 2023
Decoupled DETR: Spatially Disentangling Localization and Classification for Improved End-to-End Object Detection
ICCV 2023
Memory Augmented State Space Model for Time Series Forecasting
IJCAI 2022
Large-batch Optimization for Dense Visual Predictions: Training Faster R-CNN in 4.2 Minutes
NIPS 2022
Unifying Visual Perception by Dispersible Points Learning
ECCV 2022
Camera Auto-Calibration from the Steiner Conic of the Fundamental Matrix
ECCV 2022
Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer
CORL 2022
Self-Slimmed Vision Transformer
ECCV 2022
"UniNet: Unified Architecture Search with Convolution, Transformer, and MLP"
ECCV 2022
GeoAug: Data Augmentation for Few-Shot NeRF with Geometry Constraints
ECCV 2022
Towards Robust Face Recognition with Comprehensive Search
ECCV 2022
Rethinking Robust Representation Learning under Fine-Grained Noisy Faces
ECCV 2022
UniFormer: Unified Transformer for Efficient Spatial-Temporal Representation Learning
ICLR 2022
A Bayesian Model for Online Activity Sample Sizes
AISTATS 2022
TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers
ECCV 2022
A Trend-Driven Fashion Design System for Rapid Response Marketing in E-commerce
AAAI 2022
Segment, Magnify and Reiterate: Detecting Camouflaged Objects the Hard Way
CVPR 2022
Train a One-Million-Way Instance Classifier for Unsupervised Visual Representation Learning
AAAI 2021
SOM-NCSCM : An Efficient Neural Chinese Sentence Compression Model Enhanced with Self-Organizing Map
EMNLP 2021
Hyperbolic Geometry is Not Necessary: Lightweight Euclidean-Based Models for Low-Dimensional Knowledge Graph Embeddings
EMNLP 2021
Switchable K-Class Hyperplanes for Noise-Robust Representation Learning
ICCV 2021
Self-Supervised Video Representation Learning by Context and Motion Decoupling
CVPR 2021
Lifelong Person Re-Identification via Adaptive Knowledge Accumulation
CVPR 2021
Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization
CVPR 2021
Communication Efficient SGD via Gradient Sampling With Bayes Prior
CVPR 2021
Neighborhood Intervention Consistency: Measuring Confidence for Knowledge Graph Link Prediction
IJCAI 2021
Rotate-and-Render: Unsupervised Photorealistic Face Rotation From Single-View Images
CVPR 2020
DPGN: Distribution Propagation Graph Network for Few-Shot Learning
CVPR 2020
Revisiting the Sibling Head in Object Detector
CVPR 2020
Smoothed Nonparametric Derivative Estimation using Weighted Difference Quotients
JMLR 2020
Label-Attended Hashing for Multi-Label Image Retrieval
IJCAI 2020
KPNet: Towards Minimal Face Detector
AAAI 2020
Anisotropic Convolutional Networks for 3D Semantic Scene Completion
CVPR 2020
Discriminability Distillation in Group Representation Learning
ECCV 2020
Learning Where to Focus for Efficient Video Object Detection
ECCV 2020
More Classifiers, Less Forgetting: A Generic Multi-classifier Paradigm for Incremental Learning
ECCV 2020
Temporal Interlacing Network
AAAI 2020
Search to Distill: Pearls Are Everywhere but Not the Eyes
CVPR 2020
Scalable Place Recognition Under Appearance Change for Autonomous Driving
ICCV 2019
Exploiting Temporal Consistency for Real-Time Video Depth Estimation
ICCV 2019
Differentiable Kernel Evolution
ICCV 2019
Correlation Congruence for Knowledge Distillation
ICCV 2019
RGBD Based Dimensional Decomposition Residual Network for 3D Semantic Scene Completion
CVPR 2019
Talking Face Generation by Adversarially Disentangled Audio-Visual Representation
AAAI 2019
Conditional Adversarial Generative Flow for Controllable Image Synthesis
CVPR 2019
Gradient Harmonized Single-Stage Detector
AAAI 2019
Knowledge Distillation via Route Constrained Optimization
ICCV 2019
Beyond Trade-Off: Accelerate FCN-Based Face Detector With Higher Accuracy
CVPR 2018
Derivative Estimation in Random Design
NIPS 2018
Exploring Disentangled Feature Representation Beyond Face Identification
CVPR 2018
MoNet: Deep Motion Exploitation for Video Object Segmentation
CVPR 2018
Hyperparameter Optimization for Tracking With Continuous Deep Q-Learning
CVPR 2018
A Simple Convolutional Neural Network for Accurate P300 Detection and Character Spelling in Brain Computer Interface
IJCAI 2018
Transductive Centroid Projection for Semi-supervised Large-scale Recognition
ECCV 2018
Learning a Recurrent Residual Fusion Network for Multimodal Matching
ICCV 2017
Recurrent Scale Approximation for Object Detection in CNN
ICCV 2017
Scale-Aware Face Detection
CVPR 2017
Unsupervised Sequence Classification using Sequential Output Statistics
NIPS 2017
Quality Aware Network for Set to Set Recognition
CVPR 2017
K-Means Clustering with Distributed Dimensions
ICML 2016
Combinatorial Multi-Armed Bandit with General Reward Functions
NIPS 2016
Learning Relaxed Deep Supervision for Better Edge Detection
CVPR 2016