conftrace_

Yu Liu

196 papers · 2016–2026 · 15 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+17 more ↓

🏃 Academic Marathon (9) 🌍 Conference Polyglot (14) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (12)

🐝 Cross-Pollinator (12) 🌈 Renaissance Researcher (9) 🗺️ Taxonomy Completionist (185) 🏠 Conference Loyalist (20) 👑 Triple Crown 🏆 Grand Slam 🤝 Dynamic Duo (34) 🔬 Deep Specialist (20) 🧬 Topic Evolution 🏆 Keyword Champion 💎 Century Club (180) 🔥 Unstoppable (10) 🗃️ Keyword Collector (722) ❓ The Questioner (3) ⚡ Prolific Year (9) 🚀 Conference Pioneer 📈 Trend Setter

Conferences

CVPR (45) AAAI (35) ICCV (27) ECCV (24) NIPS (16) ICLR (13) IJCAI (12) ICML (10) EMNLP (7) COLING (2) ACL (1) AISTATS (1) CORL (1) JMLR (1) RSS (1)

Top co-authors

Guanglu Song (34) hongsheng Li (29) Lianghua Huang (13) Xiaogang Wang (12) Yujun Shen (12) Jihao Liu (11) Hao Shao (10) Zhuofan Zong (10) Boxiao Liu (9) Jingren Zhou (8)

Research topics

Applications (1)

Keywords

diffusion model (21) image generation (13) object detection (12) knowledge distillation (10) convolutional neural network (8) generative model (7) text-to-image generation (7) semantic segmentation (7) large language model (7) autonomous driving (7) domain adaptation (6) representation learning (6) neural network (6) video generation (6) image synthesis (5) image classification (4) vision-language model (4) zero-shot learning (4) self-supervised learning (4) pose estimation (4)

Papers

Two Streams, One Sarcasm: Orthogonal Expert Tuning for Holistic Multimodal Sarcasm Understanding ACL 2026 MPMA: Preference Manipulation Attack Against Model Context Protocol AAAI 2026 Cross-modal Proxy Evolving for OOD Detection with Vision-Language Models AAAI 2026 OPERA: A Reinforcement Learning--Enhanced Orchestrated Planner-Executor Architecture for Reasoning-Oriented Multi-Hop Retrieval AAAI 2026 Adaptive Dynamic Dehazing via Instruction-Driven and Task-Feedback Closed-Loop Optimization for Diverse Downstream Task Adaptation AAAI 2026 FDP: A Frequency-Decomposition Preprocessing Pipeline for Unsupervised Anomaly Detection in Brain MRI AAAI 2026 RSOD: Reliability-Guided Sonar Image Object Detection with Extremely Limited Labels AAAI 2026 Learning 3D Occupancy from Beam Overlap in 2D Rotating mmWave Radar AAAI 2026 EAGLE: Episodic Appearance- and Geometry-aware Memory for Unified 2D-3D Visual Query Localization in Egocentric Vision AAAI 2026 PathMind: A Retrieve-Prioritize-Reason Framework for Knowledge Graph Reasoning with Large Language Models AAAI 2026 Do LLMs Feel? Teaching Emotion Recognition with Prompts, Retrieval, and Curriculum Learning AAAI 2026 Time Series Class-Incremental Learning via Confidence-guided Mask Distillation and Prototype-guided Contrastive Learning AAAI 2026 IndoorUAV: Benchmarking Vision-Language UAV Navigation in Continuous Indoor Environments AAAI 2026 Causality-Aligned Semantic Recovery for Incomplete Cross-Modal Retrieval AAAI 2026 Intention-Aware Diffusion Model for Pedestrian Trajectory Prediction AAAI 2026 Gracefully Air-Written: Enhancing the Legibility and Style Consistency of In-Air Handwriting AAAI 2026 BACON: Improving Clarity of Image Captions via Bag-of-Concept Graphs CVPR 2025 NTR-Gaussian: Nighttime Dynamic Thermal Reconstruction with 4D Gaussian Splatting Based on Thermodynamics CVPR 2025 See Further When Clear: Curriculum Consistency Model CVPR 2025 MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes CVPR 2025 Improved Video VAE for Latent Video Diffusion Model CVPR 2025 As Pseudo-Label Free as Possible: Leveraging Adaptive Feature Generation for Sparsely Annotated Object Detection AAAI 2025 MMET: A Multi-Input and Multi-Scale Transformer for Efficient PDEs Solving IJCAI 2025 Aspect-Based Sentiment Analysis with Syntax-Opinion-Sentiment Reasoning Chain COLING 2025 Enhancing Semantic Clarity: Discriminative and Fine-grained Information Mining for Remote Sensing Image-Text Retrieval IJCAI 2025 EfficientPIE: Real-Time Prediction on Pedestrian Crossing Intention with Sole Observation IJCAI 2025 OT-DETECTOR: Delving into Optimal Transport for Zero-shot Out-of-Distribution Detection IJCAI 2025 EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM ICML 2025 How Distributed Collaboration Influences the Diffusion Model Training? A Theoretical Perspective ICML 2025 PDUDT: Provable Decentralized Unlearning under Dynamic Topologies ICML 2025 MMSearch: Unveiling the Potential of Large Models as Multi-modal Search Engines ICLR 2025 Building Interactable Replicas of Complex Articulated Objects via Gaussian Splatting ICLR 2025 SmartPretrain: Model-Agnostic and Dataset-Agnostic Representation Learning for Motion Prediction ICLR 2025 ACE: All-round Creator and Editor Following Instructions via Diffusion Transformer ICLR 2025 TACO: Taming Diffusion for in-the-wild Video Amodal Completion ICCV 2025 MPBR: Multimodal Progressive Bidirectional Reasoning for Open-Set Fine-Grained Recognition ICCV 2025 UniFuse: A Unified All-in-One Framework for Multi-Modal Medical Image Fusion Under Diverse Degradations and Misalignments ICCV 2025 ICE-Bench: A Unified and Comprehensive Benchmark for Image Creating and Editing ICCV 2025 LoD-Loc v2: Aerial Visual Localization over Low Level-of-Detail City Models using Explicit Silhouette Alignment ICCV 2025 VACE: All-in-One Video Creation and Editing ICCV 2025 DiffDoctor: Diagnosing Image Diffusion Models Before Treating ICCV 2025 Pretrained Reversible Generation as Unsupervised Visual Representation Learning ICCV 2025 Is Meta-Learning Out? Rethinking Unsupervised Few-Shot Classification with Limited Entropy ICCV 2025 ThinkAnswer Loss: Balancing Semantic Similarity and Exact Matching for LLM Reasoning Enhancement EMNLP 2025 Agent-in-the-Loop: A Data Flywheel for Continuous Improvement in LLM-based Customer Support EMNLP 2025 MADS: Multi-Agent Dialogue Simulation for Diverse Persuasion Data Generation EMNLP 2025 Enhancing Large Language Model for Knowledge Graph Completion via Structure-Aware Alignment-Tuning EMNLP 2025 OpenCarbon: A Contrastive Learning-based Cross-Modality Neural Approach for High-Resolution Carbon Emission Prediction Using Open Data IJCAI 2025 IDEA-Bench: How Far are Generative Models from Professional Designing? CVPR 2025 MangaNinja: Line Art Colorization with Precise Reference Following CVPR 2025 Universal Actions for Enhanced Embodied Foundation Models CVPR 2025 Decompositional Neural Scene Reconstruction with Generative Diffusion Prior CVPR 2025 Long-term Detection and Monitory of Chinese Urban Village Using Satellite Imagery IJCAI 2024 Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning NIPS 2024 CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching NIPS 2024 Phased Consistency Models NIPS 2024 Zero-shot Image Editing with Reference Imitation NIPS 2024 MoVA: Adapting Mixture of Vision Experts to Multimodal Context NIPS 2024 Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models NIPS 2024 LoD-Loc: Aerial Visual Localization using LoD 3D Map with Neural Wireframe Alignment NIPS 2024 Not Just Object, But State: Compositional Incremental Learning without Forgetting NIPS 2024 Instruction-Guided Visual Masking NIPS 2024 CI-STHPAN: Pre-trained Attention Network for Stock Selection with Channel-Independent Spatio-Temporal Hypergraph AAAI 2024 GMP-AR: Granularity Message Passing and Adaptive Reconciliation for Temporal Hierarchy Forecasting AAAI 2024 Effect Size Estimation for Duration Recommendation in Online Experiments: Leveraging Hierarchical Models and Objective Utility Approaches AAAI 2024 Causality-Inspired Invariant Representation Learning for Text-Based Person Retrieval AAAI 2024 Critic-Guided Decision Transformer for Offline Reinforcement Learning AAAI 2024 AUC Optimization from Multiple Unlabeled Datasets AAAI 2024 A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning AAAI 2024 Estimating On-Road Transportation Carbon Emissions from Open Data of Road Network and Origin-Destination Flow Data AAAI 2024 UV-SAM: Adapting Segment Anything Model for Urban Village Identification AAAI 2024 ESCP: Enhancing Emotion Recognition in Conversation with Speech and Contextual Prefixes COLING 2024 SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction CVPR 2024 Multi-agent Collaborative Perception via Motion-aware Robust Communication Network CVPR 2024 Check Locate Rectify: A Training-Free Layout Calibration System for Text-to-Image Generation CVPR 2024 GLID: Pre-training a Generalist Encoder-Decoder Vision Model CVPR 2024 Novel Class Discovery for Ultra-Fine-Grained Visual Categorization CVPR 2024 CPGA: Coding Priors-Guided Aggregation Network for Compressed Video Quality Enhancement CVPR 2024 Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation CVPR 2024 EasyDrag: Efficient Point-based Manipulation on Diffusion Models CVPR 2024 DreamVideo: Composing Your Dream Videos with Customized Subject and Motion CVPR 2024 Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following CVPR 2024 LMDrive: Closed-Loop End-to-End Driving with Large Language Models CVPR 2024 Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance CVPR 2024 AnyDoor: Zero-shot Object-level Image Customization CVPR 2024 MultiGen: Zero-shot Image Generation from Multi-modal Prompts ECCV 2024 Elegantly Written: Disentangling Writer and Character Styles for Enhancing Online Chinese Handwriting ECCV 2024 SlotLifter: Slot-guided Feature Lifting for Learning Object-Centric Radiance Fields ECCV 2024 FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis ECCV 2024 LivePhoto: Real Image Animation with Text-guided Motion Control ECCV 2024 Exploring Guided Sampling of Conditional GANs ECCV 2024 Three Things We Need to Know About Transferring Stable Diffusion to Visual Dense Prediciton Tasks ECCV 2024 Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation ECCV 2024 ZoLA: Zero-Shot Creative Long Animation Generation with Short Video Model ECCV 2024 Chains of Diffusion Models ECCV 2024 Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models ECCV 2024 Fast Context-Based Low-Light Image Enhancement via Neural Implicit Representations ECCV 2024 How Grammatical Features Impact Machine Translation: A New Test Suite for Chinese-English MT Evaluation EMNLP 2024 The Devil is in the Neurons: Interpreting and Mitigating Social Biases in Language Models ICLR 2024 Space Group Constrained Crystal Generation ICLR 2024 Continuous Invariance Learning ICLR 2024 Lipschitz Singularities in Diffusion Models ICLR 2024 DreamClean: Restoring Clean Image Using Deep Diffusion Prior ICLR 2024 ReDiffuser: Reliable Decision-Making Using a Diffuser with Confidence Estimation ICML 2024 DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning ICML 2024 StrokeNUWA—Tokenizing Strokes for Vector Graphic Synthesis ICML 2024 CCM: Real-Time Controllable Visual Content Creation Using Text-to-Image Consistency Models ICML 2024 From Pixels to Progress: Generating Road Network from Satellite Imagery for Socioeconomic Insights in Impoverished Areas IJCAI 2024 SLOTH: Structured Learning and Task-Based Optimization for Time Series Forecasting on Hierarchies AAAI 2023 Video Diffusion Models with Local-Global Context Guidance IJCAI 2023 Masked Autoencoders Are Stronger Knowledge Distillers ICCV 2023 Generating Dynamic Kernels via Transformers for Lane Detection ICCV 2023 GeoMIM: Towards Better 3D Knowledge Transfer via Masked Image Modeling for Multi-view 3D Understanding ICCV 2023 UniKD: Universal Knowledge Distillation for Mimicking Homogeneous or Heterogeneous Object Detectors ICCV 2023 Efficient-VQGAN: Towards High-Resolution Image Generation with Efficient Vision Transformers ICCV 2023 3D Semantic Subspace Traverser: Empowering 3D Generative Model with Shape Editing Capability ICCV 2023 Deep Active Contours for Real-time 6-DoF Object Tracking ICCV 2023 Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction ICCV 2023 ReasonNet: End-to-End Driving With Temporal and Global Reasoning CVPR 2023 Dimensionality-Varying Diffusion Process CVPR 2023 Long-Term Visual Localization With Mobile Sensors CVPR 2023 MixMAE: Mixed and Masked Autoencoder for Efficient Pretraining of Hierarchical Vision Transformers CVPR 2023 Arbitrary Virtual Try-on Network: Characteristics Representation and Trade-off between Body and Clothing ICLR 2023 Improving Object-centric Learning with Query Optimization ICLR 2023 GoBigger: A Scalable Platform for Cooperative-Competitive Multi-Agent Interactive Simulation ICLR 2023 LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios NIPS 2023 RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths NIPS 2023 Composer: Creative and Controllable Image Synthesis with Composable Conditions ICML 2023 Cones: Concept Neurons in Diffusion Models for Customized Generation ICML 2023 Style-Content Metric Learning for Multidomain Remote Sensing Object Recognition AAAI 2023 ACE: Cooperative Multi-Agent Q-learning with Bidirectional Action-Dependency AAAI 2023 Customizable Image Synthesis with Multiple Subjects NIPS 2023 Efficient Reinforcement Learning for Autonomous Driving with Parameterized Skills and Priors RSS 2023 COCA: COllaborative CAusal Regularization for Audio-Visual Question Answering AAAI 2023 DETRs with Collaborative Hybrid Assignments Training ICCV 2023 Decoupled DETR: Spatially Disentangling Localization and Classification for Improved End-to-End Object Detection ICCV 2023 Memory Augmented State Space Model for Time Series Forecasting IJCAI 2022 Large-batch Optimization for Dense Visual Predictions: Training Faster R-CNN in 4.2 Minutes NIPS 2022 Unifying Visual Perception by Dispersible Points Learning ECCV 2022 Camera Auto-Calibration from the Steiner Conic of the Fundamental Matrix ECCV 2022 Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer CORL 2022 Self-Slimmed Vision Transformer ECCV 2022 "UniNet: Unified Architecture Search with Convolution, Transformer, and MLP" ECCV 2022 GeoAug: Data Augmentation for Few-Shot NeRF with Geometry Constraints ECCV 2022 Towards Robust Face Recognition with Comprehensive Search ECCV 2022 Rethinking Robust Representation Learning under Fine-Grained Noisy Faces ECCV 2022 UniFormer: Unified Transformer for Efficient Spatial-Temporal Representation Learning ICLR 2022 A Bayesian Model for Online Activity Sample Sizes AISTATS 2022 TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers ECCV 2022 A Trend-Driven Fashion Design System for Rapid Response Marketing in E-commerce AAAI 2022 Segment, Magnify and Reiterate: Detecting Camouflaged Objects the Hard Way CVPR 2022 Train a One-Million-Way Instance Classifier for Unsupervised Visual Representation Learning AAAI 2021 SOM-NCSCM : An Efficient Neural Chinese Sentence Compression Model Enhanced with Self-Organizing Map EMNLP 2021 Hyperbolic Geometry is Not Necessary: Lightweight Euclidean-Based Models for Low-Dimensional Knowledge Graph Embeddings EMNLP 2021 Switchable K-Class Hyperplanes for Noise-Robust Representation Learning ICCV 2021 Self-Supervised Video Representation Learning by Context and Motion Decoupling CVPR 2021 Lifelong Person Re-Identification via Adaptive Knowledge Accumulation CVPR 2021 Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization CVPR 2021 Communication Efficient SGD via Gradient Sampling With Bayes Prior CVPR 2021 Neighborhood Intervention Consistency: Measuring Confidence for Knowledge Graph Link Prediction IJCAI 2021 Rotate-and-Render: Unsupervised Photorealistic Face Rotation From Single-View Images CVPR 2020 DPGN: Distribution Propagation Graph Network for Few-Shot Learning CVPR 2020 Revisiting the Sibling Head in Object Detector CVPR 2020 Smoothed Nonparametric Derivative Estimation using Weighted Difference Quotients JMLR 2020 Label-Attended Hashing for Multi-Label Image Retrieval IJCAI 2020 KPNet: Towards Minimal Face Detector AAAI 2020 Anisotropic Convolutional Networks for 3D Semantic Scene Completion CVPR 2020 Discriminability Distillation in Group Representation Learning ECCV 2020 Learning Where to Focus for Efficient Video Object Detection ECCV 2020 More Classifiers, Less Forgetting: A Generic Multi-classifier Paradigm for Incremental Learning ECCV 2020 Temporal Interlacing Network AAAI 2020 Search to Distill: Pearls Are Everywhere but Not the Eyes CVPR 2020 Scalable Place Recognition Under Appearance Change for Autonomous Driving ICCV 2019 Exploiting Temporal Consistency for Real-Time Video Depth Estimation ICCV 2019 Differentiable Kernel Evolution ICCV 2019 Correlation Congruence for Knowledge Distillation ICCV 2019 RGBD Based Dimensional Decomposition Residual Network for 3D Semantic Scene Completion CVPR 2019 Talking Face Generation by Adversarially Disentangled Audio-Visual Representation AAAI 2019 Conditional Adversarial Generative Flow for Controllable Image Synthesis CVPR 2019 Gradient Harmonized Single-Stage Detector AAAI 2019 Knowledge Distillation via Route Constrained Optimization ICCV 2019 Beyond Trade-Off: Accelerate FCN-Based Face Detector With Higher Accuracy CVPR 2018 Derivative Estimation in Random Design NIPS 2018 Exploring Disentangled Feature Representation Beyond Face Identification CVPR 2018 MoNet: Deep Motion Exploitation for Video Object Segmentation CVPR 2018 Hyperparameter Optimization for Tracking With Continuous Deep Q-Learning CVPR 2018 A Simple Convolutional Neural Network for Accurate P300 Detection and Character Spelling in Brain Computer Interface IJCAI 2018 Transductive Centroid Projection for Semi-supervised Large-scale Recognition ECCV 2018 Learning a Recurrent Residual Fusion Network for Multimodal Matching ICCV 2017 Recurrent Scale Approximation for Object Detection in CNN ICCV 2017 Scale-Aware Face Detection CVPR 2017 Unsupervised Sequence Classification using Sequential Output Statistics NIPS 2017 Quality Aware Network for Set to Set Recognition CVPR 2017 K-Means Clustering with Distributed Dimensions ICML 2016 Combinatorial Multi-Armed Bandit with General Reward Functions NIPS 2016 Learning Relaxed Deep Supervision for Better Edge Detection CVPR 2016