Zhenguo Li
151 papers · 2009–2026 · 13 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+19 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (23) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (7) π£ Hot Topic Early Bird
π
Renaissance Researcher
(7)
π
Interdisciplinary Bridge
π§
Keyword Pioneer
π
Keyword Trendsetter Combo
(4)
π
Conference Loyalist
(26)
π§¬
Topic Evolution
π€
Dynamic Duo
(45)
π
Grand Slam
π
Triple Crown
π₯
Mega-Team
(30)
π¬
Deep Specialist
(20)
π
Keyword Champion
(2)
β
The Questioner
(3)
π
Trend Setter
π
Conference Pioneer
π
Century Club
(150)
π₯
Unstoppable
(8)
ποΈ
Keyword Collector
(490)
β‘
Prolific Year
(22)
Conferences
CVPR (30)
ICLR (30)
NIPS (26)
ICCV (17)
AAAI (15)
ECCV (10)
ACL (6)
ICML (5)
EMNLP (3)
IJCAI (3)
WACV (3)
NAACL (2)
COLING (1)
Top co-authors
Research topics
Keywords
object detection
(14)
neural architecture search
(12)
transfer learning
(11)
image generation
(10)
diffusion model
(9)
out-of-distribution generalization
(8)
large language model
(8)
contrastive learning
(6)
knowledge distillation
(6)
generative model
(6)
representation learning
(6)
domain generalization
(6)
autonomous driving
(5)
transformer architecture
(5)
neural network optimization
(5)
attention mechanism
(5)
3d object detection
(5)
lossless compression
(5)
distribution shift
(4)
domain adaptation
(4)
Papers
Empowering Sparse-Input Neural Radiance Fields with Dual-Level Semantic Guidance from Dense Novel Views
AAAI 2026
MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes
WACV 2026
Implicit Search via Discrete Diffusion: A Study on Chess
ICLR 2025
G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model
ICLR 2025
Your Absorbing Discrete Diffusion Secretly Models the Conditional Distributions of Clean Data
ICLR 2025
Improved Diffusion-based Generative Model with Better Adversarial Robustness
ICLR 2025
Forewarned is Forearmed: Harnessing LLMs for Data Synthesis via Failure-induced Exploration
ICLR 2025
Getting More Juice Out of Your Data: Hard Pair Refinement Enhances Visual-Language Models Without Extra Data
NAACL 2025
CARTS: Advancing Neural Theorem Proving with Diversified Tactic Calibration and Bias-Resistant Tree Search
ICLR 2025
How Numerical Precision Affects Arithmetical Reasoning Capabilities of LLMs
ACL 2025
DAPE V2: Process Attention Score as Feature Map for Length Extrapolation
ACL 2025
T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation
CVPR 2025
Mixture of insighTful Experts (MoTE): The Synergy of Reasoning Chains and Expert Mixtures in Self-Alignment
ACL 2025
SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator
ICML 2025
ProofAug: Efficient Neural Theorem Proving via Fine-grained Proof Structure Analysis
ICML 2025
LiT: Delving into a Simple Linear Diffusion Transformer for Image Generation
ICCV 2025
Adding Additional Control to One-Step Diffusion with Joint Distribution Matching
ICCV 2025
QuickLLaMA: Query-aware Inference Acceleration for Large Language Models
COLING 2025
MagicDrive-V2: High-Resolution Long Video Generation for Autonomous Driving with Adaptive Control
ICCV 2025
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
CVPR 2025
Automated Evaluation of Large Vision-Language Models on Self-Driving Corner Cases
WACV 2025
Self-Adjust Softmax
EMNLP 2025
Corrupted but Not Broken: Understanding and Mitigating the Negative Impacts of Corrupted Data in Visual Instruction Tuning
EMNLP 2025
Understanding the Language Model to Solve the Symbolic Multi-Step Reasoning Problem from the Perspective of Buffer Mechanism
EMNLP 2025
How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training
ACL 2025
Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding
ICLR 2025
Jailbreaking as a Reward Misspecification Problem
ICLR 2025
Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning
ICLR 2025
MagicDrive: Street View Generation with Diverse 3D Geometry Control
ICLR 2024
Efficient Transferability Assessment for Selection of Pre-Trained Detectors
WACV 2024
"Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation"
ECCV 2024
Implicit Concept Removal of Diffusion Models
ECCV 2024
PixArt-Sigma: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
ECCV 2024
Fast Training of Diffusion Transformer with Extreme Masking for 3D Point Clouds Generation
ECCV 2024
DAPE: Data-Adaptive Positional Encoding for Length Extrapolation
NIPS 2024
Towards Understanding the Working Mechanism of Text-to-Image Diffusion Model
NIPS 2024
Dual Risk Minimization: Towards Next-Level Robustness in Fine-tuning Zero-Shot Models
NIPS 2024
Proving Theorems Recursively
NIPS 2024
Diffusion of Thought: Chain-of-Thought Reasoning in Diffusion Language Models
NIPS 2024
GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing
NIPS 2024
DeepAccident: A Motion and Accident Prediction Benchmark for V2X Autonomous Driving
AAAI 2024
Forward-Backward Reasoning in Large Language Models for Mathematical Verification
ACL 2024
GeoDiffusion: Text-Prompted Geometric Control for Object Detection Data Generation
ICLR 2024
Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis
ICLR 2024
PixArt-$\alpha$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
ICLR 2024
MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data
ICLR 2024
LEGO-Prover: Neural Theorem Proving with Growing Libraries
ICLR 2024
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
ICLR 2024
Large Language Models as Automated Aligners for benchmarking Vision-Language Models
ICLR 2024
ATG: Benchmarking Automated Theorem Generation for Generative Language Models
NAACL 2024
The Surprising Effectiveness of Skip-Tuning in Diffusion Sampling
ICML 2024
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection
CVPR 2024
Accelerating Diffusion Sampling with Optimized Time Steps
CVPR 2024
DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception
CVPR 2024
Enhancing the Power of OOD Detection via Sample-Aware Model Selection
CVPR 2024
CVT-xRF: Contrastive In-Voxel Transformer for 3D Consistent Radiance Fields from Sparse Inputs
CVPR 2024
Task-customized Masked Autoencoder via Mixture of Cluster-conditional Experts
ICLR 2023
Breaking Correlation Shift via Conditional Invariant Regularizer
ICLR 2023
CO3: Cooperative Unsupervised 3D Representation Learning for Autonomous Driving
ICLR 2023
SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models
NIPS 2023
Diff-Instruct: A Universal Approach for Transferring Knowledge From Pre-trained Diffusion Models
NIPS 2023
DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-efficient Fine-Tuning
ICCV 2023
DiffComplete: Diffusion-based Generative 3D Shape Completion
NIPS 2023
DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation
NIPS 2023
DT-Solver: Automated Theorem Proving with Dynamic-Tree Sampling Guided by Proof-level Value Function
ACL 2023
Fair-CDA: Continuous and Directional Augmentation for Group Fairness
AAAI 2023
DAMix: Exploiting Deep Autoregressive Model Zoo for Improving Lossless Compression Generalization
AAAI 2023
Explore and Exploit the Diverse Knowledge in Model Zoo for Domain Generalization
ICML 2023
T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation
NIPS 2023
UniTR: A Unified and Efficient Multi-Modal Transformer for Bird's-Eye-View Representation
ICCV 2023
DDP: Diffusion Model for Dense Visual Prediction
ICCV 2023
MetaBEV: Solving Sensor Failures for 3D Detection and Map Segmentation
ICCV 2023
Beyond One-to-One: Rethinking the Referring Image Segmentation
ICCV 2023
Mixed Autoencoder for Self-Supervised Visual Representation Learning
CVPR 2023
DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-Training via Word-Region Alignment
CVPR 2023
ContraNeRF: Generalizable Neural Radiance Fields for Synthetic-to-Real Novel View Synthesis via Contrastive Learning
CVPR 2023
Complexity Matters: Rethinking the Latent Space for Generative Modeling
NIPS 2023
Self-Guided Noise-Free Data Generation for Efficient Zero-Shot Learning
ICLR 2023
AutoBERT-Zero: Evolving BERT Backbone from Scratch
AAAI 2022
DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection
NIPS 2022
Understanding Square Loss in Training Overparametrized Neural Network Classifiers
NIPS 2022
CAGroup3D: Class-Aware Grouping for 3D Object Detection on Point Clouds
NIPS 2022
ZooD: Exploiting Model Zoo for Out-of-Distribution Generalization
NIPS 2022
Task-Customized Self-Supervised Pre-training with Scalable Dynamic Routing
AAAI 2022
Arch-Graph: Acyclic Architecture Relation Predictor for Task-Transferable Neural Architecture Search
CVPR 2022
ManiTrans: Entity-Level Text-Guided Image Manipulation via Token-Wise Semantic Alignment and Generation
CVPR 2022
Long-Tail Recognition via Compositional Knowledge Transfer
CVPR 2022
Semi-Supervised Object Detection via Multi-Instance Alignment With Global Class Prototypes
CVPR 2022
OoD-Bench: Quantifying and Understanding Two Dimensions of Out-of-Distribution Generalization
CVPR 2022
PILC: Practical Image Lossless Compression With an End-to-End GPU Oriented Neural Framework
CVPR 2022
Generative Negative Text Replay for Continual Vision-Language Pretraining
ECCV 2022
CODA: A Real-World Road Corner Case Dataset for Object Detection in Autonomous Driving
ECCV 2022
DevNet: Self-Supervised Monocular Depth Learning via Density Volume Construction
ECCV 2022
Nonlinear ICA Using Volume-Preserving Transformations
ICLR 2022
On Redundancy and Diversity in Cell-based Neural Architecture Search
ICLR 2022
Generalizing Few-Shot NAS with Gradient Matching
ICLR 2022
FILIP: Fine-grained Interactive Language-Image Pre-Training
ICLR 2022
How Well Does Self-Supervised Pre-Training Perform with Streaming Data?
ICLR 2022
Rethinking Adversarial Transferability from a Data Distribution Perspective
ICLR 2022
Memory Replay with Data Compression for Continual Learning
ICLR 2022
Revisiting Over-smoothing in BERT from the Perspective of Graph
ICLR 2022
Towards Understanding the Generative Capability of Adversarially Robust Classifiers
ICCV 2021
iFlow: Numerically Invertible Flows for Efficient Lossless Compression via a Uniform Coder
NIPS 2021
OSOA: One-Shot Online Adaptation of Deep Generative Models for Lossless Compression
NIPS 2021
MetaAugment: Sample-Aware Data Augmentation Policy Learning
AAAI 2021
DecAug: Out-of-Distribution Generalization via Decomposed Feature Representation and Semantic Augmentation
AAAI 2021
Ada-Segment: Automated Multi-loss Adaptation for Panoptic Segmentation
AAAI 2021
How to Save your Annotation Cost for Panoptic Segmentation?
AAAI 2021
MixACM: Mixup-Based Robustness Transfer via Distillation of Activated Channel Maps
NIPS 2021
On Effective Scheduling of Model-based Reinforcement Learning
NIPS 2021
Joint-DetNAS: Upgrade Your Detector With NAS, Pruning and Dynamic Distillation
CVPR 2021
Loss Function Discovery for Object Detection via Convergence-Simulation Driven Search
ICLR 2021
SparseBERT: Rethinking the Importance Analysis in Self-attention
ICML 2021
DetCo: Unsupervised Contrastive Learning for Object Detection
ICCV 2021
Towards a Theoretical Framework of Out-of-Distribution Generalization
NIPS 2021
NAS-OoD: Neural Architecture Search for Out-of-Distribution Generalization
ICCV 2021
ORDisCo: Effective and Efficient Usage of Incremental Unlabeled Data for Semi-Supervised Continual Learning
CVPR 2021
Transformation Invariant Few-Shot Object Detection
CVPR 2021
TransNAS-Bench-101: Improving Transferability and Generalizability of Cross-Task Neural Architecture Search
CVPR 2021
iVPF: Numerical Invertible Volume Preserving Flow for Efficient Lossless Compression
CVPR 2021
Exploring Geometry-Aware Contrast and Clustering Harmonization for Self-Supervised 3D Object Detection
ICCV 2021
G-DetKD: Towards General Distillation Framework for Object Detectors via Contrastive and Semantic-Guided Feature Imitation
ICCV 2021
Adversarial Invariant Learning
CVPR 2021
NASOA: Towards Faster Task-Oriented Online Fine-Tuning With a Zoo of Models
ICCV 2021
MultiSiam: Self-Supervised Multi-Instance Siamese Representation Learning for Autonomous Driving
ICCV 2021
Adversarial Robustness for Unsupervised Domain Adaptation
ICCV 2021
Boosting Few-Shot Learning With Adaptive Margin Loss
CVPR 2020
CATCH: Context-based Meta Reinforcement Learning for Transferrable Architecture Search
ECCV 2020
CurveLane-NAS: Unifying Lane-Sensitive Architecture Search and Adaptive Point Blending
ECCV 2020
AABO: Adaptive Anchor Box Optimization for Object Detection via Bayesian Sub-sampling
ECCV 2020
DropNAS: Grouped Operation Dropout for Differentiable Architecture Search
IJCAI 2020
Locally Differentially Private (Contextual) Bandits Learning
NIPS 2020
Bridging the Gap between Sample-based and One-shot Neural Architecture Search with BONAS
NIPS 2020
SP-NAS: Serial-to-Parallel Backbone Search for Object Detection
CVPR 2020
Rethinking Performance Estimation in Neural Architecture Search
CVPR 2020
SM-NAS: Structural-to-Modular Neural Architecture Search for Object Detection
AAAI 2020
Universal-RCNN: Universal Object Detector via Transferable Graph R-CNN
AAAI 2020
EHSOD: CAM-Guided End-to-End Hybrid-Supervised Object Detection with Cascade Refinement
AAAI 2020
New Interpretations of Normalization Methods in Deep Learning
AAAI 2020
Meta-Learning PAC-Bayes Priors in Model Averaging
AAAI 2020
Meta Reinforcement Learning with Task Embedding and Shared Policy
IJCAI 2019
Reasoning-RCNN: Unifying Adaptive Global Reasoning Into Large-Scale Object Detection
CVPR 2019
Spatial-Aware Graph Relation Network for Large-Scale Object Detection
CVPR 2019
Auto-FPN: Automatic Network Architecture Adaptation for Object Detection Beyond Classification
ICCV 2019
DeepFM: A Factorization-Machine based Neural Network for CTR Prediction
IJCAI 2017
New Insights Into Laplacian Similarity Search
CVPR 2015
Locally Linear Hashing for Extracting Non-Linear Manifolds
CVPR 2014
Analyzing the Harmonic Structure in Graph-Based Learning
NIPS 2013
A Bayesian Approach to Multimodal Visual Dictionary Learning
CVPR 2013
Learning with Partially Absorbing Random Walks
NIPS 2012
Fast Graph Laplacian Regularized Kernel Learning via SemidefiniteβQuadraticβLinear Programming
NIPS 2009