Li Zhang
195 papers · 2006–2026 · 21 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+17 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (26) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (5) π£ Hot Topic Early Bird
π
Renaissance Researcher
(5)
π
Interdisciplinary Bridge
π
Cross-Pollinator
(6)
π
Conference Loyalist
(36)
π€
Dynamic Duo
(18)
π
Triple Crown
π
Keyword Champion
π
Grand Slam
π₯
Mega-Team
(77)
π¬
Deep Specialist
(20)
β
The Questioner
(2)
π
Conference Pioneer
β‘
Prolific Year
(41)
π₯
Unstoppable
(14)
ποΈ
Keyword Collector
(66)
π
Century Club
(179)
π
Trend Setter
Conferences
CVPR (36)
AAAI (29)
ECCV (17)
NIPS (15)
ACL (14)
ICLR (13)
ICCV (12)
EMNLP (12)
ICML (9)
MICCAI (6)
EACL (5)
NAACL (5)
COLING (4)
IJCNLP (4)
INTERSPEECH (4)
AACL (3)
ACML (2)
IJCAI (2)
COLT (1)
AISTATS (1)
WACV (1)
Top co-authors
Research topics
Keywords
large language model
(15)
few-shot learning
(10)
convolutional neural network
(9)
semantic segmentation
(9)
zero-shot learning
(8)
transfer learning
(8)
point cloud
(7)
autonomous driving
(7)
attention mechanism
(6)
differential privacy
(5)
text generation
(5)
multi-task learning
(5)
representation learning
(4)
model compression
(4)
neural architecture search
(4)
feature extraction
(4)
domain adaptation
(4)
pose estimation
(4)
metric learning
(4)
object detection
(4)
Papers
EvoFMVC: Trusted Federated Multi-View Clustering with Evolutionary Fusion
AAAI 2026
Uncertainty-Guided View-Strength-Aware Feature Utilization for Multi-View Classification
AAAI 2026
F2SST: Frequency-to-Spatial Semantic Transfer for Few-Shot Image Classification
AAAI 2026
SIAM: Towards Generalizable Articulated Object Modeling via Single Robot-Object Interaction
AAAI 2026
ELSPR: Evaluator LLM Training Data Self-Purification on Non-Transitive Preferences via Tournament Graph Reconstruction
AAAI 2026
Quantifying the Impact of Structured Output Format on Large Language Models through Causal Inference
EACL 2026
Spatio-Temporal Distortion Aware Omnidirectional Video Super-Resolution
AAAI 2026
Exploring Category-level Articulated Object Pose Tracking on SE(3) Manifolds
AAAI 2026
Perception in Plan: Coupled Perception and Planning for End-to-End Autonomous Driving
AAAI 2026
Language Model as Planner and Formalizer under Constraints
ACL 2026
VQ-Insight: Teaching VLMs for AI-Generated Video Quality Understanding via Progressive Visual Reinforcement Learning
AAAI 2026
Collaboratively βCopy & Pasteβ 2D-3D Features for Complex Video-to-Video Motion Editing
AAAI 2026
MIDB: Multilingual Instruction Data Booster for Enhancing Cultural Equality in Multilingual Instruction Synthesis
AAAI 2026
Think in Latent Thoughts: A New Paradigm for Gloss-Free Sign Language Translation
ACL 2026
PLAWBENCH: A Rubric-Based Benchmark for Evaluating LLMs in Real-World Legal Practice
ACL 2026
TOFA: Training-Free One-Shot Federated Adaptation for Vision-Language Models
AAAI 2026
BezierGS: Dynamic Urban Scene Reconstruction with Bezier Curve Gaussian Splatting
ICCV 2025
Rethinking Layered Graphic Design Generation with a Top-Down Approach
ICCV 2025
Natural Language Inference as a Judge: Detecting Factuality and Causality Issues in Language Model Self-Reasoning for Financial Analysis
EMNLP 2025
DroidCall: A Dataset for LLM-powered Android Intent Invocation
EMNLP 2025
TurnaboutLLM: A Deductive Reasoning Benchmark from Detective Games
EMNLP 2025
R^2-Art: Category-Level Articulation Pose Estimation from Single RGB Image via Cascade Render Strategy
AAAI 2025
Calibrating Large Language Models with Sample Consistency
AAAI 2025
LoGoFair: Post-Processing for Local and Global Fairness in Federated Learning
AAAI 2025
Documentation Retrieval Improves Planning Language Generation
AACL 2025
On the Limit of Language Models as Planning Formalizers
ACL 2025
Data Interpreter: An LLM Agent for Data Science
ACL 2025
DynaCode: A Dynamic Complexity-Aware Code Benchmark for Evaluating Large Language Models in Code Generation
ACL 2025
Training Language Model to Critique for Better Refinement
ACL 2025
GAVEL: Generative Attribute-Value Extraction Using LLMs on LLM-Augmented Datasets
NAACL 2025
DiscQuant: A Quantization Method for Neural Networks Inspired by Discrepancy Theory
COLT 2025
Multi-view Graph Contrastive Learning with Dynamic Self-aware and Cross-sample Topology Augmentation for Brain Disorder Diagnosis
MICCAI 2025
Metastatic Lymph Node Station Classification in Esophageal Cancer via Prior-guided Supervision and Station-Aware Mixture-of-Experts
MICCAI 2025
Lymph Node Metastasis Classification with Prototype-guided Multiple Instance Aggregation and Heterogeneous Feature Fusion
MICCAI 2025
Deep Knowledge-Infused Transformer for NSCLC Lymph Node Station Metastasis Prediction: Development of an AI-Powered Intraoperative Decision System
MICCAI 2025
Documentation Retrieval Improves Planning Language Generation
IJCNLP 2025
Pre-defined Keypoints Promote Category-level Articulation Pose Estimation via Multi-Modal Alignment
IJCAI 2025
Diffusion$^2$: Dynamic 3D Content Generation via Score Composition of Video and Multi-view Diffusion Models
ICLR 2025
ShortcutsBench: A Large-Scale Real-world Benchmark for API-based Agents
ICLR 2025
Reflective Gaussian Splatting
ICLR 2025
Controllable Unlearning for Image-to-Image Generative Models via $\epsilon$-Constrained Optimization
ICLR 2025
GS-LiDAR: Generating Realistic LiDAR Point Clouds with Panoramic Gaussian Splatting
ICLR 2025
FreqPrior: Improving Video Diffusion Models with Frequency Filtering Gaussian Noise
ICLR 2025
Bridging Past and Future: End-to-End Autonomous Driving with Historical Prediction and Planning
CVPR 2025
IRGS: Inter-Reflective Gaussian Splatting with 2D Gaussian Ray Tracing
CVPR 2025
UniScene: Unified Occupancy-centric Driving Scene Generation
CVPR 2025
Improving Gaussian Splatting with Localized Points Management
CVPR 2025
Lessons and Insights from a Unifying Study of Parameter-Efficient Fine-Tuning (PEFT) in Visual Recognition
CVPR 2025
TensoFlow: Tensorial Flow-based Sampler for Inverse Rendering
CVPR 2025
ECVC: Exploiting Non-Local Correlations in Multiple Frames for Contextual Video Compression
CVPR 2025
GaPT-DAR: Category-level Garments Pose Tracking via Integrated 2D Deformation and 3D Reconstruction
CVPR 2025
Dual-view X-ray Detection: Can AI Detect Prohibited Items from Dual-view X-ray Images like Humans?
CVPR 2025
An Information-Theoretic Regularizer for Lossy Neural Image Compression
ICCV 2025
Driving View Synthesis on Free-form Trajectories with Generative Prior
ICCV 2025
Rethinking 3D Convolution in $\ell_p$-norm Space
NIPS 2024
Real-time Photorealistic Dynamic Scene Representation and Rendering with 4D Gaussian Splatting
ICLR 2024
Enhancing High-Resolution 3D Generation through Pixel-wise Gradient Clipping
ICLR 2024
DG-SLAM: Robust Dynamic Gaussian Splatting SLAM with Hybrid Pose Optimization
NIPS 2024
Semi-supervised Lymph Node Metastasis Classification with Pathology-guided Label Sharpening and Two-streamed Multi-scale Fusion
MICCAI 2024
An MR-Compatible Virtual Reality System for Assessing Neuronal Plasticity of Sensorimotor Neurons and Mirror Neurons
MICCAI 2024
SCDNet: Self-supervised Learning Feature based Speaker Change Detection
INTERSPEECH 2024
Motion Forecasting in Continuous Driving
NIPS 2024
DeMo: Decoupling Motion Forecasting into Directional Intentions and Dynamic States
NIPS 2024
CURE4Rec: A Benchmark for Recommendation Unlearning with Deeper Influence
NIPS 2024
Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving
ECCV 2024
U-COPE: Taking a Further Step to Universal 9D Category-level Object Pose Estimation
ECCV 2024
LaneGraph2Seq: Lane Topology Extraction with Language Model via Vertex-Edge Encoding and Connectivity Enhancement
AAAI 2024
NeRF-LiDAR: Generating Realistic LiDAR Point Clouds with Neural Radiance Fields
AAAI 2024
CatmullRom Splines-Based Regression for Image Forgery Localization
AAAI 2024
UPDP: A Unified Progressive Depth Pruner for CNN and Vision Transformer
AAAI 2024
Tetrahedron Splatting for 3D Generation
NIPS 2024
Causal-IQA: Towards the Generalization of Image Quality Assessment Based on Causal Inference
ICML 2024
BLO-SAM: Bi-level Optimization Based Finetuning of the Segment Anything Model for Overfitting-Preventing Semantic Segmentation
ICML 2024
MoE-DiffIR: Task-customized Diffusion Priors for Universal Compressed Image Restoration
ECCV 2024
Effective Lymph Nodes Detection in CT Scans Using Location Debiased Query Selection and Contrastive Query Representation in Transformer
ECCV 2024
TimeSiam: A Pre-Training Framework for Siamese Time-Series Modeling
ICML 2024
Relightable 3D Gaussians: Realistic Point Cloud Relighting with BRDF Decomposition and Ray Tracing
ECCV 2024
WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation
ECCV 2024
TimeR4 : Time-aware Retrieval-Augmented Large Language Models for Temporal Knowledge Graph Question Answering
EMNLP 2024
EfficientCAPER: An End-to-End Framework for Fast and Robust Category-Level Articulated Object Pose Estimation
NIPS 2024
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement
NIPS 2024
Tailoring with Targeted Precision: Edit-Based Agents for Open-Domain Procedure Customization
ACL 2024
PROC2PDDL: Open-Domain Planning Representations from Texts
ACL 2024
FrameQuant: Flexible Low-Bit Quantization for Transformers
ICML 2024
SlowFocus: Enhancing Fine-grained Temporal Understanding in Video LLM
NIPS 2024
Modular Blind Video Quality Assessment
CVPR 2024
CAMixerSR: Only Details Need More "Attention"
CVPR 2024
Personalized Video Comment Generation
EMNLP 2024
Consistent4D: Consistent 360Β° Dynamic Object Generation from Monocular Video
ICLR 2024
Private Learning with Public Features
AISTATS 2024
PDDLEGO: Iterative Planning in Textual Environments
NAACL 2024
Generalizable and Stable Finetuning of Pretrained Language Models on Low-Resource Texts
NAACL 2024
Choice-75: A Dataset on Decision Branching in Script Learning
COLING 2024
STEntConv: Predicting Disagreement between Reddit Users with Stance Detection and a Signed Graph Convolutional Network
COLING 2024
OpenPI2.0: An Improved Dataset for Entity Tracking in Texts
EACL 2024
Safety Verification of Nonlinear Systems with Bayesian Neural Network Controllers
AAAI 2023
SimMTM: A Simple Pre-Training Framework for Masked Time-Series Modeling
NIPS 2023
PolarFormer: Multi-Camera 3D Object Detection with Polar Transformer
AAAI 2023
Panoramic Video Salient Object Detection with Ambisonic Audio Guidance
AAAI 2023
Self-Asymmetric Invertible Network for Compression-Aware Image Rescaling
AAAI 2023
Faithful Chain-of-Thought Reasoning
AACL 2023
Human-in-the-loop Schema Induction
ACL 2023
Exploring the Curious Case of Code Prompts
ACL 2023
Generative Semantic Segmentation
CVPR 2023
Train-Once-for-All Personalization
CVPR 2023
Devil Is in the Queries: Advancing Mask Transformers for Real-World Medical Image Segmentation and Out-of-Distribution Localization
CVPR 2023
FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks
CVPR 2023
Causal Reasoning of Entities and Events in Procedural Texts
EACL 2023
PARTNER: Level up the Polar Representation for LiDAR 3D Object Detection
ICCV 2023
Translating Images to Road Network: A Non-Autoregressive Sequence-to-Sequence Approach
ICCV 2023
SeaFormer: Squeeze-enhanced Axial Transformer for Mobile Semantic Segmentation
ICLR 2023
S-NeRF: Neural Radiance Fields for Street Views
ICLR 2023
Fair and Accurate Decision Making through Group-Aware Learning
ICML 2023
Multi-Task Differential Privacy Under Distribution Skew
ICML 2023
Faithful Chain-of-Thought Reasoning
IJCNLP 2023
ImpDet: Exploring Implicit Fields for 3D Object Detection
WACV 2023
Label Definitions Improve Semantic Role Labeling
NAACL 2022
Is βMy Favorite New Movieβ My Favorite Movie? Probing the Understanding of Recursive Noun Phrases
NAACL 2022
Learning Ego 3D Representation As Ray Tracing
ECCV 2022
Backend Ensemble for Speaker Verification and Spoofing Countermeasure
INTERSPEECH 2022
Region-Aware Metric Learning for Open World Semantic Segmentation via Meta-Channel Aggregation
IJCAI 2022
DeepInteraction: 3D Object Detection via Modality Interaction
NIPS 2022
ONCE-3DLanes: Building Monocular 3D Lane Detection
CVPR 2022
FashionViL: Fashion-Focused Vision-and-Language Representation Learning
ECCV 2022
RCLane: Relay Chain Prediction for Lane Detection
ECCV 2022
Learning from Mistakes β a Framework for Neural Architecture Search
AAAI 2022
GEMv2: Multilingual NLG Benchmarking in a Single Line of Code
EMNLP 2022
Accelerating Score-Based Generative Models with Preconditioned Diffusion Sampling
ECCV 2022
Memory-Augmented Model-Driven Network for Pansharpening
ECCV 2022
CelebV-HQ: A Large-Scale Video Facial Attributes Dataset
ECCV 2022
Unsupervised Entity Linking with Guided Summarization and Multiple-Choice Selection
EMNLP 2022
Show Me More Details: Discovering Hierarchies of Procedures from Semi-structured Web Data
ACL 2022
Visual Goal-Step Inference using wikiHow
EMNLP 2021
Simpler Is Better: Few-Shot Semantic Segmentation With Classifier Weight Transformer
ICCV 2021
The Devil Is in the Task: Exploiting Reciprocal Appearance-Localization Features for Monocular 3D Object Detection
ICCV 2021
Boundary-Sensitive Pre-Training for Temporal Localization in Videos
ICCV 2021
DAST: Unsupervised Domain Adaptation in Semantic Segmentation Based on Discriminator Attention and Self-Training
AAAI 2021
Learning a Few-shot Embedding Model with Contrastive Learning
AAAI 2021
Complementary Evidence Identification in Open-Domain Question Answering
EACL 2021
Depth-Conditioned Dynamic Message Propagation for Monocular 3D Object Detection
CVPR 2021
MoViNets: Mobile Video Networks for Efficient Video Recognition
CVPR 2021
Rethinking Semantic Segmentation From a Sequence-to-Sequence Perspective With Transformers
CVPR 2021
Robust and Accurate Object Detection via Adversarial Learning
CVPR 2021
Learning Dynamic Alignment via Meta-Filter for Few-Shot Learning
CVPR 2021
Delving into Data: Effectively Substitute Training for Black-box Attack
CVPR 2021
Private Alternating Least Squares: Practical Private Matrix Completion with Tighter Rates
ICML 2021
Oneshot Differentially Private Top-k Selection
ICML 2021
SOFT: Softmax-free Transformer with Linear Complexity
NIPS 2021
Multi-Level Gazetteer-Free Geocoding
IJCNLP 2021
Progressive Coordinate Transforms for Monocular 3D Object Detection
NIPS 2021
Multi-Level Transfer Learning from Near-Field to Far-Field Speaker Verification
INTERSPEECH 2021
Semi-Open Attribute Extraction from Chinese Functional Description Text
ACML 2021
Multi-Level Gazetteer-Free Geocoding
ACL 2021
Partial-Label and Structure-constrained Deep Coupled Factorization Network
AAAI 2021
Reasoning about Goals, Steps, and Temporal Ordering with WikiHow
EMNLP 2020
Small but Mighty: New Benchmarks for Split and Rephrase
EMNLP 2020
Rankmax: An Adaptive Projection Alternative to the Softmax Function
NIPS 2020
Style Normalization and Restitution for Generalizable Person Re-Identification
CVPR 2020
Universal Value Iteration Networks: When Spatially-Invariant Is Not Universal
AAAI 2020
What Deep CNNs Benefit From Global Covariance Pooling: An Optimization Perspective
CVPR 2020
Intent Detection with WikiHow
AACL 2020
Dynamic Graph Message Passing Networks
CVPR 2020
Few-shot Action Recognition with Permutation-invariant Attention
ECCV 2020
Reversing the cycle: self-supervised deep stereo through enhanced monocular distillation
ECCV 2020
Improving Semantic Segmentation via Decoupled Body and Edge Supervision
ECCV 2020
Instance Credibility Inference for Few-Shot Learning
CVPR 2020
NPU Speaker Verification System for INTERSPEECH 2020 Far-Field Speaker Verification Challenge
INTERSPEECH 2020
Strip Pooling: Rethinking Spatial Pooling for Scene Parsing
CVPR 2020
Attention Scaling for Crowd Counting
CVPR 2020
Consistent Video Style Transfer via Compound Regularization
AAAI 2020
XingGAN for Person Image Generation
ECCV 2020
CAN: Constrained Attention Networks for Multi-Aspect Sentiment Analysis
IJCNLP 2019
CAN: Constrained Attention Networks for Multi-Aspect Sentiment Analysis
EMNLP 2019
Efficient Training on Very Large Corpora via Gramian Estimation
ICLR 2019
Fast Online Object Tracking and Segmentation: A Unifying Approach
CVPR 2019
A Closed-Form Solution to Universal Style Transfer
ICCV 2019
Improving Text-to-SQL Evaluation Methodology
ACL 2018
Learning Differentially Private Recurrent Language Models
ICLR 2018
Learning to Compare: Relation Network for Few-Shot Learning
CVPR 2018
Efficient Semantic Scene Completion Network with Spatial Group Convolution
ECCV 2018
End-to-End Learning of Multi-scale Convolutional Neural Network for Stereo Matching
ACML 2018
Physics Inspired Optimization on Semantic Transfer Features: An Alternative Method for Room Layout Estimation
CVPR 2017
Learning a Deep Embedding Model for Zero-Shot Learning
CVPR 2017
Decoder Network Over Lightweight Reconstructed Feature for Fast Semantic Style Transfer
ICCV 2017
Spatially Adaptive Computation Time for Residual Networks
CVPR 2017
Learning a Discriminative Null Space for Person Re-Identification
CVPR 2016
Nearly Optimal Private LASSO
NIPS 2015
Discriminative Low-Rank Tracking
ICCV 2015
Nonparametric Context Modeling of Local Appearance for Pose- and Expression-Robust Facial Landmark Localization
CVPR 2014
Learning Polynomials with Neural Networks
ICML 2014
Exemplar-Based Face Parsing
CVPR 2013
Affect Detection from Semantic Interpretation of Drama Improvisation
COLING 2012
Metaphor Interpretation and Context-based Affect Detection
COLING 2010
Empirical Study on the Performance Stability of Named Entity Recognition Model across Domains
EMNLP 2006
Developments in Affect Detection in E-drama
EACL 2006