Zehui Chen
28 papers · 2022–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
🐝 Cross-Pollinator (6) 🌍 Conference Polyglot (9) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌈 Renaissance Researcher (5)
🧭
Keyword Pioneer
🐝
Cross-Pollinator
(6)
🤝
Dynamic Duo
(18)
💎
Century Club
(25)
📈
Trend Setter
⚡
Prolific Year
(8)
🗃️
Keyword Collector
(102)
❓
The Questioner
Conferences
ACL (6)
AAAI (5)
ICLR (4)
ECCV (3)
EMNLP (3)
CVPR (2)
ICCV (2)
NIPS (2)
IJCAI (1)
Top co-authors
Keywords
large language model
(7)
multi-modal learning
(3)
multimodal learning
(3)
benchmark evaluation
(3)
large vision-language model
(2)
function calling
(2)
vision-language model
(2)
instruction following
(2)
domain adaptation
(2)
object detection
(2)
depth estimation
(2)
lidar point cloud
(2)
3d object detection
(2)
hallucination mitigation
(2)
semantic segmentation
(2)
point cloud
(2)
noisy label learning
(1)
video captioning
(1)
image generation
(1)
direct preference optimization
(1)
Papers
Breaking Block Boundaries: Anchor-based History-stable Decoding for Diffusion Large Language Models
ACL 2026
UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision
ACL 2026
Beyond Accuracy: Unveiling Inefficiency Patterns in Tool-Integrated Reasoning
ACL 2026
CRITICTOOL: Evaluating Self-Critique Capabilities of Large Language Models in Tool-Calling Error Scenarios
EMNLP 2025
Enhancing Large Vision-Language Models with Ultra-Detailed Image Caption Generation
EMNLP 2025
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher
ICLR 2025
MMSearch: Unveiling the Potential of Large Models as Multi-modal Search Engines
ICLR 2025
VFM-Adapter: Adapting Visual Foundation Models for Dense Prediction with Dynamic Hybrid Operation Mapping
AAAI 2025
ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use
ACL 2025
LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding
AAAI 2025
PseDet: Revisiting the Power of Pseudo Label in Incremental Object Detection
ICLR 2025
ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents
EMNLP 2025
Stream Query Denoising for Vectorized HD-Map Construction
ECCV 2024
Are We on the Right Way for Evaluating Large Vision-Language Models?
NIPS 2024
Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection
AAAI 2024
Exploring Sparse Visual Prompt for Domain Adaptive Dense Prediction
AAAI 2024
T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step
ACL 2024
Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models
ACL 2024
Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation
CVPR 2024
ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
NIPS 2024
Learning from Noisy Data for Semi-Supervised 3D Object Detection
ICCV 2023
DETRDistill: A Universal Knowledge Distillation Framework for DETR-families
ICCV 2023
Towards Domain Generalization for Multi-View 3D Object Detection in Bird-Eye-View
CVPR 2023
BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection
ICLR 2023
SimIPU: Simple 2D Image and 3D Point Cloud Unsupervised Pre-training for Spatial-Aware Visual Representations
AAAI 2022
Unsupervised Domain Adaptation for Monocular 3D Object Detection via Self-Training
ECCV 2022
AutoAlign: Pixel-Instance Feature Aggregation for Multi-Modal 3D Object Detection
IJCAI 2022
Deformable Feature Aggregation for Dynamic Multi-modal 3D Object Detection
ECCV 2022