Zehui Chen

28 papers · 2022–2026 · 9 conferences · across top CS/AI conferences

Achievements

+8 more ↓

🐝 Cross-Pollinator (6) 🌍 Conference Polyglot (9) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌈 Renaissance Researcher (5)

🧭 Keyword Pioneer 🐝 Cross-Pollinator (6) 🤝 Dynamic Duo (18) 💎 Century Club (25) 📈 Trend Setter ⚡ Prolific Year (8) 🗃️ Keyword Collector (102) ❓ The Questioner

Conferences

ACL (6) AAAI (5) ICLR (4) ECCV (3) EMNLP (3) CVPR (2) ICCV (2) NIPS (2) IJCAI (1)

Top co-authors

Feng Zhao (21) Lin Chen (8) Zhenyu Li (7) Jiaming Liu (6) Qinhong Jiang (5) Liangji Fang (5) Shuo Wang (4) Dahua Lin (4) Yu Zeng (4) Shanghang Zhang (4)

Keywords

large language model (7) multi-modal learning (3) multimodal learning (3) benchmark evaluation (3) large vision-language model (2) function calling (2) vision-language model (2) instruction following (2) domain adaptation (2) object detection (2) depth estimation (2) lidar point cloud (2) 3d object detection (2) hallucination mitigation (2) semantic segmentation (2) point cloud (2) noisy label learning (1) video captioning (1) image generation (1) direct preference optimization (1)

Papers

Breaking Block Boundaries: Anchor-based History-stable Decoding for Diffusion Large Language Models ACL 2026 UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision ACL 2026 Beyond Accuracy: Unveiling Inefficiency Patterns in Tool-Integrated Reasoning ACL 2026 CRITICTOOL: Evaluating Self-Critique Capabilities of Large Language Models in Tool-Calling Error Scenarios EMNLP 2025 Enhancing Large Vision-Language Models with Ultra-Detailed Image Caption Generation EMNLP 2025 MindSearch: Mimicking Human Minds Elicits Deep AI Searcher ICLR 2025 MMSearch: Unveiling the Potential of Large Models as Multi-modal Search Engines ICLR 2025 VFM-Adapter: Adapting Visual Foundation Models for Dense Prediction with Dynamic Hybrid Operation Mapping AAAI 2025 ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use ACL 2025 LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding AAAI 2025 PseDet: Revisiting the Power of Pseudo Label in Incremental Object Detection ICLR 2025 ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents EMNLP 2025 Stream Query Denoising for Vectorized HD-Map Construction ECCV 2024 Are We on the Right Way for Evaluating Large Vision-Language Models? NIPS 2024 Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection AAAI 2024 Exploring Sparse Visual Prompt for Domain Adaptive Dense Prediction AAAI 2024 T-Eval: Evaluating the Tool Utilization Capability of Large Language Models Step by Step ACL 2024 Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models ACL 2024 Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation CVPR 2024 ShareGPT4Video: Improving Video Understanding and Generation with Better Captions NIPS 2024 Learning from Noisy Data for Semi-Supervised 3D Object Detection ICCV 2023 DETRDistill: A Universal Knowledge Distillation Framework for DETR-families ICCV 2023 Towards Domain Generalization for Multi-View 3D Object Detection in Bird-Eye-View CVPR 2023 BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection ICLR 2023 SimIPU: Simple 2D Image and 3D Point Cloud Unsupervised Pre-training for Spatial-Aware Visual Representations AAAI 2022 Unsupervised Domain Adaptation for Monocular 3D Object Detection via Self-Training ECCV 2022 AutoAlign: Pixel-Instance Feature Aggregation for Multi-Modal 3D Object Detection IJCAI 2022 Deformable Feature Aggregation for Dynamic Multi-modal 3D Object Detection ECCV 2022