Zheng Ge

21 papers · 2020–2026 · 8 conferences · across top CS/AI conferences

Achievements

+8 more ↓

🏃 Academic Marathon (5) 🌍 Conference Polyglot (7) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (14)

🐝 Cross-Pollinator (14) 🌍 Conference Polyglot (7) 🤝 Dynamic Duo (11) 🔥 Unstoppable (6) 💎 Century Club (19) ⚡ Prolific Year (5) ❓ The Questioner 🗃️ Keyword Collector (57)

Conferences

ICLR (5) CVPR (4) ECCV (4) AAAI (2) ACL (2) ICML (2) ICCV (1) IJCAI (1)

Top co-authors

Xiangyu Zhang (13) Jianjian Sun (8) Runpei Dong (8) Liang Zhao (7) Jinrong Yang (6) Haoran Wei (6) Yuang Peng (5) Chunrui Han (5) Zeming Li (5) Zekun Qi (5)

Keywords

3d object detection (3) bird's-eye view (2) multi-view detection (2) depth estimation (2) object detection (2) video generation (1) pedestrian detection (1) domain generalization (1) multimodal learning (1) autoregressive generation (1) em algorithm (1) mathematical reasoning (1) 3d object understanding (1) embodied interaction (1) efficient computing (1) deepfake detection (1) instruction tuning (1) message passing (1) occlusion handling (1) optimal transport (1)

Papers

PRIME: A Process-Outcome Alignment Benchmark for Verifiable Reasoning in Mathematics and Engineering ACL 2026 PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning ACL 2026 Taming Teacher Forcing for Masked Autoregressive Video Generation CVPR 2025 Unhackable Temporal Reward for Scalable Video MLLMs ICLR 2025 DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation ICLR 2025 Perception in Reflection ICML 2025 Reconstructive Visual Instruction Tuning ICLR 2025 ChatSpot: Bootstrapping Multimodal LLMs via Precise Referring Instruction Tuning IJCAI 2024 DreamLLM: Synergistic Multimodal Comprehension and Creation ICLR 2024 Vary: Scaling up the Vision Vocabulary for Large Vision-Language Models ECCV 2024 Merlin: Empowering Multimodal LLMs with Foresight Minds ECCV 2024 ShapeLLM: Universal 3D Object Understanding for Embodied Interaction ECCV 2024 MatrixVT: Efficient Multi-Camera to BEV Transformation for 3D Perception ICCV 2023 BEVStereo: Enhancing Depth Estimation in Multi-View 3D Object Detection with Temporal Stereo AAAI 2023 Implicit Identity Leakage: The Stumbling Block to Improving Deepfake Detection Generalization CVPR 2023 BEVDepth: Acquisition of Reliable Depth for Multi-View 3D Object Detection AAAI 2023 Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning? ICLR 2023 Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative Pretraining ICML 2023 Dense Teacher: Dense Pseudo-Labels for Semi-Supervised Object Detection ECCV 2022 OTA: Optimal Transport Assignment for Object Detection CVPR 2021 NMS by Representative Region: Towards Crowded Pedestrian Detection by Proposal Pairing CVPR 2020