Zheng Ge
21 papers · 2020–2026 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
🏃 Academic Marathon (5) 🌍 Conference Polyglot (7) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (14)
🐝
Cross-Pollinator
(14)
🌍
Conference Polyglot
(7)
🤝
Dynamic Duo
(11)
🔥
Unstoppable
(6)
💎
Century Club
(19)
⚡
Prolific Year
(5)
❓
The Questioner
🗃️
Keyword Collector
(57)
Conferences
ICLR (5)
CVPR (4)
ECCV (4)
AAAI (2)
ACL (2)
ICML (2)
ICCV (1)
IJCAI (1)
Top co-authors
Keywords
3d object detection
(3)
bird's-eye view
(2)
multi-view detection
(2)
depth estimation
(2)
object detection
(2)
video generation
(1)
pedestrian detection
(1)
domain generalization
(1)
multimodal learning
(1)
autoregressive generation
(1)
em algorithm
(1)
mathematical reasoning
(1)
3d object understanding
(1)
embodied interaction
(1)
efficient computing
(1)
deepfake detection
(1)
instruction tuning
(1)
message passing
(1)
occlusion handling
(1)
optimal transport
(1)
Papers
PRIME: A Process-Outcome Alignment Benchmark for Verifiable Reasoning in Mathematics and Engineering
ACL 2026
PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning
ACL 2026
Taming Teacher Forcing for Masked Autoregressive Video Generation
CVPR 2025
Unhackable Temporal Reward for Scalable Video MLLMs
ICLR 2025
DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation
ICLR 2025
Perception in Reflection
ICML 2025
Reconstructive Visual Instruction Tuning
ICLR 2025
ChatSpot: Bootstrapping Multimodal LLMs via Precise Referring Instruction Tuning
IJCAI 2024
DreamLLM: Synergistic Multimodal Comprehension and Creation
ICLR 2024
Vary: Scaling up the Vision Vocabulary for Large Vision-Language Models
ECCV 2024
Merlin: Empowering Multimodal LLMs with Foresight Minds
ECCV 2024
ShapeLLM: Universal 3D Object Understanding for Embodied Interaction
ECCV 2024
MatrixVT: Efficient Multi-Camera to BEV Transformation for 3D Perception
ICCV 2023
BEVStereo: Enhancing Depth Estimation in Multi-View 3D Object Detection with Temporal Stereo
AAAI 2023
Implicit Identity Leakage: The Stumbling Block to Improving Deepfake Detection Generalization
CVPR 2023
BEVDepth: Acquisition of Reliable Depth for Multi-View 3D Object Detection
AAAI 2023
Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?
ICLR 2023
Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative Pretraining
ICML 2023
Dense Teacher: Dense Pseudo-Labels for Semi-Supervised Object Detection
ECCV 2022
OTA: Optimal Transport Assignment for Object Detection
CVPR 2021
NMS by Representative Region: Towards Crowded Pedestrian Detection by Proposal Pairing
CVPR 2020