Dongzhi Jiang
10 papers · 2023–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+4 more ↓ Show less ↑
π Cross-Pollinator (5) π Renaissance Researcher (5) πΊοΈ Taxonomy Completionist (14) π§ Keyword Pioneer π Interdisciplinary Bridge
π
Conference Polyglot
(7)
π
Century Club
(10)
β‘
Prolific Year
(5)
β
The Questioner
Conferences
ICLR (2)
ICML (2)
NIPS (2)
ACL (1)
ECCV (1)
ICCV (1)
WACV (1)
Top co-authors
Keywords
temporal modeling
(1)
knowledge transfer
(1)
chain-of-thought reasoning
(1)
multimodal learning
(1)
autonomous driving
(1)
point cloud
(1)
3d vision
(1)
visual reasoning
(1)
bird's eye view
(1)
text-to-image generation
(1)
model adaptation
(1)
instruction tuning
(1)
3d object detection
(1)
diffusion model
(1)
large multimodal model
(1)
mixture of expert
(1)
vision-language model
(1)
model fine-tuning
(1)
visual encoder
(1)
diffusion model fine-tuning
(1)
Papers
PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models
WACV 2026
SciVerse: Unveiling the Knowledge Comprehension and Visual Reasoning of LMMs on Multi-modal Scientific Problems
ACL 2025
MAVIS: Mathematical Visual Instruction Tuning with an Automatic Data Engine
ICLR 2025
MMSearch: Unveiling the Potential of Large Models as Multi-modal Search Engines
ICLR 2025
MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency
ICML 2025
EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM
ICML 2025
MoVA: Adapting Mixture of Vision Experts to Multimodal Context
NIPS 2024
MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?
ECCV 2024
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
NIPS 2024
Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
ICCV 2023