Peter Grasch
7 papers · 2021–2025 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
🌈 Renaissance Researcher (5) 🌍 Conference Polyglot (5) 🧭 Keyword Pioneer 🐝 Cross-Pollinator (15) 🌉 Interdisciplinary Bridge
👥
Mega-Team
(29)
⚡
Prolific Year
(5)
Conferences
ICLR (3)
CVPR (1)
ECCV (1)
ICCV (1)
NAACL (1)
Top co-authors
Keywords
model compression
(1)
vision transformer
(1)
depth perception
(1)
visual question answering
(1)
entity linking
(1)
named entity recognition
(1)
semantic parsing
(1)
monocular depth estimation
(1)
joint learning
(1)
efficient computing
(1)
vision language model
(1)
multimodal large language model
(1)
token reduction
(1)
voice assistant
(1)
domain classification
(1)
vision encoding
(1)
high resolution image
(1)
efficient encoding
(1)
text-rich image understanding
(1)
encoding latency
(1)
Papers
MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning
ICLR 2025
Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models
ICLR 2025
MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs
ICLR 2025
MM-Spatial: Exploring 3D Spatial Understanding in Multimodal LLMs
ICCV 2025
FastVLM: Efficient Vision Encoding for Vision Language Models
CVPR 2025
"MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"
ECCV 2024
Noise Robust Named Entity Understanding for Voice Assistants
NAACL 2021