Jing Yu Koh

16 papers · 2018–2025 · 10 conferences · across top CS/AI conferences

Achievements

+9 more ↓

🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (7) 🏃 Academic Marathon (7) 🌍 Conference Polyglot (10) 🗺️ Taxonomy Completionist (22)

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🌍 Conference Polyglot (10) 🏆 Grand Slam 🚀 Conference Pioneer 💎 Century Club (16) 🗃️ Keyword Collector (54) ⚡ Prolific Year (5) 🔥 Unstoppable (8)

Conferences

ECCV (3) ICLR (3) CVPR (2) ICCV (2) AAAI (1) ACL (1) ICML (1) IJCAI (1) NIPS (1) WACV (1)

Top co-authors

Jason Baldridge (6) Yinfei Yang (5) Honglak Lee (5) Daniel Fried (4) Han Zhang (4) Peter Anderson (3) Ruslan Salakhutdinov (3) Duc Thanh Nguyen (2) Quang-Trung Truong (2) Austin Waters (2)

Keywords

text-to-image generation (3) vision-language navigation (2) generative model (2) multimodal learning (2) depth estimation (1) scene understanding (1) image generation (1) in-context learning (1) imitation learning (1) data augmentation (1) cross-modal retrieval (1) cross-modal learning (1) visual grounding (1) text-to-image synthesis (1) probabilistic modeling (1) image-to-image translation (1) mutual information (1) mixed integer programming (1) zero-shot learning (1) contrastive learning (1)

Papers

Dissecting Adversarial Robustness of Multimodal LM Agents ICLR 2025 VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks ACL 2024 OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web ECCV 2024 Generating Images with Multimodal Language Models NIPS 2023 Grounding Language Models to Images for Multimodal Inputs and Outputs ICML 2023 A New Path: Scaling Vision-and-Language Navigation With Synthetic Instructions and Imitation Learning CVPR 2023 Simple and Effective Synthesis of Indoor 3D Scenes AAAI 2023 VQ3D: Learning a 3D-Aware Generative Model on ImageNet ICCV 2023 Vector-quantized Image Modeling with Improved VQGAN ICLR 2022 Text-to-Image Generation Grounded by Fine-Grained User Attention WACV 2021 Cross-Modal Contrastive Learning for Text-to-Image Generation CVPR 2021 Pathdreamer: A World Model for Indoor Navigation ICCV 2021 Revisiting Hierarchical Approach for Persistent Long-Term Video Prediction ICLR 2021 SideInfNet: A Deep Neural Network for Semi-Automatic Semantic Segmentation with Side Information ECCV 2020 Improving Customer Satisfaction in Bike Sharing Systems through Dynamic Repositioning IJCAI 2019 Urban Zoning Using Higher-Order Markov Random Fields on Multi-View Imagery Data ECCV 2018