Jing Yu Koh
16 papers · 2018–2025 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
π Interdisciplinary Bridge π Renaissance Researcher (7) π Academic Marathon (7) π Conference Polyglot (10) πΊοΈ Taxonomy Completionist (22)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Conference Polyglot
(10)
π
Grand Slam
π
Conference Pioneer
π
Century Club
(16)
ποΈ
Keyword Collector
(54)
β‘
Prolific Year
(5)
π₯
Unstoppable
(8)
Conferences
ECCV (3)
ICLR (3)
CVPR (2)
ICCV (2)
AAAI (1)
ACL (1)
ICML (1)
IJCAI (1)
NIPS (1)
WACV (1)
Top co-authors
Keywords
text-to-image generation
(3)
vision-language navigation
(2)
generative model
(2)
multimodal learning
(2)
depth estimation
(1)
scene understanding
(1)
image generation
(1)
in-context learning
(1)
imitation learning
(1)
data augmentation
(1)
cross-modal retrieval
(1)
cross-modal learning
(1)
visual grounding
(1)
text-to-image synthesis
(1)
probabilistic modeling
(1)
image-to-image translation
(1)
mutual information
(1)
mixed integer programming
(1)
zero-shot learning
(1)
contrastive learning
(1)
Papers
Dissecting Adversarial Robustness of Multimodal LM Agents
ICLR 2025
VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks
ACL 2024
OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web
ECCV 2024
Generating Images with Multimodal Language Models
NIPS 2023
Grounding Language Models to Images for Multimodal Inputs and Outputs
ICML 2023
A New Path: Scaling Vision-and-Language Navigation With Synthetic Instructions and Imitation Learning
CVPR 2023
Simple and Effective Synthesis of Indoor 3D Scenes
AAAI 2023
VQ3D: Learning a 3D-Aware Generative Model on ImageNet
ICCV 2023
Vector-quantized Image Modeling with Improved VQGAN
ICLR 2022
Text-to-Image Generation Grounded by Fine-Grained User Attention
WACV 2021
Cross-Modal Contrastive Learning for Text-to-Image Generation
CVPR 2021
Pathdreamer: A World Model for Indoor Navigation
ICCV 2021
Revisiting Hierarchical Approach for Persistent Long-Term Video Prediction
ICLR 2021
SideInfNet: A Deep Neural Network for Semi-Automatic Semantic Segmentation with Side Information
ECCV 2020
Improving Customer Satisfaction in Bike Sharing Systems through Dynamic Repositioning
IJCAI 2019
Urban Zoning Using Higher-Order Markov Random Fields on Multi-View Imagery Data
ECCV 2018