Jixuan Chen
4 papers · 2024–2025 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
🧭 Keyword Pioneer 🌍 Conference Polyglot (3) 🐝 Cross-Pollinator (5) 🌉 Interdisciplinary Bridge 👥 Mega-Team (23)
❓
The Questioner
Conferences
NIPS (2)
CVPR (1)
ICLR (1)
Top co-authors
Keywords
multimodal agent
(2)
benchmark evaluation
(2)
image captioning
(1)
code generation
(1)
structured output
(1)
vision language model
(1)
autonomous agent
(1)
computer environment
(1)
open-ended task
(1)
vision-language model
(1)
concept extraction
(1)
graphical user interface
(1)
interactive environment
(1)
visual language model
(1)
large language model
(1)
workflow automation
(1)
computer automation
(1)
real computer environment
(1)
semantic parsing
(1)
Papers
BACON: Improving Clarity of Image Captions via Bag-of-Concept Graphs
CVPR 2025
Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
ICLR 2025
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
NIPS 2024
Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
NIPS 2024