Yiheng Xu
11 papers · 2020–2025 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
🧭 Keyword Pioneer 🐝 Cross-Pollinator (5) 🌍 Conference Polyglot (8) 🏃 Academic Marathon (5) 🌈 Renaissance Researcher (6)
🏃
Academic Marathon
(5)
🐝
Cross-Pollinator
(5)
🌈
Renaissance Researcher
(6)
🏆
Grand Slam
💎
Century Club
(11)
🗃️
Keyword Collector
(56)
Conferences
ACL (3)
ICLR (2)
AAAI (1)
COLING (1)
EMNLP (1)
ICML (1)
IJCNLP (1)
NIPS (1)
Top co-authors
Keywords
document understanding
(5)
multimodal learning
(3)
visual-language modeling
(2)
multi-modal learning
(2)
visual document understanding
(2)
multimodal agent
(1)
markov random field
(1)
semi-supervised learning
(1)
spam detection
(1)
transformer encoder
(1)
document analysis
(1)
social network
(1)
document layout analysis
(1)
autonomous agent
(1)
open-ended task
(1)
computer environment
(1)
end-to-end training
(1)
graph convolutional network
(1)
weak supervision
(1)
multilingual nlp
(1)
Papers
AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials
ICLR 2025
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction
ICML 2025
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
NIPS 2024
Lemur: Harmonizing Natural Language and Code for Language Agents
ICLR 2024
MarkupLM: Pre-training of Text and Markup Language for Visually Rich Document Understanding
ACL 2022
XFUND: A Benchmark Dataset for Multilingual Visually Rich Form Understanding
ACL 2022
LayoutLMv2: Multi-modal Pre-training for Visually-rich Document Understanding
IJCNLP 2021
LayoutLMv2: Multi-modal Pre-training for Visually-rich Document Understanding
ACL 2021
LayoutReader: Pre-training of Text and Layout for Reading Order Detection
EMNLP 2021
DocBank: A Benchmark Dataset for Document Layout Analysis
COLING 2020
Graph Convolutional Networks with Markov Random Field Reasoning for Social Spammer Detection
AAAI 2020