Yu Su
88 papers · 2015–2026 · 13 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+17 more ↓ Show less ↑
π§ Keyword Pioneer π Conference Polyglot (13) πΊοΈ Taxonomy Completionist (12) π Interdisciplinary Bridge π Academic Marathon (10)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(12)
π§
Keyword Pioneer
π
Conference Loyalist
(22)
π€
Dynamic Duo
(25)
π
Triple Crown
π
Grand Slam
π₯
Mega-Team
(28)
π¬
Deep Specialist
(13)
π§¬
Topic Evolution
π
Conference Pioneer
β‘
Prolific Year
(5)
β
The Questioner
(3)
ποΈ
Keyword Collector
(353)
π
Century Club
(87)
π₯
Unstoppable
(11)
π
Trend Setter
Conferences
ACL (23)
EMNLP (22)
ICLR (9)
CVPR (7)
NIPS (7)
AAAI (5)
ICML (4)
NAACL (4)
IJCNLP (3)
COLING (1)
ICCV (1)
IJCAI (1)
SEMEVAL (1)
Top co-authors
Research topics
Keywords
large language model
(15)
semantic parsing
(13)
question answering
(9)
knowledge base
(6)
language model
(6)
domain adaptation
(5)
transfer learning
(5)
pre-trained language model
(5)
few-shot learning
(4)
in-context learning
(4)
knowledge graph
(4)
text classification
(4)
web agent
(4)
relation extraction
(4)
zero-shot learning
(4)
vision-language model
(3)
information extraction
(3)
knowledge distillation
(3)
fine-grained classification
(3)
knowledge extraction
(3)
Papers
OPeRA: A Dataset of Observation, Persona, Rationale, and Action for Evaluating LLMs on Human Online Shopping Behavior Simulation
ACL 2026
RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics
CVPR 2025
VERSE: Verification-based Self-Play for Code Instructions
AAAI 2025
ScholarGEC: Enhancing Controllability of Large Language Model for Chinese Academic Grammatical Error Correction
AAAI 2025
MICE for CATs: Model-Internal Confidence Estimation for Calibrating Agents with Tools
NAACL 2025
From RAG to Memory: Non-Parametric Continual Learning for Large Language Models
ICML 2025
VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
ICLR 2025
Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents
ICLR 2025
Distribution-Driven Dense Retrieval: Modeling Many-to-One Query-Document Relationship
AAAI 2025
PQR: Improving Dense Retrieval via Potential Query Modeling
ACL 2025
UniRAG: Unified Query Understanding Method for Retrieval Augmented Generation
ACL 2025
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark
ACL 2025
Completing A Systematic Review in Hours instead of Months with Interactive AI Agents
ACL 2025
Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents
ACL 2025
ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
ICLR 2025
Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers
ICLR 2025
Prompt-CAM: Making Vision Transformers Interpretable for Fine-Grained Analysis
CVPR 2025
Finer-CAM: Spotting the Difference Reveals Finer Details for Visual Explanation
CVPR 2025
Dual-View Visual Contextualization for Web Navigation
CVPR 2024
HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models
NIPS 2024
Grokking of Implicit Reasoning in Transformers: A Mechanistic Journey to the Edge of Generalization
NIPS 2024
VLM4Bio: A Benchmark Dataset to Evaluate Pretrained Vision-Language Models for Trait Discovery from Biological Images
NIPS 2024
Fine-Tuning is Fine, if Calibrated
NIPS 2024
MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning
ICLR 2024
CONSIDER: Commonalities and Specialties Driven Multilingual Code Retrieval Framework
AAAI 2024
Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts
ICLR 2024
MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction Following
ICLR 2024
AgentBench: Evaluating LLMs as Agents
ICLR 2024
A Simple Interpretable Transformer for Fine-Grained Image Classification and Analysis
ICLR 2024
Language Agents: Foundations, Prospects, and Risks
EMNLP 2024
WebOlympus: An Open Platform for Web Agents on Live Websites
EMNLP 2024
I-AM-G: Interest Augmented Multimodal Generator for Item Personalization
EMNLP 2024
Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments
EMNLP 2024
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
CVPR 2024
LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error
ACL 2024
When is Tree Search Useful for LLM Planning? It Depends on the Discriminator
ACL 2024
RePair: Automated Program Repair with Process-based Feedback
ACL 2024
GPT-4V(ision) is a Generalist Web Agent, if Grounded
ICML 2024
MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions
ICML 2024
TravelPlanner: A Benchmark for Real-World Planning with Language Agents
ICML 2024
BioCLIP: A Vision Foundation Model for the Tree of Life
CVPR 2024
Federated Learning for Semantic Parsing: Task Formulation, Evaluation Setup, New Algorithms
ACL 2023
Holistic Transfer: Towards Non-Disruptive Fine-Tuning with Partial Target Data
NIPS 2023
MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing
NIPS 2023
Donβt Generate, Discriminate: A Proposal for Grounding Language Models to Real-World Environments
ACL 2023
Privacy-Preserving Domain Adaptation of Semantic Parsers
ACL 2023
Few-shot In-context Learning on Knowledge Base Question Answering
ACL 2023
Mind2Web: Towards a Generalist Agent for the Web
NIPS 2023
Text-to-SQL Error Correction with Language Models of Code
ACL 2023
Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors
ACL 2023
Biomedical Language Models are Robust to Sub-optimal Tokenization
ACL 2023
Solving the Right Problem is Key for Translational NLP: A Case Study in UMLS Vocabulary Insertion
EMNLP 2023
Automatic Evaluation of Attribution by Large Language Models
EMNLP 2023
Error Detection for Text-to-SQL Semantic Parsing
EMNLP 2023
LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models
ICCV 2023
When More Data Hurts: A Troubling Quirk in Developing Broad-Coverage Natural Language Understanding Systems
EMNLP 2022
Thinking about GPT-3 In-Context Learning for Biomedical IE? Think Again
EMNLP 2022
One Step at a Time: Long-Horizon Vision-and-Language Navigation With Milestones
CVPR 2022
ArcaneQA: Dynamic Program Induction and Contextualized Encoding for Knowledge Base Question Answering
COLING 2022
Bridging the Generalization Gap in Text-to-SQL Parsing with Schema Expansion
ACL 2022
ITNLP at SemEval-2021 Task 11: Boosting BERT with Sampling and Adversarial Training for Knowledge Extraction
SEMEVAL 2021
ReasonBERT: Pre-trained to Reason with Distant Supervision
EMNLP 2021
An Investigation of Language Model Interpretability via Sentence Editing
EMNLP 2021
ITNLP at SemEval-2021 Task 11: Boosting BERT with Sampling and Adversarial Training for Knowledge Extraction
ACL 2021
A Systematic Investigation of KB-Text Embedding Alignment at Scale
ACL 2021
A Systematic Investigation of KB-Text Embedding Alignment at Scale
IJCNLP 2021
ITNLP at SemEval-2021 Task 11: Boosting BERT with Sampling and Adversarial Training for Knowledge Extraction
IJCNLP 2021
Compositional Generalization for Neural Semantic Parsing via Span-level Supervised Attention
NAACL 2021
An Imitation Game for Learning Semantic Parsers from User Interaction
EMNLP 2020
KGPT: Knowledge-Grounded Pre-Training for Data-to-Text Generation
EMNLP 2020
Document Classification for COVID-19 Literature
EMNLP 2020
Logical Natural Language Generation from Open-Domain Tables
ACL 2020
Document Classification for COVID-19 Literature
ACL 2020
Learning to Compose Topic-Aware Mixture of Experts for Zero-Shot Video Captioning
AAAI 2019
How Large a Vocabulary Does Text Classification Need? A Variational Approach to Vocabulary Selection
NAACL 2019
Global Textual Relation Embedding for Relational Understanding
ACL 2019
Model-based Interactive Semantic Parsing: A Unified Framework and A Text-to-SQL Case Study
IJCNLP 2019
Model-based Interactive Semantic Parsing: A Unified Framework and A Text-to-SQL Case Study
EMNLP 2019
Global Relation Embedding for Relation Extraction
NAACL 2018
DialSQL: Dialogue Based Structured Query Generation
ACL 2018
What It Takes to Achieve 100% Condition Accuracy on WikiSQL
EMNLP 2018
XL-NBT: A Cross-lingual Neural Belief Tracking Framework
EMNLP 2018
An End-to-End Deep Framework for Answer Triggering with a Novel Group-Level Objective
EMNLP 2017
Cross-domain Semantic Parsing via Paraphrasing
EMNLP 2017
Recovering Question Answering Errors via Query Revision
EMNLP 2017
On Generating Characteristic-rich Question Sets for QA Evaluation
EMNLP 2016
Improving Semantic Parsing via Answer Type Inference
EMNLP 2016
Cognitive Modelling for Predicting Examinee Performance
IJCAI 2015