Yu Su

88 papers · 2015–2026 · 13 conferences · across top CS/AI conferences

Achievements

+17 more ↓

🧭 Keyword Pioneer 🌍 Conference Polyglot (13) 🗺️ Taxonomy Completionist (12) 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (10)

🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (12) 🧭 Keyword Pioneer 🏠 Conference Loyalist (22) 🤝 Dynamic Duo (25) 👑 Triple Crown 🏆 Grand Slam 👥 Mega-Team (28) 🔬 Deep Specialist (13) 🧬 Topic Evolution 🚀 Conference Pioneer ⚡ Prolific Year (5) ❓ The Questioner (3) 🗃️ Keyword Collector (353) 💎 Century Club (87) 🔥 Unstoppable (11) 📈 Trend Setter

Conferences

ACL (23) EMNLP (22) ICLR (9) CVPR (7) NIPS (7) AAAI (5) ICML (4) NAACL (4) IJCNLP (3) COLING (1) ICCV (1) IJCAI (1) SEMEVAL (1)

Top co-authors

Huan Sun (25) Wenhu Chen (13) Xifeng Yan (11) Kai Zhang (10) Yu Gu (10) Wei-Lun Chao (9) Bernal Jiménez Gutiérrez (8) Qi Liu (8) Tanya Berger-Wolf (7) Rui Li (6)

Research topics

Robotics (1) Core AI (1)

Keywords

large language model (15) semantic parsing (13) question answering (9) knowledge base (6) language model (6) domain adaptation (5) transfer learning (5) pre-trained language model (5) few-shot learning (4) in-context learning (4) knowledge graph (4) text classification (4) web agent (4) relation extraction (4) zero-shot learning (4) vision-language model (3) information extraction (3) knowledge distillation (3) fine-grained classification (3) knowledge extraction (3)

Papers

OPeRA: A Dataset of Observation, Persona, Rationale, and Action for Evaluating LLMs on Human Online Shopping Behavior Simulation ACL 2026 RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics CVPR 2025 VERSE: Verification-based Self-Play for Code Instructions AAAI 2025 ScholarGEC: Enhancing Controllability of Large Language Model for Chinese Academic Grammatical Error Correction AAAI 2025 MICE for CATs: Model-Internal Confidence Estimation for Calibrating Agents with Tools NAACL 2025 From RAG to Memory: Non-Parametric Continual Learning for Large Language Models ICML 2025 VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents ICLR 2025 Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents ICLR 2025 Distribution-Driven Dense Retrieval: Modeling Many-to-One Query-Document Relationship AAAI 2025 PQR: Improving Dense Retrieval via Potential Query Modeling ACL 2025 UniRAG: Unified Query Understanding Method for Retrieval Augmented Generation ACL 2025 MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark ACL 2025 Completing A Systematic Review in Hours instead of Months with Interactive AI Agents ACL 2025 Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents ACL 2025 ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery ICLR 2025 Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers ICLR 2025 Prompt-CAM: Making Vision Transformers Interpretable for Fine-Grained Analysis CVPR 2025 Finer-CAM: Spotting the Difference Reveals Finer Details for Visual Explanation CVPR 2025 Dual-View Visual Contextualization for Web Navigation CVPR 2024 HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models NIPS 2024 Grokking of Implicit Reasoning in Transformers: A Mechanistic Journey to the Edge of Generalization NIPS 2024 VLM4Bio: A Benchmark Dataset to Evaluate Pretrained Vision-Language Models for Trait Discovery from Biological Images NIPS 2024 Fine-Tuning is Fine, if Calibrated NIPS 2024 MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning ICLR 2024 CONSIDER: Commonalities and Specialties Driven Multilingual Code Retrieval Framework AAAI 2024 Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts ICLR 2024 MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction Following ICLR 2024 AgentBench: Evaluating LLMs as Agents ICLR 2024 A Simple Interpretable Transformer for Fine-Grained Image Classification and Analysis ICLR 2024 Language Agents: Foundations, Prospects, and Risks EMNLP 2024 WebOlympus: An Open Platform for Web Agents on Live Websites EMNLP 2024 I-AM-G: Interest Augmented Multimodal Generator for Item Personalization EMNLP 2024 Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments EMNLP 2024 MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI CVPR 2024 LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error ACL 2024 When is Tree Search Useful for LLM Planning? It Depends on the Discriminator ACL 2024 RePair: Automated Program Repair with Process-based Feedback ACL 2024 GPT-4V(ision) is a Generalist Web Agent, if Grounded ICML 2024 MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions ICML 2024 TravelPlanner: A Benchmark for Real-World Planning with Language Agents ICML 2024 BioCLIP: A Vision Foundation Model for the Tree of Life CVPR 2024 Federated Learning for Semantic Parsing: Task Formulation, Evaluation Setup, New Algorithms ACL 2023 Holistic Transfer: Towards Non-Disruptive Fine-Tuning with Partial Target Data NIPS 2023 MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing NIPS 2023 Don’t Generate, Discriminate: A Proposal for Grounding Language Models to Real-World Environments ACL 2023 Privacy-Preserving Domain Adaptation of Semantic Parsers ACL 2023 Few-shot In-context Learning on Knowledge Base Question Answering ACL 2023 Mind2Web: Towards a Generalist Agent for the Web NIPS 2023 Text-to-SQL Error Correction with Language Models of Code ACL 2023 Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors ACL 2023 Biomedical Language Models are Robust to Sub-optimal Tokenization ACL 2023 Solving the Right Problem is Key for Translational NLP: A Case Study in UMLS Vocabulary Insertion EMNLP 2023 Automatic Evaluation of Attribution by Large Language Models EMNLP 2023 Error Detection for Text-to-SQL Semantic Parsing EMNLP 2023 LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models ICCV 2023 When More Data Hurts: A Troubling Quirk in Developing Broad-Coverage Natural Language Understanding Systems EMNLP 2022 Thinking about GPT-3 In-Context Learning for Biomedical IE? Think Again EMNLP 2022 One Step at a Time: Long-Horizon Vision-and-Language Navigation With Milestones CVPR 2022 ArcaneQA: Dynamic Program Induction and Contextualized Encoding for Knowledge Base Question Answering COLING 2022 Bridging the Generalization Gap in Text-to-SQL Parsing with Schema Expansion ACL 2022 ITNLP at SemEval-2021 Task 11: Boosting BERT with Sampling and Adversarial Training for Knowledge Extraction SEMEVAL 2021 ReasonBERT: Pre-trained to Reason with Distant Supervision EMNLP 2021 An Investigation of Language Model Interpretability via Sentence Editing EMNLP 2021 ITNLP at SemEval-2021 Task 11: Boosting BERT with Sampling and Adversarial Training for Knowledge Extraction ACL 2021 A Systematic Investigation of KB-Text Embedding Alignment at Scale ACL 2021 A Systematic Investigation of KB-Text Embedding Alignment at Scale IJCNLP 2021 ITNLP at SemEval-2021 Task 11: Boosting BERT with Sampling and Adversarial Training for Knowledge Extraction IJCNLP 2021 Compositional Generalization for Neural Semantic Parsing via Span-level Supervised Attention NAACL 2021 An Imitation Game for Learning Semantic Parsers from User Interaction EMNLP 2020 KGPT: Knowledge-Grounded Pre-Training for Data-to-Text Generation EMNLP 2020 Document Classification for COVID-19 Literature EMNLP 2020 Logical Natural Language Generation from Open-Domain Tables ACL 2020 Document Classification for COVID-19 Literature ACL 2020 Learning to Compose Topic-Aware Mixture of Experts for Zero-Shot Video Captioning AAAI 2019 How Large a Vocabulary Does Text Classification Need? A Variational Approach to Vocabulary Selection NAACL 2019 Global Textual Relation Embedding for Relational Understanding ACL 2019 Model-based Interactive Semantic Parsing: A Unified Framework and A Text-to-SQL Case Study IJCNLP 2019 Model-based Interactive Semantic Parsing: A Unified Framework and A Text-to-SQL Case Study EMNLP 2019 Global Relation Embedding for Relation Extraction NAACL 2018 DialSQL: Dialogue Based Structured Query Generation ACL 2018 What It Takes to Achieve 100% Condition Accuracy on WikiSQL EMNLP 2018 XL-NBT: A Cross-lingual Neural Belief Tracking Framework EMNLP 2018 An End-to-End Deep Framework for Answer Triggering with a Novel Group-Level Objective EMNLP 2017 Cross-domain Semantic Parsing via Paraphrasing EMNLP 2017 Recovering Question Answering Errors via Query Revision EMNLP 2017 On Generating Characteristic-rich Question Sets for QA Evaluation EMNLP 2016 Improving Semantic Parsing via Answer Type Inference EMNLP 2016 Cognitive Modelling for Predicting Examinee Performance IJCAI 2015