Huan Sun

64 papers · 2016–2025 · 11 conferences · across top CS/AI conferences

Achievements

+17 more ↓

🌍 Conference Polyglot (11) 🏃 Academic Marathon (9) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (11)

🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (11) 🤝 Dynamic Duo (25) 👑 Triple Crown 🏆 Grand Slam 👥 Mega-Team (22) 🔬 Deep Specialist (12) 🧬 Topic Evolution 🏆 Keyword Champion ❓ The Questioner (6) 📈 Trend Setter 🗃️ Keyword Collector (236) 🔥 Unstoppable (10) ⚡ Prolific Year (7) 💎 Century Club (64) 🚀 Conference Pioneer

Conferences

EMNLP (17) ACL (15) NAACL (9) ICLR (8) AAAI (3) ICML (3) IJCNLP (3) NIPS (3) COLING (1) CVPR (1) IJCAI (1)

Top co-authors

Yu Su (25) Xiang Yue (14) Xiang Deng (9) Ziru Chen (9) Boshi Wang (8) Lingbo Mo (8) Ziyu Yao (7) Shijie Chen (5) Boyuan Zheng (5) Botao Yu (5)

Keywords

large language model (13) question answering (11) semantic parsing (9) distant supervision (4) relation extraction (4) benchmark evaluation (4) neural network (4) language model (3) interactive parsing (3) in-context learning (3) agent system (3) adversarial attack (2) instruction following (2) active learning (2) knowledge base (2) vision-language model (2) few-shot learning (2) multi-task learning (2) code generation (2) multi-instance learning (2)

Papers

Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents ICLR 2025 AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs ICLR 2025 ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery ICLR 2025 AutoSDT: Scaling Data-Driven Discovery Tasks Toward Open Co-Scientists EMNLP 2025 AdvAgent: Controllable Blackbox Red-teaming on Web Agents ICML 2025 Tooling or Not Tooling? The Impact of Tools on Language Agents for Chemistry Problem Solving NAACL 2025 GroundCocoa: A Benchmark for Evaluating Compositional & Conditional Reasoning in Language Models NAACL 2025 AGrail: A Lifelong Agent Guardrail with Effective and Adaptive Safety Detection ACL 2025 MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark ACL 2025 EIA: ENVIRONMENTAL INJECTION ATTACK ON GENERALIST WEB AGENTS FOR PRIVACY LEAKAGE ICLR 2025 GPT-4V(ision) is a Generalist Web Agent, if Grounded ICML 2024 eCeLLM: Generalizing Large Language Models for E-commerce from Large-scale, High-quality Instruction Data ICML 2024 Grokking of Implicit Reasoning in Transformers: A Mechanistic Journey to the Edge of Generalization NIPS 2024 Insights of a Usability Study for KBQA Interactive Semantic Parsing: Generation Yields Benefits over Templates but External Validity Remains Challenging COLING 2024 MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI CVPR 2024 Combating Security and Privacy Issues in the Era of Large Language Models NAACL 2024 A Multi-Aspect Framework for Counter Narrative Evaluation using Large Language Models NAACL 2024 MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning ICLR 2024 AgentBench: Evaluating LLMs as Agents ICLR 2024 When is Tree Search Useful for LLM Planning? It Depends on the Discriminator ACL 2024 WebOlympus: An Open Platform for Web Agents on Live Websites EMNLP 2024 TableLlama: Towards Open Large Generalist Models for Tables NAACL 2024 How Trustworthy are Open-Source LLMs? An Assessment under Malicious Demonstrations Shows their Vulnerabilities NAACL 2024 AttributionBench: How Hard is Automatic Attribution Evaluation? ACL 2024 Mind2Web: Towards a Generalist Agent for the Web NIPS 2023 MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing NIPS 2023 Synthetic Text Generation with Differential Privacy: A Simple and Practical Recipe ACL 2023 Towards Understanding Chain-of-Thought Prompting: An Empirical Study of What Matters ACL 2023 Federated Learning for Semantic Parsing: Task Formulation, Evaluation Setup, New Algorithms ACL 2023 Text-to-SQL Error Correction with Language Models of Code ACL 2023 Biomedical Language Models are Robust to Sub-optimal Tokenization ACL 2023 Exploring Chain of Thought Style Prompting for Text-to-SQL EMNLP 2023 Automatic Evaluation of Attribution by Large Language Models EMNLP 2023 Error Detection for Text-to-SQL Semantic Parsing EMNLP 2023 Can ChatGPT Defend its Belief in Truth? Evaluating LLM Reasoning via Debate EMNLP 2023 Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning ICLR 2023 Towards Transparent Interactive Semantic Parsing via Step-by-Step Correction ACL 2022 Thinking about GPT-3 In-Context Learning for Biomedical IE? Think Again EMNLP 2022 Synthetic Question Value Estimation for Domain Adaptation of Question Answering ACL 2022 Knowledge Transfer between Structured and Unstructured Sources for Complex Question Answering NAACL 2022 Iteratively Prompt Pre-trained Language Models for Chain of Thought EMNLP 2022 Differential Privacy for Text Analytics via Natural Text Sanitization IJCNLP 2021 Structure-Grounded Pretraining for Text-to-SQL NAACL 2021 Differential Privacy for Text Analytics via Natural Text Sanitization ACL 2021 Learning Structural Edits via Incremental Tree Transformations ICLR 2021 COUGH: A Challenge Dataset and Models for COVID-19 FAQ Retrieval EMNLP 2021 ReasonBERT: Pre-trained to Reason with Distant Supervision EMNLP 2021 Question-Driven Purchasing Propensity Analysis for Recommendation AAAI 2020 Adversarial Training for Code Retrieval with Question-Description Relevance Regularization EMNLP 2020 An Imitation Game for Learning Semantic Parsers from User Interaction EMNLP 2020 Learning a Cost-Effective Annotation Policy for Question Answering EMNLP 2020 EndCold: An End-to-End Framework for Cold Question Routing in Community Question Answering Services IJCAI 2020 Rationalizing Medical Relation Prediction from Corpus-level Statistics ACL 2020 Clinical Reading Comprehension: A Thorough Analysis of the emrQA Dataset ACL 2020 Model-based Interactive Semantic Parsing: A Unified Framework and A Text-to-SQL Case Study IJCNLP 2019 Answer Identification from Product Reviews for User Questions by Multi-Task Attentive Networks AAAI 2019 Interactive Semantic Parsing for If-Then Recipes via Hierarchical Reinforcement Learning AAAI 2019 Leveraging 2-hop Distant Supervision from Table Entity Pairs for Relation Extraction IJCNLP 2019 Leveraging 2-hop Distant Supervision from Table Entity Pairs for Relation Extraction EMNLP 2019 Reinforced Dynamic Reasoning for Conversational Question Generation ACL 2019 Model-based Interactive Semantic Parsing: A Unified Framework and A Text-to-SQL Case Study EMNLP 2019 Global Relation Embedding for Relation Extraction NAACL 2018 An End-to-End Deep Framework for Answer Triggering with a Novel Group-Level Objective EMNLP 2017 On Generating Characteristic-rich Question Sets for QA Evaluation EMNLP 2016