Huan Sun
64 papers · 2016–2025 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+17 more ↓ Show less ↑
🌍 Conference Polyglot (11) 🏃 Academic Marathon (9) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (11)
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🐝
Cross-Pollinator
(11)
🤝
Dynamic Duo
(25)
👑
Triple Crown
🏆
Grand Slam
👥
Mega-Team
(22)
🔬
Deep Specialist
(12)
🧬
Topic Evolution
🏆
Keyword Champion
❓
The Questioner
(6)
📈
Trend Setter
🗃️
Keyword Collector
(236)
🔥
Unstoppable
(10)
⚡
Prolific Year
(7)
💎
Century Club
(64)
🚀
Conference Pioneer
Conferences
EMNLP (17)
ACL (15)
NAACL (9)
ICLR (8)
AAAI (3)
ICML (3)
IJCNLP (3)
NIPS (3)
COLING (1)
CVPR (1)
IJCAI (1)
Top co-authors
Keywords
large language model
(13)
question answering
(11)
semantic parsing
(9)
distant supervision
(4)
relation extraction
(4)
benchmark evaluation
(4)
neural network
(4)
language model
(3)
interactive parsing
(3)
in-context learning
(3)
agent system
(3)
adversarial attack
(2)
instruction following
(2)
active learning
(2)
knowledge base
(2)
vision-language model
(2)
few-shot learning
(2)
multi-task learning
(2)
code generation
(2)
multi-instance learning
(2)
Papers
Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents
ICLR 2025
AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs
ICLR 2025
ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
ICLR 2025
AutoSDT: Scaling Data-Driven Discovery Tasks Toward Open Co-Scientists
EMNLP 2025
AdvAgent: Controllable Blackbox Red-teaming on Web Agents
ICML 2025
Tooling or Not Tooling? The Impact of Tools on Language Agents for Chemistry Problem Solving
NAACL 2025
GroundCocoa: A Benchmark for Evaluating Compositional & Conditional Reasoning in Language Models
NAACL 2025
AGrail: A Lifelong Agent Guardrail with Effective and Adaptive Safety Detection
ACL 2025
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark
ACL 2025
EIA: ENVIRONMENTAL INJECTION ATTACK ON GENERALIST WEB AGENTS FOR PRIVACY LEAKAGE
ICLR 2025
GPT-4V(ision) is a Generalist Web Agent, if Grounded
ICML 2024
eCeLLM: Generalizing Large Language Models for E-commerce from Large-scale, High-quality Instruction Data
ICML 2024
Grokking of Implicit Reasoning in Transformers: A Mechanistic Journey to the Edge of Generalization
NIPS 2024
Insights of a Usability Study for KBQA Interactive Semantic Parsing: Generation Yields Benefits over Templates but External Validity Remains Challenging
COLING 2024
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
CVPR 2024
Combating Security and Privacy Issues in the Era of Large Language Models
NAACL 2024
A Multi-Aspect Framework for Counter Narrative Evaluation using Large Language Models
NAACL 2024
MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning
ICLR 2024
AgentBench: Evaluating LLMs as Agents
ICLR 2024
When is Tree Search Useful for LLM Planning? It Depends on the Discriminator
ACL 2024
WebOlympus: An Open Platform for Web Agents on Live Websites
EMNLP 2024
TableLlama: Towards Open Large Generalist Models for Tables
NAACL 2024
How Trustworthy are Open-Source LLMs? An Assessment under Malicious Demonstrations Shows their Vulnerabilities
NAACL 2024
AttributionBench: How Hard is Automatic Attribution Evaluation?
ACL 2024
Mind2Web: Towards a Generalist Agent for the Web
NIPS 2023
MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing
NIPS 2023
Synthetic Text Generation with Differential Privacy: A Simple and Practical Recipe
ACL 2023
Towards Understanding Chain-of-Thought Prompting: An Empirical Study of What Matters
ACL 2023
Federated Learning for Semantic Parsing: Task Formulation, Evaluation Setup, New Algorithms
ACL 2023
Text-to-SQL Error Correction with Language Models of Code
ACL 2023
Biomedical Language Models are Robust to Sub-optimal Tokenization
ACL 2023
Exploring Chain of Thought Style Prompting for Text-to-SQL
EMNLP 2023
Automatic Evaluation of Attribution by Large Language Models
EMNLP 2023
Error Detection for Text-to-SQL Semantic Parsing
EMNLP 2023
Can ChatGPT Defend its Belief in Truth? Evaluating LLM Reasoning via Debate
EMNLP 2023
Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning
ICLR 2023
Towards Transparent Interactive Semantic Parsing via Step-by-Step Correction
ACL 2022
Thinking about GPT-3 In-Context Learning for Biomedical IE? Think Again
EMNLP 2022
Synthetic Question Value Estimation for Domain Adaptation of Question Answering
ACL 2022
Knowledge Transfer between Structured and Unstructured Sources for Complex Question Answering
NAACL 2022
Iteratively Prompt Pre-trained Language Models for Chain of Thought
EMNLP 2022
Differential Privacy for Text Analytics via Natural Text Sanitization
IJCNLP 2021
Structure-Grounded Pretraining for Text-to-SQL
NAACL 2021
Differential Privacy for Text Analytics via Natural Text Sanitization
ACL 2021
Learning Structural Edits via Incremental Tree Transformations
ICLR 2021
COUGH: A Challenge Dataset and Models for COVID-19 FAQ Retrieval
EMNLP 2021
ReasonBERT: Pre-trained to Reason with Distant Supervision
EMNLP 2021
Question-Driven Purchasing Propensity Analysis for Recommendation
AAAI 2020
Adversarial Training for Code Retrieval with Question-Description Relevance Regularization
EMNLP 2020
An Imitation Game for Learning Semantic Parsers from User Interaction
EMNLP 2020
Learning a Cost-Effective Annotation Policy for Question Answering
EMNLP 2020
EndCold: An End-to-End Framework for Cold Question Routing in Community Question Answering Services
IJCAI 2020
Rationalizing Medical Relation Prediction from Corpus-level Statistics
ACL 2020
Clinical Reading Comprehension: A Thorough Analysis of the emrQA Dataset
ACL 2020
Model-based Interactive Semantic Parsing: A Unified Framework and A Text-to-SQL Case Study
IJCNLP 2019
Answer Identification from Product Reviews for User Questions by Multi-Task Attentive Networks
AAAI 2019
Interactive Semantic Parsing for If-Then Recipes via Hierarchical Reinforcement Learning
AAAI 2019
Leveraging 2-hop Distant Supervision from Table Entity Pairs for Relation Extraction
IJCNLP 2019
Leveraging 2-hop Distant Supervision from Table Entity Pairs for Relation Extraction
EMNLP 2019
Reinforced Dynamic Reasoning for Conversational Question Generation
ACL 2019
Model-based Interactive Semantic Parsing: A Unified Framework and A Text-to-SQL Case Study
EMNLP 2019
Global Relation Embedding for Relation Extraction
NAACL 2018
An End-to-End Deep Framework for Answer Triggering with a Novel Group-Level Objective
EMNLP 2017
On Generating Characteristic-rich Question Sets for QA Evaluation
EMNLP 2016