Chien-Sheng Wu
65 papers · 2016–2026 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
π£ Hot Topic Early Bird π§ Keyword Pioneer π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (12) π Conference Polyglot (8)
πΊοΈ
Taxonomy Completionist
(12)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Conference Loyalist
(23)
π€
Dynamic Duo
(42)
π₯
Mega-Team
(23)
π§¬
Topic Evolution
π
Keyword Champion
ποΈ
Keyword Collector
(268)
β‘
Prolific Year
(11)
β
The Questioner
(6)
π
Century Club
(63)
π
Trend Setter
π₯
Unstoppable
(8)
Conferences
EMNLP (23)
ACL (20)
NAACL (10)
ICLR (6)
COLING (2)
IJCNLP (2)
CONLL (1)
NIPS (1)
Top co-authors
Research topics
Keywords
large language model
(9)
question answering
(7)
text summarization
(7)
dialogue system
(5)
text generation
(5)
retrieval-augmented generation
(4)
summarization evaluation
(4)
dialogue state tracking
(4)
few-shot learning
(4)
transfer learning
(4)
natural language inference
(4)
representation learning
(3)
evaluation benchmark
(3)
automatic evaluation
(3)
knowledge base
(3)
multi-task learning
(3)
task-oriented dialogue
(3)
self-supervised learning
(3)
question generation
(3)
language model
(3)
Papers
Donβt Stop Early: Scalable Enterprise Deep Research with Controlled Information Flow and Evidence-Aware Termination
ACL 2026
GTA: Generating Long-horizon Tasks for Web Agents at Scale
ACL 2026
SiReRAG: Indexing Similar and Related Information for Multihop Reasoning
ICLR 2025
BingoGuard: LLM Content Moderation Tools with Risk Levels
ICLR 2025
Benchmarking Deep Search over Heterogeneous Enterprise Data
EMNLP 2025
Why Vision Language Models Struggle with Visual Arithmetic? Towards Enhanced Chart and Geometry Understanding
ACL 2025
ReGenesis: LLMs can Grow into Reasoning Generalists via Self-Improvement
ICLR 2025
CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments
NAACL 2025
Unanswerability Evaluation for Retrieval Augmented Generation
ACL 2025
Turning Conversations into Workflows: A Framework to Extract and Evaluate Dialog Workflows for Service AI Agents
ACL 2025
LAM SIMULATOR: Advancing Data Generation for Large Action Model Training via Online Exploration and Trajectory Feedback
ACL 2025
Evaluating Cultural and Social Awareness of LLM Web Agents
NAACL 2025
ReIFE: Re-evaluating Instruction-Following Evaluation
NAACL 2025
Do RAG Systems Cover What Matters? Evaluating and Optimizing Responses with Sub-Question Coverage
NAACL 2025
Prompt Leakage effect and mitigation strategies for multi-turn LLM Applications
EMNLP 2024
Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems
EMNLP 2024
Embrace Divergence for Richer Insights: A Multi-document Summarization Benchmark and a Case Study on Summarizing Diverse Information from News Articles
NAACL 2024
Benchmarking Generation and Evaluation Capabilities of Large Language Models for Instruction Controllable Summarization
NAACL 2024
Lexical Repetitions Lead to Rote Learning: Unveiling the Impact of Lexical Overlap in Train and Test Reference Summaries
EMNLP 2023
Towards Interpretable and Efficient Automatic Reference-Based Summarization Evaluation
EMNLP 2023
SummEdits: Measuring LLM Ability at Factual Reasoning Through The Lens of Summarization
EMNLP 2023
Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learning
ACL 2023
Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation
ACL 2023
SWiPE: A Dataset for Document-Level Simplification of Wikipedia Pages
ACL 2023
Socratic Pretraining: Question-Driven Pretraining for Controllable Summarization
ACL 2023
CaPE: Contrastive Parameter Ensembling for Reducing Hallucination in Abstractive Summarization
ACL 2023
Model ensemble instead of prompt fusion: a sample-specific knowledge transfer method for few-shot prompt tuning
ICLR 2023
Salespeople vs SalesBot: Exploring the Role of Educational Value in Conversational Recommender Systems
EMNLP 2023
INTELMO: Enhancing Modelsβ Adoption of Interactive Interfaces
EMNLP 2023
Near-Negative Distinction: Giving a Second Life to Human Evaluation Datasets
EMNLP 2022
DialFact: A Benchmark for Fact-Checking in Dialogue
ACL 2022
QAConv: Question Answering on Informative Conversations
ACL 2022
UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models
EMNLP 2022
Conformal Predictor for Improving Zero-Shot Text Classification Efficiency
EMNLP 2022
Improving Factual Consistency in Summarization with Compression-Based Post-Editing
EMNLP 2022
Discord Questions: A Computational Approach To Diversity Analysis in News Coverage
EMNLP 2022
Numerical Correlation in Text
EMNLP 2022
QAFactEval: Improved QA-Based Factual Consistency Evaluation for Summarization
NAACL 2022
Quiz Design Task: Helping Teachers Create Quizzes with Automated Question Generation
NAACL 2022
Exploring Neural Models for Query-Focused Summarization
NAACL 2022
MixQG: Neural Question Generation with Mixed Answer Types
NAACL 2022
Controllable Abstractive Dialogue Summarization with Sketch Supervision
IJCNLP 2021
GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing
ICLR 2021
Controllable Abstractive Dialogue Summarization with Sketch Supervision
ACL 2021
Discern: Discourse-Aware Entailment Reasoning Network for Conversational Machine Reading
EMNLP 2020
TOD-BERT: Pre-trained Natural Language Understanding for Task-Oriented Dialogue
EMNLP 2020
Find or Classify? Dual Strategy for Slot-Value Predictions on Multi-Domain Dialog State Tracking
COLING 2020
Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational Machine Reading
ACL 2020
A Simple Language Model for Task-Oriented Dialogue
NIPS 2020
Improving Limited Labeled Dialogue State Tracking with Self-Supervision
EMNLP 2020
Discriminative Nearest Neighbor Few-Shot Intent Detection by Transferring Natural Language Inference
EMNLP 2020
Probing Task-Oriented Dialogue Representation from Language Models
EMNLP 2020
Clickbait? Sensational Headline Generation with Auto-tuned Reinforcement Learning
IJCNLP 2019
Transferable Multi-Domain State Generator for Task-Oriented Dialogue Systems
ACL 2019
Clickbait? Sensational Headline Generation with Auto-tuned Reinforcement Learning
EMNLP 2019
Global-to-local Memory Pointer Networks for Task-Oriented Dialogue
ICLR 2019
Code-Switched Language Models Using Neural Based Synthetic Data from Parallel Sentences
CONLL 2019
Personalizing Dialogue Agents via Meta-Learning
ACL 2019
Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Oriented Dialog Systems
ACL 2018
Emo2Vec: Learning Generalized Emotion Representation by Multi-task Training
EMNLP 2018
Improving Large-Scale Fact-Checking using Decomposable Attention Models and Lexical Tagging
EMNLP 2018
Bilingual Character Representation for Efficiently Addressing Out-of-Vocabulary Words in Code-Switching Named Entity Recognition
ACL 2018
Code-Switching Language Modeling using Syntax-Aware Multi-Task Learning
ACL 2018
Real-Time Speech Emotion and Sentiment Recognition for Interactive Dialogue Systems
EMNLP 2016
Zara: A Virtual Interactive Dialogue System Incorporating Emotion, Sentiment and Personality Recognition
COLING 2016