Chien-Sheng Wu

65 papers · 2016–2026 · 8 conferences · across top CS/AI conferences

Achievements

+14 more ↓

🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (12) 🌍 Conference Polyglot (8)

🗺️ Taxonomy Completionist (12) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏠 Conference Loyalist (23) 🤝 Dynamic Duo (42) 👥 Mega-Team (23) 🧬 Topic Evolution 🏆 Keyword Champion 🗃️ Keyword Collector (268) ⚡ Prolific Year (11) ❓ The Questioner (6) 💎 Century Club (63) 📈 Trend Setter 🔥 Unstoppable (8)

Conferences

EMNLP (23) ACL (20) NAACL (10) ICLR (6) COLING (2) IJCNLP (2) CONLL (1) NIPS (1)

Top co-authors

Caiming Xiong (42) Philippe Laban (16) Pascale Fung (13) Alexander Fabbri (12) Wenhao Liu (12) Shafiq Joty (12) Prafulla Kumar Choubey (12) Andrea Madotto (10) Richard Socher (8) Kung-Hsiang Huang (8)

Research topics

Privacy (1)

Keywords

large language model (9) question answering (7) text summarization (7) dialogue system (5) text generation (5) retrieval-augmented generation (4) summarization evaluation (4) dialogue state tracking (4) few-shot learning (4) transfer learning (4) natural language inference (4) representation learning (3) evaluation benchmark (3) automatic evaluation (3) knowledge base (3) multi-task learning (3) task-oriented dialogue (3) self-supervised learning (3) question generation (3) language model (3)

Papers

Don’t Stop Early: Scalable Enterprise Deep Research with Controlled Information Flow and Evidence-Aware Termination ACL 2026 GTA: Generating Long-horizon Tasks for Web Agents at Scale ACL 2026 SiReRAG: Indexing Similar and Related Information for Multihop Reasoning ICLR 2025 BingoGuard: LLM Content Moderation Tools with Risk Levels ICLR 2025 Benchmarking Deep Search over Heterogeneous Enterprise Data EMNLP 2025 Why Vision Language Models Struggle with Visual Arithmetic? Towards Enhanced Chart and Geometry Understanding ACL 2025 ReGenesis: LLMs can Grow into Reasoning Generalists via Self-Improvement ICLR 2025 CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environments NAACL 2025 Unanswerability Evaluation for Retrieval Augmented Generation ACL 2025 Turning Conversations into Workflows: A Framework to Extract and Evaluate Dialog Workflows for Service AI Agents ACL 2025 LAM SIMULATOR: Advancing Data Generation for Large Action Model Training via Online Exploration and Trajectory Feedback ACL 2025 Evaluating Cultural and Social Awareness of LLM Web Agents NAACL 2025 ReIFE: Re-evaluating Instruction-Following Evaluation NAACL 2025 Do RAG Systems Cover What Matters? Evaluating and Optimizing Responses with Sub-Question Coverage NAACL 2025 Prompt Leakage effect and mitigation strategies for multi-turn LLM Applications EMNLP 2024 Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems EMNLP 2024 Embrace Divergence for Richer Insights: A Multi-document Summarization Benchmark and a Case Study on Summarizing Diverse Information from News Articles NAACL 2024 Benchmarking Generation and Evaluation Capabilities of Large Language Models for Instruction Controllable Summarization NAACL 2024 Lexical Repetitions Lead to Rote Learning: Unveiling the Impact of Lexical Overlap in Train and Test Reference Summaries EMNLP 2023 Towards Interpretable and Efficient Automatic Reference-Based Summarization Evaluation EMNLP 2023 SummEdits: Measuring LLM Ability at Factual Reasoning Through The Lens of Summarization EMNLP 2023 Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learning ACL 2023 Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation ACL 2023 SWiPE: A Dataset for Document-Level Simplification of Wikipedia Pages ACL 2023 Socratic Pretraining: Question-Driven Pretraining for Controllable Summarization ACL 2023 CaPE: Contrastive Parameter Ensembling for Reducing Hallucination in Abstractive Summarization ACL 2023 Model ensemble instead of prompt fusion: a sample-specific knowledge transfer method for few-shot prompt tuning ICLR 2023 Salespeople vs SalesBot: Exploring the Role of Educational Value in Conversational Recommender Systems EMNLP 2023 INTELMO: Enhancing Models’ Adoption of Interactive Interfaces EMNLP 2023 Near-Negative Distinction: Giving a Second Life to Human Evaluation Datasets EMNLP 2022 DialFact: A Benchmark for Fact-Checking in Dialogue ACL 2022 QAConv: Question Answering on Informative Conversations ACL 2022 UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language Models EMNLP 2022 Conformal Predictor for Improving Zero-Shot Text Classification Efficiency EMNLP 2022 Improving Factual Consistency in Summarization with Compression-Based Post-Editing EMNLP 2022 Discord Questions: A Computational Approach To Diversity Analysis in News Coverage EMNLP 2022 Numerical Correlation in Text EMNLP 2022 QAFactEval: Improved QA-Based Factual Consistency Evaluation for Summarization NAACL 2022 Quiz Design Task: Helping Teachers Create Quizzes with Automated Question Generation NAACL 2022 Exploring Neural Models for Query-Focused Summarization NAACL 2022 MixQG: Neural Question Generation with Mixed Answer Types NAACL 2022 Controllable Abstractive Dialogue Summarization with Sketch Supervision IJCNLP 2021 GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing ICLR 2021 Controllable Abstractive Dialogue Summarization with Sketch Supervision ACL 2021 Discern: Discourse-Aware Entailment Reasoning Network for Conversational Machine Reading EMNLP 2020 TOD-BERT: Pre-trained Natural Language Understanding for Task-Oriented Dialogue EMNLP 2020 Find or Classify? Dual Strategy for Slot-Value Predictions on Multi-Domain Dialog State Tracking COLING 2020 Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational Machine Reading ACL 2020 A Simple Language Model for Task-Oriented Dialogue NIPS 2020 Improving Limited Labeled Dialogue State Tracking with Self-Supervision EMNLP 2020 Discriminative Nearest Neighbor Few-Shot Intent Detection by Transferring Natural Language Inference EMNLP 2020 Probing Task-Oriented Dialogue Representation from Language Models EMNLP 2020 Clickbait? Sensational Headline Generation with Auto-tuned Reinforcement Learning IJCNLP 2019 Transferable Multi-Domain State Generator for Task-Oriented Dialogue Systems ACL 2019 Clickbait? Sensational Headline Generation with Auto-tuned Reinforcement Learning EMNLP 2019 Global-to-local Memory Pointer Networks for Task-Oriented Dialogue ICLR 2019 Code-Switched Language Models Using Neural Based Synthetic Data from Parallel Sentences CONLL 2019 Personalizing Dialogue Agents via Meta-Learning ACL 2019 Mem2Seq: Effectively Incorporating Knowledge Bases into End-to-End Task-Oriented Dialog Systems ACL 2018 Emo2Vec: Learning Generalized Emotion Representation by Multi-task Training EMNLP 2018 Improving Large-Scale Fact-Checking using Decomposable Attention Models and Lexical Tagging EMNLP 2018 Bilingual Character Representation for Efficiently Addressing Out-of-Vocabulary Words in Code-Switching Named Entity Recognition ACL 2018 Code-Switching Language Modeling using Syntax-Aware Multi-Task Learning ACL 2018 Real-Time Speech Emotion and Sentiment Recognition for Interactive Dialogue Systems EMNLP 2016 Zara: A Virtual Interactive Dialogue System Incorporating Emotion, Sentiment and Personality Recognition COLING 2016