Minlie Huang
237 papers · 2009–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+19 more ↓ Show less ↑
๐บ๏ธ Taxonomy Completionist (20) ๐งญ Keyword Pioneer ๐ Interdisciplinary Bridge ๐ Renaissance Researcher (6) ๐ฃ Hot Topic Early Bird
๐
Renaissance Researcher
(6)
๐งญ
Keyword Pioneer
๐
Interdisciplinary Bridge
๐
Keyword Trendsetter Combo
(8)
๐
Conference Loyalist
(22)
๐
Grand Slam
๐ฅ
Mega-Team
(22)
๐ค
Dynamic Duo
(49)
๐ฌ
Deep Specialist
(38)
๐งฌ
Topic Evolution
๐
Keyword Champion
(11)
๐
Triple Crown
๐
Trend Setter
๐๏ธ
Keyword Collector
(68)
๐
Conference Pioneer
โก
Prolific Year
(8)
๐ฅ
Unstoppable
(12)
โ
The Questioner
๐
Century Club
(221)
Conferences
ACL (93)
EMNLP (53)
IJCNLP (22)
AAAI (17)
ICLR (14)
COLING (9)
IJCAI (9)
NAACL (6)
ICML (5)
NIPS (5)
AACL (3)
ICCV (1)
Top co-authors
Research topics
Keywords
large language model
(42)
dialogue system
(34)
text generation
(25)
language model
(16)
reinforcement learning
(13)
knowledge graph
(12)
dialogue generation
(11)
question answering
(11)
natural language generation
(11)
pre-trained language model
(10)
neural network
(10)
benchmark evaluation
(9)
story generation
(8)
dialog system
(8)
few-shot learning
(8)
task-oriented dialog
(7)
data augmentation
(7)
conversational ai
(7)
text classification
(6)
commonsense reasoning
(6)
Papers
When Smiley Turns Hostile: Interpreting How Emojis Trigger LLMsโ Toxicity
AAAI 2026
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation
AAAI 2026
ฮจ-Arena: Interactive Assessment and Optimization of LLM-based Psychological Counselors with Tripartite Feedback
AAAI 2026
New Terms, New Toxicity: Consensus-based Chinese Neologism Toxicity Detection via Search-Augmented LLMs
ACL 2026
The Side Effects of Being Smart: Safety Risks in MLLMsโ Multi-Image Reasoning
ACL 2026
Glyph: Scaling Context Windows via Visual-Text Compression
ACL 2026
S^4: Operationalizing Speech Act Theory for Strategic Semi-Structured Psychiatric Interview
ACL 2026
LASA: Language-Agnostic Semantic Alignment at the Semantic Bottleneck for LLM Safety
ACL 2026
Unveiling the Landscape of Clinical Depression Assessment: From Behavioral Signatures to Psychiatric Reasoning
AAAI 2026
How Should We Enhance the Safety of Large Reasoning Models: An Empirical Study
ACL 2026
IF-RewardBench: Benchmarking Judge Models for Instruction-Following Evaluation
ACL 2026
Data Efficient RLVR via Off-Policy Influence Guidance
ACL 2026
IF-CRITIC: Towards a Fine-Grained LLM Critic for Instruction-Following Evaluation
ACL 2026
HoWToBench: Holistic Evaluation for LLMโs Capability in Human-level Writing using Tree of Writing
ACL 2026
DPRM: A Dual Implicit Process Reward Model in Multi-Hop Question Answering
AAAI 2026
WALKSAFE: Risk-aware Graph Random Walk with Bi-GRPO for LLM Safety
AAAI 2026
HPSS: Heuristic Prompting Strategy Search for LLM Evaluators
ACL 2025
Training Language Model to Critique for Better Refinement
ACL 2025
Data Selection via Optimal Control for Language Models
ICLR 2025
Language Models Learn to Mislead Humans via RLHF
ICLR 2025
CodePlan: Unlocking Reasoning Potential in Large Language Models by Scaling Code-form Planning
ICLR 2025
MiniPLM: Knowledge Distillation for Pre-training Language Models
ICLR 2025
SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models
ICLR 2025
MAPS: Advancing Multi-Modal Reasoning in Expert-Level Physical Science
ICLR 2025
VPO: Aligning Text-to-Video Generation Models with Prompt Optimization
ICCV 2025
Adversary-Aware DPO: Enhancing Safety Alignment in Vision Language Models via Adversarial Training
EMNLP 2025
DCMKC: A Dual Consistency Matching Approach for Multi-hop Question Answering in LLMs
EMNLP 2025
Crisp: Cognitive Restructuring of Negative Thoughts through Multi-turn Supportive Dialogues
EMNLP 2025
Speculating LLMsโ Chinese Training Data Pollution from Their Tokens
EMNLP 2025
Reframe Your Life Story: Interactive Narrative Therapist and Innovative Moment Assessment with Large Language Models
EMNLP 2025
DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak
EMNLP 2025
โIโve Decided to Leakโ: Probing Internals Behind Prompt Leakage Intents
EMNLP 2025
LegalAgentBench: Evaluating LLM Agents in Legal Domain
ACL 2025
Battling against Tough Resister: Strategy Planning with Adversarial Game for Non-collaborative Dialogues
ACL 2025
AGD: Adversarial Game Defense Against Jailbreak Attacks in Large Language Models
ACL 2025
Guiding not Forcing: Enhancing the Transferability of Jailbreaking Attacks on LLMs via Removing Superfluous Constraints
ACL 2025
Advancing Collaborative Debates with Role Differentiation through Multi-Agent Reinforcement Learning
ACL 2025
Understanding the Dark Side of LLMsโ Intrinsic Self-Correction
ACL 2025
Internal Value Alignment in Large Language Models through Controlled Value Vector Activation
ACL 2025
SocialEval: Evaluating Social Intelligence of Large Language Models
ACL 2025
LongSafety: Evaluating Long-Context Safety of Large Language Models
ACL 2025
LogicGame: Benchmarking Rule-Based Reasoning Abilities of Large Language Models
ACL 2025
MHALO: Evaluating MLLMs as Fine-grained Hallucination Detectors
ACL 2025
DELMAN: Dynamic Defense Against Large Language Model Jailbreaking with Model Editing
ACL 2025
Exploring Multimodal Challenges in Toxic Chinese Detection: Taxonomy, Benchmark, and Findings
ACL 2025
DPGA-TextSyn: Differentially Private Genetic Algorithm for Synthetic Text Generation
ACL 2025
DYNTEXT: Semantic-Aware Dynamic Text Sanitization for Privacy-Preserving LLM Inference
ACL 2025
MAGI: Multi-Agent Guided Interview for Psychiatric Assessment
ACL 2025
SocialSim: Towards Socialized Simulation of Emotional Support Conversation
AAAI 2025
SS-GEN: A Social Story Generation Framework with Large Language Models
AAAI 2025
CharacterBench: Benchmarking Character Customization of Large Language Models
AAAI 2025
Model Extrapolation Expedites Alignment
ACL 2025
A Survey of Post-Training Scaling in Large Language Models
ACL 2025
DC-Instruct: An Effective Framework for Generative Multi-intent Spoken Language Understanding
EMNLP 2024
ASETF: A Novel Method for Jailbreak Attack on LLMs through Translate Suffix Embeddings
EMNLP 2024
Instruction Pre-Training: Language Models are Supervised Multitask Learners
EMNLP 2024
Intent-Aware and Hate-Mitigating Counterspeech Generation via Dual-Discriminator Guided LLMs
COLING 2024
AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback
NIPS 2024
Benchmarking Complex Instruction-Following with Multiple Constraints Composition
NIPS 2024
AgentBench: Evaluating LLMs as Agents
ICLR 2024
Language Model Decoding as Direct Metrics Optimization
ICLR 2024
MiniLLM: Knowledge Distillation of Large Language Models
ICLR 2024
ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving
ICLR 2024
Large Language Models Are Not Robust Multiple Choice Selectors
ICLR 2024
Towards Efficient Exact Optimization of Language Model Alignment
ICML 2024
On Prompt-Driven Safeguarding for Large Language Models
ICML 2024
Perception of Knowledge Boundary for Large Language Models through Semi-open-ended Question Answering
NIPS 2024
ToMBench: Benchmarking Theory of Mind in Large Language Models
ACL 2024
COKE: A Cognitive Knowledge Graph for Machine Theory of Mind
ACL 2024
360โREA: Towards A Reusable Experience Accumulation with 360โ Assessment for Multi-Agent System
ACL 2024
Language Models Hallucinate, but May Excel at Fact Verification
NAACL 2024
Depression Detection in Clinical Interviews with LLM-Empowered Structural Element Graph
NAACL 2024
Black-Box Prompt Optimization: Aligning Large Language Models without Model Training
ACL 2024
EmoBench: Evaluating the Emotional Intelligence of Large Language Models
ACL 2024
Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization
ACL 2024
AlignBench: Benchmarking Chinese Alignment of Large Language Models
ACL 2024
Learning Task Decomposition to Assist Humans in Competitive Programming
ACL 2024
CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model Generation
ACL 2024
SafetyBench: Evaluating the Safety of Large Language Models
ACL 2024
ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors
EMNLP 2024
AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models
EMNLP 2024
CharacterGLM: Customizing Social Characters with Large Language Models
EMNLP 2024
Thoughts to Target: Enhance Planning for Target-driven Conversation
EMNLP 2024
Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules
EMNLP 2024
Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy
EMNLP 2023
Generating Coherent Narratives by Learning Dynamic and Discrete Entity States with a Contrastive Framework
AAAI 2023
KPT: Keyword-Guided Pre-training for Grounded Dialog Generation
AAAI 2023
Facilitating Multi-turn Emotional Support Conversation with Positive Emotion Elicitation: A Reinforcement Learning Approach
ACL 2023
MoralDial: A Framework to Train and Evaluate Moral Dialogue Systems via Moral Discussions
ACL 2023
Pre-Training to Learn in Context
ACL 2023
CASE: Aligning Coarse-to-Fine Cognition and Affection for Empathetic Response Generation
ACL 2023
DecompEval: Evaluating Generated Texts as Unsupervised Decomposed Question Answering
ACL 2023
ETHICIST: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confidence Estimation
ACL 2023
StoryTrans: Non-Parallel Story Author-Style Transfer with Discourse Representations and Content Enhancing
ACL 2023
Self-Supervised Sentence Polishing by Adding Engaging Modifiers
ACL 2023
Goal Awareness for Conversational AI: Proactivity, Non-collaborativity, and Beyond
ACL 2023
PAL: Persona-Augmented Emotional Support Conversation Generation
ACL 2023
Click: Controllable Text Generation with Sequence Likelihood Contrastive Learning
ACL 2023
AugESC: Dialogue Augmentation with Large Language Models for Emotional Support Conversation
ACL 2023
E-NER: Evidential Deep Learning for Trustworthy Named Entity Recognition
ACL 2023
Mitigating the Learning Bias towards Repetition by Self-Contrastive Training for Open-Ended Generation
ACL 2023
Uncertainty-Aware Unlikelihood Learning Improves Generative Aspect Sentiment Quad Prediction
ACL 2023
Unveiling the Implicit Toxicity in Large Language Models
EMNLP 2023
Re3Dial: Retrieve, Reorganize and Rescale Conversations for Long-Turn Open-Domain Dialogue Pre-training
EMNLP 2023
Multi-Source Probing for Open-Domain Conversational Understanding
EMNLP 2023
Task-Adaptive Tokenization: Enhancing Long-Form Text Generation Efficacy in Mental Health and Beyond
EMNLP 2023
Building Multi-domain Dialog State Trackers from Single-domain Dialogs
EMNLP 2023
ConvLab-3: A Flexible Dialogue System Toolkit Based on a Unified Data Format
EMNLP 2023
InstructSafety: A Unified Framework for Building Multidimensional and Explainable Safety Detector through Instruction Tuning
EMNLP 2023
Tailoring Language Generation Models under Total Variation Distance
ICLR 2023
Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models
ICML 2023
CTRLEval: An Unsupervised Reference-Free Metric for Evaluating Controlled Text Generation
ACL 2022
Answering Open-Domain Multi-Answer Questions via a Recall-then-Verify Framework
ACL 2022
Continual Prompt Tuning for Dialog State Tracking
ACL 2022
CDConv: A Benchmark for Contradiction Detection in Chinese Conversations
EMNLP 2022
Aligning Recommendation and Conversation via Dual Imitation
EMNLP 2022
Learning Instructions with Unlabeled Data for Zero-Shot Cross-Task Generalization
EMNLP 2022
COLD: A Benchmark for Chinese Offensive Language Detection
EMNLP 2022
Automatic Comment Generation for Chinese Student Narrative Essays
EMNLP 2022
WSpeller: Robust Word Segmentation for Enhancing Chinese Spelling Check
EMNLP 2022
AutoCAD: Automatically Generate Counterfactuals for Mitigating Shortcut Learning
EMNLP 2022
Chaining Simultaneous Thoughts for Numerical Reasoning
EMNLP 2022
Towards Identifying Social Bias in Dialog Systems: Framework, Dataset, and Benchmark
EMNLP 2022
Constructing Highly Inductive Contexts for Dialogue Safety through Controllable Reverse Generation
EMNLP 2022
A Unified Dialogue User Simulator for Few-shot Data Augmentation
EMNLP 2022
On the Safety of Conversational Models: Taxonomy, Dataset, and Benchmark
ACL 2022
Acceleration of Federated Learning with Alleviated Forgetting in Local Training
ICLR 2022
LaMemo: Language Modeling with Look-Ahead Memory
NAACL 2022
CEM: Commonsense-Aware Empathetic Response Generation
AAAI 2022
Curriculum-Based Self-Training Makes Better Few-Shot Learners for Data-to-Text Generation
IJCAI 2022
A Corpus for Understanding and Generating Moral Stories
NAACL 2022
Directed Acyclic Transformer for Non-Autoregressive Machine Translation
ICML 2022
Persona-Guided Planning for Controlling the Protagonistโs Persona in Story Generation
NAACL 2022
On the Learning of Non-Autoregressive Transformers
ICML 2022
Rethinking and Refining the Distinct Metric
ACL 2022
PPT: Pre-trained Prompt Tuning for Few-shot Learning
ACL 2022
CoMAE: A Multi-factor Hierarchical Framework for Empathetic Response Generation
ACL 2021
KuiLeiXi: a Chinese Open-Ended Text Adventure Game
ACL 2021
OpenMEVA: A Benchmark for Evaluating Open-ended Story Generation Metrics
ACL 2021
Long Text Generation by Modeling Sentence-Level and Discourse-Level Coherence
ACL 2021
A Mutual Information Maximization Approach for the Spurious Solution Problem in Weakly Supervised Question Answering
ACL 2021
CR-Walker: Tree-Structured Graph Reasoning and Dialog Acts for Conversational Recommendation
EMNLP 2021
Stylized Story Generation with Style-Guided Planning
IJCNLP 2021
Diversifying Dialog Generation via Adaptive Label Smoothing
ACL 2021
Towards Emotional Support Dialog Systems
ACL 2021
ERICA: Improving Entity and Relation Understanding for Pre-trained Language Models via Contrastive Learning
ACL 2021
A Semantic-based Method for Unsupervised Commonsense Question Answering
ACL 2021
Robustness Testing of Language Understanding in Task-Oriented Dialog
ACL 2021
Self-training Improves Pre-training for Few-shot Learning in Task-oriented Dialog Systems
EMNLP 2021
HyKnow: End-to-End Task-Oriented Dialog Modeling with Hybrid Knowledge Management
IJCNLP 2021
EARL: Informative Knowledge-Grounded Conversation Generation with Entity-Agnostic Representation Learning
EMNLP 2021
Transferable Persona-Grounded Dialogues via Grounded Minimal Edits
EMNLP 2021
NAST: A Non-Autoregressive Generator with Word Alignment for Unsupervised Text Style Transfer
IJCNLP 2021
PsyQA: A Chinese Dataset for Generating Long Counseling Text for Mental Health Support
IJCNLP 2021
DiscoDVT: Generating Long Text with Discourse-Aware Discrete Variational Transformer
EMNLP 2021
CoMAE: A Multi-factor Hierarchical Framework for Empathetic Response Generation
IJCNLP 2021
KuiLeiXi: a Chinese Open-Ended Text Adventure Game
IJCNLP 2021
OpenMEVA: A Benchmark for Evaluating Open-ended Story Generation Metrics
IJCNLP 2021
Long Text Generation by Modeling Sentence-Level and Discourse-Level Coherence
IJCNLP 2021
A Mutual Information Maximization Approach for the Spurious Solution Problem in Weakly Supervised Question Answering
IJCNLP 2021
Diversifying Dialog Generation via Adaptive Label Smoothing
IJCNLP 2021
Towards Emotional Support Dialog Systems
IJCNLP 2021
ERICA: Improving Entity and Relation Understanding for Pre-trained Language Models via Contrastive Learning
IJCNLP 2021
A Semantic-based Method for Unsupervised Commonsense Question Answering
IJCNLP 2021
Independence-aware Advantage Estimation
IJCAI 2021
Robustness Testing of Language Understanding in Task-Oriented Dialog
IJCNLP 2021
When does Further Pre-training MLM Help? An Empirical Study on Task-Oriented Dialog Pre-training
EMNLP 2021
Stylized Dialogue Response Generation Using Stylized Unpaired Texts
AAAI 2021
Turn-Level User Satisfaction Estimation in E-commerce Customer Service
IJCNLP 2021
JointGT: Graph-Text Joint Representation Learning for Text Generation from Knowledge Graphs
IJCNLP 2021
Turn-Level User Satisfaction Estimation in E-commerce Customer Service
ACL 2021
JointGT: Graph-Text Joint Representation Learning for Text Generation from Knowledge Graphs
ACL 2021
Stylized Story Generation with Style-Guided Planning
ACL 2021
HyKnow: End-to-End Task-Oriented Dialog Modeling with Hybrid Knowledge Management
ACL 2021
NAST: A Non-Autoregressive Generator with Word Alignment for Unsupervised Text Style Transfer
ACL 2021
PsyQA: A Chinese Dataset for Generating Long Counseling Text for Mental Health Support
ACL 2021
KdConv: A Chinese Multi-domain Dialogue Dataset Towards Multi-turn Knowledge-driven Conversation
ACL 2020
Language Generation with Multi-Hop Reasoning on Commonsense Knowledge Graph
EMNLP 2020
Dialogue Distillation: Open-Domain Dialogue Augmentation Using Unpaired Data
EMNLP 2020
SentiLARE: Sentiment-Aware Language Representation Learning with Linguistic Knowledge
EMNLP 2020
UNION: An Unreferenced Metric for Evaluating Open-ended Story Generation
EMNLP 2020
Youling: an AI-assisted Lyrics Creation System
EMNLP 2020
Difference-aware Knowledge Selection for Knowledge-grounded Conversation Generation
EMNLP 2020
Robustness to Modification with Shared Words in Paraphrase Identification
EMNLP 2020
Continual Learning for Natural Language Generation in Task-oriented Dialog Systems
EMNLP 2020
Generating Commonsense Explanation by Extracting Bridge Concepts from Reasoning Paths
AACL 2020
Learning Goal-oriented Dialogue Policy with opposite Agent Awareness
AACL 2020
A Pre-Training Based Personalized Dialogue Generation Model with Persona-Sparse Data
AAAI 2020
Automatic Perturbation Analysis for Scalable Certified Robustness and Beyond
NIPS 2020
Robustness Verification for Transformers
ICLR 2020
ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue Systems
ACL 2020
A Self-Training Method for Machine Reading Comprehension with Soft Evidence Extraction
ACL 2020
Multi-Agent Task-Oriented Dialog Policy Learning with Role-Aware Reward Decomposition
ACL 2020
Reinforced Molecular Optimization with Neighborhood-Controlled Grammars
NIPS 2020
ExpanRL: Hierarchical Reinforcement Learning for Course Concept Expansion in MOOCs
AACL 2020
ARAML: A Stable Adversarial Training Framework for Text Generation
EMNLP 2019
Guided Dialog Policy Learning: Reward Estimation for Multi-Domain Task-Oriented Dialog
IJCNLP 2019
Long and Diverse Text Generation with Planning-based Hierarchical Variational Model
IJCNLP 2019
ARAML: A Stable Adversarial Training Framework for Text Generation
IJCNLP 2019
Meta-Learning for Low-resource Natural Language Generation in Task-oriented Dialogue Systems
IJCAI 2019
ChID: A Large-scale Chinese IDiom Dataset for Cloze Test
ACL 2019
ConvLab: Multi-Domain End-to-End Dialog System Platform
ACL 2019
Guided Dialog Policy Learning: Reward Estimation for Multi-Domain Task-Oriented Dialog
EMNLP 2019
Story Ending Generation with Incremental Encoding and Commonsense Knowledge
AAAI 2019
A Deep Sequential Model for Discourse Parsing on Multi-Party Dialogues
AAAI 2019
A Hierarchical Framework for Relation Extraction with Reinforcement Learning
AAAI 2019
Long and Diverse Text Generation with Planning-based Hierarchical Variational Model
EMNLP 2019
Generating Informative Responses with Controlled Sentence Function
ACL 2018
An Interpretable Reasoning Network for Multi-Relation Question Answering
COLING 2018
Learning to Ask Questions in Open-domain Conversational Systems with Typed Decoders
ACL 2018
Commonsense Knowledge Aware Conversation Generation with Graph Attention
IJCAI 2018
Densely Connected CNN with Multi-scale Feature Attention for Text Classification
IJCAI 2018
A Weakly Supervised Method for Topic Segmentation and Labeling in Goal-oriented Dialogues via Reinforcement Learning
IJCAI 2018
Assigning Personality/Profile to a Chatting Machine for Coherent Conversation Generation
IJCAI 2018
An Operation Network for Abstractive Sentence Compression
COLING 2018
Linguistically Regularized LSTM for Sentiment Classification
ACL 2017
From One Point to a Manifold: Knowledge Graph Embedding for Precise Link Prediction
IJCAI 2016
TransG : A Generative Model for Knowledge Graph Embedding
ACL 2016
A Sentence Interaction Network for Modeling Dependence between Sentences
ACL 2016
Attention-based LSTM for Aspect-level Sentiment Classification
EMNLP 2016
GAKE: Graph Aware Knowledge Embedding
COLING 2016
Product Review Summarization by Exploiting Phrase Properties
COLING 2016
Context-aware Natural Language Generation for Spoken Dialogue Systems
COLING 2016
Incorporating Domain and Sentiment Supervision in Representation Learning for Domain Adaptation
IJCAI 2015
Learning Tag Embeddings and Tag-specific Composition Functions in Recursive Neural Network
ACL 2015
Learning Tag Embeddings and Tag-specific Composition Functions in Recursive Neural Network
IJCNLP 2015
New Word Detection for Sentiment Analysis
ACL 2014
Clustering Aspect-related Phrases by Leveraging Sentiment Distribution Consistency
EMNLP 2014
Fine Granular Aspect Analysis using Latent Structural Models
ACL 2012
Quality-biased Ranking of Short Texts in Microblogging Services
IJCNLP 2011
Learning to Link Entities with Knowledge Base
NAACL 2010
Structure-Aware Review Mining and Summarization
COLING 2010
Learning to Annotate Scientific Publications
COLING 2010
A Comparative Study on Ranking and Selection Strategies for Multi-Document Summarization
COLING 2010
Metadata-Aware Measures for Answer Summarization in Community Question Answering
ACL 2010
Answering Opinion Questions with Random Walks on Graphs
ACL 2009
Answering Opinion Questions with Random Walks on Graphs
IJCNLP 2009