Nanyun Peng
205 papers · 2012–2026 · 17 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+18 more ↓ Show less ↑
๐งญ Keyword Pioneer ๐ Conference Polyglot (17) ๐บ๏ธ Taxonomy Completionist (13) ๐ Interdisciplinary Bridge ๐ Academic Marathon (13)
๐
Interdisciplinary Bridge
๐บ๏ธ
Taxonomy Completionist
(13)
๐งญ
Keyword Pioneer
๐
Conference Loyalist
(49)
๐ค
Dynamic Duo
(56)
๐
Triple Crown
๐
Grand Slam
๐ฑ
Topic Pioneer
๐ฌ
Deep Specialist
(47)
๐งฌ
Topic Evolution
๐
Keyword Champion
(3)
๐
Trend Setter
โ
The Questioner
(11)
๐๏ธ
Keyword Collector
(752)
๐
Century Club
(201)
๐ฅ
Unstoppable
(12)
๐
Conference Pioneer
โก
Prolific Year
(43)
Conferences
EMNLP (69)
ACL (52)
NAACL (33)
IJCNLP (11)
NIPS (8)
AAAI (7)
ICML (7)
COLING (3)
CONLL (3)
CVPR (3)
EACL (2)
ICLR (2)
WACV (1)
INTERSPEECH (1)
IJCAI (1)
ICCV (1)
AACL (1)
Top co-authors
Research topics
Keywords
large language model
(31)
text generation
(26)
language model
(18)
event extraction
(15)
story generation
(13)
zero-shot learning
(12)
cross-lingual transfer
(10)
named entity recognition
(10)
vision-language model
(9)
multimodal learning
(8)
transfer learning
(8)
relation extraction
(7)
information extraction
(7)
question answering
(7)
dialogue system
(7)
text classification
(7)
gender bia
(6)
event detection
(6)
few-shot learning
(6)
structured prediction
(6)
Papers
LiveCLKTBench: Towards Reliable Evaluation of Cross-Lingual Knowledge Transfer in Multilingual LLMs
ACL 2026
Rethinking Creativity Evaluation: A Critical Analysis of Existing Creativity Evaluations
EACL 2026
MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Knowledge Poisoning Attacks
ACL 2026
Decoupling Task-Solving and Output Formatting in LLM Generation
ACL 2026
Vulnerability of Large Language Models to Output Prefix Jailbreaks: Impact of Positions on Safety
NAACL 2025
CoKe: Customizable Fine-Grained Story Evaluation via Chain-of-Keyword Rationalization
ACL 2025
DRS: Deep Question Reformulation With Structured Output
ACL 2025
Comparing Bad Apples to Good Oranges Aligning Large Language Models via Joint Preference Optimization
ACL 2025
METAL: A Multi-Agent Framework for Chart Generation with Test-Time Scaling
ACL 2025
Sandcastles in the Storm: Revisiting the (Im)possibility of Strong Watermarking
ACL 2025
Mind the Gesture: Evaluating AI Sensitivity to Culturally Offensive Non-Verbal Gestures
ACL 2025
SYNTHIA: Novel Concept Design with Affordance Composition
ACL 2025
Vulnerability of LLMs to Vertically Aligned Text Manipulations
ACL 2025
Creative Planning with Language Models: Practice, Evaluation and Applications
NAACL 2025
BRIEF: Bridging Retrieval and Inference for Multi-hop Reasoning via Compression
NAACL 2025
Improving Faithfulness of Text-to-Image Diffusion Models through Inference Intervention
WACV 2025
Evaluating Cultural and Social Awareness of LLM Web Agents
NAACL 2025
CaKE: Circuit-aware Editing Enables Generalizable Knowledge Learners
EMNLP 2025
REFFLY: Melody-Constrained Lyrics Editing Model
NAACL 2025
Guiding Through Complexity: What Makes Good Supervision for Hard Reasoning Tasks?
NAACL 2025
Model Extrapolation Expedites Alignment
ACL 2025
SkillVerse : Assessing and Enhancing LLMs with Tree Evaluation
ACL 2025
Collapse of Dense Retrievers: Short, Early, and Literal Biases Outranking Factual Evidence
ACL 2025
DiCoRe: Enhancing Zero-shot Event Detection via Divergent-Convergent LLM Reasoning
EMNLP 2025
SNaRe: Domain-aware Data Generation for Low-Resource Event Detection
EMNLP 2025
How to Make Large Language Models Generate 100% Valid Molecules?
EMNLP 2025
FLAMES: Improving LLM Math Reasoning via a Fine-Grained Analysis of the Data Synthesis Pipeline
EMNLP 2025
The Unreasonable Effectiveness of Model Merging for Cross-Lingual Transfer in LLMs
EMNLP 2025
Verbalized Representation Learning for Interpretable Few-Shot Generalization
ICCV 2025
MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models
ICLR 2025
VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning
CVPR 2025
Scaling Probabilistic Circuits via Monarch Matrices
ICML 2025
Contrastive Visual Data Augmentation
ICML 2025
Con-ReCall: Detecting Pre-training Data in LLMs via Contrastive Decoding
COLING 2025
Explaining Mixtures of Sources in News Articles
EMNLP 2024
QUDSELECT: Selective Decoding for Questions Under Discussion Parsing
EMNLP 2024
Explaining and Improving Contrastive Decoding by Extrapolating the Probabilities of a Huge and Hypothetical LM
EMNLP 2024
Synchronous Faithfulness Monitoring for Trustworthy Retrieval-Augmented Generation
EMNLP 2024
SPEED++: A Multilingual Event Extraction Framework for Epidemic Prediction and Preparedness
EMNLP 2024
Control Large Language Models via Divide and Conquer
EMNLP 2024
Re-ReST: Reflection-Reinforced Self-Training for Language Agents
EMNLP 2024
Human-in-the-Loop Synthetic Text Data Inspection with Provenance Tracking
NAACL 2024
Event Detection from Social Media for Epidemic Prediction
NAACL 2024
Contextual Label Projection for Cross-Lingual Structured Prediction
NAACL 2024
MacGyver: Are Large Language Models Creative Problem Solvers?
NAACL 2024
Mitigating Bias for Question Answering Models by Tracking Bias Influence
NAACL 2024
AMRFact: Enhancing Summarization Factuality Evaluation with AMR-Driven Negative Samples Generation
NAACL 2024
DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generation
NIPS 2024
Adaptable Logical Control for Large Language Models
NIPS 2024
SafeWorld: Geo-Diverse Safety Alignment
NIPS 2024
Improving Event Definition Following For Zero-Shot Event Detection
ACL 2024
Tracking the Newsworthiness of Public Documents
ACL 2024
VALOR-EVAL: Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models
ACL 2024
Argument-Aware Approach To Event Linking
ACL 2024
CaLM: Contrasting Large and Small Language Models to Verify Grounded Generation
ACL 2024
TextEE: Benchmark, Reevaluation, Reflections, and Future Challenges in Event Extraction
ACL 2024
PhonologyBench: Evaluating Phonological Skills of Large Language Models
ACL 2024
Medical Vision-Language Pre-Training for Brain Abnormalities
COLING 2024
On Prompt-Driven Safeguarding for Large Language Models
ICML 2024
ConTextual: Evaluating Context-Sensitive Text-Rich Visual Reasoning in Large Multimodal Models
ICML 2024
DiNADO: Norm-Disentangled Neurally-Decomposed Oracles for Controlling Language Models
ICML 2024
Open-Domain Text Evaluation via Contrastive Distribution Methods
ICML 2024
RLCD: Reinforcement Learning from Contrastive Distillation for LM Alignment
ICLR 2024
ARMADA: Attribute-Based Multimodal Data Augmentation
EMNLP 2024
PG-Story: Taxonomy, Dataset, and Evaluation for Ensuring Child-Safe Content for Story Generation
EMNLP 2024
Uncertainty Calibration for Tool-Using Language Agents
EMNLP 2024
Detecting Machine-Generated Long-Form Content with Latent-Space Variables
EMNLP 2024
VDebugger: Harnessing Execution Feedback for Debugging Visual Programs
EMNLP 2024
LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for Enhanced Following of Instructions with Multiple Constraints
EMNLP 2024
LLM-A*: Large Language Model Enhanced Incremental Heuristic Search on Path Planning
EMNLP 2024
Do LLMs Plan Like Human Writers? Comparing Journalist Coverage of Press Releases with LLMs
EMNLP 2024
Are Large Language Models Capable of Generating Human-Level Narratives?
EMNLP 2024
Measuring Psychological Depth in Language Models
EMNLP 2024
Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue
EMNLP 2024
STAR: Boosting Low-Resource Information Extraction by Structure-to-Text Data Generation with Large Language Models
AAAI 2024
MIDDAG: Where Does Our News Go? Investigating Information Diffusion via Community-Level Information Pathways
AAAI 2024
Matryoshka Query Transformer for Large Vision-Language Models
NIPS 2024
Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks
EMNLP 2023
Tractable Control for Autoregressive Language Generation
ICML 2023
Masked Path Modeling for Vision-and-Language Navigation
EMNLP 2023
Evaluating Large Language Models on Controlled Generation Tasks
EMNLP 2023
Are Personalized Stochastic Parrots More Dangerous? Evaluating Persona Biases in Dialogue Systems
EMNLP 2023
โKelly is a Warm Person, Joseph is a Role Modelโ: Gender Biases in LLM-Generated Reference Letters
EMNLP 2023
Creative Natural Language Generation
EMNLP 2023
ACQUIRED: A Dataset for Answering Counterfactual Questions In Real-Life Videos
EMNLP 2023
Code-Switched Text Synthesis in Unseen Language Pairs
ACL 2023
Do Models Really Learn to Follow Instructions? An Empirical Study of Instruction Tuning
ACL 2023
DICE: Data-Efficient Clinical Event Extraction with Generative Models
ACL 2023
TAGPRIME: A Unified Framework for Relational Structure Extraction
ACL 2023
AMPERE: AMR-Aware Prefix for Generation-Based Event Argument Extraction Model
ACL 2023
Unsupervised Melody-to-Lyrics Generation
ACL 2023
Are Fairy Tales Fair? Analyzing Gender Bias in Temporal Narrative Event Chains of Childrenโs Fairy Tales
ACL 2023
SIMMC-VR: A Task-oriented Multimodal Dialog Dataset with Situated and Immersive VR Streams
ACL 2023
ACCENT: An Automatic Event Commonsense Evaluation Metric for Open-Domain Dialogue Systems
ACL 2023
Gender Biases in Automatic Evaluation Metrics for Image Captioning
EMNLP 2023
GENEVA: Benchmarking Generalizability for Event Argument Extraction with Hundreds of Event Types and Argument Roles
ACL 2023
Parameter-Efficient Low-Resource Dialogue State Tracking by Prompt Tuning
INTERSPEECH 2023
DOC: Improving Long Story Coherence With Detailed Outline Control
ACL 2023
Learning Action Conditions from Instructional Manuals for Instruction Understanding
ACL 2023
DesCo: Learning Object Recognition with Rich Language Descriptions
NIPS 2023
Generalized Decoding for Pixel, Image, and Language
CVPR 2023
LEAF: Linguistically Enhanced Event Temporal Relation Framework
EMNLP 2023
Harnessing Black-Box Control to Boost Commonsense in LMโs Generation
EMNLP 2023
Localizing Active Objects from Egocentric Vision with Symbolic World Knowledge
EMNLP 2023
Identifying Informational Sources in News Articles
EMNLP 2023
InsNet: An Efficient, Flexible, and Performant Insertion-based Text Generation Model
NIPS 2022
Controllable Text Generation with Neurally-Decomposed Oracle
NIPS 2022
Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone
NIPS 2022
Zero-Shot Commonsense Question Answering with Cloze Translation and Consistency Optimization
AAAI 2022
On Measures of Biases and Harms in NLP
AACL 2022
Fantastic Questions and Where to Find Them: FairytaleQA โ An Authentic Dataset for Narrative Comprehension
ACL 2022
DEAM: Dialogue Coherence Evaluation using AMR-based Semantic Manipulations
ACL 2022
Understanding Multimodal Procedural Knowledge by Sequencing Multimodal Instructional Manuals
ACL 2022
Multilingual Generative Language Models for Zero-Shot Cross-Lingual Event Argument Extraction
ACL 2022
Sibylvariant Transformations for Robust Text Classification
ACL 2022
On the Safety of Conversational Models: Taxonomy, Dataset, and Benchmark
ACL 2022
Paraphrase Generation as Unsupervised Machine Translation
COLING 2022
An Empirical Study of Training End-to-End Vision-and-Language Transformers
CVPR 2022
Re3: Generating Longer Stories With Recursive Reprompting and Revision
EMNLP 2022
ExPUNations: Augmenting Puns with Keywords and Explanations
EMNLP 2022
Context-Situated Pun Generation
EMNLP 2022
Character-centric Story Visualization via Visual Planning and Token Alignment
EMNLP 2022
A Unified Framework for Pun Generation with Humor Principles
EMNLP 2022
EnDex: Evaluation of Dialogue Engagingness at Scale
EMNLP 2022
Towards Robust NLG Bias Evaluation with Syntactically-diverse Prompts
EMNLP 2022
Sequentially Controlled Text Generation
EMNLP 2022
NewsEdits: A News Article Revision Dataset and a Novel Document-Level Reasoning Challenge
NAACL 2022
Socially Aware Bias Measurements for Hindi Language Representations
NAACL 2022
AmbiPun: Generating Humorous Puns with Ambiguous Context
NAACL 2022
Go Back in Time: Generating Flashbacks in Stories with Event Temporal Prompts
NAACL 2022
DEGREE: A Data-Efficient Generation-Based Event Extraction Model
NAACL 2022
Zero-shot Sonnet Generation with Discourse-level Planning and Aesthetics Features
NAACL 2022
FOAM: A Follower-aware Speaker Model For Vision-and-Language Navigation
NAACL 2022
EventPlus: A Temporal Event Understanding Pipeline
NAACL 2021
DiSCoL: Toward Engaging Dialogue Systems through Conversational Line Guided Response Generation
NAACL 2021
Societal Biases in Language Generation: Progress and Challenges
IJCNLP 2021
Metaphor Generation with Conceptual Mappings
IJCNLP 2021
Men Are Elected, Women Are Married: Events Gender Bias on Wikipedia
IJCNLP 2021
COM2SENSE: A Commonsense Reasoning Benchmark with Complementary Sentences
IJCNLP 2021
Plot-guided Adversarial Example Construction for Evaluating Open-domain Story Generation
NAACL 2021
MERMAID: Metaphor Generation with Symbolism and Discriminative Decoding
NAACL 2021
โNice Try, Kiddoโ: Investigating Ad Hominems in Dialogue Responses
NAACL 2021
Improving Zero-Shot Cross-Lingual Transfer Learning via Robust Training
EMNLP 2021
AESOP: Paraphrase Generation with Adaptive Syntactic Control
EMNLP 2021
Document-level Entity-based Extraction as Template Generation
EMNLP 2021
ECONET: Effective Continual Pretraining of Language Models for Event Temporal Reasoning
EMNLP 2021
Improving Pre-trained Vision-and-Language Embeddings for Phrase Grounding
EMNLP 2021
ESTER: A Machine Reading Comprehension Dataset for Reasoning about Event Semantic Relations
EMNLP 2021
HypoGen: Hyperbole Generation with Commonsense and Counterfactual Knowledge
EMNLP 2021
HyperExpan: Taxonomy Expansion with Hyperbolic Representation Learning
EMNLP 2021
Societal Biases in Language Generation: Progress and Challenges
ACL 2021
Scientific Discourse Tagging for Evidence Extraction
EACL 2021
Metaphor Generation with Conceptual Mappings
ACL 2021
Men Are Elected, Women Are Married: Events Gender Bias on Wikipedia
ACL 2021
COM2SENSE: A Commonsense Reasoning Benchmark with Complementary Sentences
ACL 2021
Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning
EMNLP 2021
GATE: Graph Attention Transformer Encoder for Cross-lingual Relation and Event Extraction
AAAI 2021
MELINDA: A Multimodal Dataset for Biomedical Experiment Method Classification
AAAI 2021
Identifying Distributional Perspectives from Colingual Groups
NAACL 2021
Document-level Event Extraction with Efficient End-to-end Learning of Cross-event Dependencies
NAACL 2021
Predictive Engagement: An Efficient Metric for Automatic Evaluation of Open-Domain Dialogue Systems
AAAI 2020
Connecting the Dots: A Knowledgeable Path Generator for Commonsense Question Answering
EMNLP 2020
Towards Controllable Biases in Language Generation
EMNLP 2020
Biomedical Event Extraction with Hierarchical Knowledge Graphs
EMNLP 2020
STORIUM: A Dataset and Evaluation Platform for Machine-in-the-Loop Story Generation
EMNLP 2020
Generating similes effortlessly like a Pro: A Style Transfer Approach for Simile Generation
EMNLP 2020
Domain Knowledge Empowered Structured Neural Net for End-to-End Event Temporal Relation Extraction
EMNLP 2020
Content Planning for Neural Story Generation with Aristotelian Rescoring
EMNLP 2020
TORQUE: A Reading Comprehension Dataset of Temporal Ordering Questions
EMNLP 2020
Enabling Low-Resource Transfer Learning across COVID-19 Corpora by Combining Event-Extraction and Co-Training
ACL 2020
Rห3: Reverse, Retrieve, and Rank for Sarcasm Generation with Commonsense Knowledge
ACL 2020
Pun Generation with Surprise
NAACL 2019
On Difficulties of Cross-Lingual Transfer with Order Differences: A Case Study on Dependency Parsing
NAACL 2019
Plan, Write, and Revise: an Interactive System for Open-Domain Story Generation
NAACL 2019
What Matters for Neural Cross-Lingual Named Entity Recognition: An Empirical Analysis
IJCNLP 2019
Do Nuclear Submarines Have Nuclear Captains? A Challenge Dataset for Commonsense Reasoning over Adjectives and Objects
IJCNLP 2019
The Woman Worked as a Babysitter: On Biases in Language Generation
IJCNLP 2019
Target Language-Aware Constrained Inference for Cross-lingual Dependency Parsing
IJCNLP 2019
Joint Event and Temporal Relation Extraction with Shared Representations and Structured Prediction
IJCNLP 2019
Plan-and-Write: Towards Better Automatic Storytelling
AAAI 2019
Learning a Unified Named Entity Tagger from Multiple Partially Annotated Corpora for Efficient Adaptation
CONLL 2019
Deep Structured Neural Network for Event Temporal Relation Extraction
CONLL 2019
What Matters for Neural Cross-Lingual Named Entity Recognition: An Empirical Analysis
EMNLP 2019
Proceedings of the 2nd Workshop on Deep Learning Approaches for Low-Resource NLP (DeepLo 2019)
EMNLP 2019
Joint Event and Temporal Relation Extraction with Shared Representations and Structured Prediction
EMNLP 2019
Target Language-Aware Constrained Inference for Cross-lingual Dependency Parsing
EMNLP 2019
The Woman Worked as a Babysitter: On Biases in Language Generation
EMNLP 2019
Do Nuclear Submarines Have Nuclear Captains? A Challenge Dataset for Commonsense Reasoning over Adjectives and Objects
EMNLP 2019
Better Automatic Evaluation of Open-Domain Dialogue Systems with Contextualized Embeddings
NAACL 2019
Cross-Lingual Dependency Parsing with Unlabeled Auxiliary Languages
CONLL 2019
Scalable Construction and Reasoning of Massive Knowledge Bases
NAACL 2018
Towards Controllable Story Generation
NAACL 2018
Stack-Pointer Networks for Dependency Parsing
ACL 2018
Learning to Converse with Noisy Data: Generation with Calibration
IJCAI 2018
A Multi-task Learning Approach to Adapting Bilingual Word Embeddings for Cross-lingual Named Entity Recognition
IJCNLP 2017
Improving Named Entity Recognition for Chinese Social Media with Word Segmentation Representation Learning
ACL 2016
An Empirical Study of Chinese Name Matching and Applications
IJCNLP 2015
Named Entity Recognition for Chinese Social Media with Jointly Trained Embeddings
EMNLP 2015
An Empirical Study of Chinese Name Matching and Applications
ACL 2015
Dual Decomposition Inference for Graphical Models over Strings
EMNLP 2015
A Concrete Chinese NLP Pipeline
NAACL 2015
Learning Polylingual Topic Models from Code-Switched Social Media Documents
ACL 2014
Stochastic Contextual Edit Distance and Probabilistic FSTs
ACL 2014
Exploiting Latent Information to Predict Diffusions of Novel Topics on Social Networks
ACL 2012
Online Plagiarized Detection Through Exploiting Lexical, Syntax, and Semantic Information
ACL 2012