conftrace_

Nanyun Peng

205 papers · 2012–2026 · 17 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+18 more ↓

🧭 Keyword Pioneer 🌍 Conference Polyglot (17) 🗺️ Taxonomy Completionist (13) 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (13)

🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (13) 🧭 Keyword Pioneer 🏠 Conference Loyalist (49) 🤝 Dynamic Duo (56) 👑 Triple Crown 🏆 Grand Slam 🌱 Topic Pioneer 🔬 Deep Specialist (47) 🧬 Topic Evolution 🏆 Keyword Champion (3) 📈 Trend Setter ❓ The Questioner (11) 🗃️ Keyword Collector (752) 💎 Century Club (201) 🔥 Unstoppable (12) 🚀 Conference Pioneer ⚡ Prolific Year (43)

Conferences

EMNLP (69) ACL (52) NAACL (33) IJCNLP (11) NIPS (8) AAAI (7) ICML (7) COLING (3) CONLL (3) CVPR (3) EACL (2) ICLR (2) WACV (1) INTERSPEECH (1) IJCAI (1) ICCV (1) AACL (1)

Top co-authors

Kai-Wei Chang (58) Yufei Tian (16) Zi-Yi Dou (16) Te-Lin Wu (14) I-Hung Hsu (13) Jiao Sun (12) Kuan-Hao Huang (11) Tagyoung Chung (11) Jonathan May (11) Rujun Han (11)

Research topics

Privacy (2) Digital Humanities (1) Learning Types (1) Education (1)

Keywords

large language model (31) text generation (26) language model (18) event extraction (15) story generation (13) zero-shot learning (12) cross-lingual transfer (10) named entity recognition (10) vision-language model (9) multimodal learning (8) transfer learning (8) relation extraction (7) information extraction (7) question answering (7) dialogue system (7) text classification (7) gender bia (6) event detection (6) few-shot learning (6) structured prediction (6)

Papers

LiveCLKTBench: Towards Reliable Evaluation of Cross-Lingual Knowledge Transfer in Multilingual LLMs ACL 2026 Rethinking Creativity Evaluation: A Critical Analysis of Existing Creativity Evaluations EACL 2026 MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Knowledge Poisoning Attacks ACL 2026 Decoupling Task-Solving and Output Formatting in LLM Generation ACL 2026 Vulnerability of Large Language Models to Output Prefix Jailbreaks: Impact of Positions on Safety NAACL 2025 CoKe: Customizable Fine-Grained Story Evaluation via Chain-of-Keyword Rationalization ACL 2025 DRS: Deep Question Reformulation With Structured Output ACL 2025 Comparing Bad Apples to Good Oranges Aligning Large Language Models via Joint Preference Optimization ACL 2025 METAL: A Multi-Agent Framework for Chart Generation with Test-Time Scaling ACL 2025 Sandcastles in the Storm: Revisiting the (Im)possibility of Strong Watermarking ACL 2025 Mind the Gesture: Evaluating AI Sensitivity to Culturally Offensive Non-Verbal Gestures ACL 2025 SYNTHIA: Novel Concept Design with Affordance Composition ACL 2025 Vulnerability of LLMs to Vertically Aligned Text Manipulations ACL 2025 Creative Planning with Language Models: Practice, Evaluation and Applications NAACL 2025 BRIEF: Bridging Retrieval and Inference for Multi-hop Reasoning via Compression NAACL 2025 Improving Faithfulness of Text-to-Image Diffusion Models through Inference Intervention WACV 2025 Evaluating Cultural and Social Awareness of LLM Web Agents NAACL 2025 CaKE: Circuit-aware Editing Enables Generalizable Knowledge Learners EMNLP 2025 REFFLY: Melody-Constrained Lyrics Editing Model NAACL 2025 Guiding Through Complexity: What Makes Good Supervision for Hard Reasoning Tasks? NAACL 2025 Model Extrapolation Expedites Alignment ACL 2025 SkillVerse : Assessing and Enhancing LLMs with Tree Evaluation ACL 2025 Collapse of Dense Retrievers: Short, Early, and Literal Biases Outranking Factual Evidence ACL 2025 DiCoRe: Enhancing Zero-shot Event Detection via Divergent-Convergent LLM Reasoning EMNLP 2025 SNaRe: Domain-aware Data Generation for Low-Resource Event Detection EMNLP 2025 How to Make Large Language Models Generate 100% Valid Molecules? EMNLP 2025 FLAMES: Improving LLM Math Reasoning via a Fine-Grained Analysis of the Data Synthesis Pipeline EMNLP 2025 The Unreasonable Effectiveness of Model Merging for Cross-Lingual Transfer in LLMs EMNLP 2025 Verbalized Representation Learning for Interpretable Few-Shot Generalization ICCV 2025 MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models ICLR 2025 VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning CVPR 2025 Scaling Probabilistic Circuits via Monarch Matrices ICML 2025 Contrastive Visual Data Augmentation ICML 2025 Con-ReCall: Detecting Pre-training Data in LLMs via Contrastive Decoding COLING 2025 Explaining Mixtures of Sources in News Articles EMNLP 2024 QUDSELECT: Selective Decoding for Questions Under Discussion Parsing EMNLP 2024 Explaining and Improving Contrastive Decoding by Extrapolating the Probabilities of a Huge and Hypothetical LM EMNLP 2024 Synchronous Faithfulness Monitoring for Trustworthy Retrieval-Augmented Generation EMNLP 2024 SPEED++: A Multilingual Event Extraction Framework for Epidemic Prediction and Preparedness EMNLP 2024 Control Large Language Models via Divide and Conquer EMNLP 2024 Re-ReST: Reflection-Reinforced Self-Training for Language Agents EMNLP 2024 Human-in-the-Loop Synthetic Text Data Inspection with Provenance Tracking NAACL 2024 Event Detection from Social Media for Epidemic Prediction NAACL 2024 Contextual Label Projection for Cross-Lingual Structured Prediction NAACL 2024 MacGyver: Are Large Language Models Creative Problem Solvers? NAACL 2024 Mitigating Bias for Question Answering Models by Tracking Bias Influence NAACL 2024 AMRFact: Enhancing Summarization Factuality Evaluation with AMR-Driven Negative Samples Generation NAACL 2024 DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generation NIPS 2024 Adaptable Logical Control for Large Language Models NIPS 2024 SafeWorld: Geo-Diverse Safety Alignment NIPS 2024 Improving Event Definition Following For Zero-Shot Event Detection ACL 2024 Tracking the Newsworthiness of Public Documents ACL 2024 VALOR-EVAL: Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models ACL 2024 Argument-Aware Approach To Event Linking ACL 2024 CaLM: Contrasting Large and Small Language Models to Verify Grounded Generation ACL 2024 TextEE: Benchmark, Reevaluation, Reflections, and Future Challenges in Event Extraction ACL 2024 PhonologyBench: Evaluating Phonological Skills of Large Language Models ACL 2024 Medical Vision-Language Pre-Training for Brain Abnormalities COLING 2024 On Prompt-Driven Safeguarding for Large Language Models ICML 2024 ConTextual: Evaluating Context-Sensitive Text-Rich Visual Reasoning in Large Multimodal Models ICML 2024 DiNADO: Norm-Disentangled Neurally-Decomposed Oracles for Controlling Language Models ICML 2024 Open-Domain Text Evaluation via Contrastive Distribution Methods ICML 2024 RLCD: Reinforcement Learning from Contrastive Distillation for LM Alignment ICLR 2024 ARMADA: Attribute-Based Multimodal Data Augmentation EMNLP 2024 PG-Story: Taxonomy, Dataset, and Evaluation for Ensuring Child-Safe Content for Story Generation EMNLP 2024 Uncertainty Calibration for Tool-Using Language Agents EMNLP 2024 Detecting Machine-Generated Long-Form Content with Latent-Space Variables EMNLP 2024 VDebugger: Harnessing Execution Feedback for Debugging Visual Programs EMNLP 2024 LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for Enhanced Following of Instructions with Multiple Constraints EMNLP 2024 LLM-A*: Large Language Model Enhanced Incremental Heuristic Search on Path Planning EMNLP 2024 Do LLMs Plan Like Human Writers? Comparing Journalist Coverage of Press Releases with LLMs EMNLP 2024 Are Large Language Models Capable of Generating Human-Level Narratives? EMNLP 2024 Measuring Psychological Depth in Language Models EMNLP 2024 Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue EMNLP 2024 STAR: Boosting Low-Resource Information Extraction by Structure-to-Text Data Generation with Large Language Models AAAI 2024 MIDDAG: Where Does Our News Go? Investigating Information Diffusion via Community-Level Information Pathways AAAI 2024 Matryoshka Query Transformer for Large Vision-Language Models NIPS 2024 Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks EMNLP 2023 Tractable Control for Autoregressive Language Generation ICML 2023 Masked Path Modeling for Vision-and-Language Navigation EMNLP 2023 Evaluating Large Language Models on Controlled Generation Tasks EMNLP 2023 Are Personalized Stochastic Parrots More Dangerous? Evaluating Persona Biases in Dialogue Systems EMNLP 2023 “Kelly is a Warm Person, Joseph is a Role Model”: Gender Biases in LLM-Generated Reference Letters EMNLP 2023 Creative Natural Language Generation EMNLP 2023 ACQUIRED: A Dataset for Answering Counterfactual Questions In Real-Life Videos EMNLP 2023 Code-Switched Text Synthesis in Unseen Language Pairs ACL 2023 Do Models Really Learn to Follow Instructions? An Empirical Study of Instruction Tuning ACL 2023 DICE: Data-Efficient Clinical Event Extraction with Generative Models ACL 2023 TAGPRIME: A Unified Framework for Relational Structure Extraction ACL 2023 AMPERE: AMR-Aware Prefix for Generation-Based Event Argument Extraction Model ACL 2023 Unsupervised Melody-to-Lyrics Generation ACL 2023 Are Fairy Tales Fair? Analyzing Gender Bias in Temporal Narrative Event Chains of Children’s Fairy Tales ACL 2023 SIMMC-VR: A Task-oriented Multimodal Dialog Dataset with Situated and Immersive VR Streams ACL 2023 ACCENT: An Automatic Event Commonsense Evaluation Metric for Open-Domain Dialogue Systems ACL 2023 Gender Biases in Automatic Evaluation Metrics for Image Captioning EMNLP 2023 GENEVA: Benchmarking Generalizability for Event Argument Extraction with Hundreds of Event Types and Argument Roles ACL 2023 Parameter-Efficient Low-Resource Dialogue State Tracking by Prompt Tuning INTERSPEECH 2023 DOC: Improving Long Story Coherence With Detailed Outline Control ACL 2023 Learning Action Conditions from Instructional Manuals for Instruction Understanding ACL 2023 DesCo: Learning Object Recognition with Rich Language Descriptions NIPS 2023 Generalized Decoding for Pixel, Image, and Language CVPR 2023 LEAF: Linguistically Enhanced Event Temporal Relation Framework EMNLP 2023 Harnessing Black-Box Control to Boost Commonsense in LM’s Generation EMNLP 2023 Localizing Active Objects from Egocentric Vision with Symbolic World Knowledge EMNLP 2023 Identifying Informational Sources in News Articles EMNLP 2023 InsNet: An Efficient, Flexible, and Performant Insertion-based Text Generation Model NIPS 2022 Controllable Text Generation with Neurally-Decomposed Oracle NIPS 2022 Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone NIPS 2022 Zero-Shot Commonsense Question Answering with Cloze Translation and Consistency Optimization AAAI 2022 On Measures of Biases and Harms in NLP AACL 2022 Fantastic Questions and Where to Find Them: FairytaleQA – An Authentic Dataset for Narrative Comprehension ACL 2022 DEAM: Dialogue Coherence Evaluation using AMR-based Semantic Manipulations ACL 2022 Understanding Multimodal Procedural Knowledge by Sequencing Multimodal Instructional Manuals ACL 2022 Multilingual Generative Language Models for Zero-Shot Cross-Lingual Event Argument Extraction ACL 2022 Sibylvariant Transformations for Robust Text Classification ACL 2022 On the Safety of Conversational Models: Taxonomy, Dataset, and Benchmark ACL 2022 Paraphrase Generation as Unsupervised Machine Translation COLING 2022 An Empirical Study of Training End-to-End Vision-and-Language Transformers CVPR 2022 Re3: Generating Longer Stories With Recursive Reprompting and Revision EMNLP 2022 ExPUNations: Augmenting Puns with Keywords and Explanations EMNLP 2022 Context-Situated Pun Generation EMNLP 2022 Character-centric Story Visualization via Visual Planning and Token Alignment EMNLP 2022 A Unified Framework for Pun Generation with Humor Principles EMNLP 2022 EnDex: Evaluation of Dialogue Engagingness at Scale EMNLP 2022 Towards Robust NLG Bias Evaluation with Syntactically-diverse Prompts EMNLP 2022 Sequentially Controlled Text Generation EMNLP 2022 NewsEdits: A News Article Revision Dataset and a Novel Document-Level Reasoning Challenge NAACL 2022 Socially Aware Bias Measurements for Hindi Language Representations NAACL 2022 AmbiPun: Generating Humorous Puns with Ambiguous Context NAACL 2022 Go Back in Time: Generating Flashbacks in Stories with Event Temporal Prompts NAACL 2022 DEGREE: A Data-Efficient Generation-Based Event Extraction Model NAACL 2022 Zero-shot Sonnet Generation with Discourse-level Planning and Aesthetics Features NAACL 2022 FOAM: A Follower-aware Speaker Model For Vision-and-Language Navigation NAACL 2022 EventPlus: A Temporal Event Understanding Pipeline NAACL 2021 DiSCoL: Toward Engaging Dialogue Systems through Conversational Line Guided Response Generation NAACL 2021 Societal Biases in Language Generation: Progress and Challenges IJCNLP 2021 Metaphor Generation with Conceptual Mappings IJCNLP 2021 Men Are Elected, Women Are Married: Events Gender Bias on Wikipedia IJCNLP 2021 COM2SENSE: A Commonsense Reasoning Benchmark with Complementary Sentences IJCNLP 2021 Plot-guided Adversarial Example Construction for Evaluating Open-domain Story Generation NAACL 2021 MERMAID: Metaphor Generation with Symbolism and Discriminative Decoding NAACL 2021 “Nice Try, Kiddo”: Investigating Ad Hominems in Dialogue Responses NAACL 2021 Improving Zero-Shot Cross-Lingual Transfer Learning via Robust Training EMNLP 2021 AESOP: Paraphrase Generation with Adaptive Syntactic Control EMNLP 2021 Document-level Entity-based Extraction as Template Generation EMNLP 2021 ECONET: Effective Continual Pretraining of Language Models for Event Temporal Reasoning EMNLP 2021 Improving Pre-trained Vision-and-Language Embeddings for Phrase Grounding EMNLP 2021 ESTER: A Machine Reading Comprehension Dataset for Reasoning about Event Semantic Relations EMNLP 2021 HypoGen: Hyperbole Generation with Commonsense and Counterfactual Knowledge EMNLP 2021 HyperExpan: Taxonomy Expansion with Hyperbolic Representation Learning EMNLP 2021 Societal Biases in Language Generation: Progress and Challenges ACL 2021 Scientific Discourse Tagging for Evidence Extraction EACL 2021 Metaphor Generation with Conceptual Mappings ACL 2021 Men Are Elected, Women Are Married: Events Gender Bias on Wikipedia ACL 2021 COM2SENSE: A Commonsense Reasoning Benchmark with Complementary Sentences ACL 2021 Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning EMNLP 2021 GATE: Graph Attention Transformer Encoder for Cross-lingual Relation and Event Extraction AAAI 2021 MELINDA: A Multimodal Dataset for Biomedical Experiment Method Classification AAAI 2021 Identifying Distributional Perspectives from Colingual Groups NAACL 2021 Document-level Event Extraction with Efficient End-to-end Learning of Cross-event Dependencies NAACL 2021 Predictive Engagement: An Efficient Metric for Automatic Evaluation of Open-Domain Dialogue Systems AAAI 2020 Connecting the Dots: A Knowledgeable Path Generator for Commonsense Question Answering EMNLP 2020 Towards Controllable Biases in Language Generation EMNLP 2020 Biomedical Event Extraction with Hierarchical Knowledge Graphs EMNLP 2020 STORIUM: A Dataset and Evaluation Platform for Machine-in-the-Loop Story Generation EMNLP 2020 Generating similes effortlessly like a Pro: A Style Transfer Approach for Simile Generation EMNLP 2020 Domain Knowledge Empowered Structured Neural Net for End-to-End Event Temporal Relation Extraction EMNLP 2020 Content Planning for Neural Story Generation with Aristotelian Rescoring EMNLP 2020 TORQUE: A Reading Comprehension Dataset of Temporal Ordering Questions EMNLP 2020 Enabling Low-Resource Transfer Learning across COVID-19 Corpora by Combining Event-Extraction and Co-Training ACL 2020 Rˆ3: Reverse, Retrieve, and Rank for Sarcasm Generation with Commonsense Knowledge ACL 2020 Pun Generation with Surprise NAACL 2019 On Difficulties of Cross-Lingual Transfer with Order Differences: A Case Study on Dependency Parsing NAACL 2019 Plan, Write, and Revise: an Interactive System for Open-Domain Story Generation NAACL 2019 What Matters for Neural Cross-Lingual Named Entity Recognition: An Empirical Analysis IJCNLP 2019 Do Nuclear Submarines Have Nuclear Captains? A Challenge Dataset for Commonsense Reasoning over Adjectives and Objects IJCNLP 2019 The Woman Worked as a Babysitter: On Biases in Language Generation IJCNLP 2019 Target Language-Aware Constrained Inference for Cross-lingual Dependency Parsing IJCNLP 2019 Joint Event and Temporal Relation Extraction with Shared Representations and Structured Prediction IJCNLP 2019 Plan-and-Write: Towards Better Automatic Storytelling AAAI 2019 Learning a Unified Named Entity Tagger from Multiple Partially Annotated Corpora for Efficient Adaptation CONLL 2019 Deep Structured Neural Network for Event Temporal Relation Extraction CONLL 2019 What Matters for Neural Cross-Lingual Named Entity Recognition: An Empirical Analysis EMNLP 2019 Proceedings of the 2nd Workshop on Deep Learning Approaches for Low-Resource NLP (DeepLo 2019) EMNLP 2019 Joint Event and Temporal Relation Extraction with Shared Representations and Structured Prediction EMNLP 2019 Target Language-Aware Constrained Inference for Cross-lingual Dependency Parsing EMNLP 2019 The Woman Worked as a Babysitter: On Biases in Language Generation EMNLP 2019 Do Nuclear Submarines Have Nuclear Captains? A Challenge Dataset for Commonsense Reasoning over Adjectives and Objects EMNLP 2019 Better Automatic Evaluation of Open-Domain Dialogue Systems with Contextualized Embeddings NAACL 2019 Cross-Lingual Dependency Parsing with Unlabeled Auxiliary Languages CONLL 2019 Scalable Construction and Reasoning of Massive Knowledge Bases NAACL 2018 Towards Controllable Story Generation NAACL 2018 Stack-Pointer Networks for Dependency Parsing ACL 2018 Learning to Converse with Noisy Data: Generation with Calibration IJCAI 2018 A Multi-task Learning Approach to Adapting Bilingual Word Embeddings for Cross-lingual Named Entity Recognition IJCNLP 2017 Improving Named Entity Recognition for Chinese Social Media with Word Segmentation Representation Learning ACL 2016 An Empirical Study of Chinese Name Matching and Applications IJCNLP 2015 Named Entity Recognition for Chinese Social Media with Jointly Trained Embeddings EMNLP 2015 An Empirical Study of Chinese Name Matching and Applications ACL 2015 Dual Decomposition Inference for Graphical Models over Strings EMNLP 2015 A Concrete Chinese NLP Pipeline NAACL 2015 Learning Polylingual Topic Models from Code-Switched Social Media Documents ACL 2014 Stochastic Contextual Edit Distance and Probabilistic FSTs ACL 2014 Exploiting Latent Information to Predict Diffusions of Novel Topics on Social Networks ACL 2012 Online Plagiarized Detection Through Exploiting Lexical, Syntax, and Semantic Information ACL 2012