conftrace_

Chris Callison-Burch

151 papers · 2004–2026 · 15 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+17 more ↓

🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (17) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (15)

🌍 Conference Polyglot (15) 🗺️ Taxonomy Completionist (17) 🧭 Keyword Pioneer 🌟 Keyword Trendsetter Combo (4) 🏠 Conference Loyalist (33) 🔬 Deep Specialist (16) 🧬 Topic Evolution 🏆 Keyword Champion (2) 👥 Mega-Team (50) 🤝 Dynamic Duo (18) ❓ The Questioner (4) 🗃️ Keyword Collector (425) 💎 Century Club (148) 📈 Trend Setter 🚀 Conference Pioneer 🔥 Unstoppable (22) ⚡ Prolific Year (6)

Conferences

ACL (50) EMNLP (33) NAACL (29) EACL (9) IJCNLP (6) CVPR (5) AAAI (4) COLING (4) AACL (2) CONLL (2) NIPS (2) SEMEVAL (2) ECCV (1) ICLR (1) INTERSPEECH (1)

Top co-authors

Marianna Apidianaki (18) Benjamin Van Durme (16) Li Zhang (16) Ellie Pavlick (15) Liam Dugan (14) Daphne Ippolito (13) Qing Lyu (12) Ajay Patel (12) Yue Yang (11) Mark Yatskar (10)

Research topics

Keywords

large language model (26) language model (14) text generation (10) multimodal learning (6) zero-shot learning (6) machine translation (5) text classification (5) synthetic data generation (5) text summarization (4) question generation (4) word embedding (4) few-shot learning (4) machine-generated text detection (4) natural language understanding (4) prompt engineering (4) vision-language model (4) question answering (4) semantic analysis (3) code generation (3) cross-lingual transfer (3)

Papers

Toward Beginner-Friendly LLMs for Language Learning: Controlling Difficulty in Conversation EACL 2026 NSF-SciFy: Mining the NSF Awards Database for Scientific Claims ACL 2026 LaTeX2Layout: High-Fidelity, Scalable Document Layout Annotation Pipeline for Layout Detection AAAI 2026 Calibrating Large Language Models with Sample Consistency AAAI 2025 NSF-SciFy: Mining the NSF Awards Database for Scientific Claims EMNLP 2025 Contra4: Evaluating Contrastive Cross-Modal Reasoning in Audio, Video, Image, and 3D EMNLP 2025 mStyleDistance: Multilingual Style Embeddings and their Evaluation ACL 2025 Multilingual Retrieval Augmented Generation for Culturally-Sensitive Tasks: A Benchmark for Cross-lingual Robustness ACL 2025 GenAI Content Detection Task 3: Cross-Domain Machine Generated Text Detection Challenge COLING 2025 ViUniT: Visual Unit Tests for More Robust Visual Programming CVPR 2025 Concept Lancet: Image Editing with Compositional Representation Transplant CVPR 2025 Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation ACL 2025 Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models CVPR 2025 Probabilistic Soundness Guarantees in LLM Reasoning Chains EMNLP 2025 StyleDistance: Stronger Content-Independent Style Embeddings with Synthetic Parallel Examples NAACL 2025 Holodeck: Language Guided Generation of 3D Embodied AI Environments CVPR 2024 Choice-75: A Dataset on Decision Branching in Script Learning COLING 2024 A Textbook Remedy for Domain Shifts: Knowledge Priors for Medical Image Analysis NIPS 2024 CoMo: Controllable Motion Generation through Language Guided Pose Code Editing ECCV 2024 OpenPI2.0: An Improved Dataset for Entity Tracking in Texts EACL 2024 PDDLEGO: Iterative Planning in Textual Environments NAACL 2024 This Land is Your, My Land: Evaluating Geopolitical Bias in Language Models through Territorial Disputes NAACL 2024 ParaGuide: Guided Diffusion Paraphrasers for Plug-and-Play Textual Style Transfer AAAI 2024 DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows ACL 2024 RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors ACL 2024 FanOutQA: A Multi-Hop, Multi-Document Question Answering Benchmark for Large Language Models ACL 2024 Evaluating Vision-Language Models on Bistable Images ACL 2024 PROC2PDDL: Open-Domain Planning Representations from Texts ACL 2024 Uncovering Differences in Persuasive Language in Russian versus English Wikipedia EMNLP 2024 BordIRlines: A Dataset for Evaluating Cross-lingual Retrieval Augmented Generation EMNLP 2024 MiRAGeNews: Multimodal Realistic AI-Generated News Detection EMNLP 2024 TinyStyler: Efficient Few-Shot Text Style Transfer with Authorship Embeddings EMNLP 2024 ReDel: A Toolkit for LLM-Powered Recursive Multi-Agent Systems EMNLP 2024 PaCE: Parsimonious Concept Engineering for Large Language Models NIPS 2024 Human-in-the-loop Schema Induction ACL 2023 Real or Fake Text?: Investigating Human Ability to Detect Boundaries between Human-Written and Machine-Generated Text AAAI 2023 Faithful Chain-of-Thought Reasoning AACL 2023 Learning When to Speak: Latency and Quality Trade-offs for Simultaneous Speech-to-Speech Translation with Offline Models INTERSPEECH 2023 Faithful Chain-of-Thought Reasoning IJCNLP 2023 Bidirectional Language Models Are Also Few-shot Learners ICLR 2023 Kani: A Lightweight and Highly Hackable Framework for Building Language Model Applications EMNLP 2023 Learning Interpretable Style Embeddings via Prompting LLMs EMNLP 2023 PAXQA: Generating Cross-lingual Question Answering Examples at Training Scale EMNLP 2023 FIREBALL: A Dataset of Dungeons and Dragons Actual-Play with Structured Game State Information ACL 2023 Explanation-based Finetuning Makes Models More Robust to Spurious Cues ACL 2023 Open-Domain Hierarchical Event Schema Induction by Incremental Prompting and Verification ACL 2023 I Cast Detect Thoughts: Learning to Converse and Guide with Intents and Theory-of-Mind in Dungeons and Dragons ACL 2023 Learn With Martian: A Tool For Creating Assignments That Can Write And Re-Write Themselves EACL 2023 CoRRPUS: Code-based Structured Prompting for Neurosymbolic Story Understanding ACL 2023 Improving Mathematics Tutoring With A Code Scratchpad ACL 2023 Enhancing Human Summaries for Question-Answer Generation in Education ACL 2023 Automatically Generated Summaries of Video Lectures May Enhance Students’ Learning Experience ACL 2023 Exploring the Curious Case of Code Prompts ACL 2023 Representation of Lexical Stylistic Features in Language Models’ Embedding Space ACL 2023 Multilingual Bidirectional Unsupervised Translation through Multilingual Finetuning and Back-Translation EACL 2023 Causal Reasoning of Entities and Events in Procedural Texts EACL 2023 Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification CVPR 2023 Show Me More Details: Discovering Hierarchies of Procedures from Semi-structured Web Data ACL 2022 Dungeons and Dragons as a Dialog Challenge for Artificial Intelligence EMNLP 2022 Unsupervised Entity Linking with Guided Summarization and Multiple-Choice Selection EMNLP 2022 Is “My Favorite New Movie” My Favorite Movie? Probing the Understanding of Recursive Noun Phrases NAACL 2022 RESIN-11: Schema-guided Event Prediction for 11 Newsworthy Scenarios NAACL 2022 The Case for a Single Model that can Both Generate Continuations and Fill-in-the-Blank NAACL 2022 Visualizing the Obvious: A Concreteness-based Ensemble Model for Noun Property Prediction EMNLP 2022 A Feasibility Study of Answer-Agnostic Question Generation for Education ACL 2022 A Recipe for Arbitrary Text Style Transfer with Large Language Models ACL 2022 Deduplicating Training Data Makes Language Models Better ACL 2022 BiSECT: Learning to Split and Rephrase Sentences with Bitexts EMNLP 2021 “Wikily” Supervised Neural Translation Tailored to Cross-Lingual Tasks EMNLP 2021 Visual Goal-Step Inference using wikiHow EMNLP 2021 GooAQ: Open Question Answering with Diverse Answer Types EMNLP 2021 Cultural and Geographical Influences on Image Translatability of Words across Languages NAACL 2021 RESIN: A Dockerized Schema-Guided Cross-document Cross-lingual Cross-media Information Extraction and Event Tracking System NAACL 2021 TopGuNN: Fast NLP Training Data Augmentation using Large Corpora NAACL 2021 Resolving Pronouns in Twitter Streams: Context can Help! COLING 2020 Reasoning about Goals, Steps, and Temporal Ordering with WikiHow EMNLP 2020 Automatic Detection of Generated Text is Easiest when Humans are Fooled ACL 2020 Intent Detection with WikiHow AACL 2020 Toward Better Storylines with Sentence-Level Language Models ACL 2020 RoFT: A Tool for Evaluating Human Detection of Machine-Generated Text EMNLP 2020 Winter is here: Summarizing Twitter Streams related to Pre-Scheduled Events ACL 2019 Seeing Things from a Different Angle:Discovering Diverse Perspectives about Claims NAACL 2019 Unsupervised Hierarchical Story Infilling NAACL 2019 Complexity-Weighted Loss and Diverse Reranking for Sentence Simplification NAACL 2019 Comparison of Diverse Decoding Methods from Conditional Language Models ACL 2019 PerspectroScope: A Window to the World of Diverse Perspectives ACL 2019 ChatEval: A Tool for Chatbot Evaluation NAACL 2019 Magnitude: A Fast, Efficient Universal Vector Embedding Utility Package EMNLP 2018 Automated Paraphrase Lattice Creation for HyTER Machine Translation Evaluation NAACL 2018 Learning Scalar Adjective Intensity from Paraphrases EMNLP 2018 Comparing Constraints for Taxonomic Organization NAACL 2018 Simplification Using Paraphrases and Context-Based Lexical Substitution NAACL 2018 Learning Translations via Images with a Massively Multilingual Image Dataset ACL 2018 KnowYourNyms? A Game of Semantic Relationships EMNLP 2017 The Language of Place: Semantic Value from Geospatial Context EACL 2017 Learning Translations via Matrix Completion EMNLP 2017 Clustering Paraphrases by Word Sense NAACL 2016 Sentential Paraphrasing as Black-Box Machine Translation NAACL 2016 Most “babies” are “little” and most “problems” are “huge”: Compositional Entailment in Adjective-Nouns ACL 2016 Simple PPDB: A Paraphrase Database for Simplification ACL 2016 Tense Manages to Predict Implicative Behavior in Verbs EMNLP 2016 The Gun Violence Database: A new task and data set for NLP EMNLP 2016 FrameNet+: Fast Paraphrastic Tripling of FrameNet ACL 2015 Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing EMNLP 2015 SemEval-2015 Task 1: Paraphrase and Semantic Similarity in Twitter (PIT) SEMEVAL 2015 PPDB 2.0: Better paraphrase ranking, fine-grained entailment relations, word embeddings, and style classification ACL 2015 Domain-Specific Paraphrase Extraction ACL 2015 Adding Semantics to Data-Driven Paraphrasing ACL 2015 Adding Semantics to Data-Driven Paraphrasing IJCNLP 2015 Domain-Specific Paraphrase Extraction IJCNLP 2015 FrameNet+: Fast Paraphrastic Tripling of FrameNet IJCNLP 2015 PPDB 2.0: Better paraphrase ranking, fine-grained entailment relations, word embeddings, and style classification IJCNLP 2015 Cost Optimization in Crowdsourcing Translation: Low cost translations made even cheaper NAACL 2015 Crowdsourcing for NLP NAACL 2015 Are Two Heads Better than One? Crowdsourced Translation via a Two-Step Collaboration of Non-Professional Translators and Editors ACL 2014 Hallucinating Phrase Translations for Low Resource MT CONLL 2014 PARADIGM: Paraphrase Diagnostics through Grammar Matching EACL 2014 Semi-Markov Phrase-Based Monolingual Alignment EMNLP 2013 Dirt Cheap Web-Scale Parallel Text from the Common Crawl ACL 2013 PARMA: A Predicate Argument Aligner ACL 2013 Supervised Bilingual Lexicon Induction with Multiple Monolingual Signals NAACL 2013 PPDB: The Paraphrase Database NAACL 2013 Answer Extraction as Sequence Tagging with Tree Edit Distance NAACL 2013 A Lightweight and High Performance Monolingual Word Aligner ACL 2013 Monolingual Distributional Similarity for Text-to-Text Generation SEMEVAL 2012 Machine Translation of Arabic Dialects NAACL 2012 Expectations of Word Sense in Parallel Corpora NAACL 2012 Toward Statistical Machine Translation without Parallel Corpora EACL 2012 Crowdsourcing Translation: Professional Quality from Non-Professionals ACL 2011 The Arabic Online Commentary Dataset: an Annotated Dataset of Informal Arabic with High Dialectal Content ACL 2011 Incremental Syntactic Language Models for Phrase-based Translation ACL 2011 Learning Sentential Paraphrases from Bilingual Parallel Corpora for Text-to-Text Generation EMNLP 2011 Bucking the Trend: Large-Scale Cost-Focused Active Learning for Statistical Machine Translation ACL 2010 Cheap, Fast and Good Enough: Automatic Speech Recognition with Non-Expert Transcription NAACL 2010 Predicting Human-Targeted Translation Edit Rate via Untrained Human Annotators NAACL 2010 Stream-based Translation Models for Statistical Machine Translation NAACL 2010 Feasibility of Human-in-the-loop Minimum Error Rate Training EMNLP 2009 Demonstration of Joshua: An Open Source Toolkit for Parsing-based Machine Translation IJCNLP 2009 Fast, Cheap, and Creative: Evaluating Translation Quality Using Amazon’s Mechanical Turk EMNLP 2009 Improved Statistical Machine Translation Using Monolingually-Derived Paraphrases EMNLP 2009 Improving Translation Lexicon Induction from Monolingual Corpora via Dependency Contexts and Part-of-Speech Equivalences CONLL 2009 Demonstration of Joshua: An Open Source Toolkit for Parsing-based Machine Translation ACL 2009 ParaMetric: An Automatic Evaluation Metric for Paraphrasing COLING 2008 Syntactic Constraints on Paraphrases Extracted from Parallel Corpora EMNLP 2008 Moses: Open Source Toolkit for Statistical Machine Translation ACL 2007 Re-evaluating the Role of Bleu in Machine Translation Research EACL 2006 Improved Statistical Machine Translation Using Paraphrases NAACL 2006 Scaling Phrase-Based Statistical Machine Translation to Larger Corpora and Longer Phrases ACL 2005 Proceedings of the ACL Student Research Workshop ACL 2005 Paraphrasing with Bilingual Parallel Corpora ACL 2005 Statistical Machine Translation with Word- and Sentence-Aligned Parallel Corpora ACL 2004