Chris Callison-Burch
151 papers · 2004–2026 · 15 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+17 more ↓ Show less ↑
🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (17) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (15)
🌍
Conference Polyglot
(15)
🗺️
Taxonomy Completionist
(17)
🧭
Keyword Pioneer
🌟
Keyword Trendsetter Combo
(4)
🏠
Conference Loyalist
(33)
🔬
Deep Specialist
(16)
🧬
Topic Evolution
🏆
Keyword Champion
(2)
👥
Mega-Team
(50)
🤝
Dynamic Duo
(18)
❓
The Questioner
(4)
🗃️
Keyword Collector
(425)
💎
Century Club
(148)
📈
Trend Setter
🚀
Conference Pioneer
🔥
Unstoppable
(22)
⚡
Prolific Year
(6)
Conferences
ACL (50)
EMNLP (33)
NAACL (29)
EACL (9)
IJCNLP (6)
CVPR (5)
AAAI (4)
COLING (4)
AACL (2)
CONLL (2)
NIPS (2)
SEMEVAL (2)
ECCV (1)
ICLR (1)
INTERSPEECH (1)
Top co-authors
Research topics
Keywords
large language model
(26)
language model
(14)
text generation
(10)
multimodal learning
(6)
zero-shot learning
(6)
machine translation
(5)
text classification
(5)
synthetic data generation
(5)
text summarization
(4)
question generation
(4)
word embedding
(4)
few-shot learning
(4)
machine-generated text detection
(4)
natural language understanding
(4)
prompt engineering
(4)
vision-language model
(4)
question answering
(4)
semantic analysis
(3)
code generation
(3)
cross-lingual transfer
(3)
Papers
Toward Beginner-Friendly LLMs for Language Learning: Controlling Difficulty in Conversation
EACL 2026
NSF-SciFy: Mining the NSF Awards Database for Scientific Claims
ACL 2026
LaTeX2Layout: High-Fidelity, Scalable Document Layout Annotation Pipeline for Layout Detection
AAAI 2026
Calibrating Large Language Models with Sample Consistency
AAAI 2025
NSF-SciFy: Mining the NSF Awards Database for Scientific Claims
EMNLP 2025
Contra4: Evaluating Contrastive Cross-Modal Reasoning in Audio, Video, Image, and 3D
EMNLP 2025
mStyleDistance: Multilingual Style Embeddings and their Evaluation
ACL 2025
Multilingual Retrieval Augmented Generation for Culturally-Sensitive Tasks: A Benchmark for Cross-lingual Robustness
ACL 2025
GenAI Content Detection Task 3: Cross-Domain Machine Generated Text Detection Challenge
COLING 2025
ViUniT: Visual Unit Tests for More Robust Visual Programming
CVPR 2025
Concept Lancet: Image Editing with Compositional Representation Transplant
CVPR 2025
Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation
ACL 2025
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models
CVPR 2025
Probabilistic Soundness Guarantees in LLM Reasoning Chains
EMNLP 2025
StyleDistance: Stronger Content-Independent Style Embeddings with Synthetic Parallel Examples
NAACL 2025
Holodeck: Language Guided Generation of 3D Embodied AI Environments
CVPR 2024
Choice-75: A Dataset on Decision Branching in Script Learning
COLING 2024
A Textbook Remedy for Domain Shifts: Knowledge Priors for Medical Image Analysis
NIPS 2024
CoMo: Controllable Motion Generation through Language Guided Pose Code Editing
ECCV 2024
OpenPI2.0: An Improved Dataset for Entity Tracking in Texts
EACL 2024
PDDLEGO: Iterative Planning in Textual Environments
NAACL 2024
This Land is Your, My Land: Evaluating Geopolitical Bias in Language Models through Territorial Disputes
NAACL 2024
ParaGuide: Guided Diffusion Paraphrasers for Plug-and-Play Textual Style Transfer
AAAI 2024
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows
ACL 2024
RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors
ACL 2024
FanOutQA: A Multi-Hop, Multi-Document Question Answering Benchmark for Large Language Models
ACL 2024
Evaluating Vision-Language Models on Bistable Images
ACL 2024
PROC2PDDL: Open-Domain Planning Representations from Texts
ACL 2024
Uncovering Differences in Persuasive Language in Russian versus English Wikipedia
EMNLP 2024
BordIRlines: A Dataset for Evaluating Cross-lingual Retrieval Augmented Generation
EMNLP 2024
MiRAGeNews: Multimodal Realistic AI-Generated News Detection
EMNLP 2024
TinyStyler: Efficient Few-Shot Text Style Transfer with Authorship Embeddings
EMNLP 2024
ReDel: A Toolkit for LLM-Powered Recursive Multi-Agent Systems
EMNLP 2024
PaCE: Parsimonious Concept Engineering for Large Language Models
NIPS 2024
Human-in-the-loop Schema Induction
ACL 2023
Real or Fake Text?: Investigating Human Ability to Detect Boundaries between Human-Written and Machine-Generated Text
AAAI 2023
Faithful Chain-of-Thought Reasoning
AACL 2023
Learning When to Speak: Latency and Quality Trade-offs for Simultaneous Speech-to-Speech Translation with Offline Models
INTERSPEECH 2023
Faithful Chain-of-Thought Reasoning
IJCNLP 2023
Bidirectional Language Models Are Also Few-shot Learners
ICLR 2023
Kani: A Lightweight and Highly Hackable Framework for Building Language Model Applications
EMNLP 2023
Learning Interpretable Style Embeddings via Prompting LLMs
EMNLP 2023
PAXQA: Generating Cross-lingual Question Answering Examples at Training Scale
EMNLP 2023
FIREBALL: A Dataset of Dungeons and Dragons Actual-Play with Structured Game State Information
ACL 2023
Explanation-based Finetuning Makes Models More Robust to Spurious Cues
ACL 2023
Open-Domain Hierarchical Event Schema Induction by Incremental Prompting and Verification
ACL 2023
I Cast Detect Thoughts: Learning to Converse and Guide with Intents and Theory-of-Mind in Dungeons and Dragons
ACL 2023
Learn With Martian: A Tool For Creating Assignments That Can Write And Re-Write Themselves
EACL 2023
CoRRPUS: Code-based Structured Prompting for Neurosymbolic Story Understanding
ACL 2023
Improving Mathematics Tutoring With A Code Scratchpad
ACL 2023
Enhancing Human Summaries for Question-Answer Generation in Education
ACL 2023
Automatically Generated Summaries of Video Lectures May Enhance Students’ Learning Experience
ACL 2023
Exploring the Curious Case of Code Prompts
ACL 2023
Representation of Lexical Stylistic Features in Language Models’ Embedding Space
ACL 2023
Multilingual Bidirectional Unsupervised Translation through Multilingual Finetuning and Back-Translation
EACL 2023
Causal Reasoning of Entities and Events in Procedural Texts
EACL 2023
Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification
CVPR 2023
Show Me More Details: Discovering Hierarchies of Procedures from Semi-structured Web Data
ACL 2022
Dungeons and Dragons as a Dialog Challenge for Artificial Intelligence
EMNLP 2022
Unsupervised Entity Linking with Guided Summarization and Multiple-Choice Selection
EMNLP 2022
Is “My Favorite New Movie” My Favorite Movie? Probing the Understanding of Recursive Noun Phrases
NAACL 2022
RESIN-11: Schema-guided Event Prediction for 11 Newsworthy Scenarios
NAACL 2022
The Case for a Single Model that can Both Generate Continuations and Fill-in-the-Blank
NAACL 2022
Visualizing the Obvious: A Concreteness-based Ensemble Model for Noun Property Prediction
EMNLP 2022
A Feasibility Study of Answer-Agnostic Question Generation for Education
ACL 2022
A Recipe for Arbitrary Text Style Transfer with Large Language Models
ACL 2022
Deduplicating Training Data Makes Language Models Better
ACL 2022
BiSECT: Learning to Split and Rephrase Sentences with Bitexts
EMNLP 2021
“Wikily” Supervised Neural Translation Tailored to Cross-Lingual Tasks
EMNLP 2021
Visual Goal-Step Inference using wikiHow
EMNLP 2021
GooAQ: Open Question Answering with Diverse Answer Types
EMNLP 2021
Cultural and Geographical Influences on Image Translatability of Words across Languages
NAACL 2021
RESIN: A Dockerized Schema-Guided Cross-document Cross-lingual Cross-media Information Extraction and Event Tracking System
NAACL 2021
TopGuNN: Fast NLP Training Data Augmentation using Large Corpora
NAACL 2021
Resolving Pronouns in Twitter Streams: Context can Help!
COLING 2020
Reasoning about Goals, Steps, and Temporal Ordering with WikiHow
EMNLP 2020
Automatic Detection of Generated Text is Easiest when Humans are Fooled
ACL 2020
Intent Detection with WikiHow
AACL 2020
Toward Better Storylines with Sentence-Level Language Models
ACL 2020
RoFT: A Tool for Evaluating Human Detection of Machine-Generated Text
EMNLP 2020
Winter is here: Summarizing Twitter Streams related to Pre-Scheduled Events
ACL 2019
Seeing Things from a Different Angle:Discovering Diverse Perspectives about Claims
NAACL 2019
Unsupervised Hierarchical Story Infilling
NAACL 2019
Complexity-Weighted Loss and Diverse Reranking for Sentence Simplification
NAACL 2019
Comparison of Diverse Decoding Methods from Conditional Language Models
ACL 2019
PerspectroScope: A Window to the World of Diverse Perspectives
ACL 2019
ChatEval: A Tool for Chatbot Evaluation
NAACL 2019
Magnitude: A Fast, Efficient Universal Vector Embedding Utility Package
EMNLP 2018
Automated Paraphrase Lattice Creation for HyTER Machine Translation Evaluation
NAACL 2018
Learning Scalar Adjective Intensity from Paraphrases
EMNLP 2018
Comparing Constraints for Taxonomic Organization
NAACL 2018
Simplification Using Paraphrases and Context-Based Lexical Substitution
NAACL 2018
Learning Translations via Images with a Massively Multilingual Image Dataset
ACL 2018
KnowYourNyms? A Game of Semantic Relationships
EMNLP 2017
The Language of Place: Semantic Value from Geospatial Context
EACL 2017
Learning Translations via Matrix Completion
EMNLP 2017
Clustering Paraphrases by Word Sense
NAACL 2016
Sentential Paraphrasing as Black-Box Machine Translation
NAACL 2016
Most “babies” are “little” and most “problems” are “huge”: Compositional Entailment in Adjective-Nouns
ACL 2016
Simple PPDB: A Paraphrase Database for Simplification
ACL 2016
Tense Manages to Predict Implicative Behavior in Verbs
EMNLP 2016
The Gun Violence Database: A new task and data set for NLP
EMNLP 2016
FrameNet+: Fast Paraphrastic Tripling of FrameNet
ACL 2015
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing
EMNLP 2015
SemEval-2015 Task 1: Paraphrase and Semantic Similarity in Twitter (PIT)
SEMEVAL 2015
PPDB 2.0: Better paraphrase ranking, fine-grained entailment relations, word embeddings, and style classification
ACL 2015
Domain-Specific Paraphrase Extraction
ACL 2015
Adding Semantics to Data-Driven Paraphrasing
ACL 2015
Adding Semantics to Data-Driven Paraphrasing
IJCNLP 2015
Domain-Specific Paraphrase Extraction
IJCNLP 2015
FrameNet+: Fast Paraphrastic Tripling of FrameNet
IJCNLP 2015
PPDB 2.0: Better paraphrase ranking, fine-grained entailment relations, word embeddings, and style classification
IJCNLP 2015
Cost Optimization in Crowdsourcing Translation: Low cost translations made even cheaper
NAACL 2015
Crowdsourcing for NLP
NAACL 2015
Are Two Heads Better than One? Crowdsourced Translation via a Two-Step Collaboration of Non-Professional Translators and Editors
ACL 2014
Hallucinating Phrase Translations for Low Resource MT
CONLL 2014
PARADIGM: Paraphrase Diagnostics through Grammar Matching
EACL 2014
Semi-Markov Phrase-Based Monolingual Alignment
EMNLP 2013
Dirt Cheap Web-Scale Parallel Text from the Common Crawl
ACL 2013
PARMA: A Predicate Argument Aligner
ACL 2013
Supervised Bilingual Lexicon Induction with Multiple Monolingual Signals
NAACL 2013
PPDB: The Paraphrase Database
NAACL 2013
Answer Extraction as Sequence Tagging with Tree Edit Distance
NAACL 2013
A Lightweight and High Performance Monolingual Word Aligner
ACL 2013
Monolingual Distributional Similarity for Text-to-Text Generation
SEMEVAL 2012
Machine Translation of Arabic Dialects
NAACL 2012
Expectations of Word Sense in Parallel Corpora
NAACL 2012
Toward Statistical Machine Translation without Parallel Corpora
EACL 2012
Crowdsourcing Translation: Professional Quality from Non-Professionals
ACL 2011
The Arabic Online Commentary Dataset: an Annotated Dataset of Informal Arabic with High Dialectal Content
ACL 2011
Incremental Syntactic Language Models for Phrase-based Translation
ACL 2011
Learning Sentential Paraphrases from Bilingual Parallel Corpora for Text-to-Text Generation
EMNLP 2011
Bucking the Trend: Large-Scale Cost-Focused Active Learning for Statistical Machine Translation
ACL 2010
Cheap, Fast and Good Enough: Automatic Speech Recognition with Non-Expert Transcription
NAACL 2010
Predicting Human-Targeted Translation Edit Rate via Untrained Human Annotators
NAACL 2010
Stream-based Translation Models for Statistical Machine Translation
NAACL 2010
Feasibility of Human-in-the-loop Minimum Error Rate Training
EMNLP 2009
Demonstration of Joshua: An Open Source Toolkit for Parsing-based Machine Translation
IJCNLP 2009
Fast, Cheap, and Creative: Evaluating Translation Quality Using Amazon’s Mechanical Turk
EMNLP 2009
Improved Statistical Machine Translation Using Monolingually-Derived Paraphrases
EMNLP 2009
Improving Translation Lexicon Induction from Monolingual Corpora via Dependency Contexts and Part-of-Speech Equivalences
CONLL 2009
Demonstration of Joshua: An Open Source Toolkit for Parsing-based Machine Translation
ACL 2009
ParaMetric: An Automatic Evaluation Metric for Paraphrasing
COLING 2008
Syntactic Constraints on Paraphrases Extracted from Parallel Corpora
EMNLP 2008
Moses: Open Source Toolkit for Statistical Machine Translation
ACL 2007
Re-evaluating the Role of Bleu in Machine Translation Research
EACL 2006
Improved Statistical Machine Translation Using Paraphrases
NAACL 2006
Scaling Phrase-Based Statistical Machine Translation to Larger Corpora and Longer Phrases
ACL 2005
Proceedings of the ACL Student Research Workshop
ACL 2005
Paraphrasing with Bilingual Parallel Corpora
ACL 2005
Statistical Machine Translation with Word- and Sentence-Aligned Parallel Corpora
ACL 2004