Colin Cherry
72 papers · 2003–2025 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
π Academic Marathon (22) π Conference Polyglot (11) π Interdisciplinary Bridge π§ Keyword Pioneer π Cross-Pollinator (12)
π
Renaissance Researcher
(6)
π
Conference Polyglot
(11)
π
Academic Marathon
(22)
πΊ
Lone Wolf
(3)
π€
Dynamic Duo
(14)
π₯
Mega-Team
(27)
π¬
Deep Specialist
(25)
π§¬
Topic Evolution
ποΈ
Keyword Collector
(124)
β
The Questioner
β‘
Prolific Year
(6)
π₯
Unstoppable
(18)
π
Century Club
(72)
Conferences
ACL (18)
NAACL (18)
EMNLP (15)
COLING (4)
ICML (3)
IJCNLP (3)
INTERSPEECH (3)
CONLL (2)
EACL (2)
ICLR (2)
SEMEVAL (2)
Top co-authors
Keywords
machine translation
(15)
neural machine translation
(8)
multilingual model
(5)
low-resource language
(5)
large language model
(4)
translation evaluation
(3)
domain adaptation
(3)
text generation
(3)
in-context learning
(3)
few-shot learning
(2)
benchmark evaluation
(2)
language model
(2)
speech translation
(2)
speech recognition
(2)
transfer learning
(2)
african language
(2)
multilingual nlp
(1)
knowledge distillation
(1)
attention mechanism
(1)
metric learning
(1)
Papers
Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contaminationβs Impact on Machine Translation
ICML 2025
Alligators All Around: Mitigating Lexical Confusion in Low-resource Machine Translation
NAACL 2025
SSA-COMET: Do LLMs Outperform Learned Metrics in Evaluating MT for Under-Resourced African Languages?
EMNLP 2025
Evaluating WMT 2025 Metrics Shared Task Submissions on the SSA-MTE African Challenge Set
EMNLP 2025
SMOL: Professionally Translated Parallel Data for 115 Under-represented Languages
EMNLP 2025
Leveraging Domain Knowledge at Inference Time for LLM Translation: Retrieval versus Generation
NAACL 2025
Translating Step-by-Step: Decomposing the Translation Process for Improved Translation Quality of Long-Form Texts
EMNLP 2024
When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method
ICLR 2024
Quality-Aware Translation Models: Efficient Generation and Quality Estimation in a Single Model
ACL 2024
Barriers to Effective Evaluation of Simultaneous Interpretation
EACL 2024
XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
EMNLP 2023
Searching for Needles in a Haystack: On the Role of Incidental Bilingualism in PaLMβs Translation Capability
ACL 2023
Prompting PaLM for Translation: Assessing Strategies and Performance
ACL 2023
The Unreasonable Effectiveness of Few-shot Learning for Machine Translation
ICML 2023
Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation
INTERSPEECH 2022
XTREME-S: Evaluating Cross-lingual Speech Representations
INTERSPEECH 2022
Data Scaling Laws in NMT: The Effect of Noise and Architecture
ICML 2022
Scaling Laws for Neural Machine Translation
ICLR 2022
A Natural Diet: Towards Improving Naturalness of Machine Translation Output
ACL 2022
Exploring the Benefits and Limitations of Multilinguality for Non-autoregressive Machine Translation
EMNLP 2022
Inverted Projection for Robust Speech Translation
ACL 2021
Subtitle Translation as Markup Translation
INTERSPEECH 2021
Inverted Projection for Robust Speech Translation
IJCNLP 2021
Assessing Reference-Free Peer Evaluation for Machine Translation
NAACL 2021
Human-Paraphrased References Improve Neural Machine Translation
EMNLP 2020
Simultaneous Translation
EMNLP 2020
Re-translation versus Streaming for Simultaneous Translation
ACL 2020
Inference Strategies for Machine Translation with Conditional Masking
EMNLP 2020
Reinforcement Learning based Curriculum Optimization for Neural Machine Translation
NAACL 2019
Monotonic Infinite Lookback Attention for Simultaneous Machine Translation
ACL 2019
Proceedings of the 2nd Workshop on Deep Learning Approaches for Low-Resource NLP (DeepLo 2019)
EMNLP 2019
Proceedings of the Workshop on Deep Learning Approaches for Low-Resource NLP
ACL 2018
Revisiting Character-Based Neural Machine Translation with Capacity and Compression
EMNLP 2018
A Challenge Set Approach to Evaluating Machine Translation
EMNLP 2017
Integrating Morphological Desegmentation into Phrase-based Decoding
NAACL 2016
An Empirical Evaluation of Noise Contrastive Estimation for the Neural Network Joint Model of Translation
NAACL 2016
SemEval-2016 Task 6: Detecting Stance in Tweets
SEMEVAL 2016
The Unreasonable Effectiveness of Word Representations for Twitter Named Entity Recognition
NAACL 2015
Inflection Generation as Discriminative String Transduction
NAACL 2015
Lattice Desegmentation for Statistical Machine Translation
ACL 2014
NRC-Canada-2014: Detecting Aspects and Sentiment in Customer Reviews
SEMEVAL 2014
Improved Reordering for Phrase-Based Translation using Sparse Features
NAACL 2013
Regularized Minimum Error Rate Training
EMNLP 2013
Reversing Morphological Tokenization in English-to-Arabic SMT
NAACL 2013
Paraphrasing for Style
COLING 2012
MSR SPLAT, a language analysis toolkit
NAACL 2012
Batch Tuning Strategies for Statistical Machine Translation
NAACL 2012
Data-Driven Response Generation in Social Media
EMNLP 2011
Indexing Spoken Documents with Hierarchical Semantic Structures: Semantic Tree-to-string Alignment Models
IJCNLP 2011
Joint Training of Dependency Parsing Filters through Latent Support Vector Machines
ACL 2011
Lexically-Triggered Hidden Markov Models for Clinical Document Coding
ACL 2011
Integrating Joint n-gram Features into a Discriminative Training Framework
NAACL 2010
Unsupervised Modeling of Twitter Conversations
NAACL 2010
Imposing Hierarchical Browsing Structures onto Spoken Documents
COLING 2010
Fast and Accurate Arc Filtering for Dependency Parsing
COLING 2010
Cohesive Constraints in A Beam Search Phrase-based Decoder
NAACL 2009
A global model for joint lemmatization and part-of-speech prediction
IJCNLP 2009
Discriminative Substring Decoding for Transliteration
EMNLP 2009
A global model for joint lemmatization and part-of-speech prediction
ACL 2009
Unsupervised Morphological Segmentation with Log-Linear Models
NAACL 2009
On the Syllabification of Phonemes
NAACL 2009
Cohesive Phrase-Based Decoding for Statistical Machine Translation
ACL 2008
Automatic Syllabification with Structured SVMs for Letter-to-Phoneme Conversion
ACL 2008
Joint Processing and Discriminative Training for Letter-to-Phoneme Conversion
ACL 2008
Improved Large Margin Dependency Parsing via Local Constraints and Laplacian Regularization
CONLL 2006
A Comparison of Syntactically Motivated Word Alignment Spaces
EACL 2006
Soft Syntactic Constraints for Word Alignment through Discriminative Training
ACL 2006
Soft Syntactic Constraints for Word Alignment through Discriminative Training
COLING 2006
An Expectation Maximization Approach to Pronoun Resolution
CONLL 2005
Dependency Treelet Translation: Syntactically Informed Phrasal SMT
ACL 2005
Word Alignment with Cohesion Constraint
NAACL 2003
A Probability Model to Improve Word Alignment
ACL 2003