conftrace_

Colin Cherry

72 papers · 2003–2025 · 11 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+13 more ↓

🏃 Academic Marathon (22) 🌍 Conference Polyglot (11) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (12)

🌈 Renaissance Researcher (6) 🌍 Conference Polyglot (11) 🏃 Academic Marathon (22) 🐺 Lone Wolf (3) 🤝 Dynamic Duo (14) 👥 Mega-Team (27) 🔬 Deep Specialist (25) 🧬 Topic Evolution 🗃️ Keyword Collector (124) ❓ The Questioner ⚡ Prolific Year (6) 🔥 Unstoppable (18) 💎 Century Club (72)

Conferences

ACL (18) NAACL (18) EMNLP (15) COLING (4) ICML (3) IJCNLP (3) INTERSPEECH (3) CONLL (2) EACL (2) ICLR (2) SEMEVAL (2)

Top co-authors

George Foster (14) Grzegorz Kondrak (8) Markus Freitag (8) Orhan Firat (6) Jiaming Luo (6) Kristina Toutanova (5) Dekang Lin (5) Eleftheria Briakou (5) Xiaodan Zhu (4) Ankur Bapna (4)

Keywords

machine translation (15) neural machine translation (8) multilingual model (5) low-resource language (5) large language model (4) translation evaluation (3) domain adaptation (3) text generation (3) in-context learning (3) few-shot learning (2) benchmark evaluation (2) language model (2) speech translation (2) speech recognition (2) transfer learning (2) african language (2) multilingual nlp (1) knowledge distillation (1) attention mechanism (1) metric learning (1)

Papers

Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination’s Impact on Machine Translation ICML 2025 Alligators All Around: Mitigating Lexical Confusion in Low-resource Machine Translation NAACL 2025 SSA-COMET: Do LLMs Outperform Learned Metrics in Evaluating MT for Under-Resourced African Languages? EMNLP 2025 Evaluating WMT 2025 Metrics Shared Task Submissions on the SSA-MTE African Challenge Set EMNLP 2025 SMOL: Professionally Translated Parallel Data for 115 Under-represented Languages EMNLP 2025 Leveraging Domain Knowledge at Inference Time for LLM Translation: Retrieval versus Generation NAACL 2025 Translating Step-by-Step: Decomposing the Translation Process for Improved Translation Quality of Long-Form Texts EMNLP 2024 When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method ICLR 2024 Quality-Aware Translation Models: Efficient Generation and Quality Estimation in a Single Model ACL 2024 Barriers to Effective Evaluation of Simultaneous Interpretation EACL 2024 XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages EMNLP 2023 Searching for Needles in a Haystack: On the Role of Incidental Bilingualism in PaLM’s Translation Capability ACL 2023 Prompting PaLM for Translation: Assessing Strategies and Performance ACL 2023 The Unreasonable Effectiveness of Few-shot Learning for Machine Translation ICML 2023 Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation INTERSPEECH 2022 XTREME-S: Evaluating Cross-lingual Speech Representations INTERSPEECH 2022 Data Scaling Laws in NMT: The Effect of Noise and Architecture ICML 2022 Scaling Laws for Neural Machine Translation ICLR 2022 A Natural Diet: Towards Improving Naturalness of Machine Translation Output ACL 2022 Exploring the Benefits and Limitations of Multilinguality for Non-autoregressive Machine Translation EMNLP 2022 Inverted Projection for Robust Speech Translation ACL 2021 Subtitle Translation as Markup Translation INTERSPEECH 2021 Inverted Projection for Robust Speech Translation IJCNLP 2021 Assessing Reference-Free Peer Evaluation for Machine Translation NAACL 2021 Human-Paraphrased References Improve Neural Machine Translation EMNLP 2020 Simultaneous Translation EMNLP 2020 Re-translation versus Streaming for Simultaneous Translation ACL 2020 Inference Strategies for Machine Translation with Conditional Masking EMNLP 2020 Reinforcement Learning based Curriculum Optimization for Neural Machine Translation NAACL 2019 Monotonic Infinite Lookback Attention for Simultaneous Machine Translation ACL 2019 Proceedings of the 2nd Workshop on Deep Learning Approaches for Low-Resource NLP (DeepLo 2019) EMNLP 2019 Proceedings of the Workshop on Deep Learning Approaches for Low-Resource NLP ACL 2018 Revisiting Character-Based Neural Machine Translation with Capacity and Compression EMNLP 2018 A Challenge Set Approach to Evaluating Machine Translation EMNLP 2017 Integrating Morphological Desegmentation into Phrase-based Decoding NAACL 2016 An Empirical Evaluation of Noise Contrastive Estimation for the Neural Network Joint Model of Translation NAACL 2016 SemEval-2016 Task 6: Detecting Stance in Tweets SEMEVAL 2016 The Unreasonable Effectiveness of Word Representations for Twitter Named Entity Recognition NAACL 2015 Inflection Generation as Discriminative String Transduction NAACL 2015 Lattice Desegmentation for Statistical Machine Translation ACL 2014 NRC-Canada-2014: Detecting Aspects and Sentiment in Customer Reviews SEMEVAL 2014 Improved Reordering for Phrase-Based Translation using Sparse Features NAACL 2013 Regularized Minimum Error Rate Training EMNLP 2013 Reversing Morphological Tokenization in English-to-Arabic SMT NAACL 2013 Paraphrasing for Style COLING 2012 MSR SPLAT, a language analysis toolkit NAACL 2012 Batch Tuning Strategies for Statistical Machine Translation NAACL 2012 Data-Driven Response Generation in Social Media EMNLP 2011 Indexing Spoken Documents with Hierarchical Semantic Structures: Semantic Tree-to-string Alignment Models IJCNLP 2011 Joint Training of Dependency Parsing Filters through Latent Support Vector Machines ACL 2011 Lexically-Triggered Hidden Markov Models for Clinical Document Coding ACL 2011 Integrating Joint n-gram Features into a Discriminative Training Framework NAACL 2010 Unsupervised Modeling of Twitter Conversations NAACL 2010 Imposing Hierarchical Browsing Structures onto Spoken Documents COLING 2010 Fast and Accurate Arc Filtering for Dependency Parsing COLING 2010 Cohesive Constraints in A Beam Search Phrase-based Decoder NAACL 2009 A global model for joint lemmatization and part-of-speech prediction IJCNLP 2009 Discriminative Substring Decoding for Transliteration EMNLP 2009 A global model for joint lemmatization and part-of-speech prediction ACL 2009 Unsupervised Morphological Segmentation with Log-Linear Models NAACL 2009 On the Syllabification of Phonemes NAACL 2009 Cohesive Phrase-Based Decoding for Statistical Machine Translation ACL 2008 Automatic Syllabification with Structured SVMs for Letter-to-Phoneme Conversion ACL 2008 Joint Processing and Discriminative Training for Letter-to-Phoneme Conversion ACL 2008 Improved Large Margin Dependency Parsing via Local Constraints and Laplacian Regularization CONLL 2006 A Comparison of Syntactically Motivated Word Alignment Spaces EACL 2006 Soft Syntactic Constraints for Word Alignment through Discriminative Training ACL 2006 Soft Syntactic Constraints for Word Alignment through Discriminative Training COLING 2006 An Expectation Maximization Approach to Pronoun Resolution CONLL 2005 Dependency Treelet Translation: Syntactically Informed Phrasal SMT ACL 2005 Word Alignment with Cohesion Constraint NAACL 2003 A Probability Model to Improve Word Alignment ACL 2003