Leshem Choshen

66 papers · 2018–2026 · 11 conferences · across top CS/AI conferences

Achievements

+17 more ↓

🧭 Keyword Pioneer 🌍 Conference Polyglot (10) 🗺️ Taxonomy Completionist (11) 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (7)

🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (11) 🌟 Keyword Trendsetter Combo (3) 🏆 Grand Slam 👑 Triple Crown 🧬 Topic Evolution 🏆 Keyword Champion 🤝 Dynamic Duo (25) 👥 Mega-Team (28) ❓ The Questioner (3) 💎 Century Club (62) 🚀 Conference Pioneer 📈 Trend Setter 🗃️ Keyword Collector (238) ⚡ Prolific Year (9) 🔥 Unstoppable (8)

Conferences

ACL (18) EMNLP (16) CONLL (7) NAACL (6) ICLR (5) ICML (5) COLING (3) EACL (2) NIPS (2) AAAI (1) SEMEVAL (1)

Top co-authors

Omri Abend (26) Noam Slonim (12) Eyal Shnarch (10) Ariel Gera (8) Ranit Aharonov (7) Shachar Don-Yehiya (7) Yoav Katz (6) Lena Dankin (6) Mikhail Yurochkin (5) Alex Warstadt (5)

Research topics

Applications (1) Privacy (1)

Keywords

language model (10) machine translation (7) large language model (7) grammatical error correction (6) text classification (6) benchmark evaluation (5) transfer learning (5) universal dependencies (4) neural machine translation (4) natural language processing (3) active learning (3) model selection (3) pretrained language model (3) text generation (3) evaluation metric (3) reference-based evaluation (3) cognitive modeling (2) language modeling (2) bert model (2) argument mining (2)

Papers

Pretraining Language Models for Diachronic Linguistic Change Discovery EACL 2026 BabyBabelLM: A Multilingual Benchmark of Developmentally Plausible Training Data EACL 2026 CommonLID: Re-evaluating State-of-the-Art Language Identification Performance on Web Data ACL 2026 Mediocrity is the key for LLM as a Judge Anchor Selection ACL 2026 DOVE: A Large-Scale Multi-Dimensional Predictions Dataset Towards Meaningful LLM Evaluation ACL 2025 Beneath the Surface of Consistency: Exploring Cross-lingual Knowledge Representation Sharing in LLMs NAACL 2025 Compress then Serve: Serving Thousands of LoRA Adapters with Little Overhead ICML 2025 A Hitchhiker’s Guide to Scaling Law Estimation ICML 2025 Model merging with SVD to tie the Knots ICLR 2025 LiveXiv - A Multi-Modal live benchmark based on Arxiv papers content ICLR 2025 Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation ACL 2025 Findings of the Third BabyLM Challenge: Accelerating Language Modeling Research with Cognitively Plausible Data EMNLP 2025 The ShareLM Collection and Plugin: Contributing Human-Model Chats for the Benefit of the Community ACL 2025 Jump to Conclusions: Short-Cutting Transformers with Linear Transformations COLING 2024 Navigating the Modern Evaluation Landscape: Considerations in Benchmarks and Frameworks for Large Language Models (LLMs) COLING 2024 tinyBenchmarks: evaluating LLMs with fewer examples ICML 2024 Findings of the Second BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora CONLL 2024 NumeroLogic: Number Encoding for Enhanced LLMs’ Numerical Reasoning EMNLP 2024 Fuse to Forget: Bias Reduction and Selective Memorization through Model Fusion EMNLP 2024 Unitxt: Flexible, Shareable and Reusable Data Preparation and Evaluation for Generative AI NAACL 2024 Efficient Benchmarking (of Language Models) NAACL 2024 Efficient multi-prompt evaluation of LLMs NIPS 2024 Achieving Human Parity in Content-Grounded Datasets Generation ICLR 2024 Label-Efficient Model Selection for Text Generation ACL 2024 Deductive Closure Training of Language Models for Coherence, Accuracy, and Updatability ACL 2024 Data Contamination Report from the 2024 CONDA Shared Task ACL 2024 Asymmetry in Low-Rank Adapters of Foundation Models ICML 2024 TIES-Merging: Resolving Interference When Merging Models NIPS 2023 ColD Fusion: Collaborative Descent for Distributed Multitask Finetuning ACL 2023 DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering ACL 2023 MuLER: Detailed and Scalable Reference-based Evaluation CONLL 2023 Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora CONLL 2023 Where to start? Analyzing the potential value of intermediate models EMNLP 2023 Human Learning by Model Feedback: The Dynamics of Iterative Prompting with Midjourney EMNLP 2023 Knowledge is a Region in Weight Space for Fine-tuned Language Models EMNLP 2023 Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora EMNLP 2023 MuLER: Detailed and Scalable Reference-based Evaluation EMNLP 2023 Enhancing the Transformer Decoder with Transition-based Syntax CONLL 2022 Semantics-aware Attention Improves Neural Machine Translation NAACL 2022 On Neurons Invariant to Sentence Structural Changes in Neural Machine Translation EMNLP 2022 Label Sleuth: From Unlabeled Text to a Classifier in a Few Hours EMNLP 2022 Cluster & Tune: Boost Cold Start Performance in Text Classification ACL 2022 The Grammar-Learning Trajectories of Neural Language Models ACL 2022 Reinforcement Learning with Large Action Spaces for Neural Machine Translation COLING 2022 Enhancing the Transformer Decoder with Transition-based Syntax EMNLP 2022 PreQuEL: Quality Estimation of Machine Translation Outputs in Advance EMNLP 2022 On Neurons Invariant to Sentence Structural Changes in Neural Machine Translation CONLL 2022 Mediators in Determining what Processing BERT Performs First NAACL 2021 Q2: Evaluating Factual Consistency in Knowledge-Grounded Dialogues via Question Generation and Question Answering EMNLP 2021 Classifying Syntactic Errors in Learner Language CONLL 2020 Let’s Agree to Agree: Neural Networks Share Classification Order on Real Datasets ICML 2020 Active Learning for BERT: An Empirical Study EMNLP 2020 Unsupervised Expressive Rules Provide Explainability and Assist Human Experts Grasping New Domains EMNLP 2020 Classifying Syntactic Errors in Learner Language EMNLP 2020 Corpus Wide Argument Mining—A Working Solution AAAI 2020 On the Weaknesses of Reinforcement Learning for Neural Machine Translation ICLR 2020 Automatically Extracting Challenge Sets for Non-Local Phenomena in Neural Machine Translation CONLL 2019 SemEval-2019 Task 1: Cross-lingual Semantic Parsing with UCCA SEMEVAL 2019 Learning to combine Grammatical Error Corrections ACL 2019 The Language of Legal and Illegal Activity on the Darknet ACL 2019 Are You Convinced? Choosing the More Convincing Evidence with a Siamese Network ACL 2019 Reference-less Measure of Faithfulness for Grammatical Error Correction NAACL 2018 DORA The Explorer: Directed Outreaching Reinforcement Action-Selection ICLR 2018 Inherent Biases in Reference-based Evaluation for Grammatical Error Correction ACL 2018 Automatic Metric Validation for Grammatical Error Correction ACL 2018 Will it Blend? Blending Weak and Strong Labeled Data in a Neural Network for Argumentation Mining ACL 2018