Leshem Choshen
66 papers · 2018–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+17 more ↓ Show less ↑
π§ Keyword Pioneer π Conference Polyglot (10) πΊοΈ Taxonomy Completionist (11) π Interdisciplinary Bridge π Academic Marathon (7)
π
Interdisciplinary Bridge
π§
Keyword Pioneer
πΊοΈ
Taxonomy Completionist
(11)
π
Keyword Trendsetter Combo
(3)
π
Grand Slam
π
Triple Crown
π§¬
Topic Evolution
π
Keyword Champion
π€
Dynamic Duo
(25)
π₯
Mega-Team
(28)
β
The Questioner
(3)
π
Century Club
(62)
π
Conference Pioneer
π
Trend Setter
ποΈ
Keyword Collector
(238)
β‘
Prolific Year
(9)
π₯
Unstoppable
(8)
Conferences
ACL (18)
EMNLP (16)
CONLL (7)
NAACL (6)
ICLR (5)
ICML (5)
COLING (3)
EACL (2)
NIPS (2)
AAAI (1)
SEMEVAL (1)
Top co-authors
Research topics
Keywords
language model
(10)
machine translation
(7)
large language model
(7)
grammatical error correction
(6)
text classification
(6)
benchmark evaluation
(5)
transfer learning
(5)
universal dependencies
(4)
neural machine translation
(4)
natural language processing
(3)
active learning
(3)
model selection
(3)
pretrained language model
(3)
text generation
(3)
evaluation metric
(3)
reference-based evaluation
(3)
cognitive modeling
(2)
language modeling
(2)
bert model
(2)
argument mining
(2)
Papers
Pretraining Language Models for Diachronic Linguistic Change Discovery
EACL 2026
BabyBabelLM: A Multilingual Benchmark of Developmentally Plausible Training Data
EACL 2026
CommonLID: Re-evaluating State-of-the-Art Language Identification Performance on Web Data
ACL 2026
Mediocrity is the key for LLM as a Judge Anchor Selection
ACL 2026
DOVE: A Large-Scale Multi-Dimensional Predictions Dataset Towards Meaningful LLM Evaluation
ACL 2025
Beneath the Surface of Consistency: Exploring Cross-lingual Knowledge Representation Sharing in LLMs
NAACL 2025
Compress then Serve: Serving Thousands of LoRA Adapters with Little Overhead
ICML 2025
A Hitchhikerβs Guide to Scaling Law Estimation
ICML 2025
Model merging with SVD to tie the Knots
ICLR 2025
LiveXiv - A Multi-Modal live benchmark based on Arxiv papers content
ICLR 2025
Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation
ACL 2025
Findings of the Third BabyLM Challenge: Accelerating Language Modeling Research with Cognitively Plausible Data
EMNLP 2025
The ShareLM Collection and Plugin: Contributing Human-Model Chats for the Benefit of the Community
ACL 2025
Jump to Conclusions: Short-Cutting Transformers with Linear Transformations
COLING 2024
Navigating the Modern Evaluation Landscape: Considerations in Benchmarks and Frameworks for Large Language Models (LLMs)
COLING 2024
tinyBenchmarks: evaluating LLMs with fewer examples
ICML 2024
Findings of the Second BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
CONLL 2024
NumeroLogic: Number Encoding for Enhanced LLMsβ Numerical Reasoning
EMNLP 2024
Fuse to Forget: Bias Reduction and Selective Memorization through Model Fusion
EMNLP 2024
Unitxt: Flexible, Shareable and Reusable Data Preparation and Evaluation for Generative AI
NAACL 2024
Efficient Benchmarking (of Language Models)
NAACL 2024
Efficient multi-prompt evaluation of LLMs
NIPS 2024
Achieving Human Parity in Content-Grounded Datasets Generation
ICLR 2024
Label-Efficient Model Selection for Text Generation
ACL 2024
Deductive Closure Training of Language Models for Coherence, Accuracy, and Updatability
ACL 2024
Data Contamination Report from the 2024 CONDA Shared Task
ACL 2024
Asymmetry in Low-Rank Adapters of Foundation Models
ICML 2024
TIES-Merging: Resolving Interference When Merging Models
NIPS 2023
ColD Fusion: Collaborative Descent for Distributed Multitask Finetuning
ACL 2023
DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering
ACL 2023
MuLER: Detailed and Scalable Reference-based Evaluation
CONLL 2023
Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
CONLL 2023
Where to start? Analyzing the potential value of intermediate models
EMNLP 2023
Human Learning by Model Feedback: The Dynamics of Iterative Prompting with Midjourney
EMNLP 2023
Knowledge is a Region in Weight Space for Fine-tuned Language Models
EMNLP 2023
Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
EMNLP 2023
MuLER: Detailed and Scalable Reference-based Evaluation
EMNLP 2023
Enhancing the Transformer Decoder with Transition-based Syntax
CONLL 2022
Semantics-aware Attention Improves Neural Machine Translation
NAACL 2022
On Neurons Invariant to Sentence Structural Changes in Neural Machine Translation
EMNLP 2022
Label Sleuth: From Unlabeled Text to a Classifier in a Few Hours
EMNLP 2022
Cluster & Tune: Boost Cold Start Performance in Text Classification
ACL 2022
The Grammar-Learning Trajectories of Neural Language Models
ACL 2022
Reinforcement Learning with Large Action Spaces for Neural Machine Translation
COLING 2022
Enhancing the Transformer Decoder with Transition-based Syntax
EMNLP 2022
PreQuEL: Quality Estimation of Machine Translation Outputs in Advance
EMNLP 2022
On Neurons Invariant to Sentence Structural Changes in Neural Machine Translation
CONLL 2022
Mediators in Determining what Processing BERT Performs First
NAACL 2021
Q2: Evaluating Factual Consistency in Knowledge-Grounded Dialogues via Question Generation and Question Answering
EMNLP 2021
Classifying Syntactic Errors in Learner Language
CONLL 2020
Letβs Agree to Agree: Neural Networks Share Classification Order on Real Datasets
ICML 2020
Active Learning for BERT: An Empirical Study
EMNLP 2020
Unsupervised Expressive Rules Provide Explainability and Assist Human Experts Grasping New Domains
EMNLP 2020
Classifying Syntactic Errors in Learner Language
EMNLP 2020
Corpus Wide Argument MiningβA Working Solution
AAAI 2020
On the Weaknesses of Reinforcement Learning for Neural Machine Translation
ICLR 2020
Automatically Extracting Challenge Sets for Non-Local Phenomena in Neural Machine Translation
CONLL 2019
SemEval-2019 Task 1: Cross-lingual Semantic Parsing with UCCA
SEMEVAL 2019
Learning to combine Grammatical Error Corrections
ACL 2019
The Language of Legal and Illegal Activity on the Darknet
ACL 2019
Are You Convinced? Choosing the More Convincing Evidence with a Siamese Network
ACL 2019
Reference-less Measure of Faithfulness for Grammatical Error Correction
NAACL 2018
DORA The Explorer: Directed Outreaching Reinforcement Action-Selection
ICLR 2018
Inherent Biases in Reference-based Evaluation for Grammatical Error Correction
ACL 2018
Automatic Metric Validation for Grammatical Error Correction
ACL 2018
Will it Blend? Blending Weak and Strong Labeled Data in a Neural Network for Argumentation Mining
ACL 2018