Pavel Stepachev
5 papers · 2018–2026 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
🧭 Keyword Pioneer 🌍 Conference Polyglot (3) 🏃 Academic Marathon (7) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (10)
🐝
Cross-Pollinator
(12)
👥
Mega-Team
(35)
❓
The Questioner
Conferences
ACL (2)
EMNLP (2)
NAACL (1)
Top co-authors
Keywords
machine translation
(2)
multilingual corpus
(2)
low-resource translation
(2)
parallel datum
(2)
continued pretraining
(1)
language model
(1)
parameter-efficient fine-tuning
(1)
supervised fine-tuning
(1)
instruction fine-tuning
(1)
human annotation
(1)
multilingual model
(1)
checkpoint averaging
(1)
corpus quality
(1)
multilingual language model
(1)
language model pretraining
(1)
cross-lingual dependency parsing
(1)
large language model
(1)
web datum
(1)
spanning tree algorithm
(1)
synthetic treebank
(1)
Papers
CommonLID: Re-evaluating State-of-the-Art Language Identification Performance on Web Data
ACL 2026
An Expanded Massive Multilingual Dataset for High-Performance Language Technologies (HPLT)
ACL 2025
Quality or Quantity? On Data Scale and Diversity in Adapting Large Language Models for Low-Resource Translation
EMNLP 2024
Exploring Very Low-Resource Translation with LLMs: The University of Edinburgh’s Submission to AmericasNLP 2024 Translation Task
NAACL 2024
Multi-source synthetic treebank creation for improved cross-lingual dependency parsing
EMNLP 2018