David Dale
21 papers · 2021–2025 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
π Cross-Pollinator (14) π§ Keyword Pioneer π Interdisciplinary Bridge π Conference Polyglot (6) π Renaissance Researcher (5)
π
Renaissance Researcher
(5)
πΊοΈ
Taxonomy Completionist
(37)
π¬
Deep Specialist
(10)
ποΈ
Keyword Collector
(90)
π
Century Club
(21)
π₯
Unstoppable
(5)
β‘
Prolific Year
(6)
Conferences
ACL (8)
EMNLP (7)
COLING (2)
IJCNLP (2)
AACL (1)
SEMEVAL (1)
Top co-authors
Keywords
machine translation
(7)
low-resource language
(5)
parallel corpus
(4)
toxicity detection
(4)
multilingual nlp
(3)
sequence tagging
(3)
toxic span detection
(3)
text detoxification
(2)
zero-shot learning
(2)
multilingual speech
(2)
text style transfer
(2)
cross-lingual transfer
(2)
speech translation
(2)
multilingual translation
(2)
language model
(2)
benchmark evaluation
(2)
neural machine translation
(2)
paraphrase generation
(2)
hallucination detection
(2)
span detection
(2)
Papers
Less Mature is More Adaptable for Sentence-level Language Modeling
ACL 2025
Improving Language and Modality Transfer in Translation by Character-level Modeling
ACL 2025
LCFO: Long Context and Long Form Output Dataset and Benchmarking
ACL 2025
BOUQuET : dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation
EMNLP 2025
Findings of the WMT 2025 Shared Task of the Open Language Data Initiative
EMNLP 2025
Translate, Then Detect: Leveraging Machine Translation for Cross-Lingual Toxicity Classification
EMNLP 2025
FLORES+ Translation and Machine Translation Evaluation for the Erzya Language
EMNLP 2024
BLASER 2.0: a metric for evaluation and quality estimation of massively multilingual speech and text translation
EMNLP 2024
MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector
ACL 2024
SpeechAlign: A Framework for Speech Translation Alignment Evaluation
COLING 2024
Exploring Methods for Cross-lingual Text Style Transfer: The Case of Text Detoxification
AACL 2023
Detecting and Mitigating Hallucinations in Machine Translation: Model Internal Workings Alone Do Well, Sentence Similarity Even Better
ACL 2023
HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation
EMNLP 2023
Exploring Methods for Cross-lingual Text Style Transfer: The Case of Text Detoxification
IJCNLP 2023
The first neural machine translation system for the Erzya language
COLING 2022
A large-scale computational study of content preservation measures for text style transfer and paraphrase generation
ACL 2022
ParaDetox: Detoxification with Parallel Data
ACL 2022
SkoltechNLP at SemEval-2021 Task 5: Leveraging Sentence-level Pre-training for Toxic Span Detection
ACL 2021
SkoltechNLP at SemEval-2021 Task 5: Leveraging Sentence-level Pre-training for Toxic Span Detection
IJCNLP 2021
SkoltechNLP at SemEval-2021 Task 5: Leveraging Sentence-level Pre-training for Toxic Span Detection
SEMEVAL 2021
Text Detoxification using Large Pre-trained Neural Models
EMNLP 2021