Co-occurring keywords
Papers
TEDxTN: A Three-way Speech Translation Corpus for Code-Switched Tunisian Arabic - English
EMNLP 2025
BOUQuET : dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation
EMNLP 2025
JHU WMT 2025 CreoleMT System Description: Data for Belizean Kriol and French Guianese Creole MT
EMNLP 2025
Parallel Corpora for Machine Translation in Low-Resource Indic Languages: A Comprehensive Review
NAACL 2025
Sentence-Alignment in Semi-parallel Datasets
NAACL 2025
EduCSW: Building a Mandarin-English Code-Switched Generation Pipeline for Computer Science Learning
ACL 2025
Unsupervised Sentence Readability Estimation Based on Parallel Corpora for Text Simplification
ACL 2025
OpenWHO: A Document-Level Parallel Corpus for Health Translation in Low-Resource Languages
EMNLP 2025
Developing an Informal-Formal Persian Corpus: Highlighting the Differences between Two Writing Styles
COLING 2025