Arturo Oncevay
31 papers · 2018–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
🌈 Renaissance Researcher (5) 🐝 Cross-Pollinator (9) 🌍 Conference Polyglot (7) 🏃 Academic Marathon (7) 🌉 Interdisciplinary Bridge
🌉
Interdisciplinary Bridge
🗺️
Taxonomy Completionist
(52)
🧭
Keyword Pioneer
🔬
Deep Specialist
(18)
👥
Mega-Team
(58)
🏆
Keyword Champion
(2)
❓
The Questioner
📈
Trend Setter
🔥
Unstoppable
(8)
🗃️
Keyword Collector
(123)
⚡
Prolific Year
(7)
💎
Century Club
(30)
Conferences
NAACL (10)
ACL (9)
EMNLP (5)
EACL (3)
IJCNLP (2)
AACL (1)
COLING (1)
Top co-authors
Research topics
Keywords
low-resource language
(12)
machine translation
(12)
indigenous language
(7)
shared task
(5)
neural machine translation
(4)
data augmentation
(3)
subword segmentation
(3)
endangered language
(3)
morphological segmentation
(3)
cross-lingual transfer
(2)
polysynthetic language
(2)
multilingual nlp
(2)
natural language inference
(2)
terminology translation
(2)
multilingual model
(2)
evaluation metric
(2)
neural model
(2)
human evaluation
(2)
morphological reinflection
(2)
multilingual system
(2)
Papers
Distill and Align Decomposition for Enhanced Claim Verification
EACL 2026
Findings of the AmericasNLP 2025 Shared Tasks on Machine Translation, Creation of Educational Material, and Translation Metrics for Indigenous Languages of the Americas
NAACL 2025
The Impact of Domain-Specific Terminology on Machine Translation for Finance in European Languages
NAACL 2025
AfroCS-xs: Creating a Compact, High-Quality, Human-Validated Code-Switched Dataset for African Languages
ACL 2025
Translating Domain-Specific Terminology in Typologically-Diverse Languages: A Study in Tax and Financial Education
EMNLP 2025
Findings of the WMT25 Terminology Translation Task: Terminology is Useful Especially for Good MTs
EMNLP 2025
Findings of the AmericasNLP 2024 Shared Task on the Creation of Educational Materials for Indigenous Languages
NAACL 2024
Awajun-OP: Multi-domain dataset for Spanish–Awajun Machine Translation
NAACL 2024
Unlocking Knowledge with OCR-Driven Document Digitization for Peruvian Indigenous Languages
NAACL 2024
Findings of the AmericasNLP 2024 Shared Task on Machine Translation into Indigenous Languages
NAACL 2024
Exploring Enhanced Code-Switched Noising for Pretraining in Neural Machine Translation
EACL 2023
Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained Models
EACL 2023
Findings of the AmericasNLP 2023 Shared Task on Machine Translation into Indigenous Languages
ACL 2023
SchAman: Spell-Checking Resources and Benchmark for Endangered Languages from Amazonia
AACL 2022
AmericasNLI: Evaluating Zero-shot Natural Language Understanding of Pretrained Multilingual Models in Truly Low-resource Languages
ACL 2022
BPE vs. Morphological Segmentation: A Case Study on Machine Translation of Four Polysynthetic Languages
ACL 2022
CLD² Language Documentation Meets Natural Language Processing for Revitalising Endangered Languages
ACL 2022
Revisiting Syllables in Language Modelling and Their Application on Low-Resource Machine Translation
COLING 2022
SchAman: Spell-Checking Resources and Benchmark for Endangered Languages from Amazonia
IJCNLP 2022
Quantifying Synthesis and Fusion and their Impact on Machine Translation
NAACL 2022
Representation of Yine [Arawak] Morphology by Finite State Transducer Formalism
NAACL 2021
SIGMORPHON 2021 Shared Task on Morphological Reinflection: Generalization Across Languages
ACL 2021
Findings of the AmericasNLP 2021 Shared Task on Open Machine Translation for Indigenous Languages of the Americas
NAACL 2021
SIGMORPHON 2021 Shared Task on Morphological Reinflection: Generalization Across Languages
IJCNLP 2021
Peru is Multilingual, Its Machine Translation Should Be Too?
NAACL 2021
Bridging Linguistic Typology and Multilingual Machine Translation with Multi-View Language Representations
EMNLP 2020
The University of Edinburgh’s English-Tamil and English-Inuktitut Submissions to the WMT20 News Translation Task
EMNLP 2020
Efficient Strategies for Hierarchical Text Classification: External Knowledge and Auxiliary Tasks
ACL 2020
CSI Peru News: finding the culprit, victim and location in news articles
ACL 2019
Assessing Back-Translation as a Corpus Generation Strategy for non-English Tasks: A Study in Reading Comprehension and Word Sense Disambiguation
ACL 2019
Toward Universal Dependencies for Shipibo-Konibo
EMNLP 2018