conftrace_

Antonios Anastasopoulos

131 papers · 2014–2026 · 12 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+15 more ↓

🗺️ Taxonomy Completionist (16) 🧭 Keyword Pioneer 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird

🐝 Cross-Pollinator (11) 🗺️ Taxonomy Completionist (16) 🧭 Keyword Pioneer 🏠 Conference Loyalist (39) 🤝 Dynamic Duo (26) 🧬 Topic Evolution 👥 Mega-Team (62) 🌱 Topic Pioneer 🔬 Deep Specialist (47) 🏆 Keyword Champion (3) ❓ The Questioner (6) 💎 Century Club (128) 🗃️ Keyword Collector (53) ⚡ Prolific Year (13) 🔥 Unstoppable (8)

Conferences

EMNLP (39) ACL (35) NAACL (19) IJCNLP (10) COLING (8) EACL (7) AACL (5) INTERSPEECH (4) AAAI (1) ICML (1) SEMEVAL (1) WACV (1)

Top co-authors

Graham Neubig (26) Fahim Faisal (19) Md Mahfuz Ibn Alam (12) Yulia Tsvetkov (9) Marcos Zampieri (9) Marcello Federico (8) Matteo Negri (7) Dhiman Goswami (7) David Chiang (7) Roldano Cattoni (6)

Research topics

Education (2) Linguistics (1)

Keywords

low-resource language (38) machine translation (25) cross-lingual transfer (19) large language model (15) neural machine translation (14) multilingual nlp (11) multilingual model (8) data augmentation (8) domain adaptation (7) speech recognition (7) transfer learning (7) question answering (6) language identification (6) speech translation (6) morphological inflection (6) multilingual language model (6) automatic speech recognition (5) neural network (5) zero-shot learning (5) few-shot learning (4)

Papers

A RAG Approach for Typological Database Completion EACL 2026 VIGNETTE: Socially Grounded Bias Evaluation for Vision-Language Models ACL 2026 Extending ASR Evaluation Resources for Modern Greek Dialects EACL 2026 Follow the Beaten Path: The Role of Route Patterns on Vision-Language Navigation Agents Generalization Abilities NAACL 2025 Crossroads of Continents: Automated Artifact Extraction for Cultural Adaptation with Large Multimodal Models WACV 2025 VMWE identification with models trained on GUD (a UDv.2 treebank of Standard Modern Greek) NAACL 2025 Script-Agnosticism and its Impact on Language Identification for Dravidian Languages NAACL 2025 Large Language Models as a Normalizer for Transliteration and Dialectal Translation COLING 2025 Machine Translation Using Grammar Materials for LLM Post-Correction NAACL 2025 Towards Ancient Meroitic Decipherment: A Computational Approach NAACL 2025 Cross-Lingual Representation Alignment Through Contrastive Image-Caption Tuning ACL 2025 Dialect Normalization using Large Language Models and Morphological Rules ACL 2025 Costs and Benefits of AI-Enabled Topic Modeling in P-20 Research: The Case of School Improvement Plans ACL 2025 GMU Systems for the IWSLT 2025 Low-Resource Speech Translation Shared Task ACL 2025 Findings of the IWSLT 2025 Evaluation Campaign ACL 2025 Testing the Boundaries of LLMs: Dialectal and Language-Variety Tasks COLING 2025 Machine Translation Metrics for Indigenous Languages Using Fine-tuned Semantic Embeddings NAACL 2025 Multilingual Native Language Identification with Large Language Models NAACL 2025 Dialectal Toxicity Detection: Evaluating LLM-as-a-Judge Consistency Across Language Varieties EMNLP 2025 Findings of the WMT 2025 Shared Task of the Open Language Data Initiative EMNLP 2025 Tracing L1 Interference in English Learner Writing: A Longitudinal Corpus with Error Annotations EMNLP 2025 mHumanEval - A Multilingual Benchmark to Evaluate Large Language Models for Code Generation NAACL 2025 BiasDora: Exploring Hidden Biased Associations in Vision-Language Models EMNLP 2024 Language and Speech Technology for Central Kurdish Varieties COLING 2024 EmoMix-3L: A Code-Mixed Dataset for Bangla-English-Hindi for Emotion Detection COLING 2024 Back to School: Translation Using Grammar Books EMNLP 2024 CODET: A Benchmark for Contrastive Dialectal Evaluation of Machine Translation EACL 2024 Birdie: Advancing State Space Language Modeling with Dynamic Mixtures of Training Objectives EMNLP 2024 Data-Augmentation-Based Dialectal Adaptation for LLMs NAACL 2024 A Concise Survey of OCR for Low-Resource Languages NAACL 2024 A Study on Scaling Up Multilingual News Framing Analysis NAACL 2024 Extracting Lexical Features from Dialects via Interpretable Dialect Classifiers NAACL 2024 Global Gallery: The Fine Art of Painting Culture Portraits through Multilingual Instruction Tuning NAACL 2024 Speech Recognition for Greek Dialects: A Challenging Benchmark INTERSPEECH 2024 Findings of the WMT 2024 Shared Task of the Open Language Data Initiative EMNLP 2024 From Text to Maps: LLM-Driven Extraction and Geotagging of Epidemiological Data EMNLP 2024 DIALECTBENCH: An NLP Benchmark for Dialects, Varieties, and Closely-Related Languages ACL 2024 Dictionary-Aided Translation for Handling Multi-Word Expressions in Low-Resource Languages ACL 2024 Unlearning Climate Misinformation in Large Language Models ACL 2024 FINDINGS OF THE IWSLT 2024 EVALUATION CAMPAIGN ACL 2024 An Efficient Approach for Studying Cross-Lingual Transfer in Multilingual Language Models EMNLP 2024 Gloss2Text: Sign Language Gloss translation using LLMs and Semantically Aware Label Smoothing EMNLP 2024 The LLM Effect: Are Humans Truly Using LLMs, or Are They Being Influenced By Them Instead? EMNLP 2024 Noisy Parallel Data Alignment EACL 2023 SentMix-3L: A Novel Code-Mixed Test Dataset in Bangla-English-Hindi for Sentiment Analysis AACL 2023 OffMix-3L: A Novel Code-Mixed Test Dataset in Bangla-English-Hindi for Offensive Language Identification AACL 2023 BIG-C: a Multimodal Multi-Purpose Dataset for Bemba ACL 2023 Script Normalization for Unconventional Writing of Under-Resourced Languages in Bilingual Communities ACL 2023 FINDINGS OF THE IWSLT 2023 EVALUATION CAMPAIGN ACL 2023 GMU Systems for the IWSLT 2023 Dialect and Low-resource Speech Translation Tasks ACL 2023 GMNLP at SemEval-2023 Task 12: Sentiment Analysis with Phylogeny-Based Adapters ACL 2023 Language Generation Models Can Cause Harm: So What Can We Do About It? An Actionable Survey EACL 2023 Approaches to Corpus Creation for Low-Resource Language Technology: the Case of Southern Kurdish and Laki EACL 2023 PALI: A Language Identification Benchmark for Perso-Arabic Scripts EACL 2023 GlobalBench: A Benchmark for Global Progress in Natural Language Processing EMNLP 2023 LIMIT: Language Identification, Misidentification, and Translation using Hierarchical Models in 350+ Languages EMNLP 2023 Global Voices, Local Biases: Socio-Cultural Prejudices across Languages EMNLP 2023 Mitigating Societal Harms in Large Language Models EMNLP 2023 Teacher Perception of Automatically Extracted Grammar Concepts for L2 Language Learning EMNLP 2023 Offensive Language Identification in Transliterated and Code-Mixed Bangla EMNLP 2023 To token or not to token: A Comparative Study of Text Representations for Cross-Lingual Transfer EMNLP 2023 Geographic and Geopolitical Biases of Language Models EMNLP 2023 SentMix-3L: A Novel Code-Mixed Test Dataset in Bangla-English-Hindi for Sentiment Analysis IJCNLP 2023 OffMix-3L: A Novel Code-Mixed Test Dataset in Bangla-English-Hindi for Offensive Language Identification IJCNLP 2023 Zambezi Voice: A Multilingual Speech Corpus for Zambian Languages INTERSPEECH 2023 GMNLP at SemEval-2023 Task 12: Sentiment Analysis with Phylogeny-Based Adapters SEMEVAL 2023 Systematic Inequalities in Language Technology Performance across the World’s Languages ACL 2022 Phylogeny-Inspired Adaptation of Multilingual Models to New Languages AACL 2022 Revisiting the Effects of Leakage on Dependency Parsing ACL 2022 Findings of the IWSLT 2022 Evaluation Campaign ACL 2022 Findings of the VarDial Evaluation Campaign 2022 COLING 2022 Findings of the WMT’22 Shared Task on Large-Scale Machine Translation Evaluation for African Languages EMNLP 2022 Language Adapters for Large-Scale MT: The GMU System for the WMT 2022 Large-Scale Machine Translation Evaluation for African Languages Shared Task EMNLP 2022 SIGMORPHON–UniMorph 2022 Shared Task 0: Generalization and Typologically Diverse Morphological Inflection NAACL 2022 Educational Tools for Mapuzugun NAACL 2022 The SUMEval 2022 Shared Task on Performance Prediction of Multilingual Pre-trained Language Models AACL 2022 The GMU System Submission for the SUMEval 2022 Shared Task AACL 2022 Phylogeny-Inspired Adaptation of Multilingual Models to New Languages IJCNLP 2022 Dataset Geography: Mapping Language Data to Language Users ACL 2022 Machine Translation into Low-resource Language Varieties ACL 2021 SD-QA: Spoken Dialectal Question Answering for the Real World EMNLP 2021 Multilingual Code-Switching for Zero-Shot Cross-Lingual Intent Prediction and Slot Filling EMNLP 2021 Investigating Post-pretraining Representation Alignment for Cross-Lingual Question Answering EMNLP 2021 Findings of the WMT Shared Task on Machine Translation Using Terminologies EMNLP 2021 Code to Comment Translation: A Comparative Study on Model Effectiveness & Errors ACL 2021 Machine Translation into Low-resource Language Varieties IJCNLP 2021 Towards more equitable question answering systems: How much more data do you need? IJCNLP 2021 FINDINGS OF THE IWSLT 2021 EVALUATION CAMPAIGN IJCNLP 2021 Code to Comment Translation: A Comparative Study on Model Effectiveness & Errors IJCNLP 2021 Phoneme Recognition Through Fine Tuning of Phonetic Representations: A Case Study on Luhya Language Varieties INTERSPEECH 2021 FINDINGS OF THE IWSLT 2021 EVALUATION CAMPAIGN ACL 2021 When Being Unseen from mBERT is just the Beginning: Handling New Languages With Multilingual Language Models NAACL 2021 Towards more equitable question answering systems: How much more data do you need? ACL 2021 Evaluating the Morphosyntactic Well-formedness of Generated Texts EMNLP 2021 When is Wall a Pared and when a Muro?: Extracting Rules Governing Lexical Selection EMNLP 2021 Predicting Performance for Natural Language Processing Tasks ACL 2020 TICO-19: the Translation Initiative for COvid-19 EMNLP 2020 Fine-Tuning MT systems for Robustness to Second-Language Speaker Variations EMNLP 2020 Transliteration for Cross-Lingual Morphological Inflection ACL 2020 The CMU-LTI submission to the SIGMORPHON 2020 Shared Task 0: Language-Specific Cross-Lingual Transfer ACL 2020 SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection ACL 2020 Should All Cross-Lingual Embeddings Speak English? ACL 2020 Dynamic Data Selection and Weighting for Iterative Back-Translation EMNLP 2020 Towards Minimal Supervision BERT-Based Grammar Error Correction (Student Abstract) AAAI 2020 OCR Post Correction for Endangered Language Texts EMNLP 2020 Automatic Extraction of Rules Governing Morphological Agreement EMNLP 2020 X-FACTR: Multilingual Factual Knowledge Retrieval from Pretrained Language Models EMNLP 2020 Automatic Interlinear Glossing for Under-Resourced Languages Leveraging Translations COLING 2020 Endangered Languages meet Modern NLP COLING 2020 Optimizing Data Usage via Differentiable Rewards ICML 2020 It’s not a Non-Issue: Negation as a Source of Error in Machine Translation EMNLP 2020 It’s Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information ACL 2020 Investigating Meta-Learning Algorithms for Low-Resource Natural Language Understanding Tasks IJCNLP 2019 Investigating Meta-Learning Algorithms for Low-Resource Natural Language Understanding Tasks EMNLP 2019 Unsupervised Domain Adaptation for Neural Machine Translation with Domain-Aware Feature Embeddings EMNLP 2019 An Analysis of Source-Side Grammatical Errors in NMT ACL 2019 Findings of the First Shared Task on Machine Translation Robustness ACL 2019 Improving Robustness of Neural Machine Translation with Multi-task Learning ACL 2019 Neural Machine Translation of Text from Non-Native Speakers NAACL 2019 Choosing Transfer Languages for Cross-Lingual Learning ACL 2019 Generalized Data Augmentation for Low-Resource Translation ACL 2019 Pushing the Limits of Low-Resource Morphological Inflection EMNLP 2019 Pushing the Limits of Low-Resource Morphological Inflection IJCNLP 2019 Unsupervised Domain Adaptation for Neural Machine Translation with Domain-Aware Feature Embeddings IJCNLP 2019 Tied Multitask Learning for Neural Speech Translation NAACL 2018 Freezing Subnetworks to Analyze Domain Adaptation in Neural Machine Translation EMNLP 2018 Part-of-Speech Tagging on an Endangered Language: a Parallel Griko-Italian Resource COLING 2018 Leveraging Translations for Speech Transcription in Low-resource Settings INTERSPEECH 2018 An Attentional Model for Speech Translation Without Transcription NAACL 2016 An Unsupervised Probability Model for Speech-to-Translation Alignment of Low-Resource Languages EMNLP 2016 Adaptive Quality Estimation for Machine Translation ACL 2014