Antonios Anastasopoulos
131 papers · 2014–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
🗺️ Taxonomy Completionist (16) 🧭 Keyword Pioneer 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird
🐝
Cross-Pollinator
(11)
🗺️
Taxonomy Completionist
(16)
🧭
Keyword Pioneer
🏠
Conference Loyalist
(39)
🤝
Dynamic Duo
(26)
🧬
Topic Evolution
👥
Mega-Team
(62)
🌱
Topic Pioneer
🔬
Deep Specialist
(47)
🏆
Keyword Champion
(3)
❓
The Questioner
(6)
💎
Century Club
(128)
🗃️
Keyword Collector
(53)
⚡
Prolific Year
(13)
🔥
Unstoppable
(8)
Conferences
EMNLP (39)
ACL (35)
NAACL (19)
IJCNLP (10)
COLING (8)
EACL (7)
AACL (5)
INTERSPEECH (4)
AAAI (1)
ICML (1)
SEMEVAL (1)
WACV (1)
Top co-authors
Research topics
Keywords
low-resource language
(38)
machine translation
(25)
cross-lingual transfer
(19)
large language model
(15)
neural machine translation
(14)
multilingual nlp
(11)
multilingual model
(8)
data augmentation
(8)
domain adaptation
(7)
speech recognition
(7)
transfer learning
(7)
question answering
(6)
language identification
(6)
speech translation
(6)
morphological inflection
(6)
multilingual language model
(6)
automatic speech recognition
(5)
neural network
(5)
zero-shot learning
(5)
few-shot learning
(4)
Papers
A RAG Approach for Typological Database Completion
EACL 2026
VIGNETTE: Socially Grounded Bias Evaluation for Vision-Language Models
ACL 2026
Extending ASR Evaluation Resources for Modern Greek Dialects
EACL 2026
Follow the Beaten Path: The Role of Route Patterns on Vision-Language Navigation Agents Generalization Abilities
NAACL 2025
Crossroads of Continents: Automated Artifact Extraction for Cultural Adaptation with Large Multimodal Models
WACV 2025
VMWE identification with models trained on GUD (a UDv.2 treebank of Standard Modern Greek)
NAACL 2025
Script-Agnosticism and its Impact on Language Identification for Dravidian Languages
NAACL 2025
Large Language Models as a Normalizer for Transliteration and Dialectal Translation
COLING 2025
Machine Translation Using Grammar Materials for LLM Post-Correction
NAACL 2025
Towards Ancient Meroitic Decipherment: A Computational Approach
NAACL 2025
Cross-Lingual Representation Alignment Through Contrastive Image-Caption Tuning
ACL 2025
Dialect Normalization using Large Language Models and Morphological Rules
ACL 2025
Costs and Benefits of AI-Enabled Topic Modeling in P-20 Research: The Case of School Improvement Plans
ACL 2025
GMU Systems for the IWSLT 2025 Low-Resource Speech Translation Shared Task
ACL 2025
Findings of the IWSLT 2025 Evaluation Campaign
ACL 2025
Testing the Boundaries of LLMs: Dialectal and Language-Variety Tasks
COLING 2025
Machine Translation Metrics for Indigenous Languages Using Fine-tuned Semantic Embeddings
NAACL 2025
Multilingual Native Language Identification with Large Language Models
NAACL 2025
Dialectal Toxicity Detection: Evaluating LLM-as-a-Judge Consistency Across Language Varieties
EMNLP 2025
Findings of the WMT 2025 Shared Task of the Open Language Data Initiative
EMNLP 2025
Tracing L1 Interference in English Learner Writing: A Longitudinal Corpus with Error Annotations
EMNLP 2025
mHumanEval - A Multilingual Benchmark to Evaluate Large Language Models for Code Generation
NAACL 2025
BiasDora: Exploring Hidden Biased Associations in Vision-Language Models
EMNLP 2024
Language and Speech Technology for Central Kurdish Varieties
COLING 2024
EmoMix-3L: A Code-Mixed Dataset for Bangla-English-Hindi for Emotion Detection
COLING 2024
Back to School: Translation Using Grammar Books
EMNLP 2024
CODET: A Benchmark for Contrastive Dialectal Evaluation of Machine Translation
EACL 2024
Birdie: Advancing State Space Language Modeling with Dynamic Mixtures of Training Objectives
EMNLP 2024
Data-Augmentation-Based Dialectal Adaptation for LLMs
NAACL 2024
A Concise Survey of OCR for Low-Resource Languages
NAACL 2024
A Study on Scaling Up Multilingual News Framing Analysis
NAACL 2024
Extracting Lexical Features from Dialects via Interpretable Dialect Classifiers
NAACL 2024
Global Gallery: The Fine Art of Painting Culture Portraits through Multilingual Instruction Tuning
NAACL 2024
Speech Recognition for Greek Dialects: A Challenging Benchmark
INTERSPEECH 2024
Findings of the WMT 2024 Shared Task of the Open Language Data Initiative
EMNLP 2024
From Text to Maps: LLM-Driven Extraction and Geotagging of Epidemiological Data
EMNLP 2024
DIALECTBENCH: An NLP Benchmark for Dialects, Varieties, and Closely-Related Languages
ACL 2024
Dictionary-Aided Translation for Handling Multi-Word Expressions in Low-Resource Languages
ACL 2024
Unlearning Climate Misinformation in Large Language Models
ACL 2024
FINDINGS OF THE IWSLT 2024 EVALUATION CAMPAIGN
ACL 2024
An Efficient Approach for Studying Cross-Lingual Transfer in Multilingual Language Models
EMNLP 2024
Gloss2Text: Sign Language Gloss translation using LLMs and Semantically Aware Label Smoothing
EMNLP 2024
The LLM Effect: Are Humans Truly Using LLMs, or Are They Being Influenced By Them Instead?
EMNLP 2024
Noisy Parallel Data Alignment
EACL 2023
SentMix-3L: A Novel Code-Mixed Test Dataset in Bangla-English-Hindi for Sentiment Analysis
AACL 2023
OffMix-3L: A Novel Code-Mixed Test Dataset in Bangla-English-Hindi for Offensive Language Identification
AACL 2023
BIG-C: a Multimodal Multi-Purpose Dataset for Bemba
ACL 2023
Script Normalization for Unconventional Writing of Under-Resourced Languages in Bilingual Communities
ACL 2023
FINDINGS OF THE IWSLT 2023 EVALUATION CAMPAIGN
ACL 2023
GMU Systems for the IWSLT 2023 Dialect and Low-resource Speech Translation Tasks
ACL 2023
GMNLP at SemEval-2023 Task 12: Sentiment Analysis with Phylogeny-Based Adapters
ACL 2023
Language Generation Models Can Cause Harm: So What Can We Do About It? An Actionable Survey
EACL 2023
Approaches to Corpus Creation for Low-Resource Language Technology: the Case of Southern Kurdish and Laki
EACL 2023
PALI: A Language Identification Benchmark for Perso-Arabic Scripts
EACL 2023
GlobalBench: A Benchmark for Global Progress in Natural Language Processing
EMNLP 2023
LIMIT: Language Identification, Misidentification, and Translation using Hierarchical Models in 350+ Languages
EMNLP 2023
Global Voices, Local Biases: Socio-Cultural Prejudices across Languages
EMNLP 2023
Mitigating Societal Harms in Large Language Models
EMNLP 2023
Teacher Perception of Automatically Extracted Grammar Concepts for L2 Language Learning
EMNLP 2023
Offensive Language Identification in Transliterated and Code-Mixed Bangla
EMNLP 2023
To token or not to token: A Comparative Study of Text Representations for Cross-Lingual Transfer
EMNLP 2023
Geographic and Geopolitical Biases of Language Models
EMNLP 2023
SentMix-3L: A Novel Code-Mixed Test Dataset in Bangla-English-Hindi for Sentiment Analysis
IJCNLP 2023
OffMix-3L: A Novel Code-Mixed Test Dataset in Bangla-English-Hindi for Offensive Language Identification
IJCNLP 2023
Zambezi Voice: A Multilingual Speech Corpus for Zambian Languages
INTERSPEECH 2023
GMNLP at SemEval-2023 Task 12: Sentiment Analysis with Phylogeny-Based Adapters
SEMEVAL 2023
Systematic Inequalities in Language Technology Performance across the World’s Languages
ACL 2022
Phylogeny-Inspired Adaptation of Multilingual Models to New Languages
AACL 2022
Revisiting the Effects of Leakage on Dependency Parsing
ACL 2022
Findings of the IWSLT 2022 Evaluation Campaign
ACL 2022
Findings of the VarDial Evaluation Campaign 2022
COLING 2022
Findings of the WMT’22 Shared Task on Large-Scale Machine Translation Evaluation for African Languages
EMNLP 2022
Language Adapters for Large-Scale MT: The GMU System for the WMT 2022 Large-Scale Machine Translation Evaluation for African Languages Shared Task
EMNLP 2022
SIGMORPHON–UniMorph 2022 Shared Task 0: Generalization and Typologically Diverse Morphological Inflection
NAACL 2022
Educational Tools for Mapuzugun
NAACL 2022
The SUMEval 2022 Shared Task on Performance Prediction of Multilingual Pre-trained Language Models
AACL 2022
The GMU System Submission for the SUMEval 2022 Shared Task
AACL 2022
Phylogeny-Inspired Adaptation of Multilingual Models to New Languages
IJCNLP 2022
Dataset Geography: Mapping Language Data to Language Users
ACL 2022
Machine Translation into Low-resource Language Varieties
ACL 2021
SD-QA: Spoken Dialectal Question Answering for the Real World
EMNLP 2021
Multilingual Code-Switching for Zero-Shot Cross-Lingual Intent Prediction and Slot Filling
EMNLP 2021
Investigating Post-pretraining Representation Alignment for Cross-Lingual Question Answering
EMNLP 2021
Findings of the WMT Shared Task on Machine Translation Using Terminologies
EMNLP 2021
Code to Comment Translation: A Comparative Study on Model Effectiveness & Errors
ACL 2021
Machine Translation into Low-resource Language Varieties
IJCNLP 2021
Towards more equitable question answering systems: How much more data do you need?
IJCNLP 2021
FINDINGS OF THE IWSLT 2021 EVALUATION CAMPAIGN
IJCNLP 2021
Code to Comment Translation: A Comparative Study on Model Effectiveness & Errors
IJCNLP 2021
Phoneme Recognition Through Fine Tuning of Phonetic Representations: A Case Study on Luhya Language Varieties
INTERSPEECH 2021
FINDINGS OF THE IWSLT 2021 EVALUATION CAMPAIGN
ACL 2021
When Being Unseen from mBERT is just the Beginning: Handling New Languages With Multilingual Language Models
NAACL 2021
Towards more equitable question answering systems: How much more data do you need?
ACL 2021
Evaluating the Morphosyntactic Well-formedness of Generated Texts
EMNLP 2021
When is Wall a Pared and when a Muro?: Extracting Rules Governing Lexical Selection
EMNLP 2021
Predicting Performance for Natural Language Processing Tasks
ACL 2020
TICO-19: the Translation Initiative for COvid-19
EMNLP 2020
Fine-Tuning MT systems for Robustness to Second-Language Speaker Variations
EMNLP 2020
Transliteration for Cross-Lingual Morphological Inflection
ACL 2020
The CMU-LTI submission to the SIGMORPHON 2020 Shared Task 0: Language-Specific Cross-Lingual Transfer
ACL 2020
SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection
ACL 2020
Should All Cross-Lingual Embeddings Speak English?
ACL 2020
Dynamic Data Selection and Weighting for Iterative Back-Translation
EMNLP 2020
Towards Minimal Supervision BERT-Based Grammar Error Correction (Student Abstract)
AAAI 2020
OCR Post Correction for Endangered Language Texts
EMNLP 2020
Automatic Extraction of Rules Governing Morphological Agreement
EMNLP 2020
X-FACTR: Multilingual Factual Knowledge Retrieval from Pretrained Language Models
EMNLP 2020
Automatic Interlinear Glossing for Under-Resourced Languages Leveraging Translations
COLING 2020
Endangered Languages meet Modern NLP
COLING 2020
Optimizing Data Usage via Differentiable Rewards
ICML 2020
It’s not a Non-Issue: Negation as a Source of Error in Machine Translation
EMNLP 2020
It’s Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information
ACL 2020
Investigating Meta-Learning Algorithms for Low-Resource Natural Language Understanding Tasks
IJCNLP 2019
Investigating Meta-Learning Algorithms for Low-Resource Natural Language Understanding Tasks
EMNLP 2019
Unsupervised Domain Adaptation for Neural Machine Translation with Domain-Aware Feature Embeddings
EMNLP 2019
An Analysis of Source-Side Grammatical Errors in NMT
ACL 2019
Findings of the First Shared Task on Machine Translation Robustness
ACL 2019
Improving Robustness of Neural Machine Translation with Multi-task Learning
ACL 2019
Neural Machine Translation of Text from Non-Native Speakers
NAACL 2019
Choosing Transfer Languages for Cross-Lingual Learning
ACL 2019
Generalized Data Augmentation for Low-Resource Translation
ACL 2019
Pushing the Limits of Low-Resource Morphological Inflection
EMNLP 2019
Pushing the Limits of Low-Resource Morphological Inflection
IJCNLP 2019
Unsupervised Domain Adaptation for Neural Machine Translation with Domain-Aware Feature Embeddings
IJCNLP 2019
Tied Multitask Learning for Neural Speech Translation
NAACL 2018
Freezing Subnetworks to Analyze Domain Adaptation in Neural Machine Translation
EMNLP 2018
Part-of-Speech Tagging on an Endangered Language: a Parallel Griko-Italian Resource
COLING 2018
Leveraging Translations for Speech Transcription in Low-resource Settings
INTERSPEECH 2018
An Attentional Model for Speech Translation Without Transcription
NAACL 2016
An Unsupervised Probability Model for Speech-to-Translation Alignment of Low-Resource Languages
EMNLP 2016
Adaptive Quality Estimation for Machine Translation
ACL 2014