Muhammad Abdul-Mageed
105 papers · 2011–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (11) 🌍 Conference Polyglot (9)
🧭
Keyword Pioneer
🐣
Hot Topic Early Bird
🏃
Academic Marathon
(14)
🏠
Conference Loyalist
(37)
🤝
Dynamic Duo
(28)
👥
Mega-Team
(44)
🔬
Deep Specialist
(40)
🧬
Topic Evolution
🏆
Keyword Champion
(3)
❓
The Questioner
(4)
🗃️
Keyword Collector
(290)
💎
Century Club
(104)
🔥
Unstoppable
(10)
📈
Trend Setter
⚡
Prolific Year
(6)
Conferences
ACL (38)
EMNLP (31)
NAACL (12)
EACL (7)
COLING (5)
INTERSPEECH (4)
IJCNLP (3)
SEMEVAL (3)
AACL (2)
Top co-authors
Research topics
Keywords
large language model
(23)
arabic language
(19)
machine translation
(15)
multilingual nlp
(14)
text classification
(14)
arabic dialect
(13)
multilingual model
(13)
low-resource language
(12)
dialect identification
(12)
transfer learning
(9)
african language
(8)
benchmark evaluation
(7)
zero-shot learning
(7)
few-shot learning
(7)
neural machine translation
(7)
dialectal arabic
(7)
natural language processing
(6)
sentiment analysis
(5)
speech recognition
(5)
knowledge distillation
(5)
Papers
Alexandria: A Multi-Domain Dialectal Arabic Machine Translation Dataset for Culturally Inclusive and Linguistically Diverse LLMs
ACL 2026
Pearl: A Multimodal Culturally-Aware Arabic Instruction Dataset
EMNLP 2025
EduAdapt: A Question Answer Benchmark Dataset for Evaluating Grade-Level Adaptability in LLMs
EMNLP 2025
Voice of a Continent: Mapping Africa’s Speech Technology Frontier
EMNLP 2025
NileChat: Towards Linguistically Diverse and Culturally Aware LLMs for Local Communities
EMNLP 2025
Beyond Content: How Grammatical Gender Shapes Visual Representation in Text-to-Image Models
EMNLP 2025
PalmX 2025: The First Shared Task on Benchmarking LLMs on Arabic and Islamic Culture
EMNLP 2025
NADI 2025: The First Multidialectal Arabic Speech Processing Shared Task
EMNLP 2025
Swan and ArabicMTEB: Dialect-Aware, Arabic-Centric, Cross-Lingual, and Cross-Cultural Embedding Models and Benchmarks
NAACL 2025
Effective Self-Mining of In-Context Examples for Unsupervised Machine Translation with LLMs
NAACL 2025
JAWAHER: A Multidialectal Dataset of Arabic Proverbs for LLM Benchmarking
NAACL 2025
uDistil-Whisper: Label-Free Data Filtering for Knowledge Distillation in Low-Data Regimes
NAACL 2025
Where Are We? Evaluating LLM Performance on African Languages
ACL 2025
Palm: A Culturally Inclusive and Linguistically Diverse Dataset for Arabic LLMs
ACL 2025
AraHealthQA 2025: The First Shared Task on Arabic Health Question Answering
EMNLP 2025
Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic
ACL 2024
NADI 2024: The Fifth Nuanced Arabic Dialect Identification Shared Task
ACL 2024
WojoodNER 2024: The Second Arabic Named Entity Recognition Shared Task
ACL 2024
Fumbling in Babel: An Investigation into ChatGPT’s Language Identification Ability
NAACL 2024
Distilling Text Style Transfer With Self-Explanation From LLMs
NAACL 2024
To Distill or Not to Distill? On the Robustness of Robust Knowledge Distillation
ACL 2024
Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks
ACL 2024
Cheetah: Natural Language Generation for 517 African Languages
ACL 2024
Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Architecture-Routed Mixture-of-Experts
ACL 2024
LLM Performance Predictors are good initializers for Architecture Search
ACL 2024
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models
ACL 2024
Toucan: Many-to-Many Translation for 150 African Language Pairs
ACL 2024
Towards Zero-Shot Text-To-Speech for Arabic Dialects
ACL 2024
Arabic Automatic Story Generation with Large Language Models
ACL 2024
John vs. Ahmed: Debate-Induced Bias in Multilingual LLMs
ACL 2024
Qalam: A Multimodal LLM for Arabic Optical Character and Handwriting Recognition
ACL 2024
Casablanca: Data and Models for Multidialectal Arabic Speech Recognition
EMNLP 2024
DetoxLLM: A Framework for Detoxification with Explanations
EMNLP 2024
Interplay of Machine Translation, Diacritics, and Diacritization
NAACL 2024
What Does it Take to Generalize SER Model Across Datasets? A Comprehensive Benchmark
INTERSPEECH 2024
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions
EACL 2024
On the Utility of Pretraining Language Models on Synthetic Data
ACL 2024
Benchmarking LLaMA-3 on Arabic Language Generation Tasks
ACL 2024
Gazelle: An Instruction Dataset for Arabic Writing Assistance
EMNLP 2024
From Nile Sands to Digital Hands: Machine Translation of Coptic Texts
ACL 2024
SERENGETI: Massively Multilingual Language Models for Africa
ACL 2023
PACT: Pretraining with Adversarial Contrastive Learning for Text Classification
AACL 2023
ProMap: Effective Bilingual Lexicon Induction via Language Model Prompting
AACL 2023
Contrastive Learning of Sociopragmatic Meaning in Social Media
ACL 2023
AutoMoE: Heterogeneous Mixture-of-Experts with Adaptive Computation for Efficient Neural Machine Translation
ACL 2023
ORCA: A Challenging Benchmark for Arabic Language Understanding
ACL 2023
UBC-DLNLP at SemEval-2023 Task 12: Impact of Transfer Learning on African Sentiment Analysis
ACL 2023
Small Character Models Match Large Word Models for Autocomplete Under Memory Constraints
ACL 2023
Cross-Platform and Cross-Domain Abusive Language Detection with Supervised Contrastive Learning
ACL 2023
Improving Neural Machine Translation of Indigenous Languages with Multilingual Transfer Learning
EACL 2023
SIDLR: Slot and Intent Detection Models for Low-Resource Language Varieties
EACL 2023
GPTAraEval: A Comprehensive Evaluation of ChatGPT on Arabic NLP
EMNLP 2023
The Skipped Beat: A Study of Sociopragmatic Understanding in LLMs for 64 Languages
EMNLP 2023
JASMINE: Arabic GPT Models for Few-Shot Learning
EMNLP 2023
Dolphin: A Challenging and Diverse Benchmark for Arabic NLG
EMNLP 2023
Violet: A Vision-Language Model for Arabic Image Captioning with Gemini Decoder
EMNLP 2023
TARJAMAT: Evaluation of Bard and ChatGPT on Machine Translation of Ten Arabic Varieties
EMNLP 2023
Beyond English: Evaluating LLMs for Arabic Grammatical Error Correction
EMNLP 2023
Octopus: A Multitask Model and Toolkit for Arabic Natural Language Generation
EMNLP 2023
Arabic Fine-Grained Entity Recognition
EMNLP 2023
VoxArabica: A Robust Dialect-Aware Arabic Speech Recognition System
EMNLP 2023
NADI 2023: The Fourth Nuanced Arabic Dialect Identification Shared Task
EMNLP 2023
WojoodNER 2023: The First Arabic Named Entity Recognition Shared Task
EMNLP 2023
ProMap: Effective Bilingual Lexicon Induction via Language Model Prompting
IJCNLP 2023
PACT: Pretraining with Adversarial Contrastive Learning for Text Classification
IJCNLP 2023
N-Shot Benchmarking of Whisper on Diverse Arabic Speech Recognition
INTERSPEECH 2023
On the Robustness of Arabic Speech Dialect Identification
INTERSPEECH 2023
UBC-DLNLP at SemEval-2023 Task 12: Impact of Transfer Learning on African Sentiment Analysis
SEMEVAL 2023
A Benchmark Study of Contrastive Learning for Arabic Social Meaning
EMNLP 2022
AfroLID: A Neural Language Identification Tool for African Languages
EMNLP 2022
Linguistically-Motivated Yorùbá-English Machine Translation
COLING 2022
AraT5: Text-to-Text Transformers for Arabic Language Generation
ACL 2022
Improving Social Meaning Detection with Pragmatic Masking and Surrogate Fine-Tuning
ACL 2022
Automatic Detection of Entity-Manipulated Text using Factual Knowledge
ACL 2022
Towards Afrocentric NLP for African Languages: Where We Are and Where We Can Go
ACL 2022
NADI 2022: The Third Nuanced Arabic Dialect Identification Shared Task
EMNLP 2022
ARBERT & MARBERT: Deep Bidirectional Transformers for Arabic
ACL 2021
Investigating Code-Mixed Modern Standard Arabic-Egyptian to English Machine Translation
NAACL 2021
NADI 2021: The Second Nuanced Arabic Dialect Identification Shared Task
EACL 2021
AraStance: A Multi-Country and Multi-Domain Dataset of Arabic Stance Detection for Fact Checking
NAACL 2021
Exploring Text-to-Text Transformers for English to Hinglish Machine Translation with Synthetic Code-Mixing
NAACL 2021
IndT5: A Text-to-Text Transformer for 10 Indigenous Languages
NAACL 2021
DiaLex: A Benchmark for Evaluating Multidialectal Arabic Word Embeddings
EACL 2021
Improving Similar Language Translation With Transfer Learning
EMNLP 2021
Machine Translation of Low-Resource Indo-European Languages
EMNLP 2021
Mega-COV: A Billion-Scale Dataset of 100+ Languages for COVID-19
EACL 2021
Self-Training Pre-Trained Language Models for Zero- and Few-Shot Multi-Dialectal Arabic Sequence Labeling
EACL 2021
ARBERT & MARBERT: Deep Bidirectional Transformers for Arabic
IJCNLP 2021
Growing Together: Modeling Human Language Learning With n-Best Multi-Checkpoint Machine Translation
ACL 2020
NADI 2020: The First Nuanced Arabic Dialect Identification Shared Task
COLING 2020
Machine Generation and Detection of Arabic Manipulated and Fake News
COLING 2020
Automatic Detection of Machine Generated Text: A Critical Survey
COLING 2020
Toward Micro-Dialect Identification in Diaglossic and Code-Switched Environments
EMNLP 2020
One Model to Pronounce Them All: Multilingual Grapheme-to-Phoneme Conversion With a Transformer Ensemble
ACL 2020
UBC-NLP at SemEval-2019 Task 6: Ensemble Learning of Offensive Content With Enhanced Training Data
SEMEVAL 2019
No Army, No Navy: BERT Semi-Supervised Learning of Arabic Dialects
ACL 2019
UBC-NLP at SemEval-2019 Task 4: Hyperpartisan News Detection With Attention-Based Bi-LSTMs
SEMEVAL 2019
Neural Machine Translation of Low-Resource and Similar Languages with Backtranslation
ACL 2019
SPEAK YOUR MIND! Towards Imagined Speech Recognition with Hierarchical Deep Learning
INTERSPEECH 2019
UBC-NLP at IEST 2018: Learning Implicit Emotion With an Ensemble of Language Models
EMNLP 2018
Enabling Deep Learning of Emotion With First-Person Seed Expressions
NAACL 2018
Deep Models for Arabic Dialect Identification on Benchmarked Data
COLING 2018
EmoNet: Fine-Grained Emotion Detection with Gated Recurrent Neural Networks
ACL 2017
Does ‘well-being’ translate on Twitter?
EMNLP 2016
Subjectivity and Sentiment Analysis of Modern Standard Arabic
ACL 2011