conftrace_

Muhammad Abdul-Mageed

105 papers · 2011–2026 · 9 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+15 more ↓

🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (11) 🌍 Conference Polyglot (9)

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏃 Academic Marathon (14) 🏠 Conference Loyalist (37) 🤝 Dynamic Duo (28) 👥 Mega-Team (44) 🔬 Deep Specialist (40) 🧬 Topic Evolution 🏆 Keyword Champion (3) ❓ The Questioner (4) 🗃️ Keyword Collector (290) 💎 Century Club (104) 🔥 Unstoppable (10) 📈 Trend Setter ⚡ Prolific Year (6)

Conferences

ACL (38) EMNLP (31) NAACL (12) EACL (7) COLING (5) INTERSPEECH (4) IJCNLP (3) SEMEVAL (3) AACL (2)

Top co-authors

AbdelRahim Elmadany (28) El Moatez Billah Nagoudi (26) Chiyu Zhang (15) Fakhraddin Alwajih (13) Ife Adebara (12) Laks Lakshmanan (11) Abdellah El Mekki (11) Md Tawkat Islam Khondaker (9) Gagan Bhatia (9) V.S. (8)

Research topics

Linguistics (2) Understanding (1) Applications (1) Education (1)

Keywords

large language model (23) arabic language (19) machine translation (15) multilingual nlp (14) text classification (14) arabic dialect (13) multilingual model (13) low-resource language (12) dialect identification (12) transfer learning (9) african language (8) benchmark evaluation (7) zero-shot learning (7) few-shot learning (7) neural machine translation (7) dialectal arabic (7) natural language processing (6) sentiment analysis (5) speech recognition (5) knowledge distillation (5)

Papers

Alexandria: A Multi-Domain Dialectal Arabic Machine Translation Dataset for Culturally Inclusive and Linguistically Diverse LLMs ACL 2026 Pearl: A Multimodal Culturally-Aware Arabic Instruction Dataset EMNLP 2025 EduAdapt: A Question Answer Benchmark Dataset for Evaluating Grade-Level Adaptability in LLMs EMNLP 2025 Voice of a Continent: Mapping Africa’s Speech Technology Frontier EMNLP 2025 NileChat: Towards Linguistically Diverse and Culturally Aware LLMs for Local Communities EMNLP 2025 Beyond Content: How Grammatical Gender Shapes Visual Representation in Text-to-Image Models EMNLP 2025 PalmX 2025: The First Shared Task on Benchmarking LLMs on Arabic and Islamic Culture EMNLP 2025 NADI 2025: The First Multidialectal Arabic Speech Processing Shared Task EMNLP 2025 Swan and ArabicMTEB: Dialect-Aware, Arabic-Centric, Cross-Lingual, and Cross-Cultural Embedding Models and Benchmarks NAACL 2025 Effective Self-Mining of In-Context Examples for Unsupervised Machine Translation with LLMs NAACL 2025 JAWAHER: A Multidialectal Dataset of Arabic Proverbs for LLM Benchmarking NAACL 2025 uDistil-Whisper: Label-Free Data Filtering for Knowledge Distillation in Low-Data Regimes NAACL 2025 Where Are We? Evaluating LLM Performance on African Languages ACL 2025 Palm: A Culturally Inclusive and Linguistically Diverse Dataset for Arabic LLMs ACL 2025 AraHealthQA 2025: The First Shared Task on Arabic Health Question Answering EMNLP 2025 Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic ACL 2024 NADI 2024: The Fifth Nuanced Arabic Dialect Identification Shared Task ACL 2024 WojoodNER 2024: The Second Arabic Named Entity Recognition Shared Task ACL 2024 Fumbling in Babel: An Investigation into ChatGPT’s Language Identification Ability NAACL 2024 Distilling Text Style Transfer With Self-Explanation From LLMs NAACL 2024 To Distill or Not to Distill? On the Robustness of Robust Knowledge Distillation ACL 2024 Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks ACL 2024 Cheetah: Natural Language Generation for 517 African Languages ACL 2024 Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Architecture-Routed Mixture-of-Experts ACL 2024 LLM Performance Predictors are good initializers for Architecture Search ACL 2024 FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models ACL 2024 Toucan: Many-to-Many Translation for 150 African Language Pairs ACL 2024 Towards Zero-Shot Text-To-Speech for Arabic Dialects ACL 2024 Arabic Automatic Story Generation with Large Language Models ACL 2024 John vs. Ahmed: Debate-Induced Bias in Multilingual LLMs ACL 2024 Qalam: A Multimodal LLM for Arabic Optical Character and Handwriting Recognition ACL 2024 Casablanca: Data and Models for Multidialectal Arabic Speech Recognition EMNLP 2024 DetoxLLM: A Framework for Detoxification with Explanations EMNLP 2024 Interplay of Machine Translation, Diacritics, and Diacritization NAACL 2024 What Does it Take to Generalize SER Model Across Datasets? A Comprehensive Benchmark INTERSPEECH 2024 LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions EACL 2024 On the Utility of Pretraining Language Models on Synthetic Data ACL 2024 Benchmarking LLaMA-3 on Arabic Language Generation Tasks ACL 2024 Gazelle: An Instruction Dataset for Arabic Writing Assistance EMNLP 2024 From Nile Sands to Digital Hands: Machine Translation of Coptic Texts ACL 2024 SERENGETI: Massively Multilingual Language Models for Africa ACL 2023 PACT: Pretraining with Adversarial Contrastive Learning for Text Classification AACL 2023 ProMap: Effective Bilingual Lexicon Induction via Language Model Prompting AACL 2023 Contrastive Learning of Sociopragmatic Meaning in Social Media ACL 2023 AutoMoE: Heterogeneous Mixture-of-Experts with Adaptive Computation for Efficient Neural Machine Translation ACL 2023 ORCA: A Challenging Benchmark for Arabic Language Understanding ACL 2023 UBC-DLNLP at SemEval-2023 Task 12: Impact of Transfer Learning on African Sentiment Analysis ACL 2023 Small Character Models Match Large Word Models for Autocomplete Under Memory Constraints ACL 2023 Cross-Platform and Cross-Domain Abusive Language Detection with Supervised Contrastive Learning ACL 2023 Improving Neural Machine Translation of Indigenous Languages with Multilingual Transfer Learning EACL 2023 SIDLR: Slot and Intent Detection Models for Low-Resource Language Varieties EACL 2023 GPTAraEval: A Comprehensive Evaluation of ChatGPT on Arabic NLP EMNLP 2023 The Skipped Beat: A Study of Sociopragmatic Understanding in LLMs for 64 Languages EMNLP 2023 JASMINE: Arabic GPT Models for Few-Shot Learning EMNLP 2023 Dolphin: A Challenging and Diverse Benchmark for Arabic NLG EMNLP 2023 Violet: A Vision-Language Model for Arabic Image Captioning with Gemini Decoder EMNLP 2023 TARJAMAT: Evaluation of Bard and ChatGPT on Machine Translation of Ten Arabic Varieties EMNLP 2023 Beyond English: Evaluating LLMs for Arabic Grammatical Error Correction EMNLP 2023 Octopus: A Multitask Model and Toolkit for Arabic Natural Language Generation EMNLP 2023 Arabic Fine-Grained Entity Recognition EMNLP 2023 VoxArabica: A Robust Dialect-Aware Arabic Speech Recognition System EMNLP 2023 NADI 2023: The Fourth Nuanced Arabic Dialect Identification Shared Task EMNLP 2023 WojoodNER 2023: The First Arabic Named Entity Recognition Shared Task EMNLP 2023 ProMap: Effective Bilingual Lexicon Induction via Language Model Prompting IJCNLP 2023 PACT: Pretraining with Adversarial Contrastive Learning for Text Classification IJCNLP 2023 N-Shot Benchmarking of Whisper on Diverse Arabic Speech Recognition INTERSPEECH 2023 On the Robustness of Arabic Speech Dialect Identification INTERSPEECH 2023 UBC-DLNLP at SemEval-2023 Task 12: Impact of Transfer Learning on African Sentiment Analysis SEMEVAL 2023 A Benchmark Study of Contrastive Learning for Arabic Social Meaning EMNLP 2022 AfroLID: A Neural Language Identification Tool for African Languages EMNLP 2022 Linguistically-Motivated Yorùbá-English Machine Translation COLING 2022 AraT5: Text-to-Text Transformers for Arabic Language Generation ACL 2022 Improving Social Meaning Detection with Pragmatic Masking and Surrogate Fine-Tuning ACL 2022 Automatic Detection of Entity-Manipulated Text using Factual Knowledge ACL 2022 Towards Afrocentric NLP for African Languages: Where We Are and Where We Can Go ACL 2022 NADI 2022: The Third Nuanced Arabic Dialect Identification Shared Task EMNLP 2022 ARBERT & MARBERT: Deep Bidirectional Transformers for Arabic ACL 2021 Investigating Code-Mixed Modern Standard Arabic-Egyptian to English Machine Translation NAACL 2021 NADI 2021: The Second Nuanced Arabic Dialect Identification Shared Task EACL 2021 AraStance: A Multi-Country and Multi-Domain Dataset of Arabic Stance Detection for Fact Checking NAACL 2021 Exploring Text-to-Text Transformers for English to Hinglish Machine Translation with Synthetic Code-Mixing NAACL 2021 IndT5: A Text-to-Text Transformer for 10 Indigenous Languages NAACL 2021 DiaLex: A Benchmark for Evaluating Multidialectal Arabic Word Embeddings EACL 2021 Improving Similar Language Translation With Transfer Learning EMNLP 2021 Machine Translation of Low-Resource Indo-European Languages EMNLP 2021 Mega-COV: A Billion-Scale Dataset of 100+ Languages for COVID-19 EACL 2021 Self-Training Pre-Trained Language Models for Zero- and Few-Shot Multi-Dialectal Arabic Sequence Labeling EACL 2021 ARBERT & MARBERT: Deep Bidirectional Transformers for Arabic IJCNLP 2021 Growing Together: Modeling Human Language Learning With n-Best Multi-Checkpoint Machine Translation ACL 2020 NADI 2020: The First Nuanced Arabic Dialect Identification Shared Task COLING 2020 Machine Generation and Detection of Arabic Manipulated and Fake News COLING 2020 Automatic Detection of Machine Generated Text: A Critical Survey COLING 2020 Toward Micro-Dialect Identification in Diaglossic and Code-Switched Environments EMNLP 2020 One Model to Pronounce Them All: Multilingual Grapheme-to-Phoneme Conversion With a Transformer Ensemble ACL 2020 UBC-NLP at SemEval-2019 Task 6: Ensemble Learning of Offensive Content With Enhanced Training Data SEMEVAL 2019 No Army, No Navy: BERT Semi-Supervised Learning of Arabic Dialects ACL 2019 UBC-NLP at SemEval-2019 Task 4: Hyperpartisan News Detection With Attention-Based Bi-LSTMs SEMEVAL 2019 Neural Machine Translation of Low-Resource and Similar Languages with Backtranslation ACL 2019 SPEAK YOUR MIND! Towards Imagined Speech Recognition with Hierarchical Deep Learning INTERSPEECH 2019 UBC-NLP at IEST 2018: Learning Implicit Emotion With an Ensemble of Language Models EMNLP 2018 Enabling Deep Learning of Emotion With First-Person Seed Expressions NAACL 2018 Deep Models for Arabic Dialect Identification on Benchmarked Data COLING 2018 EmoNet: Fine-Grained Emotion Detection with Gated Recurrent Neural Networks ACL 2017 Does ‘well-being’ translate on Twitter? EMNLP 2016 Subjectivity and Sentiment Analysis of Modern Standard Arabic ACL 2011