Hamdy Mubarak

47 papers · 2014–2026 · 9 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🏃 Academic Marathon (11) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (9) 🐝 Cross-Pollinator (7)

🌍 Conference Polyglot (9) 🏃 Academic Marathon (11) 🌈 Renaissance Researcher (8) 👥 Mega-Team (43) 🤝 Dynamic Duo (21) 🔬 Deep Specialist (15) 🧬 Topic Evolution ⚡ Prolific Year (7) 💎 Century Club (45) 🔥 Unstoppable (7) 🗃️ Keyword Collector (156)

Conferences

EMNLP (13) EACL (10) ACL (7) COLING (6) SEMEVAL (5) IJCNLP (2) NAACL (2) CONLL (1) INTERSPEECH (1)

Top co-authors

Ahmed Abdelali (21) Kareem Darwish (15) Preslav Nakov (10) Younes Samih (10) Majd Hawasly (9) Firoj Alam (8) Sabit Hassan (7) Ahmed Ali (7) Nadir Durrani (6) Hassan Sajjad (6)

Research topics

Education (1)

Keywords

text classification (13) arabic language (11) large language model (9) social media (5) offensive language detection (5) dialect identification (5) deep neural network (4) neural network (4) hate speech detection (4) transformer model (4) offensive language identification (3) automatic speech recognition (3) arabic diacritization (3) dialectal arabic (3) social media analysis (3) benchmark evaluation (3) sentiment analysis (2) hallucination detection (2) zero-shot learning (2) multilingual nlp (2)

Papers

Fanar-Sadiq: A Multi-Agent Architecture for Grounded Islamic QA ACL 2026 Nahw: A Comprehensive Benchmark of Arabic Grammar Understanding, Error Detection, Correction, and Explanation EACL 2026 BALSAM: A Platform for Benchmarking Arabic Large Language Models EMNLP 2025 PalmX 2025: The First Shared Task on Benchmarking LLMs on Arabic and Islamic Culture EMNLP 2025 DialG2P: Dialectal Grapheme-to-Phoneme. Arabic as a Case Study EMNLP 2025 Advancing Arabic Diacritization: Improved Datasets, Benchmarking, and State-of-the-Art Models EMNLP 2025 IslamicEval 2025: The First Shared Task of Capturing LLMs Hallucination in Islamic Content EMNLP 2025 AraSafe: Benchmarking Safety in Arabic LLMs EMNLP 2025 ArabicWeb-Edu: Educational Quality Data for Arabic LLM Training EMNLP 2025 LAraBench: Benchmarking Arabic AI with Large Language Models EACL 2024 Halwasa: Quantify and Analyze Hallucinations in Large Language Models: Arabic as a Case Study COLING 2024 So Hateful! Building a Multi-Label Hate Speech Annotated Arabic Dataset COLING 2024 Beyond Orthography: Automatic Recovery of Short Vowels and Dialectal Sounds in Arabic ACL 2024 Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification ACL 2024 Wikidata as a Source of Demographic Information ACL 2024 LLMeBench: A Flexible Framework for Accelerating LLMs Benchmarking EACL 2024 ArAIEval Shared Task: Persuasion Techniques and Disinformation Detection in Arabic Text EMNLP 2023 QVoice: Arabic Speech Pronunciation Learning Application INTERSPEECH 2023 Overview of the WANLP 2022 Shared Task on Propaganda Detection in Arabic EMNLP 2022 ArabGend: Gender Analysis and Inference on Arabic Twitter COLING 2022 NatiQ: An End-to-end Text-to-Speech System for Arabic EMNLP 2022 QASR: QCRI Aljazeera Speech Resource A Large Scale Annotated Arabic Speech Corpus IJCNLP 2021 QASR: QCRI Aljazeera Speech Resource A Large Scale Annotated Arabic Speech Corpus ACL 2021 ASAD: Arabic Social media Analytics and unDerstanding EACL 2021 ArCorona: Analyzing Arabic Tweets in the Early Days of Coronavirus (COVID-19) Pandemic EACL 2021 QADI: Arabic Dialect Identification in the Wild EACL 2021 Arabic Offensive Language on Twitter: Analysis and Experiments EACL 2021 Adult Content Detection on Arabic Twitter: Analysis and Experiments EACL 2021 UL2C: Mapping User Locations to Countries on Arabic Twitter EACL 2021 Fighting the COVID-19 Infodemic: Modeling the Perspective of Journalists, Fact-Checkers, Social Media Platforms, Policy Makers, and the Society EMNLP 2021 Arabic Curriculum Analysis COLING 2020 ALT at SemEval-2020 Task 12: Arabic and English Offensive Language Identification in Social Media SEMEVAL 2020 SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media (OffensEval 2020) COLING 2020 SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media (OffensEval 2020) SEMEVAL 2020 ALT at SemEval-2020 Task 12: Arabic and English Offensive Language Identification in Social Media COLING 2020 A System for Diacritizing Four Varieties of Arabic EMNLP 2019 A System for Diacritizing Four Varieties of Arabic IJCNLP 2019 POS Tagging for Improving Code-Switching Identification in Arabic ACL 2019 Highly Effective Arabic Diacritization using Sequence to Sequence Modeling NAACL 2019 QC-GO Submission for MADAR Shared Task: Arabic Fine-Grained Dialect Identification ACL 2019 Learning from Relatives: Unified Dialectal Arabic Segmentation CONLL 2017 SemEval-2017 Task 3: Community Question Answering SEMEVAL 2017 QCRI Live Speech Translation System EACL 2017 SemEval-2016 Task 3: Community Question Answering SEMEVAL 2016 Farasa: A Fast and Furious Segmenter for Arabic NAACL 2016 QCRI: Answer Selection for Community Question Answering - Experiments for Arabic and English SEMEVAL 2015 Verifiably Effective Arabic Dialect Identification EMNLP 2014