Hamdy Mubarak
47 papers · 2014–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
🏃 Academic Marathon (11) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (9) 🐝 Cross-Pollinator (7)
🌍
Conference Polyglot
(9)
🏃
Academic Marathon
(11)
🌈
Renaissance Researcher
(8)
👥
Mega-Team
(43)
🤝
Dynamic Duo
(21)
🔬
Deep Specialist
(15)
🧬
Topic Evolution
⚡
Prolific Year
(7)
💎
Century Club
(45)
🔥
Unstoppable
(7)
🗃️
Keyword Collector
(156)
Conferences
EMNLP (13)
EACL (10)
ACL (7)
COLING (6)
SEMEVAL (5)
IJCNLP (2)
NAACL (2)
CONLL (1)
INTERSPEECH (1)
Top co-authors
Research topics
Keywords
text classification
(13)
arabic language
(11)
large language model
(9)
social media
(5)
offensive language detection
(5)
dialect identification
(5)
deep neural network
(4)
neural network
(4)
hate speech detection
(4)
transformer model
(4)
offensive language identification
(3)
automatic speech recognition
(3)
arabic diacritization
(3)
dialectal arabic
(3)
social media analysis
(3)
benchmark evaluation
(3)
sentiment analysis
(2)
hallucination detection
(2)
zero-shot learning
(2)
multilingual nlp
(2)
Papers
Fanar-Sadiq: A Multi-Agent Architecture for Grounded Islamic QA
ACL 2026
Nahw: A Comprehensive Benchmark of Arabic Grammar Understanding, Error Detection, Correction, and Explanation
EACL 2026
BALSAM: A Platform for Benchmarking Arabic Large Language Models
EMNLP 2025
PalmX 2025: The First Shared Task on Benchmarking LLMs on Arabic and Islamic Culture
EMNLP 2025
DialG2P: Dialectal Grapheme-to-Phoneme. Arabic as a Case Study
EMNLP 2025
Advancing Arabic Diacritization: Improved Datasets, Benchmarking, and State-of-the-Art Models
EMNLP 2025
IslamicEval 2025: The First Shared Task of Capturing LLMs Hallucination in Islamic Content
EMNLP 2025
AraSafe: Benchmarking Safety in Arabic LLMs
EMNLP 2025
ArabicWeb-Edu: Educational Quality Data for Arabic LLM Training
EMNLP 2025
LAraBench: Benchmarking Arabic AI with Large Language Models
EACL 2024
Halwasa: Quantify and Analyze Hallucinations in Large Language Models: Arabic as a Case Study
COLING 2024
So Hateful! Building a Multi-Label Hate Speech Annotated Arabic Dataset
COLING 2024
Beyond Orthography: Automatic Recovery of Short Vowels and Dialectal Sounds in Arabic
ACL 2024
Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification
ACL 2024
Wikidata as a Source of Demographic Information
ACL 2024
LLMeBench: A Flexible Framework for Accelerating LLMs Benchmarking
EACL 2024
ArAIEval Shared Task: Persuasion Techniques and Disinformation Detection in Arabic Text
EMNLP 2023
QVoice: Arabic Speech Pronunciation Learning Application
INTERSPEECH 2023
Overview of the WANLP 2022 Shared Task on Propaganda Detection in Arabic
EMNLP 2022
ArabGend: Gender Analysis and Inference on Arabic Twitter
COLING 2022
NatiQ: An End-to-end Text-to-Speech System for Arabic
EMNLP 2022
QASR: QCRI Aljazeera Speech Resource A Large Scale Annotated Arabic Speech Corpus
IJCNLP 2021
QASR: QCRI Aljazeera Speech Resource A Large Scale Annotated Arabic Speech Corpus
ACL 2021
ASAD: Arabic Social media Analytics and unDerstanding
EACL 2021
ArCorona: Analyzing Arabic Tweets in the Early Days of Coronavirus (COVID-19) Pandemic
EACL 2021
QADI: Arabic Dialect Identification in the Wild
EACL 2021
Arabic Offensive Language on Twitter: Analysis and Experiments
EACL 2021
Adult Content Detection on Arabic Twitter: Analysis and Experiments
EACL 2021
UL2C: Mapping User Locations to Countries on Arabic Twitter
EACL 2021
Fighting the COVID-19 Infodemic: Modeling the Perspective of Journalists, Fact-Checkers, Social Media Platforms, Policy Makers, and the Society
EMNLP 2021
Arabic Curriculum Analysis
COLING 2020
ALT at SemEval-2020 Task 12: Arabic and English Offensive Language Identification in Social Media
SEMEVAL 2020
SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media (OffensEval 2020)
COLING 2020
SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media (OffensEval 2020)
SEMEVAL 2020
ALT at SemEval-2020 Task 12: Arabic and English Offensive Language Identification in Social Media
COLING 2020
A System for Diacritizing Four Varieties of Arabic
EMNLP 2019
A System for Diacritizing Four Varieties of Arabic
IJCNLP 2019
POS Tagging for Improving Code-Switching Identification in Arabic
ACL 2019
Highly Effective Arabic Diacritization using Sequence to Sequence Modeling
NAACL 2019
QC-GO Submission for MADAR Shared Task: Arabic Fine-Grained Dialect Identification
ACL 2019
Learning from Relatives: Unified Dialectal Arabic Segmentation
CONLL 2017
SemEval-2017 Task 3: Community Question Answering
SEMEVAL 2017
QCRI Live Speech Translation System
EACL 2017
SemEval-2016 Task 3: Community Question Answering
SEMEVAL 2016
Farasa: A Fast and Furious Segmenter for Arabic
NAACL 2016
QCRI: Answer Selection for Community Question Answering - Experiments for Arabic and English
SEMEVAL 2015
Verifiably Effective Arabic Dialect Identification
EMNLP 2014