Nizar Habash
154 papers · 2003–2026 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
π£ Hot Topic Early Bird π Interdisciplinary Bridge π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (12) π Conference Polyglot (10)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Academic Marathon
(22)
π
Keyword Trendsetter Combo
(9)
π
Conference Loyalist
(41)
π¬
Deep Specialist
(38)
π
Keyword Champion
(8)
π₯
Mega-Team
(62)
π€
Dynamic Duo
(20)
π
Century Club
(149)
ποΈ
Keyword Collector
(318)
β‘
Prolific Year
(17)
π₯
Unstoppable
(21)
π
Trend Setter
β
The Questioner
Conferences
ACL (42)
EMNLP (36)
COLING (27)
NAACL (18)
EACL (16)
IJCNLP (6)
CONLL (5)
SEMEVAL (2)
INTERSPEECH (1)
MLHC (1)
Top co-authors
Research topics
Keywords
arabic language
(34)
morphological analysis
(25)
text classification
(20)
arabic dialect
(13)
machine translation
(13)
natural language processing
(11)
dialect identification
(11)
part-of-speech tagging
(10)
multilingual nlp
(8)
readability assessment
(8)
arabic nlp
(8)
sequence-to-sequence model
(8)
morphological disambiguation
(7)
low-resource language
(7)
large language model
(7)
dependency parsing
(7)
dialectal arabic
(7)
shared task
(6)
grammatical error correction
(6)
automatic speech recognition
(5)
Papers
Do Diacritics Matter? Evaluating the Impact of Arabic Diacritics on Tokenization and LLM Benchmarks
EACL 2026
Computational Benchmarks for Egyptian Arabic Child Directed Speech
EACL 2026
A Tale of Two Scripts: Transliteration and Post-Correction for Judeo-Arabic
EACL 2026
Is Human-Like Text Liked by Humans? Multilingual Human Detection and Preference Against AI
ACL 2026
Cross-Lingual Empirical Evaluation of Large Language Models for Arabic Medical Tasks
EACL 2026
ARWI: Arabic Write and Improve
NAACL 2025
The Impact of Code-switched Synthetic Data Quality is Task Dependent: Insights from MT and ASR
NAACL 2025
Enhancing Text Editing for Grammatical Error Correction: Arabic as a Case Study
ACL 2025
A Large and Balanced Corpus for Fine-grained Arabic Readability Assessment
ACL 2025
A Derivational ChainBank for Modern Standard Arabic
COLING 2025
GenAI Content Detection Task 1: English and Multilingual Machine-Generated Text Detection: AI vs. Human
COLING 2025
MedArabiQ: Benchmarking Large Language Models on Arabic Medical Tasks
MLHC 2025
NADI 2025: The First Multidialectal Arabic Speech Processing Shared Task
EMNLP 2025
BAREC Shared Task 2025 on Arabic Readability Assessment
EMNLP 2025
AraHealthQA 2025: The First Shared Task on Arabic Health Question Answering
EMNLP 2025
BALSAM: A Platform for Benchmarking Arabic Large Language Models
EMNLP 2025
Evaluating Prompt Relevance in Arabic Automatic Essay Scoring: Insights from Synthetic and Real-World Data
EMNLP 2025
Lemmatizing Dialectal Arabic with Sequence-to-Sequence Models
EMNLP 2025
Data Augmentation for Maltese NLP using Transliterated and Machine Translated Arabic Data
EMNLP 2025
Radical Allomorphy: Phonological Surface Forms without Phonology
EMNLP 2025
BAREC Demo: Resources and Tools for Sentence-level Arabic Readability Assessment
EMNLP 2025
Lemmatization as a Classification Task: Results from Arabic across Multiple Genres
EMNLP 2025
The Arabic Generality Score: Another Dimension of Modeling Arabic Dialectness
EMNLP 2025
A Survey of Code-switched Arabic NLP: Progress, Challenges, and Future Directions
COLING 2025
Proper Noun Diacritization for Arabic Wikipedia: A Benchmark Dataset
ACL 2025
Enhancing Arabic Automated Essay Scoring with Synthetic Data and Error Injection
ACL 2025
Guidelines for Fine-grained Sentence-level Arabic Readability Annotation
ACL 2025
From Multiple-Choice to Extractive QA: A Case Study for English and Arabic
COLING 2025
Beyond Cairo: Saβidi Egyptian Arabic Literary Corpus Construction and Analysis
NAACL 2025
Strategies for Arabic Readability Modeling
ACL 2024
LLM-DetectAIve: a Tool for Fine-Grained Machine-Generated Text Detection
EMNLP 2024
M4GT-Bench: Evaluation Benchmark for Black-Box Machine-Generated Text Detection
ACL 2024
Arabic Diacritics in the Wild: Exploiting Opportunities for Improved Diacritization
ACL 2024
ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic
ACL 2024
Exploiting Dialect Identification in Automatic Dialectal Text Normalization
ACL 2024
The FIGNEWS Shared Task on News Media Narratives
ACL 2024
NADI 2024: The Fifth Nuanced Arabic Dialect Identification Shared Task
ACL 2024
Investigating Gender Bias in STEM Job Advertisements
ACL 2024
Computational Morphology and Lexicography Modeling of Modern Standard Arabic Nominals
EACL 2024
M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection
EACL 2024
Cross-Lingual Transfer from Related Languages: Treating Low-Resource Maltese as Multilingual Code-Switching
EACL 2024
Camel Morph MSA: A Large-Scale Open-Source Morphological Analyzer for Modern Standard Arabic
COLING 2024
EMAD: A Bridge Tagset for Unifying Arabic POS Annotations
COLING 2024
Palmyra 3.0: A User-Friendly Cloud-Based Platform for Morphology and Dependency Syntax Annotation
COLING 2024
The SAMER Arabic Text Simplification Corpus
COLING 2024
ZAEBUC-Spoken: A Multilingual Multidialectal Arabic-English Speech Corpus
COLING 2024
Exploring Segmentation Approaches for Neural Machine Translation of Code-Switched Egyptian Arabic-English Text
EACL 2023
NADI 2023: The Fourth Nuanced Arabic Dialect Identification Shared Task
EMNLP 2023
CamelParser2.0: A State-of-the-Art Dependency Parser for Arabic
EMNLP 2023
Data Augmentation Techniques for Machine Translation of Code-Switched Texts: A Comparative Study
EMNLP 2023
Advancements in Arabic Grammatical Error Detection and Correction: An Empirical Investigation
EMNLP 2023
Investigating Lexical Replacements for Arabic-English Code-Switched Data Augmentation
EACL 2023
Exploring the Impact of Transliteration on NLP Performance: Treating Maltese as an Arabic Dialect
ACL 2023
Morphotactic Modeling in an Open-source Multi-dialectal Arabic Morphological Analyzer and Generator
NAACL 2022
Morphosyntactic Tagging with Pre-trained Language Models for Arabic and its Dialects
ACL 2022
Maknuune: A Large Open Palestinian Arabic Lexicon
EMNLP 2022
The Shared Task on Gender Rewriting
EMNLP 2022
ArzEn-ST: A Three-way Speech Translation Corpus for Code-Switched Egyptian Arabic-English
EMNLP 2022
NADI 2022: The Third Nuanced Arabic Dialect Identification Shared Task
EMNLP 2022
AraBART: a Pretrained Arabic Sequence-to-Sequence Model for Abstractive Summarization
EMNLP 2022
Camelira: An Arabic Multi-Dialect Morphological Disambiguator
EMNLP 2022
Arabic Word-level Readability Visualization for Assisted Text Simplification
EMNLP 2022
Arabic Natural Language Processing
EMNLP 2022
User-Centric Gender Rewriting
NAACL 2022
Automatic Error Type Annotation for Arabic
EMNLP 2021
SIGMORPHON 2021 Shared Task on Morphological Reinflection: Generalization Across Languages
ACL 2021
Automatic Error Type Annotation for Arabic
CONLL 2021
A View From the Crowd: Evaluation Challenges for Time-Offset Interaction Applications
EACL 2021
The Interplay of Variant, Size, and Task Type in Arabic Pre-trained Language Models
EACL 2021
Automatic Romanization of Arabic Bibliographic Records
EACL 2021
NADI 2021: The Second Nuanced Arabic Dialect Identification Shared Task
EACL 2021
SIGMORPHON 2021 Shared Task on Morphological Reinflection: Generalization Across Languages
IJCNLP 2021
Gender-Aware Reinflection using Linguistically Enhanced Neural Models
COLING 2020
An Online Readability Leveled Arabic Thesaurus
COLING 2020
Utilizing Subword Entities in Character-Level Sequence-to-Sequence Lemmatization Models
COLING 2020
Multitask Easy-First Dependency Parsing: Exploiting Complementarities of Different Dependency Representations
COLING 2020
The Paradigm Discovery Problem
ACL 2020
NADI 2020: The First Nuanced Arabic Dialect Identification Shared Task
COLING 2020
Joint Diacritization, Lemmatization, Normalization, and Fine-Grained Morphological Tagging
ACL 2020
A Unified Model for Arabizi Detection and Transliteration using Sequence-to-Sequence Models
COLING 2020
PALMYRA 2.0: A Configurable Multilingual Platform Independent Tool for Morphology and Syntax Annotation
COLING 2020
A Little Linguistics Goes a Long Way: Unsupervised Segmentation with Limited Language Specific Guidance
ACL 2019
The Effectiveness of Simple Hybrid Systems for Hypernym Discovery
ACL 2019
Automatic Gender Identification and Reinflection in Arabic
ACL 2019
The MADAR Shared Task on Arabic Fine-Grained Dialect Identification
ACL 2019
ADIDA: Automatic Dialect Identification for Arabic
NAACL 2019
Morphologically Annotated Corpora for Seven Arabic Dialects: Taizi, Sanaani, Najdi, Jordanian, Syrian, Iraqi and Moroccan
ACL 2019
Towards Variability Resistant Dialectal Speech Evaluation
INTERSPEECH 2019
Adversarial Multitask Learning for Joint Multi-Feature and Multi-Dialect Morphological Modeling
ACL 2019
Improving Domain Independent Question Parsing with Synthetic Treebanks
COLING 2018
Fine-Grained Arabic Dialect Identification
COLING 2018
Noise-Robust Morphological Disambiguation for Dialectal Arabic
NAACL 2018
Addressing Noise in Multidialectal Word Embeddings
ACL 2018
Feature Optimization for Predicting Readability of Arabic L1 and L2
ACL 2018
Utilizing Character and Word Embeddings for Text Normalization with Sequence-to-Sequence Models
EMNLP 2018
Complementary Strategies for Low Resourced Morphological Modeling
EMNLP 2018
An Arabic Morphological Analyzer and Generator with Copious Features
EMNLP 2018
A Cross-lingual Messenger with Keyword Searchable Phrases for the Travel Domain
COLING 2018
A Parallel Corpus for Evaluating Machine Translation between Arabic and European Languages
EACL 2017
OMAM at SemEval-2017 Task 4: Evaluation of English State-of-the-Art Sentiment Analysis Models for Arabic and a New Topic-based Model
SEMEVAL 2017
Donβt Throw Those Morphological Analyzers Away Just Yet: Neural Morphological Disambiguation for Arabic
EMNLP 2017
CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies
CONLL 2017
OMAM at SemEval-2017 Task 4: English Sentiment Analysis with Conditional Random Fields
SEMEVAL 2017
Creating Resources for Dialectal Arabic from a Single Annotation: A Case Study on Egyptian and Levantine
COLING 2016
CamelParser: A system for Arabic Syntactic Analysis and Morphological Disambiguation
COLING 2016
YAMAMA: Yet Another Multi-Dialect Arabic Morphological Analyzer
COLING 2016
Botta: An Arabic Dialect Chatbot
COLING 2016
Machine Translation Evaluation for Arabic using Morphologically-enriched Embeddings
COLING 2016
Improving Arabic Diacritization through Syntactic Analysis
EMNLP 2015
Predicting the Structure of Cooking Recipes
EMNLP 2015
Unsupervised Morphology-Based Vocabulary Expansion
ACL 2014
The Illinois-Columbia System in the CoNLL-2014 Shared Task
CONLL 2014
Sentence Level Dialect Identification for Machine Translation System Selection
ACL 2014
Automatic Transliteration of Romanized Dialectal Arabic
CONLL 2014
Natural Language Processing of Arabic and its Dialects
EMNLP 2014
Generalized Character-Level Spelling Error Correction
ACL 2014
A Web-based Annotation Framework For Large-Scale Text Correction
IJCNLP 2013
DIRA: Dialectal Arabic Information Retrieval Assistant
IJCNLP 2013
Dialectal Arabic to English Machine Translation: Pivoting through Modern Standard Arabic
NAACL 2013
Morphological Analysis and Disambiguation for Dialectal Arabic
NAACL 2013
Automatic Morphological Enrichment of a Morphologically Underspecified Treebank
NAACL 2013
Overview of the SPMRL 2013 Shared Task: A Cross-Framework Evaluation of Parsing Morphologically Rich Languages
EMNLP 2013
SPMRLβ13 Shared Task System: The CADIM Arabic Dependency Parser
EMNLP 2013
Automatic Extraction of Morphological Lexicons from Morphologically Annotated Corpora
EMNLP 2013
Orthographic and Morphological Processing for Persian-to-English Statistical Machine Translation
IJCNLP 2013
Selective Combination of Pivot and Direct Statistical Machine Translation Models
IJCNLP 2013
Reranking with Linguistic and Semantic Features for Arabic Optical Character Recognition
ACL 2013
Language Independent Connectivity Strength Features for Phrase Pivot Statistical Machine Translation
ACL 2013
Processing Spontaneous Orthography
NAACL 2013
Elissa: A Dialectal to Standard Arabic Machine Translation System
COLING 2012
Identifying Broken Plurals, Irregular Gender, and Rationality in Arabic Text
EACL 2012
Arabic Dialect Processing Tutorial
NAACL 2012
Using Deep Morphology to Improve Automatic Error Detection in Arabic Handwriting Recognition
ACL 2011
Improving Arabic Dependency Parsing with Form-based and Functional Morphological Features
ACL 2011
A Corpus for Modeling Morpho-Syntactic Agreement in Arabic: Gender, Number and Rationality
ACL 2011
Improving Arabic-to-English Statistical Machine Translation by Reordering Post-Verbal Subjects for Alignment
ACL 2010
CATiB: The Columbia Arabic Treebank
ACL 2009
CATiB: The Columbia Arabic Treebank
IJCNLP 2009
Improving the Arabic Pronunciation Dictionary for Phone and Word Recognition with Linguistically-Based Pronunciation Rules
NAACL 2009
Arabic Morphological Tagging, Diacritization, and Lemmatization Using Lexeme Models and Feature Ranking
ACL 2008
Four Techniques for Online Handling of Out-of-Vocabulary Words in Arabic-English Statistical Machine Translation
ACL 2008
Combination of Statistical Word Alignments Based on Multiple Preprocessing Schemes
NAACL 2007
Arabic Dialect Processing Tutorial
NAACL 2007
Arabic Diacritization through Full Morphological Tagging
NAACL 2007
Determining Case in Arabic: Learning Complex Linguistic Behavior Requires Complex Linguistic Features
EMNLP 2007
Determining Case in Arabic: Learning Complex Linguistic Behavior Requires Complex Linguistic Features
CONLL 2007
MAGEAD: A Morphological Analyzer and Generator for the Arabic Dialects
COLING 2006
MAGEAD: A Morphological Analyzer and Generator for the Arabic Dialects
ACL 2006
Combination of Arabic Preprocessing Schemes for Statistical Machine Translation
COLING 2006
Combination of Arabic Preprocessing Schemes for Statistical Machine Translation
ACL 2006
Parsing Arabic Dialects
EACL 2006
Arabic Preprocessing Schemes for Statistical Machine Translation
NAACL 2006
Arabic Tokenization, Part-of-Speech Tagging and Morphological Disambiguation in One Fell Swoop
ACL 2005
A Categorial Variation Database for English
NAACL 2003