Saied Alshahrani
10 papers · 2022–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+4 more ↓ Show less ↑
π£ Hot Topic Early Bird π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (18) π§ Keyword Pioneer π Conference Polyglot (4)
π
Cross-Pollinator
(6)
π₯
Mega-Team
(43)
π
Century Club
(10)
ποΈ
Keyword Collector
(51)
Conferences
ACL (4)
EMNLP (4)
COLING (1)
EACL (1)
Top co-authors
Keywords
arabic language
(4)
large language model
(4)
corpus quality
(3)
multilingual nlp
(2)
arabic wikipedia
(2)
instruction tuning
(2)
dataset evaluation
(1)
data quality
(1)
arabic dialect
(1)
arabic language model
(1)
adversarial defense
(1)
adversarial example
(1)
bert model
(1)
data contamination
(1)
vocabulary learning
(1)
adversarial attack
(1)
instruction dataset
(1)
word representation
(1)
gender bia
(1)
word embedding
(1)
Papers
Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion
ACL 2025
Mind the Gap: A Review of Arabic Post-Training Datasets and Their Limitations
EMNLP 2025
BALSAM: A Platform for Benchmarking Arabic Large Language Models
EMNLP 2025
Arabic Synonym BERT-based Adversarial Examples for Text Classification
EACL 2024
CIDAR: Culturally Relevant Instruction Dataset For Arabic
ACL 2024
Leveraging Corpus Metadata to Detect Template-based Translation: An Exploratory Case Study of the Egyptian Arabic Wikipedia Edition
COLING 2024
Performance Implications of Using Unrepresentative Corpora in Arabic Natural Language Processing
EMNLP 2023
DEPTH+: An Enhanced Depth Metric for Wikipedia Corpora Quality
ACL 2023
Learning From Arabic Corpora But Not Always From Arabic Speakers: A Case Study of the Arabic Wikipedia Editions
EMNLP 2022
Roadblocks in Gender Bias Measurement for Diachronic Corpora
ACL 2022