Khalid Almubarak
11 papers · 2022–2025 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+4 more ↓ Show less ↑
π Interdisciplinary Bridge π§ Keyword Pioneer π Cross-Pollinator (6) πΊοΈ Taxonomy Completionist (22) π£ Hot Topic Early Bird
π
Conference Polyglot
(3)
π₯
Mega-Team
(54)
π
Century Club
(11)
ποΈ
Keyword Collector
(56)
Conferences
ACL (8)
EMNLP (2)
NIPS (1)
Top co-authors
Keywords
large language model
(7)
arabic language
(5)
instruction tuning
(2)
prompt engineering
(2)
cross-lingual transfer
(2)
multilingual nlp
(2)
multilingual language model
(2)
knowledge distillation
(1)
image retrieval
(1)
benchmark evaluation
(1)
language model training
(1)
language model evaluation
(1)
cross-modal learning
(1)
responsible ai
(1)
question answering
(1)
text classification
(1)
benchmark dataset
(1)
continued pretraining
(1)
zero-shot generalization
(1)
multitask learning
(1)
Papers
Mind the Gap: A Review of Arabic Post-Training Datasets and Their Limitations
EMNLP 2025
Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion
ACL 2025
BALSAM: A Platform for Benchmarking Arabic Large Language Models
EMNLP 2025
Commonsense Reasoning in Arab Culture
ACL 2025
CIDAR: Culturally Relevant Instruction Dataset For Arabic
ACL 2024
ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic
ACL 2024
AraCLIP: Cross-Lingual Learning for Effective Arabic Image Retrieval
ACL 2024
Crosslingual Generalization through Multitask Finetuning
ACL 2023
BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting
ACL 2023
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
NIPS 2022
PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts
ACL 2022