Alham Fikri Aji
85 papers · 2017–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
🌍 Conference Polyglot (9) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (13) 🏃 Academic Marathon (8)
🐣
Hot Topic Early Bird
🐝
Cross-Pollinator
(8)
🧭
Keyword Pioneer
🏠
Conference Loyalist
(27)
🔬
Deep Specialist
(31)
🧬
Topic Evolution
🏆
Keyword Champion
(19)
🤝
Dynamic Duo
(20)
👥
Mega-Team
(92)
🗃️
Keyword Collector
(305)
❓
The Questioner
(6)
⚡
Prolific Year
(14)
💎
Century Club
(81)
🔥
Unstoppable
(9)
📈
Trend Setter
Conferences
ACL (28)
EMNLP (27)
COLING (8)
IJCNLP (6)
NAACL (6)
AACL (4)
EACL (3)
SEMEVAL (2)
NIPS (1)
Top co-authors
Research topics
Keywords
multilingual nlp
(19)
large language model
(18)
low-resource language
(17)
cross-lingual transfer
(16)
neural machine translation
(10)
machine translation
(8)
text classification
(8)
multilingual model
(8)
multilingual language model
(6)
zero-shot learning
(6)
model compression
(6)
knowledge distillation
(6)
stochastic gradient descent
(5)
distributed learning
(4)
transfer learning
(4)
zero-shot prompting
(4)
prompt engineering
(4)
language model
(4)
multi-label classification
(3)
efficient inference
(3)
Papers
Is Human-Like Text Liked by Humans? Multilingual Human Detection and Preference Against AI
ACL 2026
Macaron: Controlled, Human-Written Benchmark for Multilingual and Multicultural Reasoning via Template-Filling
ACL 2026
Afri-MCQA: Multimodal Cultural Question Answering for African Languages
ACL 2026
Multilingual Iterative Model Pruning: What Matters?
AACL 2025
Unveiling the Influence of Amplifying Language-Specific Neurons
AACL 2025
SemEval-2025 Task 11: Bridging the Gap in Text-Based Emotion Detection
SEMEVAL 2025
Enabling Natural Zero-Shot Prompting on Encoder Models via Statement-Tuning
NAACL 2025
MLKV: Multi-Layer Key-Value Heads for Memory Efficient Transformer Decoding
NAACL 2025
Thank You, Stingray: Multilingual Large Language Models Can Not (Yet) Disambiguate Cross-Lingual Word Senses
NAACL 2025
WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines
NAACL 2025
Unveiling the Influence of Amplifying Language-Specific Neurons
IJCNLP 2025
Multilingual Iterative Model Pruning: What Matters?
IJCNLP 2025
Language Surgery in Multilingual Large Language Models
EMNLP 2025
Entropy2Vec: Crosslingual Language Modeling Entropy as End-to-End Learnable Language Representations
EMNLP 2025
MoMentS: A Comprehensive Multimodal Benchmark for Theory of Mind
EMNLP 2025
Style Over Substance: Evaluation Biases for Large Language Models
COLING 2025
Balanced Multi-Factor In-Context Learning for Multilingual Large Language Models
EMNLP 2025
From Surveys to Narratives: Rethinking Cultural Value Adaptation in LLMs
EMNLP 2025
Data Laundering: Artificially Boosting Benchmark Results through Knowledge Distillation
ACL 2025
BRIGHTER: BRIdging the Gap in Human-Annotated Textual Emotion Recognition Datasets for 28 Languages
ACL 2025
KazMMLU: Evaluating Language Models on Kazakh, Russian, and Regional Knowledge of Kazakhstan
ACL 2025
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia
ACL 2025
Do Language Models Understand Honorific Systems in Javanese?
ACL 2025
NusaAksara: A Multimodal and Multilingual Benchmark for Preserving Indonesian Indigenous Scripts
ACL 2025
Statement-Tuning Enables Efficient Cross-lingual Generalization in Encoder-only Models
ACL 2025
A Multi-Labeled Dataset for Indonesian Discourse: Examining Toxicity, Polarization, and Demographics Information
ACL 2025
SemEval-2025 Task 11: Bridging the Gap in Text-Based Emotion Detection
ACL 2025
LORAXBENCH: A Multitask, Multilingual Benchmark Suite for 20 Indonesian Languages
EMNLP 2025
WangchanThaiInstruct: An instruction-following Dataset for Culture-Aware, Multitask, and Multi-domain Evaluation in Thai
EMNLP 2025
NusaDialogue: Dialogue Summarization and Generation for Underrepresented and Extremely Low-Resource Languages
COLING 2025
CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation
EMNLP 2025
From Multiple-Choice to Extractive QA: A Case Study for English and Arabic
COLING 2025
GenAI Content Detection Task 1: English and Multilingual Machine-Generated Text Detection: AI vs. Human
COLING 2025
A Paradigm Shift: The Future of Machine Translation Lies with Large Language Models
COLING 2024
Cendol: Open Instruction-tuned Generative Large Language Models for Indonesian Languages
ACL 2024
CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark
NIPS 2024
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions
EACL 2024
M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection
EACL 2024
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
EMNLP 2024
Towards Measuring and Modeling “Culture” in LLMs: A Survey
EMNLP 2024
Cultural Conditioning or Placebo? On the Effectiveness of Socio-Demographic Prompting
EMNLP 2024
Re-Evaluating Evaluation for Multilingual Summarization
EMNLP 2024
LLM-DetectAIve: a Tool for Fine-Grained Machine-Generated Text Detection
EMNLP 2024
LinguAlchemy: Fusing Typological and Geographical Elements for Unseen Language Generalization
EMNLP 2024
Efficient and Interpretable Grammatical Error Correction with Mixture of Experts
EMNLP 2024
SemEval Task 1: Semantic Textual Relatedness for African and Asian Languages
NAACL 2024
SemEval Task 1: Semantic Textual Relatedness for African and Asian Languages
SEMEVAL 2024
NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages
IJCNLP 2023
Current Status of NLP in South East Asia with Insights from Multilingualism and Language Diversity
IJCNLP 2023
Direct Fact Retrieval from Knowledge Graphs without Entity Linking
ACL 2023
GlobalBench: A Benchmark for Global Progress in Natural Language Processing
EMNLP 2023
Prompting Multilingual Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages
EMNLP 2023
Crosslingual Generalization through Multitask Finetuning
ACL 2023
BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting
ACL 2023
On “Scientific Debt” in NLP: A Case for More Rigour in Language Model Pre-Training Research
ACL 2023
Multilingual Large Language Models Are Not (Yet) Code-Switchers
EMNLP 2023
LLM-powered Data Augmentation for Enhanced Cross-lingual Performance
EMNLP 2023
WebIE: Faithful and Robust Information Extraction on the Web
ACL 2023
NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages
AACL 2023
Current Status of NLP in South East Asia with Insights from Multilingualism and Language Diversity
AACL 2023
NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages
EACL 2023
Knowledge-Augmented Language Model Prompting for Zero-Shot Knowledge Graph Question Answering
ACL 2023
NusaCrowd: Open Source Initiative for Indonesian NLP Resources
ACL 2023
Multi-lingual and Multi-cultural Figurative Language Understanding
ACL 2023
The Decades Progress on Code-Switching Research in NLP: A Systematic Survey on Trends and Challenges
ACL 2023
One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia
ACL 2022
A Relation Extraction Dataset for Knowledge Extraction from Web Tables
COLING 2022
Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering
COLING 2022
Towards better structured and less noisy Web data: Oscar with Register annotations
COLING 2022
The University of Edinburgh’s Bengali-Hindi Submissions to the WMT21 News Translation Task
EMNLP 2021
IndoCollex: A Testbed for Morphological Transformation of Indonesian Colloquial Words
ACL 2021
IndoCollex: A Testbed for Morphological Transformation of Indonesian Colloquial Words
IJCNLP 2021
BERT Goes Brrr: A Venture Towards the Lesser Error in Classifying Medical Self-Reporters on Twitter
NAACL 2021
IndoNLI: A Natural Language Inference Dataset for Indonesian
EMNLP 2021
Efficient Machine Translation with Model Pruning and Quantization
EMNLP 2021
In Neural Machine Translation, What Does Transfer Learning Transfer?
ACL 2020
Compressing Neural Machine Translation Models with 4-bit Precision
ACL 2020
Edinburgh’s Submissions to the 2020 Machine Translation Efficiency Task
ACL 2020
Combining Global Sparse Gradients with Local Gradients in Distributed Neural Network Training
EMNLP 2019
Making Asynchronous Stochastic Gradient Descent Work for Transformers
EMNLP 2019
Combining Global Sparse Gradients with Local Gradients in Distributed Neural Network Training
IJCNLP 2019
From Research to Production and Back: Ludicrously Fast Neural Machine Translation
EMNLP 2019
Accelerating Asynchronous Stochastic Gradient Descent for Neural Machine Translation
EMNLP 2018
Marian: Fast Neural Machine Translation in C++
ACL 2018
Sparse Communication for Distributed Gradient Descent
EMNLP 2017