Sebastian Ruder
86 papers · 2016–2025 · 13 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+19 more ↓ Show less ↑
π£ Hot Topic Early Bird πΊοΈ Taxonomy Completionist (18) π§ Keyword Pioneer π Interdisciplinary Bridge π Conference Polyglot (13)
π
Conference Polyglot
(13)
π£
Hot Topic Early Bird
πΊοΈ
Taxonomy Completionist
(18)
π
Keyword Trendsetter Combo
(6)
π
Conference Loyalist
(24)
π
Keyword Champion
π€
Dynamic Duo
(15)
π
Grand Slam
π₯
Mega-Team
(61)
π±
Topic Pioneer
π¬
Deep Specialist
(38)
π§¬
Topic Evolution
β
The Questioner
(5)
π
Century Club
(86)
β‘
Prolific Year
(16)
ποΈ
Keyword Collector
(290)
π
Trend Setter
π
Conference Pioneer
π₯
Unstoppable
(10)
Conferences
EMNLP (31)
ACL (24)
ICLR (5)
NAACL (5)
IJCNLP (4)
NIPS (4)
AAAI (3)
SEMEVAL (3)
COLING (2)
EACL (2)
CONLL (1)
ICML (1)
INTERSPEECH (1)
Top co-authors
Research topics
Keywords
cross-lingual transfer
(17)
multilingual nlp
(17)
low-resource language
(16)
transfer learning
(15)
language model
(8)
benchmark evaluation
(7)
zero-shot learning
(7)
large language model
(7)
parameter-efficient fine-tuning
(6)
multilingual model
(6)
multilingual language model
(6)
sentiment analysis
(6)
named entity recognition
(6)
adapter module
(6)
domain adaptation
(5)
question answering
(5)
bilingual lexicon induction
(5)
multi-task learning
(5)
machine translation
(5)
pretrained language model
(5)
Papers
Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation
ACL 2025
M-RewardBench: Evaluating Reward Models in Multilingual Settings
ACL 2025
AL-QASIDA: Analyzing LLM Quality and Accuracy Systematically in Dialectal Arabic
ACL 2025
Arbiters of Ambivalence: Challenges of using LLMs in No-Consensus tasks
ACL 2025
BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual Transfer
NAACL 2024
Understanding and Mitigating Language Confusion in LLMs
EMNLP 2024
LLM See, LLM Do: Leveraging Active Inheritance to Target Non-Differentiable Objectives
EMNLP 2024
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages
EMNLP 2024
Language and Task Arithmetic with Parameter-Efficient Layers for Zero-Shot Summarization
EMNLP 2024
How Does Quantization Affect Multilingual LLMs?
EMNLP 2024
Connecting Language Technologies with Rich, Diverse Data Sources Covering Thousands of Languages
COLING 2024
BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts
NIPS 2024
Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning
ACL 2024
AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages
EMNLP 2023
Adapters: A Unified Library for Parameter-Efficient and Modular Transfer Learning
EMNLP 2023
TaTA: A Multilingual Table-to-Text Dataset for African Languages
EMNLP 2023
XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages
EMNLP 2023
mmT5: Modular Multilingual Pre-Training Solves Source Language Hallucinations
EMNLP 2023
Romanization-based Large-scale Adaptation of Multilingual Language Models
EMNLP 2023
AfriQA: Cross-lingual Open-Retrieval Question Answering for African Languages
EMNLP 2023
Language models are multilingual chain-of-thought reasoners
ICLR 2023
Empowering Cross-lingual Behavioral Testing of NLP Models with Typological Features
ACL 2023
NusaCrowd: Open Source Initiative for Indonesian NLP Resources
ACL 2023
SemEval-2023 Task 12: Sentiment Analysis for African Languages (AfriSenti-SemEval)
ACL 2023
NusaX: Multilingual Parallel Sentiment Dataset for 10 Indonesian Local Languages
EACL 2023
Evaluating the Diversity, Equity, and Inclusion of NLP Technology: A Case Study for Indian Languages
EACL 2023
SemEval-2023 Task 12: Sentiment Analysis for African Languages (AfriSenti-SemEval)
SEMEVAL 2023
Evaluating and Modeling Attribution for Cross-Lingual Question Answering
EMNLP 2023
Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation
ACL 2022
MasakhaNER 2.0: Africa-centric Transfer Learning for Named Entity Recognition
EMNLP 2022
Hyper-X: A Unified Hypernetwork for Multi-Task Multilingual Transfer
EMNLP 2022
ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning
ICLR 2022
Charformer: Fast Character Transformers via Gradient-based Subword Tokenization
ICLR 2022
Modular and Parameter-Efficient Fine-Tuning for NLP Models
EMNLP 2022
XTREME-S: Evaluating Cross-lingual Speech Representations
INTERSPEECH 2022
Memorisation versus Generalisation in Pre-trained Language Models
ACL 2022
Square One Bias in NLP: Towards a Multi-Dimensional Exploration of the Research Manifold
ACL 2022
One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia
ACL 2022
FewNLU: Benchmarking State-of-the-Art Methods for Few-Shot Natural Language Understanding
ACL 2022
Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks
IJCNLP 2021
How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models
IJCNLP 2021
MAD-G: Multilingual Adapter Generation for Efficient Cross-Lingual Transfer
EMNLP 2021
Multi-Domain Multilingual Question Answering
EMNLP 2021
Multi-view Subword Regularization
NAACL 2021
Analogy Training Multilingual Encoders
AAAI 2021
How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models
ACL 2021
Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks
ACL 2021
Mind the Gap: Assessing Temporal Generalization in Neural Language Models
NIPS 2021
Compacter: Efficient Low-Rank Hypercomplex Adapter Layers
NIPS 2021
Long Range Arena : A Benchmark for Efficient Transformers
ICLR 2021
IndoNLG: Benchmark and Resources for Evaluating Indonesian Natural Language Generation
EMNLP 2021
UNKs Everywhere: Adapting Multilingual Language Models to New Scripts
EMNLP 2021
Rethinking Embedding Coupling in Pre-trained Language Models
ICLR 2021
XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation
EMNLP 2021
Efficient Test Time Adapter Ensembling for Low-resource Language Varieties
EMNLP 2021
AxCell: Automatic Extraction of Results from Machine Learning Papers
EMNLP 2020
On the Cross-lingual Transferability of Monolingual Representations
ACL 2020
A Call for More Rigor in Unsupervised Cross-lingual Learning
ACL 2020
Morphologically Aware Word-Level Translation
COLING 2020
Are All Good Word Vector Spaces Isomorphic?
EMNLP 2020
MAD-X: An Adapter-Based Framework for Multi-Task Cross-Lingual Transfer
EMNLP 2020
AdapterHub: A Framework for Adapting Transformers
EMNLP 2020
XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalisation
ICML 2020
Episodic Memory in Lifelong Language Learning
NIPS 2019
Donβt Forget the Long Tail! A Comprehensive Analysis of Morphological Generalization in Bilingual Lexicon Induction
EMNLP 2019
To Tune or Not to Tune? Adapting Pretrained Representations to Diverse Tasks
ACL 2019
Proceedings of the 4th Workshop on Representation Learning for NLP (RepL4NLP-2019)
ACL 2019
Unsupervised Cross-Lingual Representation Learning
ACL 2019
Latent Multi-Task Architecture Learning
AAAI 2019
Donβt Forget the Long Tail! A Comprehensive Analysis of Morphological Generalization in Bilingual Lexicon Induction
IJCNLP 2019
MultiFiT: Efficient Multi-lingual Language Model Fine-tuning
IJCNLP 2019
How to (Properly) Evaluate Cross-Lingual Word Embeddings: On Strong Baselines, Comparative Analyses, and Some Misconceptions
ACL 2019
Transfer Learning in Natural Language Processing
NAACL 2019
A Hierarchical Multi-Task Approach for Learning Embeddings from Semantic Tasks
AAAI 2019
MultiFiT: Efficient Multi-lingual Language Model Fine-tuning
EMNLP 2019
Universal Language Model Fine-tuning for Text Classification
ACL 2018
A Discriminative Latent-Variable Model for Bilingual Lexicon Induction
EMNLP 2018
Strong Baselines for Neural Semi-Supervised Learning under Domain Shift
ACL 2018
On the Limitations of Unsupervised Bilingual Dictionary Induction
ACL 2018
Multi-Task Learning of Pairwise Sequence Classification Tasks over Disparate Label Spaces
NAACL 2018
360Β° Stance Detection
NAACL 2018
Generalizing Procrustes Analysis for Better Bilingual Dictionary Induction
CONLL 2018
Learning to select data for transfer learning with Bayesian Optimization
EMNLP 2017
A Hierarchical Model of Reviews for Aspect-based Sentiment Analysis
EMNLP 2016
INSIGHT-1 at SemEval-2016 Task 5: Deep Learning for Multilingual Aspect-based Sentiment Analysis
SEMEVAL 2016
INSIGHT-1 at SemEval-2016 Task 4: Convolutional Neural Networks for Sentiment Classification and Quantification
SEMEVAL 2016