Goran Glavaš
113 papers · 2012–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
🧭 Keyword Pioneer 🌍 Conference Polyglot (9) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (13) 🏃 Academic Marathon (13)
🗺️
Taxonomy Completionist
(13)
🏃
Academic Marathon
(13)
🧭
Keyword Pioneer
🌟
Keyword Trendsetter Combo
(4)
🏠
Conference Loyalist
(35)
🔬
Deep Specialist
(33)
🏆
Keyword Champion
(2)
🤝
Dynamic Duo
(52)
💎
Century Club
(110)
❓
The Questioner
(10)
🗃️
Keyword Collector
(356)
⚡
Prolific Year
(15)
🔥
Unstoppable
(9)
📈
Trend Setter
Conferences
ACL (36)
EMNLP (33)
EACL (11)
NAACL (11)
IJCNLP (9)
COLING (6)
SEMEVAL (4)
AAAI (2)
ICLR (1)
Top co-authors
Research topics
Keywords
cross-lingual transfer
(31)
word embedding
(14)
zero-shot learning
(10)
pretrained language model
(9)
low-resource language
(8)
zero-shot transfer
(8)
transfer learning
(7)
domain adaptation
(7)
few-shot learning
(6)
bilingual lexicon induction
(6)
cross-lingual word embedding
(6)
multilingual transformer
(6)
word vector
(6)
lexical entailment
(5)
language model
(5)
multilingual language model
(5)
lexical knowledge
(5)
distributional semantics
(5)
machine translation
(5)
named entity recognition
(5)
Papers
Mind Your Special Tokens! On the Importance of Dedicated Sequence-End Tokens in Vision-Language Embedding Models
EACL 2026
Compositional Steering of Large Language Models with Steering Tokens
ACL 2026
Don’t Stop the Multi-Party! On Generating Synthetic Written Multi-Party Conversations with Constraints
AAAI 2026
MVL-SIB: A Massively Multilingual Vision-Language Benchmark for Cross-Modal Topical Matching
ACL 2025
Large Language Models are Miscalibrated In-Context Learners
ACL 2025
BabelEdits: A Benchmark and a Modular Approach for Robust Cross-lingual Knowledge Editing of Large Language Models
ACL 2025
How Much Do LLMs Hallucinate across Languages? On Realistic Multilingual Estimation of LLM Hallucination
EMNLP 2025
ReCoVeR the Target Language: Language Steering without Sacrificing Task Performance
EMNLP 2025
On Generalization across Measurement Systems: LLMs Entail More Test-Time Compute for Underrepresented Cultures
ACL 2025
On Synthesizing Data for Context Attribution in Question Answering
ACL 2025
Centurio: On Drivers of Multilingual Ability of Large Vision-Language Model
ACL 2025
ObscuraCoder: Powering Efficient Code LM Pre-Training Via Obfuscation Grounding
ICLR 2025
TransAlign: Machine Translation Encoders are Strong Word Aligners, Too
EMNLP 2025
Modular Sentence Encoders: Separating Language Specialization from Cross-Lingual Alignment
ACL 2025
The Devil Is in the Word Alignment Details: On Translation-Based Cross-Lingual Transfer for Token Classification Tasks
ACL 2025
Does Object Grounding Really Reduce Hallucination of Large Vision-Language Models?
EMNLP 2024
SQATIN: Supervised Instruction Tuning Meets Question Answering for Improved Dialogue NLU
NAACL 2024
To Translate or Not to Translate: A Systematic Investigation of Translation-Based Cross-Lingual Transfer to Low-Resource Languages
NAACL 2024
Babel-ImageNet: Massively Multilingual Evaluation of Vision-and-Language Representations
ACL 2024
IRCoder: Intermediate Representations Make Language Models Robust Multilingual Code Generators
ACL 2024
ANHALTEN: Cross-Lingual Transfer for German Token-Level Reference-Free Hallucination Detection
ACL 2024
mBLIP: Efficient Bootstrapping of Multilingual Vision-LLMs
ACL 2024
Improving Vision-Language Cross-Lingual Transfer with Scheduled Unfreezing
ACL 2024
Train Once, Use Flexibly: A Modular Framework for Multi-Aspect Neural News Recommendation
EMNLP 2024
Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+ Languages
EMNLP 2024
African or European Swallow? Benchmarking Large Vision-Language Models for Fine-Grained Object Classification
EMNLP 2024
JSI and WüNLP at the DIALECT-COPA Shared Task: In-Context Learning From Just a Few Dialectal Examples Gets You Quite Far
NAACL 2024
VarDial Evaluation Campaign 2024: Commonsense Reasoning in Dialects and Multi-Label Similar Language Identification
NAACL 2024
Knowledge Distillation vs. Pretraining from Scratch under a Fixed (Computation) Budget
NAACL 2024
Kardeş-NLU: Transfer to Low-Resource Languages with the Help of a High-Resource Cousin – A Benchmark and Evaluation for Turkic Languages
EACL 2024
Leveraging Open Information Extraction for More Robust Domain Transfer of Event Trigger Detection
EACL 2024
Free Lunch: Robust Cross-Lingual Transfer via Model Checkpoint Averaging
ACL 2023
Massively Multilingual Lexical Specialization of Multilingual Transformers
ACL 2023
One For All & All For One: Bypassing Hyperparameter Tuning with Model Averaging for Cross-Lingual Transfer
EMNLP 2023
NewsRecLib: A PyTorch-Lightning Library for Neural News Recommendation
EMNLP 2023
The Devil is in the Details: On Models and Training Regimes for Few-Shot Intent Classification
EACL 2023
Improving Cross-Lingual Transfer for Open Information Extraction with Linguistic Feature Projection
EMNLP 2023
Probing Cross-Lingual Lexical Knowledge from Multilingual Sentence Encoders
EACL 2023
Can Demographic Factors Improve Text Classification? Revisiting Demographic Adaptation in the Age of Transformers
EACL 2023
A General-Purpose Multilingual Document Encoder
EMNLP 2023
AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classification
EMNLP 2023
Vicinal Risk Minimization for Few-Shot Cross-lingual Transfer in Abusive Language Detection
EMNLP 2023
Linking Surface Facts to Large-Scale Knowledge Graphs
EMNLP 2023
Multi2WOZ: A Robust Multilingual Dataset and Conversational Pretraining for Task-Oriented Dialog
NAACL 2022
Don’t Stop Fine-Tuning: On Training Regimes for Few-Shot Cross-Lingual Transfer with Multilingual Language Models
EMNLP 2022
SLICER: Sliced Fine-Tuning for Low-Resource Cross-Lingual Transfer for Named Entity Recognition
EMNLP 2022
AnnIE: An Annotation Platform for Constructing Complete Open Information Extraction Benchmark
ACL 2022
Natural Language Processing for Multilingual Task-Oriented Dialogue
ACL 2022
DS-TOD: Efficient Domain Specialization for Task-Oriented Dialog
ACL 2022
Parameter-Efficient Neural Reranking for Cross-Lingual and Multilingual Retrieval
COLING 2022
BenchIE: A Framework for Multi-Faceted Fact-Based Open Information Extraction Evaluation
ACL 2022
BAD-X: Bilingual Adapters Improve Zero-Shot Cross-Lingual Transfer
NAACL 2022
ZusammenQA: Data Augmentation with Specialized Models for Cross-lingual Open-retrieval Question Answering System
NAACL 2022
LexFit: Lexical Fine-Tuning of Pretrained Language Models
ACL 2021
RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language Models
ACL 2021
Verb Knowledge Injection for Multilingual Event Processing
ACL 2021
Climbing the Tower of Treebanks: Improving Low-Resource Dependency Parsing via Hierarchical Source Selection
ACL 2021
Is Supervised Syntactic Parsing Beneficial for Language Understanding Tasks? An Empirical Investigation
EACL 2021
DebIE: A Platform for Implicit and Explicit Debiasing of Word Embedding Spaces
EACL 2021
Training and Domain Adaptation for Supervised Text Segmentation
EACL 2021
MAD-G: Multilingual Adapter Generation for Efficient Cross-Lingual Transfer
EMNLP 2021
Sustainable Modular Debiasing of Language Models
EMNLP 2021
RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language Models
IJCNLP 2021
LexFit: Lexical Fine-Tuning of Pretrained Language Models
IJCNLP 2021
Verb Knowledge Injection for Multilingual Event Processing
IJCNLP 2021
Climbing the Tower of Treebanks: Improving Low-Resource Dependency Parsing via Hierarchical Source Selection
IJCNLP 2021
XHate-999: Analyzing and Detecting Abusive Language Across Domains and Languages
COLING 2020
Towards Instance-Level Parser Selection for Cross-Lingual Transfer of Dependency Parsers
COLING 2020
Specializing Unsupervised Pretraining Models for Word-Level Semantic Similarity
COLING 2020
Probing Pretrained Language Models for Lexical Semantics
EMNLP 2020
From Zero to Hero: On the Limitations of Zero-Shot Language Transfer with Multilingual Transformers
EMNLP 2020
XCOPA: A Multilingual Dataset for Causal Commonsense Reasoning
EMNLP 2020
Common Sense or World Knowledge? Investigating Adapter-Based Knowledge Injection into Pretrained Transformers
EMNLP 2020
On the Limitations of Cross-lingual Encoders as Exposed by Reference-Free Machine Translation Evaluation
ACL 2020
Classification-Based Self-Learning for Weakly Supervised Bilingual Lexicon Induction
ACL 2020
Non-Linear Instance-Based Cross-Lingual Mapping for Non-Isomorphic Embedding Spaces
ACL 2020
SemEval-2020 Task 2: Predicting Multilingual and Cross-Lingual (Graded) Lexical Entailment
SEMEVAL 2020
Improving Bilingual Lexicon Induction with Unsupervised Post-Processing of Monolingual Word Vector Spaces
ACL 2020
A General Framework for Implicit and Explicit Debiasing of Distributional Word Vector Spaces
AAAI 2020
AraWEAT: Multidimensional Analysis of Biases in Arabic Word Embeddings
COLING 2020
SemEval-2020 Task 2: Predicting Multilingual and Cross-Lingual (Graded) Lexical Entailment
COLING 2020
Semantic Specialization of Distributional Word Vectors
EMNLP 2019
Do We Really Need Fully Unsupervised Cross-Lingual Embeddings?
EMNLP 2019
Cross-lingual Semantic Specialization via Lexical Relation Induction
EMNLP 2019
Cross-lingual Semantic Specialization via Lexical Relation Induction
IJCNLP 2019
Do We Really Need Fully Unsupervised Cross-Lingual Embeddings?
IJCNLP 2019
Semantic Specialization of Distributional Word Vectors
IJCNLP 2019
SEAGLE: A Platform for Comparative Evaluation of Semantic Encoders for Information Retrieval
IJCNLP 2019
Specializing Distributional Vectors of All Words for Lexical Entailment
ACL 2019
Computational Analysis of Political Texts: Bridging Research Efforts Across Communities
ACL 2019
Multilingual and Cross-Lingual Graded Lexical Entailment
ACL 2019
Generalized Tuning of Distributional Word Vectors for Monolingual and Cross-Lingual Lexical Entailment
ACL 2019
How to (Properly) Evaluate Cross-Lingual Word Embeddings: On Strong Baselines, Comparative Analyses, and Some Misconceptions
ACL 2019
SEAGLE: A Platform for Comparative Evaluation of Semantic Encoders for Information Retrieval
EMNLP 2019
Proceedings of the Thirteenth Workshop on Graph-Based Methods for Natural Language Processing (TextGraphs-13)
EMNLP 2019
ArguminSci: A Tool for Analyzing Argumentation and Rhetorical Aspects in Scientific Writing
EMNLP 2018
Proceedings of the Twelfth Workshop on Graph-Based Methods for Natural Language Processing (TextGraphs-12)
NAACL 2018
An Argument-Annotated Corpus of Scientific Publications
EMNLP 2018
Explicit Retrofitting of Distributional Word Vectors
ACL 2018
Post-Specialisation: Retrofitting Vectors of Words Unseen in Lexical Resources
NAACL 2018
Discriminating between Lexico-Semantic Relations with the Specialization Tensor Model
NAACL 2018
Adversarial Propagation and Zero-Shot Cross-Lingual Transfer of Word Vector Specialization
EMNLP 2018
Investigating the Role of Argumentation in the Rhetorical Analysis of Scientific Publications with Neural Multi-Task Learning Models
EMNLP 2018
Dual Tensor Model for Detecting Asymmetric Lexico-Semantic Relations
EMNLP 2017
Unsupervised Cross-Lingual Scaling of Political Texts
EACL 2017
Improving Neural Knowledge Base Completion with Cross-Lingual Projections
EACL 2017
Simplifying Lexical Simplification: Do We Need Simplified Corpora?
IJCNLP 2015
TAKELAB: Medical Information Extraction and Linking with MINERAL
SEMEVAL 2015
TKLBLIIR: Detecting Twitter Paraphrases with TweetingJay
SEMEVAL 2015
Simplifying Lexical Simplification: Do We Need Simplified Corpora?
ACL 2015
Recognizing Identical Events with Graph Kernels
ACL 2013
Event-Centered Information Retrieval Using Kernels on Event Graphs
EMNLP 2013
TakeLab: Systems for Measuring Semantic Text Similarity
SEMEVAL 2012