Marta Villegas
20 papers · 2019–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
๐ Renaissance Researcher (7) ๐ Cross-Pollinator (12) ๐ Academic Marathon (6) ๐ Conference Polyglot (6) ๐ Interdisciplinary Bridge
๐
Academic Marathon
(6)
๐
Renaissance Researcher
(7)
๐ค
Dynamic Duo
(16)
๐ฌ
Deep Specialist
(11)
๐๏ธ
Keyword Collector
(92)
๐
Century Club
(18)
โก
Prolific Year
(7)
โ
The Questioner
(3)
๐ฅ
Unstoppable
(5)
Conferences
COLING (6)
EMNLP (4)
ACL (3)
EACL (3)
IJCNLP (2)
AACL (1)
NAACL (1)
Top co-authors
Research topics
Keywords
named entity recognition
(5)
low-resource language
(4)
large language model
(4)
benchmark evaluation
(3)
clinical text
(2)
text classification
(2)
image-induced fidelity loss
(2)
multilingual benchmark
(2)
language resource
(2)
model merging
(2)
visual language model
(2)
multilingual alignment
(2)
question answering
(1)
clinical named entity recognition
(1)
multilingual nlp
(1)
natural language inference
(1)
sequence labeling
(1)
few-shot learning
(1)
knowledge editing
(1)
privacy preservation
(1)
Papers
Vinclat: Evaluating Reasoning, Cognition and Culture in One Game
EACL 2026
Breaking Language Barriers in Visual Language Models via Multilingual Textual Regularization
AACL 2025
VeritasQA: A Truthfulness Benchmark Aimed at Multilingual Transferability
COLING 2025
IberoBench: A Benchmark for LLM Evaluation in Iberian Languages
COLING 2025
Breaking Language Barriers in Visual Language Models via Multilingual Textual Regularization
IJCNLP 2025
Multi-LMentry: Can Multilingual LLMs Solve Elementary Tasks Across Languages?
EMNLP 2025
Extending Off-the-shelf NER Systems to Personal Information Detection in Dialogues with a Virtual Agent: Findings from a Real-Life Use Case
EACL 2024
Mass-Editing Memory with Attention in Transformers: A cross-lingual exploration of knowledge
ACL 2024
A CURATEd CATalog: Rethinking the Extraction of Pretraining Corpora for Mid-Resourced Languages
COLING 2024
Becoming a High-Resource Language in Speech: The Catalan Case in the Common Voice Corpus
COLING 2024
Building a Data Infrastructure for a Mid-Resource Language: The Case of Catalan
COLING 2024
FLOR: On the Effectiveness of Language Adaptation
COLING 2024
Community OSCAR: A Community Effort for Multilingual Web Data
EMNLP 2024
A weakly supervised textual entailment approach to zero-shot text classification
EACL 2023
Pretrained Biomedical Language Models for Clinical NLP in Spanish
ACL 2022
Assessing the Limits of Straightforward Models for Nested Named Entity Recognition in Spanish Clinical Narratives
EMNLP 2022
Are Multilingual Models the Best Choice for Moderately Under-resourced Languages? A Comprehensive Assessment for Catalan
ACL 2021
Are Multilingual Models the Best Choice for Moderately Under-resourced Languages? A Comprehensive Assessment for Catalan
IJCNLP 2021
Medical Word Embeddings for Spanish: Development and Evaluation
NAACL 2019
PharmaCoNER: Pharmacological Substances, Compounds and proteins Named Entity Recognition track
EMNLP 2019