Marta R. Costa-jussà
77 papers · 2006–2025 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
🌍 Conference Polyglot (10) 🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🏃 Academic Marathon (19)
🧭
Keyword Pioneer
🐣
Hot Topic Early Bird
🌈
Renaissance Researcher
(8)
🌟
Keyword Trendsetter Combo
(3)
🏠
Conference Loyalist
(29)
🤝
Dynamic Duo
(23)
👥
Mega-Team
(36)
🔬
Deep Specialist
(42)
🏆
Keyword Champion
(4)
🗃️
Keyword Collector
(242)
📈
Trend Setter
🚀
Conference Pioneer
⚡
Prolific Year
(8)
🔥
Unstoppable
(8)
💎
Century Club
(77)
Conferences
ACL (30)
EMNLP (29)
COLING (5)
EACL (3)
IJCNLP (3)
IJCAI (2)
NAACL (2)
AAAI (1)
CONLL (1)
INTERSPEECH (1)
Top co-authors
Research topics
Keywords
neural machine translation
(20)
machine translation
(18)
speech translation
(9)
gender bia
(9)
transformer model
(8)
multilingual nlp
(8)
attention mechanism
(7)
multilingual translation
(6)
transfer learning
(5)
low-resource language
(5)
toxicity detection
(5)
speech encoder
(4)
transformer architecture
(4)
text classification
(3)
human evaluation
(3)
zero-shot learning
(3)
domain adaptation
(3)
word embedding
(3)
knowledge distillation
(3)
lifelong learning
(3)
Papers
Translate, Then Detect: Leveraging Machine Translation for Cross-Lingual Toxicity Classification
EMNLP 2025
On the Role of Speech Data in Reducing Toxicity Detection Bias
NAACL 2025
2M-BELEBELE: Highly Multilingual Speech and American Sign Language Comprehension Dataset Download PDF
ACL 2025
Y-NQ: English-Yorùbá Evaluation dataset for Open-Book Reading Comprehension with Open-Ended Questions
ACL 2025
Towards Massive Multilingual Holistic Bias
ACL 2025
LCFO: Long Context and Long Form Output Dataset and Benchmarking
ACL 2025
BOUQuET : dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation
EMNLP 2025
Improving Language and Modality Transfer in Translation by Character-level Modeling
ACL 2025
Overview of the Shared Task on Machine Translation Gender Bias Evaluation with Multilingual Holistic Bias
ACL 2024
Gender-specific Machine Translation with Large Language Models
EMNLP 2024
BLASER 2.0: a metric for evaluation and quality estimation of massively multilingual speech and text translation
EMNLP 2024
On the Similarity of Circuits across Languages: a Case Study on the Subject-verb Agreement Task
EMNLP 2024
Unveiling the Role of Pretraining in Direct Speech Translation
EMNLP 2024
SpeechAlign: A Framework for Speech Translation Alignment Evaluation
COLING 2024
MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector
ACL 2024
Pushing the Limits of Zero-shot End-to-End Speech Translation
ACL 2024
SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations
EMNLP 2023
Toxicity in Multilingual Machine Translation at Scale
EMNLP 2023
The Gender-GAP Pipeline: A Gender-Aware Polyglot Pipeline for Gender Characterisation in 55 Languages
EMNLP 2023
Detecting and Mitigating Hallucinations in Machine Translation: Model Internal Workings Alone Do Well, Sentence Similarity Even Better
ACL 2023
Explaining How Transformers Use Context to Build Predictions
ACL 2023
BLASER: A Text-Free Speech-to-Speech Translation Evaluation Metric
ACL 2023
Speech Translation with Foundation Models and Optimal Transport: UPC at IWSLT23
ACL 2023
HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation
EMNLP 2023
Multilingual Holistic Bias: Extending Descriptors and Patterns to Unveil Demographic Biases in Languages at Scale
EMNLP 2023
Interpreting Gender Bias in Neural Machine Translation: Multilingual Architecture Matters
AAAI 2022
On the Locality of Attention in Direct Speech Translation
ACL 2022
Pretrained Speech Encoders and Efficient Fine-tuning Methods for Speech Translation: UPC at IWSLT 2022
ACL 2022
Measuring the Mixing of Contextual Information in the Transformer
EMNLP 2022
Towards Opening the Black Box of Neural Machine Translation: Source and Target Interpretations of the Transformer
EMNLP 2022
Findings of the WMT’22 Shared Task on Large-Scale Machine Translation Evaluation for African Languages
EMNLP 2022
SHAS: Approaching optimal Segmentation for End-to-End Speech Translation
INTERSPEECH 2022
Findings of the 2021 Conference on Machine Translation (WMT21)
EMNLP 2021
The TALP-UPC Participation in WMT21 News Translation Task: an mBART-based NMT Approach
EMNLP 2021
High Frequent In-domain Words Segmentation and Forward Translation for the WMT21 Biomedical Task
EMNLP 2021
End-to-End Speech Translation with Pre-trained Models and Adapters: UPC at IWSLT 2021
IJCNLP 2021
Multilingual Machine Translation: Closing the Gap between Shared and Language-specific Encoder-Decoders
EACL 2021
Impact of COVID-19 in Natural Language Processing Publications: a Disaggregated Study in Gender, Contribution and Experience
EACL 2021
End-to-End Speech Translation with Pre-trained Models and Adapters: UPC at IWSLT 2021
ACL 2021
Attention Weights in Transformer NMT Fail Aligning Words Between Sequences but Largely Explain Model Predictions
EMNLP 2021
Towards Mitigating Gender Bias in a decoder-based Neural Machine Translation model by Adding Contextual Information
ACL 2020
Multilingual Neural Machine Translation: Case-study for Catalan, Spanish and Portuguese Romance Languages
EMNLP 2020
E-Commerce Content and Collaborative-based Recommendation using K-Nearest Neighbors and Enriched Weighted Vectors
COLING 2020
Fine-tuning Neural Machine Translation on Gender-Balanced Datasets
COLING 2020
Combining Subword Representations into Word-level Representations in the Transformer Architecture
ACL 2020
Syntax-driven Iterative Expansion Language Models for Controllable Text Generation
EMNLP 2020
Findings of the 2020 Conference on Machine Translation (WMT20)
EMNLP 2020
Findings of the First Shared Task on Lifelong Learning Machine Translation
EMNLP 2020
Continual Lifelong Learning in Natural Language Processing: A Survey
COLING 2020
Enhancing Word Embeddings with Knowledge Extracted from Lexical Resources
ACL 2020
The TALP-UPC System Description for WMT20 News Translation Task: Multilingual Adaptation for Low Resource MT
EMNLP 2020
The IPN-CIC team system submission for the WMT 2020 similar language task
EMNLP 2020
Evaluating the Underlying Gender Bias in Contextualized Word Embeddings
ACL 2019
Proceedings of the First Workshop on Gender Bias in Natural Language Processing
ACL 2019
Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: System Demonstrations
ACL 2019
Multilingual, Multi-scale and Multi-layer Visualization of Intermediate Representations
EMNLP 2019
Multilingual, Multi-scale and Multi-layer Visualization of Intermediate Representations
IJCNLP 2019
From Bilingual to Multilingual Neural Machine Translation by Incremental Training
ACL 2019
The TALP-UPC System for the WMT Similar Language Task: Statistical vs Neural Machine Translation
ACL 2019
Terminology-Aware Segmentation and Domain Feature for the WMT19 Biomedical Translation Task
ACL 2019
The TALP-UPC Machine Translation Systems for WMT19 News Translation Task: Pivoting Techniques for Low Resource MT
ACL 2019
Findings of the 2019 Conference on Machine Translation (WMT19)
ACL 2019
Equalizing Gender Bias in Neural Machine Translation with Word Embeddings Techniques
ACL 2019
BERT Masked Language Modeling for Co-reference Resolution
ACL 2019
Gendered Ambiguous Pronoun (GAP) Shared Task at the Gender Bias in NLP Workshop 2019
ACL 2019
A Neural Approach to Language Variety Translation
COLING 2018
The TALP-UPC Machine Translation Systems for WMT18 News Shared Translation Task
EMNLP 2018
Neural Machine Translation with the Transformer and Multi-Source Romance Languages for the Biomedical WMT 2018 task
EMNLP 2018
From Feature to Paradigm: Deep Learning in Machine Translation (Extended Abstract)
IJCAI 2018
Character-based Neural Machine Translation
ACL 2016
CHISPA on the GO: A mobile Chinese-Spanish translation service for travellers in trouble
EACL 2014
Evaluating Indirect Strategies for Chinese–Spanish Statistical Machine Translation: Extended Abstract
IJCAI 2013
Enhancing scarce-resource language translation through pivot combinations
IJCNLP 2011
Analysis and System Combination of Phrase- and N-Gram-Based Statistical Machine Translation Systems
NAACL 2007
Smooth Bilingual N-Gram Translation
CONLL 2007
Smooth Bilingual N-Gram Translation
EMNLP 2007
Statistical Machine Reordering
EMNLP 2006