conftrace_

Marta R. Costa-jussà

77 papers · 2006–2025 · 10 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+15 more ↓

🌍 Conference Polyglot (10) 🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🏃 Academic Marathon (19)

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🌈 Renaissance Researcher (8) 🌟 Keyword Trendsetter Combo (3) 🏠 Conference Loyalist (29) 🤝 Dynamic Duo (23) 👥 Mega-Team (36) 🔬 Deep Specialist (42) 🏆 Keyword Champion (4) 🗃️ Keyword Collector (242) 📈 Trend Setter 🚀 Conference Pioneer ⚡ Prolific Year (8) 🔥 Unstoppable (8) 💎 Century Club (77)

Conferences

ACL (30) EMNLP (29) COLING (5) EACL (3) IJCNLP (3) IJCAI (2) NAACL (2) AAAI (1) CONLL (1) INTERSPEECH (1)

Top co-authors

José A. R. Fonollosa (23) Carlos Escolano (14) Gerard I. Gállego (12) Ioannis Tsiamas (11) Christophe Ropers (11) Pierre Andrews (9) Javier Ferrando (9) David Dale (9) Eduardo Sánchez (7) Christine Basta (7)

Research topics

Keywords

neural machine translation (20) machine translation (18) speech translation (9) gender bia (9) transformer model (8) multilingual nlp (8) attention mechanism (7) multilingual translation (6) transfer learning (5) low-resource language (5) toxicity detection (5) speech encoder (4) transformer architecture (4) text classification (3) human evaluation (3) zero-shot learning (3) domain adaptation (3) word embedding (3) knowledge distillation (3) lifelong learning (3)

Papers

Translate, Then Detect: Leveraging Machine Translation for Cross-Lingual Toxicity Classification EMNLP 2025 On the Role of Speech Data in Reducing Toxicity Detection Bias NAACL 2025 2M-BELEBELE: Highly Multilingual Speech and American Sign Language Comprehension Dataset Download PDF ACL 2025 Y-NQ: English-Yorùbá Evaluation dataset for Open-Book Reading Comprehension with Open-Ended Questions ACL 2025 Towards Massive Multilingual Holistic Bias ACL 2025 LCFO: Long Context and Long Form Output Dataset and Benchmarking ACL 2025 BOUQuET : dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation EMNLP 2025 Improving Language and Modality Transfer in Translation by Character-level Modeling ACL 2025 Overview of the Shared Task on Machine Translation Gender Bias Evaluation with Multilingual Holistic Bias ACL 2024 Gender-specific Machine Translation with Large Language Models EMNLP 2024 BLASER 2.0: a metric for evaluation and quality estimation of massively multilingual speech and text translation EMNLP 2024 On the Similarity of Circuits across Languages: a Case Study on the Subject-verb Agreement Task EMNLP 2024 Unveiling the Role of Pretraining in Direct Speech Translation EMNLP 2024 SpeechAlign: A Framework for Speech Translation Alignment Evaluation COLING 2024 MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector ACL 2024 Pushing the Limits of Zero-shot End-to-End Speech Translation ACL 2024 SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations EMNLP 2023 Toxicity in Multilingual Machine Translation at Scale EMNLP 2023 The Gender-GAP Pipeline: A Gender-Aware Polyglot Pipeline for Gender Characterisation in 55 Languages EMNLP 2023 Detecting and Mitigating Hallucinations in Machine Translation: Model Internal Workings Alone Do Well, Sentence Similarity Even Better ACL 2023 Explaining How Transformers Use Context to Build Predictions ACL 2023 BLASER: A Text-Free Speech-to-Speech Translation Evaluation Metric ACL 2023 Speech Translation with Foundation Models and Optimal Transport: UPC at IWSLT23 ACL 2023 HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation EMNLP 2023 Multilingual Holistic Bias: Extending Descriptors and Patterns to Unveil Demographic Biases in Languages at Scale EMNLP 2023 Interpreting Gender Bias in Neural Machine Translation: Multilingual Architecture Matters AAAI 2022 On the Locality of Attention in Direct Speech Translation ACL 2022 Pretrained Speech Encoders and Efficient Fine-tuning Methods for Speech Translation: UPC at IWSLT 2022 ACL 2022 Measuring the Mixing of Contextual Information in the Transformer EMNLP 2022 Towards Opening the Black Box of Neural Machine Translation: Source and Target Interpretations of the Transformer EMNLP 2022 Findings of the WMT’22 Shared Task on Large-Scale Machine Translation Evaluation for African Languages EMNLP 2022 SHAS: Approaching optimal Segmentation for End-to-End Speech Translation INTERSPEECH 2022 Findings of the 2021 Conference on Machine Translation (WMT21) EMNLP 2021 The TALP-UPC Participation in WMT21 News Translation Task: an mBART-based NMT Approach EMNLP 2021 High Frequent In-domain Words Segmentation and Forward Translation for the WMT21 Biomedical Task EMNLP 2021 End-to-End Speech Translation with Pre-trained Models and Adapters: UPC at IWSLT 2021 IJCNLP 2021 Multilingual Machine Translation: Closing the Gap between Shared and Language-specific Encoder-Decoders EACL 2021 Impact of COVID-19 in Natural Language Processing Publications: a Disaggregated Study in Gender, Contribution and Experience EACL 2021 End-to-End Speech Translation with Pre-trained Models and Adapters: UPC at IWSLT 2021 ACL 2021 Attention Weights in Transformer NMT Fail Aligning Words Between Sequences but Largely Explain Model Predictions EMNLP 2021 Towards Mitigating Gender Bias in a decoder-based Neural Machine Translation model by Adding Contextual Information ACL 2020 Multilingual Neural Machine Translation: Case-study for Catalan, Spanish and Portuguese Romance Languages EMNLP 2020 E-Commerce Content and Collaborative-based Recommendation using K-Nearest Neighbors and Enriched Weighted Vectors COLING 2020 Fine-tuning Neural Machine Translation on Gender-Balanced Datasets COLING 2020 Combining Subword Representations into Word-level Representations in the Transformer Architecture ACL 2020 Syntax-driven Iterative Expansion Language Models for Controllable Text Generation EMNLP 2020 Findings of the 2020 Conference on Machine Translation (WMT20) EMNLP 2020 Findings of the First Shared Task on Lifelong Learning Machine Translation EMNLP 2020 Continual Lifelong Learning in Natural Language Processing: A Survey COLING 2020 Enhancing Word Embeddings with Knowledge Extracted from Lexical Resources ACL 2020 The TALP-UPC System Description for WMT20 News Translation Task: Multilingual Adaptation for Low Resource MT EMNLP 2020 The IPN-CIC team system submission for the WMT 2020 similar language task EMNLP 2020 Evaluating the Underlying Gender Bias in Contextualized Word Embeddings ACL 2019 Proceedings of the First Workshop on Gender Bias in Natural Language Processing ACL 2019 Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: System Demonstrations ACL 2019 Multilingual, Multi-scale and Multi-layer Visualization of Intermediate Representations EMNLP 2019 Multilingual, Multi-scale and Multi-layer Visualization of Intermediate Representations IJCNLP 2019 From Bilingual to Multilingual Neural Machine Translation by Incremental Training ACL 2019 The TALP-UPC System for the WMT Similar Language Task: Statistical vs Neural Machine Translation ACL 2019 Terminology-Aware Segmentation and Domain Feature for the WMT19 Biomedical Translation Task ACL 2019 The TALP-UPC Machine Translation Systems for WMT19 News Translation Task: Pivoting Techniques for Low Resource MT ACL 2019 Findings of the 2019 Conference on Machine Translation (WMT19) ACL 2019 Equalizing Gender Bias in Neural Machine Translation with Word Embeddings Techniques ACL 2019 BERT Masked Language Modeling for Co-reference Resolution ACL 2019 Gendered Ambiguous Pronoun (GAP) Shared Task at the Gender Bias in NLP Workshop 2019 ACL 2019 A Neural Approach to Language Variety Translation COLING 2018 The TALP-UPC Machine Translation Systems for WMT18 News Shared Translation Task EMNLP 2018 Neural Machine Translation with the Transformer and Multi-Source Romance Languages for the Biomedical WMT 2018 task EMNLP 2018 From Feature to Paradigm: Deep Learning in Machine Translation (Extended Abstract) IJCAI 2018 Character-based Neural Machine Translation ACL 2016 CHISPA on the GO: A mobile Chinese-Spanish translation service for travellers in trouble EACL 2014 Evaluating Indirect Strategies for Chinese–Spanish Statistical Machine Translation: Extended Abstract IJCAI 2013 Enhancing scarce-resource language translation through pivot combinations IJCNLP 2011 Analysis and System Combination of Phrase- and N-Gram-Based Statistical Machine Translation Systems NAACL 2007 Smooth Bilingual N-Gram Translation CONLL 2007 Smooth Bilingual N-Gram Translation EMNLP 2007 Statistical Machine Reordering EMNLP 2006