Karen Livescu

61 papers · 2004–2026 · 10 conferences · across top CS/AI conferences

Achievements

+13 more ↓

🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (23) 🌈 Renaissance Researcher (6) 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird

🗺️ Taxonomy Completionist (23) 🧭 Keyword Pioneer 🌍 Conference Polyglot (10) 🤝 Dynamic Duo (18) 👥 Mega-Team (76) 🔬 Deep Specialist (13) 🏆 Keyword Champion (2) 🔥 Unstoppable (14) 📈 Trend Setter ⚡ Prolific Year (6) 💎 Century Club (60) 🗃️ Keyword Collector (51) 🚀 Conference Pioneer

Conferences

ACL (16) INTERSPEECH (16) EMNLP (12) NAACL (7) ICML (3) ICCV (2) ICLR (2) AAAI (1) CVPR (1) IJCNLP (1)

Top co-authors

Kevin Gimpel (18) Shinji Watanabe (13) Shubham Toshniwal (9) Bowen Shi (8) Greg Shakhnarovich (7) Ankita Pasad (6) Weiran Wang (6) William Chen (6) Suwon Shon (6) Diane Brentari (6)

Research topics

Speech & Audio (1)

Keywords

self-supervised learning (10) speech recognition (9) representation learning (6) multimodal learning (5) spoken language understanding (5) sign language translation (5) sign language recognition (5) automatic speech recognition (5) zero-shot learning (4) transfer learning (4) canonical correlation analysis (4) video understanding (4) speech representation (4) multi-task learning (3) constituency parsing (3) visual grounding (3) latent variable model (3) variational inference (3) coreference resolution (3) question answering (3)

Papers

Cross-Modal Taxonomic Generalization in (Vision-) Language Models ACL 2026 Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks ICLR 2025 Chunk-Distilled Language Modeling ICLR 2025 SHuBERT: Self-Supervised Sign Language Representation Learning via Multi-Stream Cluster Prediction ACL 2025 SignMusketeers: An Efficient Multi-Stream Approach for Sign Language Translation at Scale ACL 2025 Towards Robust Speech Representation Learning for Thousands of Languages EMNLP 2024 DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding INTERSPEECH 2024 ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints, Languages, and Datasets INTERSPEECH 2024 UniverSLU: Universal Spoken Language Understanding for Diverse Tasks with Natural Language Instructions NAACL 2024 On the Effects of Heterogeneous Data Sources on Speech-to-Text Foundation Models INTERSPEECH 2024 Convolution-Augmented Parameter-Efficient Fine-Tuning for Speech Recognition INTERSPEECH 2024 Self-Supervised Speech Representations are More Phonetic than Semantic INTERSPEECH 2024 Structured Tree Alignment for Evaluation of (Speech) Constituency Parsing ACL 2024 On the Evaluation of Speech Foundation Models for Spoken Language Understanding ACL 2024 CMU’s IWSLT 2024 Offline Speech Translation System: A Cascaded Approach For Long-Form Robustness ACL 2024 Toward Joint Language Modeling for Speech Units and Text EMNLP 2023 SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks ACL 2023 TTIC’s Submission to WMT-SLT 23 EMNLP 2023 Chess as a Testbed for Language Model State Tracking AAAI 2022 Self-supervised Representation Learning for Speech Processing NAACL 2022 Searching for fingerspelled content in American Sign Language ACL 2022 Substructure Distribution Projection for Zero-Shot Cross-Lingual Dependency Parsing ACL 2022 On the Use of External Data for Spoken Named Entity Recognition NAACL 2022 Open-Domain Sign Language Translation Learned from Online Video EMNLP 2022 Baked-in State Probing EMNLP 2022 TTIC’s WMT-SLT 22 Sign Language Translation System EMNLP 2022 On Generalization in Coreference Resolution EMNLP 2021 Fingerspelling Detection in American Sign Language CVPR 2021 Substructure Substitution: Structured Data Augmentation for NLP ACL 2021 Learning Speech Models from Multi-Modal Data INTERSPEECH 2021 Substructure Substitution: Structured Data Augmentation for NLP IJCNLP 2021 Multilingual Jointly Trained Acoustic and Written Word Embeddings INTERSPEECH 2020 Discrete Latent Variable Representations for Low-Resource Text Classification ACL 2020 PeTra: A Sparsely Supervised Memory Model for People Tracking ACL 2020 A Cross-Task Analysis of Text Span Representations ACL 2020 On the Role of Supervision in Unsupervised Constituency Parsing EMNLP 2020 Learning to Ignore: Long Document Coreference with Bounded Memory Neural Networks EMNLP 2020 Fingerspelling Recognition in the Wild With Iterative Visual Attention ICCV 2019 On the Contributions of Visual and Textual Supervision in Low-Resource Semantic Speech Retrieval INTERSPEECH 2019 Pre-Trained Text Embeddings for Enhanced Text-to-Speech Synthesis INTERSPEECH 2019 Visually Grounded Neural Syntax Acquisition ACL 2019 Pre-training on high-resource speech recognition improves low-resource speech-to-text translation NAACL 2019 Parsing Speech: a Neural Approach to Integrating Lexical and Acoustic-Prosodic Information NAACL 2018 Low-Resource Speech-to-Text Translation INTERSPEECH 2018 Variational Sequential Labelers for Semi-Supervised Learning EMNLP 2018 Visually Grounded Learning of Keyword Prediction from Untranscribed Speech INTERSPEECH 2017 Multitask Learning with Low-Level Auxiliary Tasks for Encoder-Decoder Based Speech Recognition INTERSPEECH 2017 Query-by-Example Search with Discriminative Neural Acoustic Word Embeddings INTERSPEECH 2017 Acoustic Feature Learning via Deep Variational Canonical Correlation Analysis INTERSPEECH 2017 Efficient Segmental Cascades for Speech Recognition INTERSPEECH 2016 Triphone State-Tying via Deep Canonical Correlation Analysis INTERSPEECH 2016 Nonparametric Canonical Correlation Analysis ICML 2016 Charagram: Embedding Words and Sentences via Character n-grams EMNLP 2016 Deep Multilingual Correlation for Improved Word Embeddings NAACL 2015 On Deep Multi-View Representation Learning ICML 2015 Tailoring Continuous Word Representations for Dependency Parsing ACL 2014 Fingerspelling Recognition with Semi-Markov Conditional Random Fields ICCV 2013 Deep Canonical Correlation Analysis ICML 2013 Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing EMNLP 2013 Discriminative Pronunciation Modeling: A Large-Margin, Feature-Rich Approach ACL 2012 Feature-based Pronunciation Modeling for Speech Recognition NAACL 2004