Karen Livescu
61 papers · 2004–2026 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (23) π Renaissance Researcher (6) π Interdisciplinary Bridge π£ Hot Topic Early Bird
πΊοΈ
Taxonomy Completionist
(23)
π§
Keyword Pioneer
π
Conference Polyglot
(10)
π€
Dynamic Duo
(18)
π₯
Mega-Team
(76)
π¬
Deep Specialist
(13)
π
Keyword Champion
(2)
π₯
Unstoppable
(14)
π
Trend Setter
β‘
Prolific Year
(6)
π
Century Club
(60)
ποΈ
Keyword Collector
(51)
π
Conference Pioneer
Conferences
ACL (16)
INTERSPEECH (16)
EMNLP (12)
NAACL (7)
ICML (3)
ICCV (2)
ICLR (2)
AAAI (1)
CVPR (1)
IJCNLP (1)
Top co-authors
Research topics
Keywords
self-supervised learning
(10)
speech recognition
(9)
representation learning
(6)
multimodal learning
(5)
spoken language understanding
(5)
sign language translation
(5)
sign language recognition
(5)
automatic speech recognition
(5)
zero-shot learning
(4)
transfer learning
(4)
canonical correlation analysis
(4)
video understanding
(4)
speech representation
(4)
multi-task learning
(3)
constituency parsing
(3)
visual grounding
(3)
latent variable model
(3)
variational inference
(3)
coreference resolution
(3)
question answering
(3)
Papers
Cross-Modal Taxonomic Generalization in (Vision-) Language Models
ACL 2026
Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
ICLR 2025
Chunk-Distilled Language Modeling
ICLR 2025
SHuBERT: Self-Supervised Sign Language Representation Learning via Multi-Stream Cluster Prediction
ACL 2025
SignMusketeers: An Efficient Multi-Stream Approach for Sign Language Translation at Scale
ACL 2025
Towards Robust Speech Representation Learning for Thousands of Languages
EMNLP 2024
DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding
INTERSPEECH 2024
ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints, Languages, and Datasets
INTERSPEECH 2024
UniverSLU: Universal Spoken Language Understanding for Diverse Tasks with Natural Language Instructions
NAACL 2024
On the Effects of Heterogeneous Data Sources on Speech-to-Text Foundation Models
INTERSPEECH 2024
Convolution-Augmented Parameter-Efficient Fine-Tuning for Speech Recognition
INTERSPEECH 2024
Self-Supervised Speech Representations are More Phonetic than Semantic
INTERSPEECH 2024
Structured Tree Alignment for Evaluation of (Speech) Constituency Parsing
ACL 2024
On the Evaluation of Speech Foundation Models for Spoken Language Understanding
ACL 2024
CMUβs IWSLT 2024 Offline Speech Translation System: A Cascaded Approach For Long-Form Robustness
ACL 2024
Toward Joint Language Modeling for Speech Units and Text
EMNLP 2023
SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks
ACL 2023
TTICβs Submission to WMT-SLT 23
EMNLP 2023
Chess as a Testbed for Language Model State Tracking
AAAI 2022
Self-supervised Representation Learning for Speech Processing
NAACL 2022
Searching for fingerspelled content in American Sign Language
ACL 2022
Substructure Distribution Projection for Zero-Shot Cross-Lingual Dependency Parsing
ACL 2022
On the Use of External Data for Spoken Named Entity Recognition
NAACL 2022
Open-Domain Sign Language Translation Learned from Online Video
EMNLP 2022
Baked-in State Probing
EMNLP 2022
TTICβs WMT-SLT 22 Sign Language Translation System
EMNLP 2022
On Generalization in Coreference Resolution
EMNLP 2021
Fingerspelling Detection in American Sign Language
CVPR 2021
Substructure Substitution: Structured Data Augmentation for NLP
ACL 2021
Learning Speech Models from Multi-Modal Data
INTERSPEECH 2021
Substructure Substitution: Structured Data Augmentation for NLP
IJCNLP 2021
Multilingual Jointly Trained Acoustic and Written Word Embeddings
INTERSPEECH 2020
Discrete Latent Variable Representations for Low-Resource Text Classification
ACL 2020
PeTra: A Sparsely Supervised Memory Model for People Tracking
ACL 2020
A Cross-Task Analysis of Text Span Representations
ACL 2020
On the Role of Supervision in Unsupervised Constituency Parsing
EMNLP 2020
Learning to Ignore: Long Document Coreference with Bounded Memory Neural Networks
EMNLP 2020
Fingerspelling Recognition in the Wild With Iterative Visual Attention
ICCV 2019
On the Contributions of Visual and Textual Supervision in Low-Resource Semantic Speech Retrieval
INTERSPEECH 2019
Pre-Trained Text Embeddings for Enhanced Text-to-Speech Synthesis
INTERSPEECH 2019
Visually Grounded Neural Syntax Acquisition
ACL 2019
Pre-training on high-resource speech recognition improves low-resource speech-to-text translation
NAACL 2019
Parsing Speech: a Neural Approach to Integrating Lexical and Acoustic-Prosodic Information
NAACL 2018
Low-Resource Speech-to-Text Translation
INTERSPEECH 2018
Variational Sequential Labelers for Semi-Supervised Learning
EMNLP 2018
Visually Grounded Learning of Keyword Prediction from Untranscribed Speech
INTERSPEECH 2017
Multitask Learning with Low-Level Auxiliary Tasks for Encoder-Decoder Based Speech Recognition
INTERSPEECH 2017
Query-by-Example Search with Discriminative Neural Acoustic Word Embeddings
INTERSPEECH 2017
Acoustic Feature Learning via Deep Variational Canonical Correlation Analysis
INTERSPEECH 2017
Efficient Segmental Cascades for Speech Recognition
INTERSPEECH 2016
Triphone State-Tying via Deep Canonical Correlation Analysis
INTERSPEECH 2016
Nonparametric Canonical Correlation Analysis
ICML 2016
Charagram: Embedding Words and Sentences via Character n-grams
EMNLP 2016
Deep Multilingual Correlation for Improved Word Embeddings
NAACL 2015
On Deep Multi-View Representation Learning
ICML 2015
Tailoring Continuous Word Representations for Dependency Parsing
ACL 2014
Fingerspelling Recognition with Semi-Markov Conditional Random Fields
ICCV 2013
Deep Canonical Correlation Analysis
ICML 2013
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing
EMNLP 2013
Discriminative Pronunciation Modeling: A Large-Margin, Feature-Rich Approach
ACL 2012
Feature-based Pronunciation Modeling for Speech Recognition
NAACL 2004