Ozlem Kalinli
20 papers · 2016–2024 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
π Conference Polyglot (3) π§ Keyword Pioneer π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (13) π Academic Marathon (8)
π
Academic Marathon
(8)
π
Cross-Pollinator
(8)
π
Renaissance Researcher
(5)
π€
Dynamic Duo
(11)
π§¬
Topic Evolution
π¬
Deep Specialist
(15)
π
Century Club
(20)
ποΈ
Keyword Collector
(88)
π
Conference Pioneer
β‘
Prolific Year
(6)
Conferences
INTERSPEECH (18)
EMNLP (1)
NAACL (1)
Top co-authors
Keywords
automatic speech recognition
(10)
speech recognition
(7)
word error rate
(6)
semantic distance
(3)
on-device speech recognition
(3)
spoken language understanding
(3)
end-to-end model
(2)
semantic parsing
(2)
large language model
(2)
neural network
(2)
acoustic modeling
(2)
natural language understanding
(2)
latency optimization
(2)
cross-modal learning
(1)
self-supervised learning
(1)
text representation
(1)
feature extraction
(1)
semi-supervised learning
(1)
speech processing
(1)
acoustic model
(1)
Papers
AudioChatLlama: Towards General-Purpose Speech Abilities for LLMs
NAACL 2024
Evaluating Speech Recognition Performance Towards Large Language Model Based Voice Assistants
INTERSPEECH 2024
Towards measuring fairness in speech recognition: Fair-Speech dataset
INTERSPEECH 2024
Multi-Head State Space Model for Speech Recognition
INTERSPEECH 2023
Modality Confidence Aware Training for Robust End-to-End Spoken Language Understanding
INTERSPEECH 2023
Federated Domain Adaptation for ASR with Full Self-Supervision
INTERSPEECH 2022
Joint Audio/Text Training for Transformer Rescorer of Streaming Speech Recognition
EMNLP 2022
Streaming parallel transducer beam search with fast slow cascaded encoders
INTERSPEECH 2022
Deliberation Model for On-Device Spoken Language Understanding
INTERSPEECH 2022
Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric
INTERSPEECH 2022
Scaling ASR Improves Zero and Few Shot Learning
INTERSPEECH 2022
Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding
INTERSPEECH 2021
Transformer-Based Acoustic Modeling for Streaming Speech Synthesis
INTERSPEECH 2021
Collaborative Training of Acoustic Encoders for Speech Recognition
INTERSPEECH 2021
Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion
INTERSPEECH 2021
Dynamic Encoder Transducer: A Flexible Solution for Trading Off Accuracy for Latency
INTERSPEECH 2021
Flexi-Transducer: Optimizing Latency, Accuracy and Compute for Multi-Domain On-Device Scenarios
INTERSPEECH 2021
Dissecting User-Perceived Latency of On-Device E2E Speech Recognition
INTERSPEECH 2021
Bandwidth Embeddings for Mixed-Bandwidth Speech Recognition
INTERSPEECH 2019
Analysis of Multi-Lingual Emotion Recognition Using Auditory Attention Features
INTERSPEECH 2016