conftrace_

Ozlem Kalinli

20 papers · 2016–2024 · 3 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓
+10 more ↓ 🌍 Conference Polyglot (3) 🧭 Keyword Pioneer πŸŒ‰ Interdisciplinary Bridge πŸ—ΊοΈ Taxonomy Completionist (13) πŸƒ Academic Marathon (8)
πŸƒ Academic Marathon (8) 🐝 Cross-Pollinator (8) 🌈 Renaissance Researcher (5) 🀝 Dynamic Duo (11) 🧬 Topic Evolution πŸ”¬ Deep Specialist (15) πŸ’Ž Century Club (20) πŸ—ƒοΈ Keyword Collector (88) πŸš€ Conference Pioneer ⚑ Prolific Year (6)

Conferences

INTERSPEECH (18) EMNLP (1) NAACL (1)

Papers

AudioChatLlama: Towards General-Purpose Speech Abilities for LLMs NAACL 2024 Evaluating Speech Recognition Performance Towards Large Language Model Based Voice Assistants INTERSPEECH 2024 Towards measuring fairness in speech recognition: Fair-Speech dataset INTERSPEECH 2024 Multi-Head State Space Model for Speech Recognition INTERSPEECH 2023 Modality Confidence Aware Training for Robust End-to-End Spoken Language Understanding INTERSPEECH 2023 Federated Domain Adaptation for ASR with Full Self-Supervision INTERSPEECH 2022 Joint Audio/Text Training for Transformer Rescorer of Streaming Speech Recognition EMNLP 2022 Streaming parallel transducer beam search with fast slow cascaded encoders INTERSPEECH 2022 Deliberation Model for On-Device Spoken Language Understanding INTERSPEECH 2022 Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric INTERSPEECH 2022 Scaling ASR Improves Zero and Few Shot Learning INTERSPEECH 2022 Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding INTERSPEECH 2021 Transformer-Based Acoustic Modeling for Streaming Speech Synthesis INTERSPEECH 2021 Collaborative Training of Acoustic Encoders for Speech Recognition INTERSPEECH 2021 Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion INTERSPEECH 2021 Dynamic Encoder Transducer: A Flexible Solution for Trading Off Accuracy for Latency INTERSPEECH 2021 Flexi-Transducer: Optimizing Latency, Accuracy and Compute for Multi-Domain On-Device Scenarios INTERSPEECH 2021 Dissecting User-Perceived Latency of On-Device E2E Speech Recognition INTERSPEECH 2021 Bandwidth Embeddings for Mixed-Bandwidth Speech Recognition INTERSPEECH 2019 Analysis of Multi-Lingual Emotion Recognition Using Auditory Attention Features INTERSPEECH 2016