Christian Fuegen

19 papers · 2019–2024 · 3 conferences · across top CS/AI conferences

Achievements

+10 more ↓

🌍 Conference Polyglot (3) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (12) 🧭 Keyword Pioneer 🏃 Academic Marathon (5)

🏃 Academic Marathon (5) 🐝 Cross-Pollinator (4) 🌈 Renaissance Researcher (7) 👥 Mega-Team (85) 🔬 Deep Specialist (10) ⚡ Prolific Year (8) 💎 Century Club (19) ❓ The Questioner 🗃️ Keyword Collector (93) 🔥 Unstoppable (6)

Conferences

INTERSPEECH (16) CVPR (2) NAACL (1)

Top co-authors

Ozlem Kalinli (9) Duc Le (8) Michael L. Seltzer (7) Chunyang Wu (6) Yangyang Shi (6) Jay Mahadeokar (5) Alex Xiao (4) Yuan Shangguan (4) Niko Moritz (3) Suyoun Kim (3)

Keywords

automatic speech recognition (7) word error rate (6) semi-supervised learning (3) on-device speech recognition (3) speech recognition (3) attention mechanism (2) latency optimization (2) semantic distance (2) end-to-end speech recognition (2) transfer learning (2) natural language understanding (2) text-to-speech synthesis (2) streaming speech recognition (2) spoken language understanding (1) video understanding (1) cross-modal learning (1) self-supervised learning (1) few-shot learning (1) audio visual (1) speech processing (1)

Papers

AudioChatLlama: Towards General-Purpose Speech Abilities for LLMs NAACL 2024 Streaming Audio-Visual Speech Recognition with Alignment Regularization INTERSPEECH 2023 SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision CVPR 2023 Directional Speech Recognition for Speaker Disambiguation and Cross-talk Suppression INTERSPEECH 2023 Ego4D: Around the World in 3,000 Hours of Egocentric Video CVPR 2022 Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric INTERSPEECH 2022 Scaling ASR Improves Zero and Few Shot Learning INTERSPEECH 2022 Transformer-Based Acoustic Modeling for Streaming Speech Synthesis INTERSPEECH 2021 A Two-Stage Approach to Speech Bandwidth Extension INTERSPEECH 2021 Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion INTERSPEECH 2021 Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding INTERSPEECH 2021 Dynamic Encoder Transducer: A Flexible Solution for Trading Off Accuracy for Latency INTERSPEECH 2021 Flexi-Transducer: Optimizing Latency, Accuracy and Compute for Multi-Domain On-Device Scenarios INTERSPEECH 2021 Dissecting User-Perceived Latency of On-Device E2E Speech Recognition INTERSPEECH 2021 Do Sound Event Representations Generalize to Other Audio Tasks? A Case Study in Audio Transfer Learning INTERSPEECH 2021 Interactive Text-to-Speech System via Joint Style Analysis INTERSPEECH 2020 Weak-Attention Suppression for Transformer Based Speech Recognition INTERSPEECH 2020 Large Scale Weakly and Semi-Supervised Learning for Low-Resource Video ASR INTERSPEECH 2020 Joint Grapheme and Phoneme Embeddings for Contextual End-to-End ASR INTERSPEECH 2019