Christian Fuegen
19 papers · 2019–2024 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
π Conference Polyglot (3) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (12) π§ Keyword Pioneer π Academic Marathon (5)
π
Academic Marathon
(5)
π
Cross-Pollinator
(4)
π
Renaissance Researcher
(7)
π₯
Mega-Team
(85)
π¬
Deep Specialist
(10)
β‘
Prolific Year
(8)
π
Century Club
(19)
β
The Questioner
ποΈ
Keyword Collector
(93)
π₯
Unstoppable
(6)
Conferences
INTERSPEECH (16)
CVPR (2)
NAACL (1)
Top co-authors
Keywords
automatic speech recognition
(7)
word error rate
(6)
semi-supervised learning
(3)
on-device speech recognition
(3)
speech recognition
(3)
attention mechanism
(2)
latency optimization
(2)
semantic distance
(2)
end-to-end speech recognition
(2)
transfer learning
(2)
natural language understanding
(2)
text-to-speech synthesis
(2)
streaming speech recognition
(2)
spoken language understanding
(1)
video understanding
(1)
cross-modal learning
(1)
self-supervised learning
(1)
few-shot learning
(1)
audio visual
(1)
speech processing
(1)
Papers
AudioChatLlama: Towards General-Purpose Speech Abilities for LLMs
NAACL 2024
Streaming Audio-Visual Speech Recognition with Alignment Regularization
INTERSPEECH 2023
SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
CVPR 2023
Directional Speech Recognition for Speaker Disambiguation and Cross-talk Suppression
INTERSPEECH 2023
Ego4D: Around the World in 3,000 Hours of Egocentric Video
CVPR 2022
Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric
INTERSPEECH 2022
Scaling ASR Improves Zero and Few Shot Learning
INTERSPEECH 2022
Transformer-Based Acoustic Modeling for Streaming Speech Synthesis
INTERSPEECH 2021
A Two-Stage Approach to Speech Bandwidth Extension
INTERSPEECH 2021
Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion
INTERSPEECH 2021
Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding
INTERSPEECH 2021
Dynamic Encoder Transducer: A Flexible Solution for Trading Off Accuracy for Latency
INTERSPEECH 2021
Flexi-Transducer: Optimizing Latency, Accuracy and Compute for Multi-Domain On-Device Scenarios
INTERSPEECH 2021
Dissecting User-Perceived Latency of On-Device E2E Speech Recognition
INTERSPEECH 2021
Do Sound Event Representations Generalize to Other Audio Tasks? A Case Study in Audio Transfer Learning
INTERSPEECH 2021
Interactive Text-to-Speech System via Joint Style Analysis
INTERSPEECH 2020
Weak-Attention Suppression for Transformer Based Speech Recognition
INTERSPEECH 2020
Large Scale Weakly and Semi-Supervised Learning for Low-Resource Video ASR
INTERSPEECH 2020
Joint Grapheme and Phoneme Embeddings for Contextual End-to-End ASR
INTERSPEECH 2019