conftrace_

Takuya Yoshioka

29 papers · 2016–2024 · 4 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓
+13 more ↓ 🌍 Conference Polyglot (4) 🧭 Keyword Pioneer πŸŒ‰ Interdisciplinary Bridge πŸ—ΊοΈ Taxonomy Completionist (19) πŸƒ Academic Marathon (8)
πŸ—ΊοΈ Taxonomy Completionist (19) 🧭 Keyword Pioneer πŸƒ Academic Marathon (8) 🏠 Conference Loyalist (26) πŸ”¬ Deep Specialist (12) πŸ‘₯ Mega-Team (20) 🀝 Dynamic Duo (15) πŸ”₯ Unstoppable (7) ⚑ Prolific Year (5) πŸ“ˆ Trend Setter πŸš€ Conference Pioneer πŸ’Ž Century Club (29) πŸ—ƒοΈ Keyword Collector (54)

Conferences

INTERSPEECH (26) AAAI (1) EMNLP (1) NAACL (1)

Papers

Target conversation extraction: Source separation using turn-taking dynamics INTERSPEECH 2024 i-Code Studio: A Configurable and Composable Framework for Integrative AI EMNLP 2024 i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data NAACL 2024 Knowledge boosting during low-latency inference INTERSPEECH 2024 i-Code: An Integrative and Composable Multimodal Learning Framework AAAI 2023 Factual Consistency Oriented Speech Recognition INTERSPEECH 2023 Real-Time Joint Personalized Speech Enhancement and Acoustic Echo Cancellation INTERSPEECH 2023 Adapting Multi-Lingual ASR Models for Handling Multiple Talkers INTERSPEECH 2023 Speaker Diarization for ASR Output with T-vectors: A Sequence Classification Approach INTERSPEECH 2023 Streaming Multi-Talker ASR with Token-Level Serialized Output Training INTERSPEECH 2022 Leveraging Real Conversational Data for Multi-Channel Continuous Speech Separation INTERSPEECH 2022 Separating Long-Form Speech with Group-wise Permutation Invariant Training INTERSPEECH 2022 Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings INTERSPEECH 2022 Fast Real-time Personalized Speech Enhancement: End-to-End Enhancement Network (E3Net) and Knowledge Distillation INTERSPEECH 2022 Investigation of Practical Aspects of Single Channel Speech Separation for ASR INTERSPEECH 2021 Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone INTERSPEECH 2021 End-to-End Speaker-Attributed ASR with Transformer INTERSPEECH 2021 Human Listening and Live Captioning: Multi-Task Training for Speech Enhancement INTERSPEECH 2021 Ultra Fast Speech Separation Model with Teacher Student Learning INTERSPEECH 2021 Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of any Number of Speakers INTERSPEECH 2020 Neural Speech Separation Using Spatially Distributed Microphones INTERSPEECH 2020 Serialized Output Training for End-to-End Overlapped Speech Recognition INTERSPEECH 2020 An End-to-End Architecture of Online Multi-Channel Speech Separation INTERSPEECH 2020 Meeting Transcription Using Asynchronous Distant Microphones INTERSPEECH 2019 Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks INTERSPEECH 2018 Investigations on Data Augmentation and Loss Functions for Deep Learning Based Speech-Background Separation INTERSPEECH 2018 Optimization of Speech Enhancement Front-End with Speech Recognition-Level Criterion INTERSPEECH 2016 Robust Example Search Using Bottleneck Features for Example-Based Speech Enhancement INTERSPEECH 2016 Context Adaptive Neural Network for Rapid Adaptation of Deep CNN Based Acoustic Models INTERSPEECH 2016