Papers
Investigating Parameter Sharing in Multilingual Speech Translation
Qian Wang, Chen Wang, Jiajun Zhang
Investigating perception of spoken dialogue acceptability through surprisal
Sarenne Carrol Wallbridge, Catherine Lai, Peter Bell
Investigating phonetic convergence of laughter in conversation
Bogdan Ludusan, Marin Schröer, Petra Wagner
Investigating Prosodic Variation in British English Varieties using ProPer
Hae-Sung Jeon, Stephen Nichols
Investigating Self-supervised Pretraining Frameworks for Pathological Speech Recognition
Lester Phillip Violeta, Wen Chin Huang, Tomoki Toda
Investigating the contribution of speaker attributes to speaker separability using disentangled speaker representations
Chau Luu, Steve Renals, Peter Bell
Investigating the Impact of Crosslingual Acoustic-Phonetic Similarities on Multilingual Speech Recognition
Muhammad Umar Farooq, Thomas Hain
Investigating the Impact of Speech Compression on the Acoustics of Dysarthric Speech
Kelvin Tran, Lingfeng Xu, Gabriela Stegmann et al.
Investigating the influence of personality on acoustic-prosodic entrainment
Andreas Weise, Rivka Levitan
Investigation into Target Speaking Rate Adaptation for Voice Conversion
Michael Kuhlmann, Fritz Seebauer, Janek Ebbers et al.
Investigation of Ensemble features of Self-Supervised Pretrained Models for Automatic Speech Recognition
A Arunkumar, Vrunda Nileshkumar Sukhadia, Srinivasan Umesh
Investigation on the Band Importance of Phase-aware Speech Enhancement
Zhuohuang Zhang, Donald Williamson, Yi Shen
Isochronous is beautiful? Syllabic event detection in a neuro-inspired oscillatory model is facilitated by isochrony in speech
Mamady NABE, Julien Diard, Jean-Luc Schwartz
Isochrony-Aware Neural Machine Translation for Automatic Dubbing
Derek Tam, Surafel M. Lakew, Yogesh Virkar et al.
Iterative Sound Source Localization for Unknown Number of Sources
Yanjie Fu, Meng Ge, Haoran Yin et al.
Japanese ASR-Robust Pre-trained Language Model with Pseudo-Error Sentences Generated by Grapheme-Phoneme Conversion
Yasuhito Ohsugi, Itsumi Saito, Kyosuke Nishida et al.
JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech
Dan Lim, Sunghee Jung, Eesung Kim
J-MAC: Japanese multi-speaker audiobook corpus for speech synthesis
Shinnosuke Takamichi, Wataru Nakata, Naoko Tanji et al.
Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification
Saurabh Kataria, Jesús Villalba, Laureano Moro-Velázquez et al.
Joint Encoder-Decoder Self-Supervised Pre-training for ASR
A Arunkumar, Srinivasan Umesh
Joint Estimation of Direction-of-Arrival and Distance for Arrays with Directional Sensors based on Sparse Bayesian Learning
Feifei Xiong, Pengyu Wang, Zhongfu Ye et al.
Joint Modeling of Multi-Sample and Subband Signals for Fast Neural Vocoding on CPU
Hiroki Kanagawa, Yusuke Ijima, Hiroyuki Toda
Joint Neural AEC and Beamforming with Double-Talk Detection
Vinay Kothapally, YONG XU, Meng Yu et al.
Joint Optimization of Sampling Rate Offsets Based on Entire Signal Relationship Among Distributed Microphones
Yoshiki Masuyama, Kouei Yamaoka, Nobutaka Ono
Joint Optimization of the Module and Sign of the Spectral Real Part Based on CRN for Speech Denoising.
Zilu Guo, Xu Xu, Zhongfu Ye