Papers
Relationship between the acoustic time intervals and tongue movements of German diphthongs
Arne-Lukas Fietkau, Simon Stone, Peter Birkholz
Relative Acoustic Features for Distance Estimation in Smart-Homes
Francesco Nespoli, Daniel Barreda, Patrick Naylor
Reliability criterion based on learning-phase entropy for speaker recognition with neural network
Pierre-Michel Bousquet, Mickael Rouvier, Jean-Francois Bonastre
Reliable Visualization for Deep Speaker Recognition
Pengqi Li, Lantian Li, Askar Hamdulla et al.
Representation Selective Self-distillation and wav2vec 2.0 Feature Exploration for Spoof-aware Speaker Verification
Jin Woo Lee, Eungbeom Kim, Junghyun Koo et al.
Representing 'how you say' with 'what you say': English corpus of focused speech and text reflecting corresponding implications
Naoaki Suzuki, Satoshi Nakamura
ResectNet: An Efficient Architecture for Voice Activity Detection on Mobile Devices
Okan Köpüklü, Maja Taseska
Residual Language Model for End-to-end Speech Recognition
Emiru Tsunoo, Yosuke Kashiwagi, Chaitanya Prasad Narisetty et al.
Response Timing Estimation for Spoken Dialog System using Dialog Act Estimation
Jin Sakuma, Shinya Fujie, Tetsunori Kobayashi
RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion
Dacheng Yin, Chuanxin Tang, Yanqing Liu et al.
Revisiting joint decoding based multi-talker speech recognition with DNN acoustic model
Martin Kocour, Katerina Zmolikova, Lucas Ondel et al.
Revisiting visuo-spatial processing in individuals with congenital amusia
Zixia Fan, Jing Shao, Weigong Pan et al.
REYD – The First Yiddish Text-to-Speech Dataset and System
Jacob Webber, Samuel K. Lo, Isaac L. Bleaman
RNN-T lattice enhancement by grafting of pruned paths
Mirek Novak, Pavlos Papadopoulos
RNN Transducers for Named Entity Recognition with constraints on alignment for understanding medical conversations
Hagen Soltau, Izhak Shafran, Mingqiu Wang et al.
Robust Cough Feature Extraction and Classification Method for COVID-19 Cough Detection Based on Vocalization Characteristics
Xueshuai Zhang, Jiakun Shen, Jun Zhou et al.
Robust End-to-end Speaker Diarization with Generic Neural Clustering
Chenyu Yang, Yu Wang
Robust Pitch Estimation Using Multi-Branch CNN-LSTM and 1-Norm LP Residual
Mudit D. Batra, JAYESH, C.S. Ramalingam
Robust Self-Supervised Audio-Visual Speech Recognition
Bowen Shi, Wei-Ning Hsu, Abdelrahman Mohamed
SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech
Hyunjae Cho, Wonbin Jung, Junhyeok Lee et al.
SAQAM: Spatial Audio Quality Assessment Metric
Pranay Manocha, Anurag Kumar, Buye Xu et al.
SA-SASV: An End-to-End Spoof-Aggregated Spoofing-Aware Speaker Verification System
Zhongwei Teng, Quchen Fu, Jules White et al.
SASV 2022: The First Spoofing-Aware Speaker Verification Challenge
Jee-weon Jung, Hemlata Tak, Hye-jin Shim et al.
SASV Based on Pre-trained ASV System and Integrated Scoring Module
Yuxiang Zhang, Zhuo Li, Wenchao Wang et al.
SATTS: Speaker Attractor Text to Speech, Learning to Speak by Learning to Separate
Nabarun Goswami, Tatsuya Harada