Papers
Human-in-the-Loop Efficiency Analysis for Binary Classification in Edyson
Per Fallgren, Jens Edlund
Human Listening and Live Captioning: Multi-Task Training for Speech Enhancement
Sefik Emre Eskimez, Xiaofei Wang, Min Tang et al.
Human Spoofing Detection Performance on Degraded Speech
Camryn Terblanche, Philip Harrison, Amelia J. Gully
Human-to-Human Conversation Dataset for Learning Fine-Grained Turn-Taking Action
Kehan Chen, Zezhong Li, Suyang Dai et al.
ICSpk: Interpretable Complex Speaker Embedding Extractor from Raw Waveform
Junyi Peng, Xiaoyang Qu, Jianzong Wang et al.
Identification of F1 and F2 in Speech Using Modified Zero Frequency Filtering
RaviShankar Prasad, Mathew Magimai-Doss
Identifying Cognitive Impairment Using Sentence Representation Vectors
Bahman Mirheidari, Yilin Pan, Daniel Blackburn et al.
Identifying Conflict Escalation and Primates by Using Ensemble X-Vectors and Fisher Vector Features
José Vicente Egas-López, Mercedes Vetráb, László Tóth et al.
Identifying Indicators of Vulnerability from Short Speech Segments Using Acoustic and Textual Features
Xia Cui, Amila Gamage, Terry Hanley et al.
Image-Based Assessment of Jaw Parameters and Jaw Kinematics for Articulatory Simulation: Preliminary Results
Ajish K. Abraham, V. Sivaramakrishnan, N. Swapna et al.
Impact of Emotional State on Estimation of Willingness to Buy from Advertising Speech
Mizuki Nagano, Yusuke Ijima, Sadao Hiroya
Impact of Encoding and Segmentation Strategies on End-to-End Simultaneous Speech Translation
Ha Nguyen, Yannick Estève, Laurent Besacier
Implicit Filter-and-Sum Network for End-to-End Multi-Channel Speech Separation
Yi Luo, Nima Mesgarani
Importance of Parasagittal Sensor Information in Tongue Motion Capture Through a Diphonic Analysis
Salvador Medina, Sarah Taylor, Mark Tiede et al.
Improve Cross-Lingual Text-To-Speech Synthesis on Monolingual Corpora with Pitch Contour Information
Haoyue Zhan, Haitong Zhang, Wenjie Ou et al.
Improved Meta-Learning Training for Speaker Verification
Yafeng Chen, Wu Guo, Bin Gu
Improved Speech Enhancement Using a Complex-Domain GAN with Fused Time-Domain and Time-Frequency Domain Constraints
Feng Dang, Pengyuan Zhang, Hangting Chen
Improved Speech Separation with Time-and-Frequency Cross-Domain Feature Selection
Tian Lan, Yuxin Qian, Yilan Lyu et al.
Improvement of Automatic English Pronunciation Assessment with Small Number of Utterances Using Sentence Speakability
Satsuki Naijo, Akinori Ito, Takashi Nose
Improving Accent Identification and Accented Speech Recognition Under a Framework of Self-Supervised Learning
Keqi Deng, Songjun Cao, Long Ma
Improving Channel Decorrelation for Multi-Channel Target Speech Extraction
Jiangyu Han, Wei Rao, Yannan Wang et al.
Improving Customization of Neural Transducers by Mitigating Acoustic Mismatch of Synthesized Audio
Gakuto Kurata, George Saon, Brian Kingsbury et al.
Improving Deep CNN Architectures with Variable-Length Training Samples for Text-Independent Speaker Verification
Yanfeng Wu, Junan Zhao, Chenkai Guo et al.
Improving Multilingual Transformer Transducer Models by Reducing Language Confusions
Eric Sun, Jinyu Li, Zhong Meng et al.
Improving Multi-Speaker TTS Prosody Variance with a Residual Encoder and Normalizing Flows
Iván Vallés-Pérez, Julian Roth, Grzegorz Beringer et al.