Papers
8,761 papers found
Speech Enhancement Using Bayesian Wavenet
Kaizhi Qian, Yang Zhang, Shiyu Chang et al.
Speech Enhancement Using Non-Negative Spectrogram Models with Mel-Generalized Cepstral Regularization
Li Li, Hirokazu Kameoka, Tomoki Toda et al.
Speech Intelligibility in Cars: The Effect of Speaking Style, Noise and Listener Age
Cassia Valentini Botinhao, Junichi Yamagishi
Speech Processing Approach for Diagnosing Dementia in an Early Stage
Roozbeh Sadeghian, J. David Schaffer, Stephen A. Zahorian
Speech Rate Comparison When Talking to a System and Talking to a Human: A Study from a Speech-to-Speech, Machine Translation Mediated Map Task
Hayakawa Akira, Carl Vogel, Saturnino Luz et al.
Speech Recognition and Understanding on Hardware-Accelerated DSP
Georg Stemmer, Munir Georges, Joachim Hofer et al.
Speech Representation Learning Using Unsupervised Data-Driven Modulation Filtering for Robust ASR
Purvi Agrawal, Sriram Ganapathy
Speech Synthesis for Mixed-Language Navigation Instructions
Khyathi Raghavi Chandu, SaiKrishna Rallabandi, Sunayana Sitaram et al.
Spoken Language Identification Using LSTM-Based Angular Proximity
G. Gelly, J.L. Gauvain
Spoof Detection Using Source, Instantaneous Frequency and Cepstral Features
Sarfaraz Jelil, Rohan Kumar Das, S.R. Mahadeva Prasanna et al.
Spotting Social Signals in Conversational Speech over IP: A Deep Learning Perspective
Raymond Brueckner, Maximilian Schmitt, Maja Pantic et al.
Stability of Prosodic Characteristics Across Age and Gender Groups
Jan Volín, Tereza Tykalová, Tomáš Bořil
Statistical Voice Conversion with WaveNet-Based Waveform Generation
Kazuhiro Kobayashi, Tomoki Hayashi, Akira Tamamori et al.
Stepsize Control for Acoustic Feedback Cancellation Based on the Detection of Reverberant Signal Periods and the Estimated System Distance
Philipp Bulling, Klaus Linhard, Arthur Wolf et al.
Stochastic Recurrent Neural Network for Speech Recognition
Jen-Tzung Chien, Chen Shen
Structured-Based Curriculum Learning for End-to-End English-Japanese Speech Translation
Takatomo Kano, Sakriani Sakti, Satoshi Nakamura
Student-Teacher Training with Diverse Decision Tree Ensembles
Jeremy H.M. Wong, Mark J.F. Gales
Studying the Link Between Inter-Speaker Coordination and Speech Imitation Through Human-Machine Interactions
Leonardo Lancia, Thierry Chaminade, Noël Nguyen et al.
Subband Selection for Binaural Speech Source Localization
Girija Ramesan Karthik, Prasanta Kumar Ghosh
Subject-Independent Classification of Japanese Spoken Sentences by Multiple Frequency Bands Phase Pattern of EEG Response During Speech Perception
Hiroki Watanabe, Hiroki Tanaka, Sakriani Sakti et al.
Subjective Intelligibility of Deep Neural Network-Based Speech Enhancement
Femke B. Gelderblom, Tron V. Tronstad, Erlend Magnus Viggen
Symbol Sequence Search from Telephone Conversation
Masayuki Suzuki, Gakuto Kurata, Abhinav Sethy et al.
Synthesising isiZulu-English Code-Switch Bigrams Using Word Embeddings
Ewald van der Westhuizen, Thomas Niesler
Synthesising Uncertainty: The Interplay of Vocal Effort and Hesitation Disfluencies
Éva Székely, Joseph Mendelson, Joakim Gustafson
Synthesis of VV Utterances from Muscle Activation to Sound with a 3D Model
Saeed Dabbaghchian, Marc Arnela, Olov Engwall et al.