Papers
Adjusting the Frame: Biphasic Performative Control of Speech Rhythm
Samuel Delalez, Christophe d’Alessandro
A Dual Source-Filter Model of Snore Audio for Snorer Group Classification
Achuth Rao M.V., Shivani Yadav, Prasanta Kumar Ghosh
Advances in Joint CTC-Attention Based End-to-End Speech Recognition with a Deep CNN Encoder and RNN-LM
Takaaki Hori, Shinji Watanabe, Yu Zhang et al.
Adversarial Auto-Encoders for Speech Based Emotion Recognition
Saurabh Sahu, Rahul Gupta, Ganesh Sivaraman et al.
Adversarial Network Bottleneck Features for Noise Robust Speaker Verification
Hong Yu, Zheng-Hua Tan, Zhanyu Ma et al.
Aerodynamic Features of French Fricatives
Rosario Signorello, Sergio Hassid, Didier Demolin
A Fast Robust 1D Flow Model for a Self-Oscillating Coupled 2D FEM Vocal Fold Simulation
Arvind Vasudevan, Victor Zappi, Peter Anderson et al.
A Fully Convolutional Neural Network for Speech Enhancement
Se Rim Park, Jin Won Lee
A Gender Bias in the Acoustic-Melodic Features of Charismatic Speech?
Eszter Novák-Tót, Oliver Niebuhr, Aoju Chen
A Generative Model for Score Normalization in Speaker Recognition
Albert Swart, Niko Brümmer
A Hierarchical Encoder-Decoder Model for Statistical Parametric Speech Synthesis
Srikanth Ronanki, Oliver Watts, Simon King
Alternative Approaches to Neural Network Based Speaker Verification
Anna Silnova, Lukáš Burget, Jan Černocký
A Mask Estimation Method Integrating Data Field Model for Speech Enhancement
Xianyun Wang, Changchun Bao, Feng Bao
A Maximum Likelihood Approach to Deep Neural Network Based Nonlinear Spectral Mapping for Single-Channel Speech Separation
Yannan Wang, Jun Du, Li-Rong Dai et al.
A Modulation Property of Time-Frequency Derivatives of Filtered Phase and its Application to Aperiodicity and foEstimation
Hideki Kawahara, Ken-Ichi Sakakibara, Masanori Morise et al.
A Mostly Data-Driven Approach to Inverse Text Normalization
Ernest Pusateri, Bharat Ram Ambati, Elizabeth Brooks et al.
A Mouth Opening Effect Based on Pole Modification for Expressive Singing Voice Transformation
Luc Ardaillon, Axel Roebel
An Affect Prediction Approach Through Depression Severity Parameter Incorporation in Neural Networks
Rahul Gupta, Saurabh Sahu, Carol Espy-Wilson et al.
Analysis and Description of ABC Submission to NIST SRE 2016
Oldřich Plchot, Pavel Matějka, Anna Silnova et al.
Analysis of Acoustic-to-Articulatory Speech Inversion Across Different Accents and Languages
Ganesh Sivaraman, Carol Espy-Wilson, Martijn Wieling
Analysis of Engagement and User Experience with a Laughter Responsive Social Robot
Bekir Berker Türker, Zana Buçinca, Engin Erzin et al.
Analysis of Score Normalization in Multilingual Speaker Recognition
Pavel Matějka, Ondřej Novotný, Oldřich Plchot et al.
Analysis of the Relationship Between Prosodic Features of Fillers and its Forms or Occurrence Positions
Shizuka Nakamura, Ryosuke Nakanishi, Katsuya Takanashi et al.