Paavo Alku

28 papers · 2016–2024 · 1 conference · across top CS/AI conferences

Achievements

+12 more ↓

🏃 Academic Marathon (8) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (10) 🐣 Hot Topic Early Bird

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏠 Conference Loyalist (28) 🤝 Dynamic Duo (12) 🧬 Topic Evolution 🏆 Keyword Champion (3) 🚀 Conference Pioneer ⚡ Prolific Year (8) 📈 Trend Setter 🗃️ Keyword Collector (106) 🔥 Unstoppable (5) 💎 Century Club (28)

Conferences

INTERSPEECH (28)

Top co-authors

Lauri Juvela (12) Manu Airaksinen (8) Sudarsana Reddy Kadiri (7) Bajibabu Bollepalli (6) Junichi Yamagishi (5) Okko Rasanen (4) Manila Kodali (4) Dhananjaya Gowda (2) Emma Jokinen (2) Shreyas Seshadri (2)

Keywords

deep neural network (6) support vector machine (5) speech synthesis (5) glottal inverse filtering (5) lombard speech (4) linear prediction (3) mel-frequency cepstral coefficient (3) speech enhancement (3) spectral tilt (3) glottal excitation (3) acoustic feature (3) speech classification (3) speech analysis (3) speech signal (2) speech processing (2) generative adversarial network (2) speech intelligibility (2) text-to-speech synthesis (2) convolutional neural network (2) voice conversion (2)

Papers

Fine-tuning of Pre-trained Models for Classification of Vocal Intensity Category from Speech Signals INTERSPEECH 2024 Severity Classification of Parkinson's Disease from Speech using Single Frequency Filtering-based Features INTERSPEECH 2023 Classification of Vocal Intensity Category from Speech using the Wav2vec2 and Whisper Embeddings INTERSPEECH 2023 Comparing 1-dimensional and 2-dimensional spectral feature representations in voice pathology detection using machine learning and deep learning classifiers INTERSPEECH 2022 Convolutional Neural Networks for Classification of Voice Qualities from Speech and Neck Surface Accelerometer Signals INTERSPEECH 2022 Parkinson’s Disease Detection from Speech Using Single Frequency Filtering Cepstral Coefficients INTERSPEECH 2020 Augmented CycleGANs for Continuous Scale Normal-to-Lombard Speaking Style Conversion INTERSPEECH 2019 GELP: GAN-Excited Linear Prediction for Speech Synthesis from Mel-Spectrogram INTERSPEECH 2019 Lombard Speech Synthesis Using Transfer Learning in a Tacotron Text-to-Speech System INTERSPEECH 2019 Mel-Frequency Cepstral Coefficients of Voice Source Waveforms for Classification of Phonation Types in Speech INTERSPEECH 2019 Dysarthric Speech Classification Using Glottal Features Computed from Non-words, Words and Sentences INTERSPEECH 2018 Speaker-independent Raw Waveform Model for Glottal Excitation INTERSPEECH 2018 Time-regularized Linear Prediction for Noise-robust Extraction of the Spectral Envelope of Speech INTERSPEECH 2018 Effects of Training Data Variety in Generating Glottal Pulses from Acoustic Features with DNNs INTERSPEECH 2017 Speaking Style Conversion from Normal to Lombard Speech Using a Glottal Vocoder and Bayesian GMMs INTERSPEECH 2017 Reducing Mismatch in Training of DNN-Based Glottal Excitation Models in a Statistical Parametric Text-to-Speech System INTERSPEECH 2017 Time-Varying Autoregressions for Speaker Verification in Reverberant Conditions INTERSPEECH 2017 Evaluation of Spectral Tilt Measures for Sentence Prominence Under Different Noise Conditions INTERSPEECH 2017 Generative Adversarial Network-Based Glottal Waveform Model for Statistical Parametric Speech Synthesis INTERSPEECH 2017 Glottal Source Estimation from Coded Telephone Speech Using a Deep Neural Network INTERSPEECH 2017 Majorisation-Minimisation Based Optimisation of the Composite Autoregressive System with Application to Glottal Inverse Filtering INTERSPEECH 2016 The Use of Read versus Conversational Lombard Speech in Spectral Tilt Modeling for Intelligibility Enhancement in Near-End Noise Conditions INTERSPEECH 2016 Intelligibility Enhancement at the Receiving End of the Speech Transmission System — Effects of Far-End Noise Reduction INTERSPEECH 2016 GlottDNN — A Full-Band Glottal Vocoder for Statistical Parametric Speech Synthesis INTERSPEECH 2016 Using Text and Acoustic Features in Predicting Glottal Excitation Waveforms for Parametric Speech Synthesis with Recurrent Neural Networks INTERSPEECH 2016 Analysis of Face Mask Effect on Speaker Recognition INTERSPEECH 2016 Time-Varying Quasi-Closed-Phase Weighted Linear Prediction Analysis of Speech for Accurate Formant Detection and Tracking INTERSPEECH 2016 Automatic Glottal Inverse Filtering with Non-Negative Matrix Factorization INTERSPEECH 2016