Paavo Alku
28 papers · 2016–2024 · 1 conference · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
π Academic Marathon (8) π§ Keyword Pioneer π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (10) π£ Hot Topic Early Bird
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Conference Loyalist
(28)
π€
Dynamic Duo
(12)
π§¬
Topic Evolution
π
Keyword Champion
(3)
π
Conference Pioneer
β‘
Prolific Year
(8)
π
Trend Setter
ποΈ
Keyword Collector
(106)
π₯
Unstoppable
(5)
π
Century Club
(28)
Conferences
INTERSPEECH (28)
Top co-authors
Keywords
deep neural network
(6)
support vector machine
(5)
speech synthesis
(5)
glottal inverse filtering
(5)
lombard speech
(4)
linear prediction
(3)
mel-frequency cepstral coefficient
(3)
speech enhancement
(3)
spectral tilt
(3)
glottal excitation
(3)
acoustic feature
(3)
speech classification
(3)
speech analysis
(3)
speech signal
(2)
speech processing
(2)
generative adversarial network
(2)
speech intelligibility
(2)
text-to-speech synthesis
(2)
convolutional neural network
(2)
voice conversion
(2)
Papers
Fine-tuning of Pre-trained Models for Classification of Vocal Intensity Category from Speech Signals
INTERSPEECH 2024
Severity Classification of Parkinson's Disease from Speech using Single Frequency Filtering-based Features
INTERSPEECH 2023
Classification of Vocal Intensity Category from Speech using the Wav2vec2 and Whisper Embeddings
INTERSPEECH 2023
Comparing 1-dimensional and 2-dimensional spectral feature representations in voice pathology detection using machine learning and deep learning classifiers
INTERSPEECH 2022
Convolutional Neural Networks for Classification of Voice Qualities from Speech and Neck Surface Accelerometer Signals
INTERSPEECH 2022
Parkinsonβs Disease Detection from Speech Using Single Frequency Filtering Cepstral Coefficients
INTERSPEECH 2020
Augmented CycleGANs for Continuous Scale Normal-to-Lombard Speaking Style Conversion
INTERSPEECH 2019
GELP: GAN-Excited Linear Prediction for Speech Synthesis from Mel-Spectrogram
INTERSPEECH 2019
Lombard Speech Synthesis Using Transfer Learning in a Tacotron Text-to-Speech System
INTERSPEECH 2019
Mel-Frequency Cepstral Coefficients of Voice Source Waveforms for Classification of Phonation Types in Speech
INTERSPEECH 2019
Dysarthric Speech Classification Using Glottal Features Computed from Non-words, Words and Sentences
INTERSPEECH 2018
Speaker-independent Raw Waveform Model for Glottal Excitation
INTERSPEECH 2018
Time-regularized Linear Prediction for Noise-robust Extraction of the Spectral Envelope of Speech
INTERSPEECH 2018
Effects of Training Data Variety in Generating Glottal Pulses from Acoustic Features with DNNs
INTERSPEECH 2017
Speaking Style Conversion from Normal to Lombard Speech Using a Glottal Vocoder and Bayesian GMMs
INTERSPEECH 2017
Reducing Mismatch in Training of DNN-Based Glottal Excitation Models in a Statistical Parametric Text-to-Speech System
INTERSPEECH 2017
Time-Varying Autoregressions for Speaker Verification in Reverberant Conditions
INTERSPEECH 2017
Evaluation of Spectral Tilt Measures for Sentence Prominence Under Different Noise Conditions
INTERSPEECH 2017
Generative Adversarial Network-Based Glottal Waveform Model for Statistical Parametric Speech Synthesis
INTERSPEECH 2017
Glottal Source Estimation from Coded Telephone Speech Using a Deep Neural Network
INTERSPEECH 2017
Majorisation-Minimisation Based Optimisation of the Composite Autoregressive System with Application to Glottal Inverse Filtering
INTERSPEECH 2016
The Use of Read versus Conversational Lombard Speech in Spectral Tilt Modeling for Intelligibility Enhancement in Near-End Noise Conditions
INTERSPEECH 2016
Intelligibility Enhancement at the Receiving End of the Speech Transmission System β Effects of Far-End Noise Reduction
INTERSPEECH 2016
GlottDNN β A Full-Band Glottal Vocoder for Statistical Parametric Speech Synthesis
INTERSPEECH 2016
Using Text and Acoustic Features in Predicting Glottal Excitation Waveforms for Parametric Speech Synthesis with Recurrent Neural Networks
INTERSPEECH 2016
Analysis of Face Mask Effect on Speaker Recognition
INTERSPEECH 2016
Time-Varying Quasi-Closed-Phase Weighted Linear Prediction Analysis of Speech for Accurate Formant Detection and Tracking
INTERSPEECH 2016
Automatic Glottal Inverse Filtering with Non-Negative Matrix Factorization
INTERSPEECH 2016