Speech & Audio
6,389 papers (14 classified directly here)
Papers per year
1
5
6
2
7
5
3
6
3
2
2
441
383
460
560
704
742
913
936
749
360
99
'10
'15
'20
'25
Papers (including subtopics)
VoxSim: A perceptual voice similarity dataset
INTERSPEECH 2024
IndicMOS: Multilingual MOS Prediction for 7 Indian languages
INTERSPEECH 2024
Disentangling prosody and timbre embeddings via voice conversion
INTERSPEECH 2024
G2PA: G2P with Aligned Audio for Mandarin Chinese
INTERSPEECH 2024
Positional Description for Numerical Normalization
INTERSPEECH 2024
Self-Train Before You Transcribe
INTERSPEECH 2024
Hierarchical Multi-Task Learning with CTC and Recursive Operation
INTERSPEECH 2024