Papers
8,761 papers found
Novel-view Acoustic Synthesis From 3D Reconstructed Rooms
Byeongjoo Ahn, Karren Yang, Brian Hamilton et al.
NumberLie: a game-based experiment to understand the acoustics of deception and truthfulness
Alessandro De Luca, Andrew Clark, Volker Dellwo
OCEAN-AI: open multimodal framework for personality traits assessment and HR-processes automatization
Elena Ryumina, Dmitry Ryumin, Alexey Karpov
On Calibration of Speech Classification Models: Insights from Energy-Based Model Investigations
Yaqian Hao, Chenguang Hu, Yingying Gao et al.
Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment
Christoph Boeddeker, Tobias Cord-Landwehr, Reinhold Haeb-Umbach
On Comparing Time- and Frequency-Domain Rhythm Measures in Classifying Assamese Dialects
Joyshree Chakraborty, Leena Dihingia, Priyankoo Sarmah et al.
On Disfluency and Non-lexical Sound Labeling for End-to-end Automatic Speech Recognition
Peter Mihajlik, Yan Meng, Mate S Kadar et al.
One-class learning with adaptive centroid shift for audio deepfake detection
Hyun Myung Kim, Kangwook Jang, Hoirin Kim
One-pass Multiple Conformer and Foundation Speech Systems Compression and Quantization Using An All-in-one Neural Model
Zhaoqing Li, Haoning Xu, Tianzi Wang et al.
On Improving Error Resilience of Neural End-to-End Speech Coders
Kishan Gupta, Nicola Pia, Srikanth Korse et al.
Online Subloop Search via Uncertainty Quantization for Efficient Test-Time Adaptation
Jae-Hong Lee, Sang-Eon Lee, Dong-Hyun Kim et al.
On the calibration of powerset speaker diarization models
Alexis Plaquet, Hervé Bredin
On the Effectiveness of Acoustic BPE in Decoder-Only TTS
Bohan Li, Feiyu Shen, Yiwei Guo et al.
On the Effects of Heterogeneous Data Sources on Speech-to-Text Foundation Models
Jinchuan Tian, Yifan Peng, William Chen et al.
On the Encoding of Gender in Transformer-based ASR Representations
Aravind Krishnan, Badr M. Abdullah, Dietrich Klakow
On the impact of several regularization techniques on label noise robustness of self-supervised speaker verification systems
Abderrahim Fathan, Xiaolin Zhu, Jahangir Alam
On the relationship between speech production and vocabulary size in 3-5 year olds
Alexis DeMaere, Nicole van Rootselaar, Fangfang Li et al.
On the social bias of speech self-supervised models
Yi-Cheng Lin, Tzu-Quan Lin, Hsi-Che Lin et al.
On the Success and Limitations of Auxiliary Network Based Word-Level End-to-End Neural Speaker Diarization
Yiling Huang, Weiran Wang, Guanlong Zhao et al.
On the Usefulness of Speaker Embeddings for Speaker Retrieval in the Wild: A Comparative Study of x-vector and ECAPA-TDNN Models
Erfan Loweimi, Mengjie Qian, Kate Knill et al.
On the Use of Plausible Arguments in Explainable Conversational AI
Martina Di Bratto, Maria Di Maro, Antonio Origlia
Optical Flow Guided Tongue Trajectory Generation for Diffusion-based Acoustic to Articulatory Inversion
Yudong Yang, Rongfeng Su, Rukiye Ruzi et al.
Optimizing Automatic Speech Assessment: W-RankSim Regularization and Hybrid Feature Fusion Strategies
Chung-Wen Wu, Berlin Chen