Papers
8,761 papers found
A Method of Audio-Visual Person Verification by Mining Connections between Time Series
Peiwen Sun, Shanshan Zhang, Zishan Liu et al.
A Metric-Driven Approach to Conformer Layer Pruning for Efficient ASR Inference
Dhanush Bekal, Karthik Gopalakrishnan, Karel Mundnich et al.
A Model for Every User and Budget: Label-Free and Personalized Mixed-Precision Quantization
Edward Fish, Umberto Michieli, Mete Ozay
A More Accurate Internal Language Model Score Estimation for the Hybrid Autoregressive Transducer
Kyungmin Lee, Haeri Kim, Sichen Jin et al.
A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint Models
Pin-Jui Ku, Chao-Han Huck Yang, Sabato Siniscalchi et al.
A Multimodal Investigation of Speech, Text, Cognitive and Facial Video Features for Characterizing Depression With and Without Medication
Michael Neumann, Hardik Kothare, Doug Habberstad et al.
A multimodal prototypical approach for unsupervised sound classification
Saksham Singh Kushwaha, Magdalena Fuentes
A Multiple-Teacher Pruning Based Self-Distillation (MT-PSD) Approach to Model Compression for Audio-Visual Wake Word Spotting
Haotian Wang, Jun Du, Hengshun Zhou et al.
A Multi-Scale Attentive Transformer for Multi-Instrument Symbolic Music Generation
Xipin Wei, Junhui Chen, Zirui Zheng et al.
A Multi-Task Learning Framework for Sound Event Detection using High-level Acoustic Characteristics of Sounds
Tanmay Khandelwal, Rohan Kumar Das
An Acoustic Analysis of Fricative Variation in Three Accents of English
Roland Adams, Calbert Graham
Analysis and automatic prediction of exertion from speech: Contrasting objective and subjective measures collected while running
Andreas Triantafyllopoulos, Alexander Gebhard, Alexander Kathan et al.
Analysis of Acoustic information in End-to-End Spoken Language Translation
Gerard Sant, Carlos Escolano
Analysis of Mean Opinion Scores in Subjective Evaluation of Synthetic Speech Based on Tail Probabilities
Yusuke Yasuda, Tomoki Toda
An Analysis of Glottal Features of Chronic Kidney Disease Speech and Its Application to CKD Detection
Jihyun Mun, Sunhee Kim, Myeong Ju Kim et al.
An Analysis of Goodness of Pronunciation for Child Speech
Xinwei Cao, Zijian Fan, Torbjørn Svendsen et al.
An ASR-enabled Reading Tutor: Investigating Feedback to Optimize Interaction for Learning to Read
Yu Bai, Ferdy Hubers, Catia Cucchiarini et al.
An Automatic Multimodal Approach to Analyze Linguistic and Acoustic Cues on Parkinson's Disease Patients
Daniel Escobar-Grisales, Tomás Arias-Vergara, Cristian David Ríos-Urrego et al.
An Autoregressive Conversational Dynamics Model for Dialogue Systems
Matthew McNeill, Rivka Levitan
An Efficient and Noise-Robust Audiovisual Encoder for Audiovisual Speech Recognition
Zhengyang Li, Chenwei Liang, Timo Lohrenz et al.
An Efficient Approach for the Automated Segmentation and Transcription of the People's Speech Sorpus
Astik Biswas, Abdelmoumene Boumadane, Stephane Peillon et al.
An Enhanced Res2Net with Local and Global Feature Fusion for Speaker Verification
Yafeng Chen, Siqi Zheng, Hui Wang et al.
An Equitable Framework for Automatically Assessing Children's Oral Narrative Language Abilities
Alexander Johnson, Hariram Veeramani, Natarajan Balaji Shankar et al.
A neural architecture for selective attention to speech features
Nika Jurov, William Idsardi, Naomi H. Feldman