conftrace_

← Application Areas

Machine Learning › Application Areas ›

Efficient Computing

6,876 papers

Papers per year

Papers

Improving Audio Classification with Low-Sampled Microphone Input: An Empirical Study Using Model Self-Distillation INTERSPEECH 2024

Study Selectively: An Adaptive Knowledge Distillation based on a Voting Network for Heart Sound Classification INTERSPEECH 2024

DualVC 3: Leveraging Language Model Generated Pseudo Context for End-to-end Low Latency Streaming Voice Conversion INTERSPEECH 2024

SEQ-former: A context-enhanced and efficient automatic speech recognition framework INTERSPEECH 2024

Towards Effective and Efficient Non-autoregressive Decoding Using Block-based Attention Mask INTERSPEECH 2024

Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm INTERSPEECH 2024

Parameter-Efficient Adapter Based on Pre-trained Models for Speech Translation INTERSPEECH 2024

Low-Complexity Acoustic Scene Classification Using Parallel Attention-Convolution Network INTERSPEECH 2024

Speech Boosting: Low-Latency Live Speech Enhancement for TWS Earbuds INTERSPEECH 2024

Sub-PNWR: Speech Enhancement Based on Signal Sub-Band Splitting and Pseudo Noisy Waveform Reconstruction Loss INTERSPEECH 2024

SAML: Speaker Adaptive Mixture of LoRA Experts for End-to-End ASR INTERSPEECH 2024

Automatic Detection of Hearing Loss from Children's Speech using wav2vec 2.0 Features INTERSPEECH 2024

Faster Vocoder: a multi threading approach to achieve low latency during TTS Inference INTERSPEECH 2024

Mobile PresenTra: NICT fast neural text-to-speech system on smartphones with incremental inference of MS-FC-HiFi-GAN for law-latency synthesis INTERSPEECH 2024

Streaming Audio Transformers for Online Audio Tagging INTERSPEECH 2024

Efficient CNNs with Quaternion Transformations and Pruning for Audio Tagging INTERSPEECH 2024

Efficient Audio Captioning with Encoder-Level Knowledge Distillation INTERSPEECH 2024

tinyCLAP: Distilling Constrastive Language-Audio Pretrained Models INTERSPEECH 2024

A Low-Bitrate Neural Audio Codec Framework with Bandwidth Reduction and Recovery for High-Sampling-Rate Waveforms INTERSPEECH 2024

Speech Prefix-Tuning with RNNT Loss for Improving LLM Predictions INTERSPEECH 2024

Edged based audio-visual speech enhancement demonstrator INTERSPEECH 2024

Adapter Learning from Pre-trained Model for Robust Spoof Speech Detection INTERSPEECH 2024

RT-LA-VocE: Real-Time Low-SNR Audio-Visual Speech Enhancement INTERSPEECH 2024

MultiStage Speech Bandwidth Extension with Flexible Sampling Rate Control INTERSPEECH 2024

Leveraging Adapter for Parameter-Efficient ASR Encoder INTERSPEECH 2024