conftrace_

Speech & Audio › Processing ›

Speech Processing

24 papers

Papers per year

1

1

1

2

1

18

Papers

SAC: Neural Speech Codec with Semantic-Acoustic Dual-Stream Quantization ACL 2026

SDiaReward: Modeling and Benchmarking Spoken Dialogue Rewards with Modality and Colloquialness ACL 2026

XY-Tokenizer: Mitigating the Semantic-Acoustic Conflict in Low-Bitrate Speech Codecs ACL 2026

VoxMind: An End-to-End Agentic Spoken Dialogue System ACL 2026

An Exploration of Mamba for Speech Self-Supervised Models ACL 2026

Protecting Bystander Privacy via Selective Hearing in Audio LLMs ACL 2026

POWSM: A Phonetic Open Whisper-Style Speech Foundation Model ACL 2026

LLM-ForcedAligner: A Non-Autoregressive and Accurate LLM-Based Forced Aligner for Multilingual and Long-Form Speech ACL 2026

Audio Jailbreak: An Open Comprehensive Benchmark for Jailbreaking Large Audio-Language Models ACL 2026

Difference in Task Performance on Sparse Speech Representations ACL 2026

SpeechMedAssist: Efficiently and Effectively Adapting Speech Language Models for Medical Consultation ACL 2026

Audio MultiChallenge: A Multi-Turn Evaluation of Spoken Dialogue Systems on Natural Human Interaction ACL 2026

Temporal Precision Matters: Brain-Tuning Speech Language Models with Millisecond-Resolution Neural Signals ACL 2026

CIS-BWE: Chaos-Informed Speech Bandwidth Extension ACL 2026

Speculative End-Turn Detector for Efficient Speech Chatbot Assistant ACL 2026

Praat++: Multimedia Annotation System for Speech and Vocalization ACL 2026

CBAL: Context-Based Agentic Learning for Speaker Diarization Segmentation Refinement ACL 2026

Sommelier: Scalable Open Multi-turn Audio Pre-processing for Full-duplex Speech Language Models ACL 2026

PACHAT: Persona-Aware Speech Assistant for Multi-party Dialogue EMNLP 2025

CA-SSLR: Condition-Aware Self-Supervised Learning Representation for Generalized Speech Processing NIPS 2024

Growing Trees on Sounds: Assessing Strategies for End-to-End Dependency Parsing of Speech ACL 2024

Unsupervised Word Segmentation using K Nearest Neighbors INTERSPEECH 2022

Token-Level Supervised Contrastive Learning for Punctuation Restoration INTERSPEECH 2021

Learning Spoken Language Representations with Neural Lattice Language Modeling ACL 2020