conftrace
_
Papers
Trends
Conferences
Explore
More
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Processing
Speech & Audio
›
Processing
›
Speech Processing
24 papers
Papers per year
2020: 1
1
2021: 1
1
2022: 1
1
2024: 2
2
2025: 1
1
2026: 18
18
Papers
SAC: Neural Speech Codec with Semantic-Acoustic Dual-Stream Quantization
ACL 2026
SDiaReward: Modeling and Benchmarking Spoken Dialogue Rewards with Modality and Colloquialness
ACL 2026
XY-Tokenizer: Mitigating the Semantic-Acoustic Conflict in Low-Bitrate Speech Codecs
ACL 2026
VoxMind: An End-to-End Agentic Spoken Dialogue System
ACL 2026
An Exploration of Mamba for Speech Self-Supervised Models
ACL 2026
Protecting Bystander Privacy via Selective Hearing in Audio LLMs
ACL 2026
POWSM: A Phonetic Open Whisper-Style Speech Foundation Model
ACL 2026
LLM-ForcedAligner: A Non-Autoregressive and Accurate LLM-Based Forced Aligner for Multilingual and Long-Form Speech
ACL 2026
Audio Jailbreak: An Open Comprehensive Benchmark for Jailbreaking Large Audio-Language Models
ACL 2026
Difference in Task Performance on Sparse Speech Representations
ACL 2026
SpeechMedAssist: Efficiently and Effectively Adapting Speech Language Models for Medical Consultation
ACL 2026
Audio MultiChallenge: A Multi-Turn Evaluation of Spoken Dialogue Systems on Natural Human Interaction
ACL 2026
Temporal Precision Matters: Brain-Tuning Speech Language Models with Millisecond-Resolution Neural Signals
ACL 2026
CIS-BWE: Chaos-Informed Speech Bandwidth Extension
ACL 2026
Speculative End-Turn Detector for Efficient Speech Chatbot Assistant
ACL 2026
Praat++: Multimedia Annotation System for Speech and Vocalization
ACL 2026
CBAL: Context-Based Agentic Learning for Speaker Diarization Segmentation Refinement
ACL 2026
Sommelier: Scalable Open Multi-turn Audio Pre-processing for Full-duplex Speech Language Models
ACL 2026
PACHAT: Persona-Aware Speech Assistant for Multi-party Dialogue
EMNLP 2025
CA-SSLR: Condition-Aware Self-Supervised Learning Representation for Generalized Speech Processing
NIPS 2024
Growing Trees on Sounds: Assessing Strategies for End-to-End Dependency Parsing of Speech
ACL 2024
Unsupervised Word Segmentation using K Nearest Neighbors
INTERSPEECH 2022
Token-Level Supervised Contrastive Learning for Punctuation Restoration
INTERSPEECH 2021
Learning Spoken Language Representations with Neural Lattice Language Modeling
ACL 2020
<
1
>