conftrace_

Siddhant Arora

28 papers · 2021–2026 · 7 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+9 more ↓

🌍 Conference Polyglot (7) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (14) 🐝 Cross-Pollinator (12)

🧭 Keyword Pioneer 🤝 Dynamic Duo (25) 👥 Mega-Team (76) 🔥 Unstoppable (5) 🗃️ Keyword Collector (107) ⚡ Prolific Year (6) ❓ The Questioner 📈 Trend Setter 💎 Century Club (27)

Conferences

INTERSPEECH (13) ACL (4) NAACL (4) ICLR (3) EMNLP (2) AAAI (1) COLING (1)

Top co-authors

Shinji Watanabe (26) Yifan Peng (8) Yosuke Kashiwagi (8) Hayato Futami (8) Emiru Tsunoo (8) Jiatong Shi (8) Brian Yan (8) William Chen (7) Karen Livescu (4) Jee-weon Jung (4)

Keywords

spoken language understanding (12) automatic speech recognition (7) speech recognition (5) end-to-end model (4) benchmark evaluation (2) multi-task learning (2) connectionist temporal classification (2) speech translation (2) zero-shot learning (2) beam search (2) model compression (2) question answering (1) speech synthesis (1) topic modeling (1) neural network pruning (1) attention mechanism (1) model architecture (1) self-supervised learning (1) language modeling (1) masked language model (1)

Papers

Full-Duplex-Bench-v2: A Multi-Turn Evaluation Framework for Duplex Dialogue Systems with an Automated Examiner ACL 2026 Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks ICLR 2025 VERSA: A Versatile Evaluation Toolkit for Speech, Audio, and Music NAACL 2025 Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics ICLR 2025 ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems NAACL 2025 Context-aware Dynamic Pruning for Speech Foundation Models ICLR 2025 ESPnet-SpeechLM: An Open Speech Language Model Toolkit NAACL 2025 Creation and Analysis of an International Corpus of Privacy Laws COLING 2024 Decoder-only Architecture for Streaming End-to-end Speech Recognition INTERSPEECH 2024 OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer INTERSPEECH 2024 Finding Task-specific Subnetworks in Multi-task Spoken Language Understanding Model INTERSPEECH 2024 Rapid Language Adaptation for Multilingual E2E Speech Recognition Using Encoder Prompting INTERSPEECH 2024 To what extent can ASV systems naturally defend against spoofing attacks? INTERSPEECH 2024 UniverSLU: Universal Spoken Language Understanding for Diverse Tasks with Natural Language Instructions NAACL 2024 On the Evaluation of Speech Foundation Models for Spoken Language Understanding ACL 2024 Integrating Pretrained ASR and LM to Perform Sequence Generation for Spoken Language Understanding INTERSPEECH 2023 SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks ACL 2023 CMU’s IWSLT 2023 Simultaneous Speech Translation System ACL 2023 Tensor decomposition for minimization of E2E SLU model toward on-device processing INTERSPEECH 2023 Integration of Frame- and Label-synchronous Beam Search for Streaming Encoder-decoder Speech Recognition INTERSPEECH 2023 BASS: Block-wise Adaptation for Speech Summarization INTERSPEECH 2023 A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks INTERSPEECH 2023 Explain, Edit, and Understand: Rethinking User Study Design for Evaluating Model Explanations AAAI 2022 Two-Pass Low Latency End-to-End Spoken Language Understanding INTERSPEECH 2022 Blockwise Streaming Transformer for Spoken Language Understanding and Simultaneous Speech Translation INTERSPEECH 2022 BERT Meets CTC: New Formulation of End-to-End Speech Recognition with Pre-trained Masked Language Model EMNLP 2022 Token-level Sequence Labeling for Spoken Language Understanding using Compositional End-to-End Models EMNLP 2022 Rethinking End-to-End Evaluation of Decomposable Tasks: A Case Study on Spoken Language Understanding INTERSPEECH 2021