Siddhant Arora
28 papers · 2021–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
π Conference Polyglot (7) π§ Keyword Pioneer π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (14) π Cross-Pollinator (12)
π§
Keyword Pioneer
π€
Dynamic Duo
(25)
π₯
Mega-Team
(76)
π₯
Unstoppable
(5)
ποΈ
Keyword Collector
(107)
β‘
Prolific Year
(6)
β
The Questioner
π
Trend Setter
π
Century Club
(27)
Conferences
INTERSPEECH (13)
ACL (4)
NAACL (4)
ICLR (3)
EMNLP (2)
AAAI (1)
COLING (1)
Top co-authors
Keywords
spoken language understanding
(12)
automatic speech recognition
(7)
speech recognition
(5)
end-to-end model
(4)
benchmark evaluation
(2)
multi-task learning
(2)
connectionist temporal classification
(2)
speech translation
(2)
zero-shot learning
(2)
beam search
(2)
model compression
(2)
question answering
(1)
speech synthesis
(1)
topic modeling
(1)
neural network pruning
(1)
attention mechanism
(1)
model architecture
(1)
self-supervised learning
(1)
language modeling
(1)
masked language model
(1)
Papers
Full-Duplex-Bench-v2: A Multi-Turn Evaluation Framework for Duplex Dialogue Systems with an Automated Examiner
ACL 2026
Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
ICLR 2025
VERSA: A Versatile Evaluation Toolkit for Speech, Audio, and Music
NAACL 2025
Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics
ICLR 2025
ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems
NAACL 2025
Context-aware Dynamic Pruning for Speech Foundation Models
ICLR 2025
ESPnet-SpeechLM: An Open Speech Language Model Toolkit
NAACL 2025
Creation and Analysis of an International Corpus of Privacy Laws
COLING 2024
Decoder-only Architecture for Streaming End-to-end Speech Recognition
INTERSPEECH 2024
OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer
INTERSPEECH 2024
Finding Task-specific Subnetworks in Multi-task Spoken Language Understanding Model
INTERSPEECH 2024
Rapid Language Adaptation for Multilingual E2E Speech Recognition Using Encoder Prompting
INTERSPEECH 2024
To what extent can ASV systems naturally defend against spoofing attacks?
INTERSPEECH 2024
UniverSLU: Universal Spoken Language Understanding for Diverse Tasks with Natural Language Instructions
NAACL 2024
On the Evaluation of Speech Foundation Models for Spoken Language Understanding
ACL 2024
Integrating Pretrained ASR and LM to Perform Sequence Generation for Spoken Language Understanding
INTERSPEECH 2023
SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks
ACL 2023
CMUβs IWSLT 2023 Simultaneous Speech Translation System
ACL 2023
Tensor decomposition for minimization of E2E SLU model toward on-device processing
INTERSPEECH 2023
Integration of Frame- and Label-synchronous Beam Search for Streaming Encoder-decoder Speech Recognition
INTERSPEECH 2023
BASS: Block-wise Adaptation for Speech Summarization
INTERSPEECH 2023
A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks
INTERSPEECH 2023
Explain, Edit, and Understand: Rethinking User Study Design for Evaluating Model Explanations
AAAI 2022
Two-Pass Low Latency End-to-End Spoken Language Understanding
INTERSPEECH 2022
Blockwise Streaming Transformer for Spoken Language Understanding and Simultaneous Speech Translation
INTERSPEECH 2022
BERT Meets CTC: New Formulation of End-to-End Speech Recognition with Pre-trained Masked Language Model
EMNLP 2022
Token-level Sequence Labeling for Spoken Language Understanding using Compositional End-to-End Models
EMNLP 2022
Rethinking End-to-End Evaluation of Decomposable Tasks: A Case Study on Spoken Language Understanding
INTERSPEECH 2021