conftrace_

Papers

Audio MultiChallenge: A Multi-Turn Evaluation of Spoken Dialogue Systems on Natural Human Interaction ACL 2026 EGOILLUSION: Benchmarking Hallucinations in Egocentric Video Understanding EMNLP 2025 MULTIVOX: A Benchmark for Evaluating Voice Assistants for Multimodal Interactions EMNLP 2025 MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark ICLR 2025 Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs ICLR 2025 ProSE: Diffusion Priors for Speech Enhancement NAACL 2025 CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models ICLR 2024 ABEX: Data Augmentation for Low-Resource NLU via Expanding Abstract Descriptions ACL 2024 ASPIRE: Language-Guided Data Augmentation for Improving Robustness Against Spurious Correlations ACL 2024 GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities EMNLP 2024 LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition INTERSPEECH 2024 Do Vision-Language Models Understand Compound Nouns? NAACL 2024 CoDa: Constrained Generation based Data Augmentation for Low-Resource NLP NAACL 2024 CoSyn: Detecting Implicit Hate Speech in Online Conversations Using a Context Synergized Hyperbolic Network EMNLP 2023 DALE: Generative Data Augmentation for Low-Resource Legal NLP EMNLP 2023 ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NER ACL 2023 AdVerb: Visually Guided Audio Dereverberation ICCV 2023 MMER: Multimodal Multi-task Learning for Speech Emotion Recognition INTERSPEECH 2023