conftrace_

Yuchen Hu

24 papers · 2021–2025 · 7 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+9 more ↓

🌍 Conference Polyglot (7) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (11) 🐝 Cross-Pollinator (12)

🐝 Cross-Pollinator (12) 🌈 Renaissance Researcher (6) 🤝 Dynamic Duo (19) 🏆 Keyword Champion (5) ⚡ Prolific Year (8) 🗃️ Keyword Collector (100) 💎 Century Club (24) 🔥 Unstoppable (5) ❓ The Questioner

Conferences

ACL (10) ICLR (4) INTERSPEECH (4) AAAI (2) NIPS (2) IJCAI (1) IJCNLP (1)

Top co-authors

Chen Chen (19) Eng Siong Chng (12) Ruizhe Li (9) Chao-Han Huck Yang (7) Chengwei Qin (6) Heqing Zou (5) Qiushi Zhu (4) EngSiong Chng (4) Pin-Yu Chen (4) Chao Zhang (3)

Research topics

Speech & Audio (1)

Keywords

multimodal learning (7) audio-visual speech recognition (5) automatic speech recognition (4) large language model (4) catastrophic forgetting (3) contrastive learning (3) multimodal fusion (3) representation learning (3) speech enhancement (2) n-best hypothesis (2) continual learning (2) speech translation (2) unsupervised domain adaptation (2) simultaneous translation (2) noise-robust speech recognition (2) in-context learning (1) data augmentation (1) machine translation (1) knowledge distillation (1) adversarial learning (1)

Papers

AnalyticKWS: Towards Exemplar-Free Analytic Class Incremental Learning for Small-footprint Keyword Spotting ACL 2025 Beyond Output Matching: Bidirectional Alignment for Enhanced In-Context Learning ACL 2025 GenSE: Generative Speech Enhancement via Language Models using Hierarchical Modeling ICLR 2025 Audio Large Language Models Can Be Descriptive Speech Quality Evaluators ICLR 2025 Relevant or Random: Can LLMs Truly Perform Analogical Reasoning? ACL 2025 Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models ACL 2024 Noise-aware Speech Enhancement using Diffusion Probabilistic Model INTERSPEECH 2024 Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models NIPS 2024 Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation AAAI 2024 GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators ACL 2024 Overcoming Catastrophic Forgetting by Exemplar Selection in Task-oriented Dialogue System ACL 2024 Large Language Models are Efficient Learners of Noise-Robust Speech Recognition ICLR 2024 It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition ICLR 2024 Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition ACL 2023 A Neural State-Space Modeling Approach to Efficient Speech Separation INTERSPEECH 2023 HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models NIPS 2023 Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition INTERSPEECH 2023 UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning ACL 2023 MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition ACL 2023 Leveraging Modality-Specific Representations for Audio-Visual Speech Recognition via Reinforcement Learning AAAI 2023 Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition IJCAI 2023 Interactive Auido-text Representation for Automated Audio Captioning with Contrastive Learning INTERSPEECH 2022 The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021 ACL 2021 The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021 IJCNLP 2021