Yuchen Hu
24 papers · 2021–2025 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
π Conference Polyglot (7) π§ Keyword Pioneer π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (11) π Cross-Pollinator (12)
π
Cross-Pollinator
(12)
π
Renaissance Researcher
(6)
π€
Dynamic Duo
(19)
π
Keyword Champion
(5)
β‘
Prolific Year
(8)
ποΈ
Keyword Collector
(100)
π
Century Club
(24)
π₯
Unstoppable
(5)
β
The Questioner
Conferences
ACL (10)
ICLR (4)
INTERSPEECH (4)
AAAI (2)
NIPS (2)
IJCAI (1)
IJCNLP (1)
Top co-authors
Research topics
Keywords
multimodal learning
(7)
audio-visual speech recognition
(5)
automatic speech recognition
(4)
large language model
(4)
catastrophic forgetting
(3)
contrastive learning
(3)
multimodal fusion
(3)
representation learning
(3)
speech enhancement
(2)
n-best hypothesis
(2)
continual learning
(2)
speech translation
(2)
unsupervised domain adaptation
(2)
simultaneous translation
(2)
noise-robust speech recognition
(2)
in-context learning
(1)
data augmentation
(1)
machine translation
(1)
knowledge distillation
(1)
adversarial learning
(1)
Papers
AnalyticKWS: Towards Exemplar-Free Analytic Class Incremental Learning for Small-footprint Keyword Spotting
ACL 2025
Beyond Output Matching: Bidirectional Alignment for Enhanced In-Context Learning
ACL 2025
GenSE: Generative Speech Enhancement via Language Models using Hierarchical Modeling
ICLR 2025
Audio Large Language Models Can Be Descriptive Speech Quality Evaluators
ICLR 2025
Relevant or Random: Can LLMs Truly Perform Analogical Reasoning?
ACL 2025
Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models
ACL 2024
Noise-aware Speech Enhancement using Diffusion Probabilistic Model
INTERSPEECH 2024
Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models
NIPS 2024
Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation
AAAI 2024
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators
ACL 2024
Overcoming Catastrophic Forgetting by Exemplar Selection in Task-oriented Dialogue System
ACL 2024
Large Language Models are Efficient Learners of Noise-Robust Speech Recognition
ICLR 2024
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition
ICLR 2024
Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition
ACL 2023
A Neural State-Space Modeling Approach to Efficient Speech Separation
INTERSPEECH 2023
HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models
NIPS 2023
Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition
INTERSPEECH 2023
UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning
ACL 2023
MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition
ACL 2023
Leveraging Modality-Specific Representations for Audio-Visual Speech Recognition via Reinforcement Learning
AAAI 2023
Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition
IJCAI 2023
Interactive Auido-text Representation for Automated Audio Captioning with Contrastive Learning
INTERSPEECH 2022
The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021
ACL 2021
The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021
IJCNLP 2021