Shang-Wen Li

35 papers · 2020–2025 · 10 conferences · across top CS/AI conferences

Achievements

+12 more ↓

🏃 Academic Marathon (5) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (10) 🐣 Hot Topic Early Bird

🌍 Conference Polyglot (10) 🏃 Academic Marathon (5) 🐝 Cross-Pollinator (13) 🤝 Dynamic Duo (15) 👥 Mega-Team (20) 🔬 Deep Specialist (11) 🏆 Keyword Champion (2) 🗃️ Keyword Collector (130) 📈 Trend Setter ⚡ Prolific Year (6) 🔥 Unstoppable (6) 💎 Century Club (35)

Conferences

INTERSPEECH (11) ACL (7) NAACL (6) EMNLP (4) IJCNLP (2) CVPR (1) EACL (1) ICLR (1) ICML (1) NIPS (1)

Top co-authors

Hung-yi Lee (15) James Glass (8) Yung-Sung Chuang (7) Hongyin Luo (7) Abdelrahman Mohamed (7) Guan-Ting Lin (6) Henghui Zhu (5) Hu Xu (5) Po-Yao Huang (5) Shinji Watanabe (4)

Research topics

Resources & Methods (1) Speech & Audio (1)

Keywords

self-supervised learning (9) transfer learning (6) contrastive learning (5) domain adaptation (4) representation learning (4) zero-shot learning (4) automatic speech recognition (4) few-shot learning (4) natural language processing (3) language model pretraining (3) speech processing (3) knowledge distillation (3) spoken language understanding (3) unsupervised learning (2) language model (2) continual learning (2) synthetic datum (2) domain generalization (2) semantic similarity (2) bias mitigation (2)

Papers

Demystifying Synthetic Data in LLM Pre-training: A Systematic Study of Scaling Laws, Benefits, and Pitfalls EMNLP 2025 SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models ICML 2025 GSQA: An End-to-End Model for Generative Spoken Question Answering INTERSPEECH 2024 Altogether: Image Captioning via Re-aligning Alt-text EMNLP 2024 Demystifying CLIP Data ICLR 2024 VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild ACL 2024 MoDE: CLIP Data Experts via Clustering CVPR 2024 Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model INTERSPEECH 2023 MAViL: Masked Audio-Video Learners NIPS 2023 Improving Textless Spoken Language Understanding with Discrete Units as Intermediate Target INTERSPEECH 2023 ML-SUPERB: Multilingual Speech Universal PERformance Benchmark INTERSPEECH 2023 Introducing Semantics into Speech Encoders ACL 2023 Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering ACL 2023 Self-supervised Representation Learning for Speech Processing NAACL 2022 SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities ACL 2022 Lifelong Pretraining: Continually Adapting Language Models to Emerging Corpora ACL 2022 Listen, Adapt, Better WER: Source-free Single-utterance Test-time Adaptation for Automatic Speech Recognition INTERSPEECH 2022 An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks INTERSPEECH 2022 DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering INTERSPEECH 2022 Cooperative Self-training of Machine Reading Comprehension NAACL 2022 Meta Learning for Natural Language Processing: A Survey NAACL 2022 DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings NAACL 2022 Lifelong Pretraining: Continually Adapting Language Models to Emerging Corpora NAACL 2022 Mitigating Biases in Toxic Language Detection through Invariant Rationalization ACL 2021 Meta Learning and Its Applications to Natural Language Processing ACL 2021 Pairwise Supervised Contrastive Learning of Sentence Representations EMNLP 2021 SUPERB: Speech Processing Universal PERformance Benchmark INTERSPEECH 2021 Joint Retrieval-Extraction Training for Evidence-Aware Dialog Response Selection INTERSPEECH 2021 Supporting Clustering with Contrastive Learning NAACL 2021 Zero-shot Generalization in Dialog State Tracking through Generative Question Answering EACL 2021 Meta Learning and Its Applications to Natural Language Processing IJCNLP 2021 Mitigating Biases in Toxic Language Detection through Invariant Rationalization IJCNLP 2021 Knowledge Grounded Conversational Symptom Detection with Graph Memory Networks EMNLP 2020 Prototypical Q Networks for Automatic Conversational Diagnosis and Few-Shot New Disease Adaption INTERSPEECH 2020 Style Attuned Pre-Training and Parameter Efficient Fine-Tuning for Spoken Language Understanding INTERSPEECH 2020