Shang-Wen Li
35 papers · 2020–2025 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
🏃 Academic Marathon (5) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (10) 🐣 Hot Topic Early Bird
🌍
Conference Polyglot
(10)
🏃
Academic Marathon
(5)
🐝
Cross-Pollinator
(13)
🤝
Dynamic Duo
(15)
👥
Mega-Team
(20)
🔬
Deep Specialist
(11)
🏆
Keyword Champion
(2)
🗃️
Keyword Collector
(130)
📈
Trend Setter
⚡
Prolific Year
(6)
🔥
Unstoppable
(6)
💎
Century Club
(35)
Conferences
INTERSPEECH (11)
ACL (7)
NAACL (6)
EMNLP (4)
IJCNLP (2)
CVPR (1)
EACL (1)
ICLR (1)
ICML (1)
NIPS (1)
Top co-authors
Research topics
Keywords
self-supervised learning
(9)
transfer learning
(6)
contrastive learning
(5)
domain adaptation
(4)
representation learning
(4)
zero-shot learning
(4)
automatic speech recognition
(4)
few-shot learning
(4)
natural language processing
(3)
language model pretraining
(3)
speech processing
(3)
knowledge distillation
(3)
spoken language understanding
(3)
unsupervised learning
(2)
language model
(2)
continual learning
(2)
synthetic datum
(2)
domain generalization
(2)
semantic similarity
(2)
bias mitigation
(2)
Papers
Demystifying Synthetic Data in LLM Pre-training: A Systematic Study of Scaling Laws, Benefits, and Pitfalls
EMNLP 2025
SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models
ICML 2025
GSQA: An End-to-End Model for Generative Spoken Question Answering
INTERSPEECH 2024
Altogether: Image Captioning via Re-aligning Alt-text
EMNLP 2024
Demystifying CLIP Data
ICLR 2024
VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild
ACL 2024
MoDE: CLIP Data Experts via Clustering
CVPR 2024
Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model
INTERSPEECH 2023
MAViL: Masked Audio-Video Learners
NIPS 2023
Improving Textless Spoken Language Understanding with Discrete Units as Intermediate Target
INTERSPEECH 2023
ML-SUPERB: Multilingual Speech Universal PERformance Benchmark
INTERSPEECH 2023
Introducing Semantics into Speech Encoders
ACL 2023
Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering
ACL 2023
Self-supervised Representation Learning for Speech Processing
NAACL 2022
SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities
ACL 2022
Lifelong Pretraining: Continually Adapting Language Models to Emerging Corpora
ACL 2022
Listen, Adapt, Better WER: Source-free Single-utterance Test-time Adaptation for Automatic Speech Recognition
INTERSPEECH 2022
An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks
INTERSPEECH 2022
DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering
INTERSPEECH 2022
Cooperative Self-training of Machine Reading Comprehension
NAACL 2022
Meta Learning for Natural Language Processing: A Survey
NAACL 2022
DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings
NAACL 2022
Lifelong Pretraining: Continually Adapting Language Models to Emerging Corpora
NAACL 2022
Mitigating Biases in Toxic Language Detection through Invariant Rationalization
ACL 2021
Meta Learning and Its Applications to Natural Language Processing
ACL 2021
Pairwise Supervised Contrastive Learning of Sentence Representations
EMNLP 2021
SUPERB: Speech Processing Universal PERformance Benchmark
INTERSPEECH 2021
Joint Retrieval-Extraction Training for Evidence-Aware Dialog Response Selection
INTERSPEECH 2021
Supporting Clustering with Contrastive Learning
NAACL 2021
Zero-shot Generalization in Dialog State Tracking through Generative Question Answering
EACL 2021
Meta Learning and Its Applications to Natural Language Processing
IJCNLP 2021
Mitigating Biases in Toxic Language Detection through Invariant Rationalization
IJCNLP 2021
Knowledge Grounded Conversational Symptom Detection with Graph Memory Networks
EMNLP 2020
Prototypical Q Networks for Automatic Conversational Diagnosis and Few-Shot New Disease Adaption
INTERSPEECH 2020
Style Attuned Pre-Training and Parameter Efficient Fine-Tuning for Spoken Language Understanding
INTERSPEECH 2020