Yaowei Li

17 papers · 2023–2025 · 8 conferences · across top CS/AI conferences

Achievements

+6 more ↓

🐝 Cross-Pollinator (12) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (8) 🌈 Renaissance Researcher (8)

🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🤝 Dynamic Duo (13) ⚡ Prolific Year (6) 🗃️ Keyword Collector (80) 💎 Century Club (17)

Conferences

AAAI (6) INTERSPEECH (3) CVPR (2) ICCV (2) ACL (1) COLING (1) EMNLP (1) ICLR (1)

Top co-authors

Yuexian Zou (13) Xuxin Cheng (13) Hongxiang Li (11) Zhihong Zhu (11) Ziyu Yao (4) Wanshi Xu (2) Meng Cao (2) Ying Shan (2) Zhaoyang Zhang (2) Bang Yang (2)

Keywords

spoken language understanding (4) contrastive learning (4) multimodal learning (3) slot filling (3) intent detection (3) curriculum learning (2) multi-intent detection (2) multi-task learning (2) vision-language model (2) video grounding (2) cross-modal learning (2) cross-modal alignment (2) medical imaging (1) video synthesis (1) catastrophic forgetting (1) optimal transport (1) continual learning (1) representation learning (1) game theory (1) cross-lingual transfer (1)

Papers

DisPose: Disentangling Pose Guidance for Controllable Human Image Animation ICLR 2025 DM-Adapter: Domain-Aware Mixture-of-Adapters for Text-Based Person Retrieval AAAI 2025 Image Conductor: Precision Control for Interactive Video Synthesis AAAI 2025 NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images CVPR 2025 Towards Multi-modal Sarcasm Detection via Disentangled Multi-grained Multi-modal Distilling COLING 2024 Towards Multi-Intent Spoken Language Understanding via Hierarchical Attention and Optimal Transport AAAI 2024 Soul-Mix: Enhancing Multimodal Machine Translation with Manifold Mixup ACL 2024 Aligner²: Enhancing Joint Multiple Intent Detection and Slot Filling via Adjustive and Forced Cross-Task Alignment AAAI 2024 Exploiting Auxiliary Caption for Video Grounding AAAI 2024 Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning AAAI 2024 GhostT5: Generate More Features with Cheap Operations to Improve Textless Spoken Question Answering INTERSPEECH 2023 Efficient Multimodal Fusion via Interactive Prompting CVPR 2023 Accelerating Multiple Intent Detection and Slot Filling via Targeted Knowledge Distillation EMNLP 2023 Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation ICCV 2023 G2L: Semantically Aligned and Uniform Video Grounding via Geodesic and Game Theory ICCV 2023 FC-MTLF: A Fine- and Coarse-grained Multi-Task Learning Framework for Cross-Lingual Spoken Language Understanding INTERSPEECH 2023 C²A-SLU: Cross and Contrastive Attention for Improving ASR Robustness in Spoken Language Understanding INTERSPEECH 2023