Yaowei Li
17 papers · 2023–2025 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
π Cross-Pollinator (12) π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (8) π Renaissance Researcher (8)
π
Interdisciplinary Bridge
π§
Keyword Pioneer
π€
Dynamic Duo
(13)
β‘
Prolific Year
(6)
ποΈ
Keyword Collector
(80)
π
Century Club
(17)
Conferences
AAAI (6)
INTERSPEECH (3)
CVPR (2)
ICCV (2)
ACL (1)
COLING (1)
EMNLP (1)
ICLR (1)
Top co-authors
Keywords
spoken language understanding
(4)
contrastive learning
(4)
multimodal learning
(3)
slot filling
(3)
intent detection
(3)
curriculum learning
(2)
multi-intent detection
(2)
multi-task learning
(2)
vision-language model
(2)
video grounding
(2)
cross-modal learning
(2)
cross-modal alignment
(2)
medical imaging
(1)
video synthesis
(1)
catastrophic forgetting
(1)
optimal transport
(1)
continual learning
(1)
representation learning
(1)
game theory
(1)
cross-lingual transfer
(1)
Papers
DisPose: Disentangling Pose Guidance for Controllable Human Image Animation
ICLR 2025
DM-Adapter: Domain-Aware Mixture-of-Adapters for Text-Based Person Retrieval
AAAI 2025
Image Conductor: Precision Control for Interactive Video Synthesis
AAAI 2025
NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images
CVPR 2025
Towards Multi-modal Sarcasm Detection via Disentangled Multi-grained Multi-modal Distilling
COLING 2024
Towards Multi-Intent Spoken Language Understanding via Hierarchical Attention and Optimal Transport
AAAI 2024
Soul-Mix: Enhancing Multimodal Machine Translation with Manifold Mixup
ACL 2024
AlignerΒ²: Enhancing Joint Multiple Intent Detection and Slot Filling via Adjustive and Forced Cross-Task Alignment
AAAI 2024
Exploiting Auxiliary Caption for Video Grounding
AAAI 2024
Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning
AAAI 2024
GhostT5: Generate More Features with Cheap Operations to Improve Textless Spoken Question Answering
INTERSPEECH 2023
Efficient Multimodal Fusion via Interactive Prompting
CVPR 2023
Accelerating Multiple Intent Detection and Slot Filling via Targeted Knowledge Distillation
EMNLP 2023
Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation
ICCV 2023
G2L: Semantically Aligned and Uniform Video Grounding via Geodesic and Game Theory
ICCV 2023
FC-MTLF: A Fine- and Coarse-grained Multi-Task Learning Framework for Cross-Lingual Spoken Language Understanding
INTERSPEECH 2023
CΒ²A-SLU: Cross and Contrastive Attention for Improving ASR Robustness in Spoken Language Understanding
INTERSPEECH 2023