Shao-Yen Tseng
10 papers · 2016–2026 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
🏃 Academic Marathon (8) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (5) 🐣 Hot Topic Early Bird
🏃
Academic Marathon
(8)
🐣
Hot Topic Early Bird
🌉
Interdisciplinary Bridge
🧬
Topic Evolution
🗃️
Keyword Collector
(66)
🚀
Conference Pioneer
💎
Century Club
(10)
📈
Trend Setter
❓
The Questioner
Conferences
INTERSPEECH (4)
CVPR (2)
AAAI (1)
ACL (1)
EMNLP (1)
NAACL (1)
Top co-authors
Keywords
multimodal learning
(4)
language model
(3)
vision-language pretraining
(2)
recurrent neural network
(2)
transformer architecture
(2)
cross-modal alignment
(2)
vision-language model
(2)
attention mechanism
(1)
self-supervised learning
(1)
image generation
(1)
depth estimation
(1)
natural language understanding
(1)
speech analysis
(1)
multiple instance learning
(1)
language modeling
(1)
object detection
(1)
convolutional neural network
(1)
diffusion model
(1)
long short-term memory
(1)
knowledge distillation
(1)
Papers
LieCraft: A Multi-Agent Framework for Evaluating Deceptive Capabilities in Language Models
AAAI 2026
Why do LLaVA Vision-Language Models Reply to Images in English?
EMNLP 2024
L-MAGIC: Language Model Assisted Generation of Images with Coherence
CVPR 2024
ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning
ACL 2023
KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation
NAACL 2022
VL-InterpreT: An Interactive Visualization Tool for Interpreting Vision-Language Transformers
CVPR 2022
Predicting Behavior in Cancer-Afflicted Patient and Spouse Interactions Using Speech and Language
INTERSPEECH 2019
Multiple Instance Deep Learning for Weakly Supervised Small-Footprint Audio Event Detection
INTERSPEECH 2018
Approaching Human Performance in Behavior Estimation in Couples Therapy Using Deep Sentence Embeddings
INTERSPEECH 2017
Couples Behavior Modeling and Annotation Using Low-Resource LSTM Language Models
INTERSPEECH 2016