Yue Yang
35 papers · 2020–2026 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
π Academic Marathon (5) π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (10) π£ Hot Topic Early Bird
π£
Hot Topic Early Bird
π
Cross-Pollinator
(9)
π
Interdisciplinary Bridge
π
Keyword Champion
π€
Dynamic Duo
(11)
π₯
Mega-Team
(50)
π
Grand Slam
π¬
Deep Specialist
(10)
ποΈ
Keyword Collector
(154)
β‘
Prolific Year
(10)
π
Century Club
(29)
π₯
Unstoppable
(6)
Conferences
CVPR (7)
EMNLP (7)
ACL (6)
AAAI (4)
ICML (3)
NIPS (3)
ICLR (2)
EACL (1)
ECCV (1)
ICCV (1)
Top co-authors
Research topics
Keywords
vision-language model
(8)
large language model
(7)
language model
(4)
multimodal learning
(4)
zero-shot learning
(3)
text-to-image generation
(3)
concept bottleneck model
(2)
image understanding
(2)
chain-of-thought prompting
(2)
transfer learning
(2)
federated learning
(2)
embodied ai
(2)
scene understanding
(2)
multimodal reasoning
(2)
semi-supervised learning
(2)
image captioning
(2)
image classification
(2)
diffusion model
(2)
concept bottleneck
(2)
graph clustering
(1)
Papers
Explain the Synth: Interpretable Evaluation of LLM Data Synthesis
ACL 2026
Safe-FedLLM: Delving into the Safety of Federated Large Language Models
ACL 2026
SAR-DisentDM: A Semantic-Disentangled Diffusion Model for Limited-Data SAR Image Synthesis
AAAI 2026
Musical Score Understanding Benchmark: Evaluating Large Language Modelsβ Comprehension of Complete Musical Scores
ACL 2026
Anchor-Driven NystrΓΆm for Deep Graph-Level Clustering
AAAI 2026
Personalized Federated Graph-Level Clustering Network
AAAI 2026
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models
CVPR 2025
BACON: Improving Clarity of Image Captions via Bag-of-Concept Graphs
CVPR 2025
OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation
CVPR 2025
DivScene: Towards Open-Vocabulary Object Navigation with Large Vision Language Models in Diverse Scenes
EMNLP 2025
Competitively Consistent Clustering
ICML 2025
Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping
ICLR 2025
Articulate-Anything: Automatic Modeling of Articulated Objects via a Vision-Language Foundation Model
ICLR 2025
InterIDEAS: Philosophical Intertextuality via LLMs
EMNLP 2025
Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation
ACL 2025
Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability, Reproducibility, and Practicality
NIPS 2024
A Textbook Remedy for Domain Shifts: Knowledge Priors for Medical Image Analysis
NIPS 2024
ConvBench: A Multi-Turn Conversation Evaluation Benchmark with Hierarchical Ablation Capability for Large Vision-Language Models
NIPS 2024
Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification
AAAI 2024
DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model
CVPR 2024
Holodeck: Language Guided Generation of 3D Embodied AI Environments
CVPR 2024
CoMo: Controllable Motion Generation through Language Guided Pose Code Editing
ECCV 2024
MiRAGeNews: Multimodal Realistic AI-Generated News Detection
EMNLP 2024
Position: Towards Implicit Prompt For Text-To-Image Models
ICML 2024
MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI
ICML 2024
Causal Reasoning of Entities and Events in Procedural Texts
EACL 2023
Semi-Supervised 2D Human Pose Estimation Driven by Position Inconsistency Pseudo Label Correction Module
CVPR 2023
I Spy a Metaphor: Large Language Models and Diffusion Models Co-Create Visual Metaphors
ACL 2023
Name Your Colour For the Task: Artificially Discover Colour Naming via Colour Quantisation Transformer
ICCV 2023
Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification
CVPR 2023
Visualizing the Obvious: A Concreteness-based Ensemble Model for Noun Property Prediction
EMNLP 2022
Show Me More Details: Discovering Hierarchies of Procedures from Semi-structured Web Data
ACL 2022
Z-LaVI: Zero-Shot Language Solver Fueled by Visual Imagination
EMNLP 2022
Visual Goal-Step Inference using wikiHow
EMNLP 2021
MedDialog: Large-scale Medical Dialogue Datasets
EMNLP 2020