Yue Yang

35 papers · 2020–2026 · 10 conferences · across top CS/AI conferences

Achievements

+12 more ↓

🏃 Academic Marathon (5) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (10) 🐣 Hot Topic Early Bird

🐣 Hot Topic Early Bird 🐝 Cross-Pollinator (9) 🌉 Interdisciplinary Bridge 🏆 Keyword Champion 🤝 Dynamic Duo (11) 👥 Mega-Team (50) 🏆 Grand Slam 🔬 Deep Specialist (10) 🗃️ Keyword Collector (154) ⚡ Prolific Year (10) 💎 Century Club (29) 🔥 Unstoppable (6)

Conferences

CVPR (7) EMNLP (7) ACL (6) AAAI (4) ICML (3) NIPS (3) ICLR (2) EACL (1) ECCV (1) ICCV (1)

Top co-authors

Chris Callison-Burch (11) Mark Yatskar (8) Kaipeng Zhang (8) Yu Qiao (7) Wenqi Shao (7) Ping Luo (6) Artemis Panagopoulou (4) Hao Zhang (4) Yuqi Lin (4) Aniruddha Kembhavi (3)

Research topics

Digital Humanities (1)

Keywords

vision-language model (8) large language model (7) language model (4) multimodal learning (4) zero-shot learning (3) text-to-image generation (3) concept bottleneck model (2) image understanding (2) chain-of-thought prompting (2) transfer learning (2) federated learning (2) embodied ai (2) scene understanding (2) multimodal reasoning (2) semi-supervised learning (2) image captioning (2) image classification (2) diffusion model (2) concept bottleneck (2) graph clustering (1)

Papers

Explain the Synth: Interpretable Evaluation of LLM Data Synthesis ACL 2026 Safe-FedLLM: Delving into the Safety of Federated Large Language Models ACL 2026 SAR-DisentDM: A Semantic-Disentangled Diffusion Model for Limited-Data SAR Image Synthesis AAAI 2026 Musical Score Understanding Benchmark: Evaluating Large Language Models’ Comprehension of Complete Musical Scores ACL 2026 Anchor-Driven Nyström for Deep Graph-Level Clustering AAAI 2026 Personalized Federated Graph-Level Clustering Network AAAI 2026 Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models CVPR 2025 BACON: Improving Clarity of Image Captions via Bag-of-Concept Graphs CVPR 2025 OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation CVPR 2025 DivScene: Towards Open-Vocabulary Object Navigation with Large Vision Language Models in Diverse Scenes EMNLP 2025 Competitively Consistent Clustering ICML 2025 Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping ICLR 2025 Articulate-Anything: Automatic Modeling of Articulated Objects via a Vision-Language Foundation Model ICLR 2025 InterIDEAS: Philosophical Intertextuality via LLMs EMNLP 2025 Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation ACL 2025 Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability, Reproducibility, and Practicality NIPS 2024 A Textbook Remedy for Domain Shifts: Knowledge Priors for Medical Image Analysis NIPS 2024 ConvBench: A Multi-Turn Conversation Evaluation Benchmark with Hierarchical Ablation Capability for Large Vision-Language Models NIPS 2024 Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification AAAI 2024 DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model CVPR 2024 Holodeck: Language Guided Generation of 3D Embodied AI Environments CVPR 2024 CoMo: Controllable Motion Generation through Language Guided Pose Code Editing ECCV 2024 MiRAGeNews: Multimodal Realistic AI-Generated News Detection EMNLP 2024 Position: Towards Implicit Prompt For Text-To-Image Models ICML 2024 MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI ICML 2024 Causal Reasoning of Entities and Events in Procedural Texts EACL 2023 Semi-Supervised 2D Human Pose Estimation Driven by Position Inconsistency Pseudo Label Correction Module CVPR 2023 I Spy a Metaphor: Large Language Models and Diffusion Models Co-Create Visual Metaphors ACL 2023 Name Your Colour For the Task: Artificially Discover Colour Naming via Colour Quantisation Transformer ICCV 2023 Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification CVPR 2023 Visualizing the Obvious: A Concreteness-based Ensemble Model for Noun Property Prediction EMNLP 2022 Show Me More Details: Discovering Hierarchies of Procedures from Semi-structured Web Data ACL 2022 Z-LaVI: Zero-Shot Language Solver Fueled by Visual Imagination EMNLP 2022 Visual Goal-Step Inference using wikiHow EMNLP 2021 MedDialog: Large-scale Medical Dialogue Datasets EMNLP 2020