Yue Fan
30 papers · 2021–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
🐝 Cross-Pollinator (12) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (11) 🏃 Academic Marathon (5)
🌉
Interdisciplinary Bridge
🌍
Conference Polyglot
(11)
🧬
Topic Evolution
👥
Mega-Team
(22)
🏆
Keyword Champion
(2)
🗃️
Keyword Collector
(125)
⚡
Prolific Year
(5)
💎
Century Club
(26)
🔥
Unstoppable
(5)
❓
The Questioner
Conferences
ACL (7)
EMNLP (6)
ICLR (5)
CVPR (3)
AAAI (2)
ICCV (2)
COLING (1)
ECCV (1)
IJCAI (1)
NAACL (1)
NIPS (1)
Top co-authors
Keywords
multimodal large language model
(6)
large language model
(3)
representation learning
(2)
zero-shot learning
(2)
contrastive learning
(2)
benchmark evaluation
(2)
layer selection
(2)
multimodal learning
(2)
multi-agent system
(2)
bidirectional reasoning
(2)
semi-supervised learning
(2)
domain adaptation
(2)
visual grounding
(2)
hierarchical classification
(1)
scene understanding
(1)
multi-task learning
(1)
link prediction
(1)
vision transformer
(1)
explainable question answering
(1)
unified benchmark
(1)
Papers
Learning to Generate and Extract: A Multi-Agent Collaboration Framework for Zero-Shot Document-Level Event Arguments Extraction
AAAI 2026
FlowSearch: Advancing Deep Research with Dynamic Structured Knowledge Flow
ACL 2026
Leibniz: Theory-of-Mind Driven Neuro-Symbolic Logical Reasoning via Multi-Agent Collaboration
ACL 2026
Suggest-Verify-Revise: A Three-Stage Document-Level Event Causality Identification with Narrative Consistency
ACL 2026
Dynamic Energy-Based Contrastive Learning with Multi-Stage Knowledge Verification for Event Causality Identification
EMNLP 2025
Multimodal Inconsistency Reasoning (MMIR): A New Benchmark for Multimodal Reasoning Models
ACL 2025
Enhancing Event Causality Identification with LLM Knowledge and Concept-Level Event Relations
COLING 2025
Multi-Layer Visual Feature Fusion in Multimodal LLMs: Methods, Analysis, and Best Practices
CVPR 2025
Factored-NeuS: Reconstructing Surfaces, Illumination, and Materials of Possibly Glossy Objects
CVPR 2025
Multimodal Language Models See Better When They Look Shallower
EMNLP 2025
GUI-Bee: Align GUI Action Grounding to Novel Environments via Autonomous Exploration
EMNLP 2025
Embodied VideoAgent: Persistent Memory from Egocentric Videos and Embodied Sensors Enables Dynamic Scene Understanding
ICCV 2025
MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos
ICLR 2025
Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage
ICLR 2025
TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters
ICLR 2025
LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language Models
NAACL 2025
Active Listening: Personalized Question Generation in Open-Domain Social Conversation with User Model Based Prompting
EMNLP 2024
FRVA: Fact-Retrieval and Verification Augmented Entailment Tree Generation for Explainable Question Answering
ACL 2024
Muffin or Chihuahua? Challenging Multimodal Large Language Models with Multipanel VQA
ACL 2024
VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding
ECCV 2024
Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding
EMNLP 2024
R2H: Building Multimodal Navigation Helpers that Respond to Help Requests
EMNLP 2023
Aerial Vision-and-Dialog Navigation
ACL 2023
FreeMatch: Self-adaptive Thresholding for Semi-supervised Learning
ICLR 2023
SoftMatch: Addressing the Quantity-Quality Tradeoff in Semi-supervised Learning
ICLR 2023
SSB: Simple but Strong Baseline for Boosting Performance of Open-Set Semi-Supervised Learning
ICCV 2023
USB: A Unified Semi-supervised Learning Benchmark for Classification
NIPS 2022
CoSSL: Co-Learning of Representation and Classifier for Imbalanced Semi-Supervised Learning
CVPR 2022
Multi-Vector Embedding on Networks with Taxonomies
IJCAI 2022
Gene Regulatory Network Inference using 3D Convolutional Neural Network
AAAI 2021