Yibo Yan
24 papers · 2023–2026 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
🐝 Cross-Pollinator (10) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (6) 🌈 Renaissance Researcher (8)
🌍
Conference Polyglot
(6)
🤝
Dynamic Duo
(18)
🔬
Deep Specialist
(11)
🗃️
Keyword Collector
(97)
⚡
Prolific Year
(19)
💎
Century Club
(22)
Conferences
ACL (9)
EMNLP (8)
ICML (3)
AAAI (2)
ICLR (1)
JMLR (1)
Top co-authors
Research topics
Keywords
multimodal large language model
(8)
large language model
(5)
multimodal learning
(4)
benchmark evaluation
(3)
educational technology
(3)
mathematical reasoning
(2)
tutoring system
(2)
attention pattern
(2)
machine unlearning
(2)
visual question answering
(2)
data augmentation
(1)
visual reasoning
(1)
instruction following
(1)
model safety
(1)
video understanding
(1)
text classification
(1)
privacy preservation
(1)
scene graph
(1)
confidence calibration
(1)
prompt learning
(1)
Papers
Vision-Language Introspection: Mitigating Overconfident Hallucinations in MLLMs via Interpretable Bi-Causal Steering
ACL 2026
Sharp Eyes and Memory for VideoLLMs: Information-Aware Visual Token Pruning for Efficient and Reliable VideoLLM Reasoning
AAAI 2026
Reefknot: A Comprehensive Benchmark for Relation Hallucination Evaluation, Analysis and Mitigation in Multimodal Large Language Models
ACL 2025
EssayJudge: A Multi-Granular Benchmark for Assessing Automated Essay Scoring Capabilities of Multimodal Large Language Models
ACL 2025
MMUnlearner: Reformulating Multimodal Machine Unlearning in the Era of Multimodal Large Language Models
ACL 2025
Unlocking Speech Instruction Data Potential with Query Rewriting
ACL 2025
A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges
ACL 2025
SafeEraser: Enhancing Safety in Multimodal Large Language Models through Multimodal Machine Unlearning
ACL 2025
A Survey on Proactive Defense Strategies Against Misinformation in Large Language Models
ACL 2025
Pierce the Mists, Greet the Sky: Decipher Knowledge Overshadowing via Knowledge Circuit Analysis
EMNLP 2025
UrbanVLP: Multi-Granularity Vision-Language Pretraining for Urban Socioeconomic Indicator Prediction
AAAI 2025
Exploring Response Uncertainty in MLLMs: An Empirical Evaluation under Misleading Scenarios
EMNLP 2025
VLA-Mark: A cross modal watermark for large vision-language alignment models
EMNLP 2025
LLM Agents for Education: Advances and Applications
EMNLP 2025
PhysicsArena: The First Multimodal Physics Reasoning Benchmark Exploring Variable, Process, and Solution Dimensions
EMNLP 2025
Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality
ICLR 2025
OneForecast: A Universal Framework for Global and Regional Weather Forecasting
ICML 2025
RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning
ICML 2025
Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models
ICML 2025
Position: LLMs Can be Good Tutors in English Education
EMNLP 2025
MathAgent: Leveraging a Mixture-of-Math-Agent Framework for Real-World Multimodal Mathematical Error Detection
ACL 2025
FastMem: Fast Memorization of Prompt Improves Context Awareness of Large Language Models
EMNLP 2024
MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model
EMNLP 2024
Confidence Intervals and Hypothesis Testing for High-dimensional Quantile Regression: Convolution Smoothing and Debiasing
JMLR 2023