Ziyue Wang

29 papers · 2019–2026 · 12 conferences · across top CS/AI conferences

Achievements

+13 more ↓

🐝 Cross-Pollinator (8) 🏃 Academic Marathon (6) 🌍 Conference Polyglot (12) 🧭 Keyword Pioneer 🌈 Renaissance Researcher (7)

🌈 Renaissance Researcher (7) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (55) 🤝 Dynamic Duo (10) 🔬 Deep Specialist (12) 👥 Mega-Team (22) ❓ The Questioner ⚡ Prolific Year (13) 🚀 Conference Pioneer 💎 Century Club (26) 🗃️ Keyword Collector (121) 📈 Trend Setter 🔥 Unstoppable (5)

Conferences

ACL (7) EMNLP (5) AAAI (4) CVPR (2) ECCV (2) ICCV (2) MICCAI (2) ICML (1) IJCAI (1) IJCNLP (1) JMLR (1) MIDL (1)

Top co-authors

Peng Li (11) Yang Liu (11) Fuwen Luo (8) Chi Chen (7) Xiaolong Wang (4) yongbing zhang (4) Ming Yan (3) Fei Huang (3) Ye Zhang (3) Liang Zhao (3)

Keywords

multimodal large language model (7) multimodal learning (4) benchmark evaluation (4) large language model (4) multi-view clustering (3) vision-language model (3) question answering (3) natural language inference (2) visual question answering (2) legal document analysis (2) visual comprehension (2) multi-image understanding (2) multimodal reasoning (2) text representation (2) video understanding (2) legal nlp (2) multilingual nlp (1) semantic segmentation (1) attention mechanism (1) zero-shot learning (1)

Papers

KNNDA: A New Perspective of Alignment Recovery for Partially View-Aligned Clustering AAAI 2026 PathFLIP: Fine-grained Language-Image Pretraining for Versatile Computational Pathology AAAI 2026 MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding ACL 2026 MUCAR: Benchmarking Multilingual Cross-Modal Ambiguity Resolution for Multimodal Large Language Models EMNLP 2025 Incomplete and Unpaired Multi-View Graph Clustering with Cross-View Feature Fusion AAAI 2025 ActiView: Evaluating Active Perception Ability for Multimodal Large Language Models ACL 2025 Perspective Transition of Large Language Models for Solving Subjective Tasks ACL 2025 EgoLife: Towards Egocentric Life Assistant CVPR 2025 CoSpace: Benchmarking Continuous Space Perception Ability for Vision-Language Models CVPR 2025 DongbaMIE: A Multimodal Information Extraction Dataset for Evaluating Semantic Understanding of Dongba Pictograms EMNLP 2025 Diving into Mitigating Hallucinations from a Vision Perspective for Large Vision-Language Models EMNLP 2025 How Do Multimodal Large Language Models Handle Complex Multimodal Reasoning? Placing Them in An Extensible Escape Game ICCV 2025 Structure Matters: Revisiting Boundary Refinement in Video Object Segmentation ICCV 2025 The Four Color Theorem for Cell Instance Segmentation ICML 2025 Dual Robust Unbiased Multi-View Clustering for Incomplete and Unpaired Information IJCAI 2025 ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tracking MICCAI 2025 Octopus: Embodied Vision-Language Programmer from Environmental Feedback ECCV 2024 CODIS: Benchmarking Context-dependent Visual Comprehension for Multimodal Large Language Models ACL 2024 Graph-Structured Speculative Decoding ACL 2024 Boundary-aware Contrastive Learning for Semi-supervised Nuclei Instance Segmentation MIDL 2024 Model Composition for Multimodal Large Language Models ACL 2024 Browse and Concentrate: Comprehending Multimodal Content via Prior-LLM Context Fusion ACL 2024 Dynamic Pseudo Label Optimization in Point-Supervised Nuclei Segmentation MICCAI 2024 Tractable and Near-Optimal Adversarial Algorithms for Robust Estimation in Contaminated Gaussian Models JMLR 2023 Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask Questions EMNLP 2023 Imperceptible Adversarial Attack via Invertible Neural Networks AAAI 2023 Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video Retrieval ECCV 2022 IFlyLegal: A Chinese Legal System for Consultation, Law Searching, and Document Analysis IJCNLP 2019 IFlyLegal: A Chinese Legal System for Consultation, Law Searching, and Document Analysis EMNLP 2019