Ziyue Wang
29 papers · 2019–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
π Cross-Pollinator (8) π Academic Marathon (6) π Conference Polyglot (12) π§ Keyword Pioneer π Renaissance Researcher (7)
π
Renaissance Researcher
(7)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(55)
π€
Dynamic Duo
(10)
π¬
Deep Specialist
(12)
π₯
Mega-Team
(22)
β
The Questioner
β‘
Prolific Year
(13)
π
Conference Pioneer
π
Century Club
(26)
ποΈ
Keyword Collector
(121)
π
Trend Setter
π₯
Unstoppable
(5)
Conferences
ACL (7)
EMNLP (5)
AAAI (4)
CVPR (2)
ECCV (2)
ICCV (2)
MICCAI (2)
ICML (1)
IJCAI (1)
IJCNLP (1)
JMLR (1)
MIDL (1)
Top co-authors
Keywords
multimodal large language model
(7)
multimodal learning
(4)
benchmark evaluation
(4)
large language model
(4)
multi-view clustering
(3)
vision-language model
(3)
question answering
(3)
natural language inference
(2)
visual question answering
(2)
legal document analysis
(2)
visual comprehension
(2)
multi-image understanding
(2)
multimodal reasoning
(2)
text representation
(2)
video understanding
(2)
legal nlp
(2)
multilingual nlp
(1)
semantic segmentation
(1)
attention mechanism
(1)
zero-shot learning
(1)
Papers
KNNDA: A New Perspective of Alignment Recovery for Partially View-Aligned Clustering
AAAI 2026
PathFLIP: Fine-grained Language-Image Pretraining for Versatile Computational Pathology
AAAI 2026
MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding
ACL 2026
MUCAR: Benchmarking Multilingual Cross-Modal Ambiguity Resolution for Multimodal Large Language Models
EMNLP 2025
Incomplete and Unpaired Multi-View Graph Clustering with Cross-View Feature Fusion
AAAI 2025
ActiView: Evaluating Active Perception Ability for Multimodal Large Language Models
ACL 2025
Perspective Transition of Large Language Models for Solving Subjective Tasks
ACL 2025
EgoLife: Towards Egocentric Life Assistant
CVPR 2025
CoSpace: Benchmarking Continuous Space Perception Ability for Vision-Language Models
CVPR 2025
DongbaMIE: A Multimodal Information Extraction Dataset for Evaluating Semantic Understanding of Dongba Pictograms
EMNLP 2025
Diving into Mitigating Hallucinations from a Vision Perspective for Large Vision-Language Models
EMNLP 2025
How Do Multimodal Large Language Models Handle Complex Multimodal Reasoning? Placing Them in An Extensible Escape Game
ICCV 2025
Structure Matters: Revisiting Boundary Refinement in Video Object Segmentation
ICCV 2025
The Four Color Theorem for Cell Instance Segmentation
ICML 2025
Dual Robust Unbiased Multi-View Clustering for Incomplete and Unpaired Information
IJCAI 2025
ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tracking
MICCAI 2025
Octopus: Embodied Vision-Language Programmer from Environmental Feedback
ECCV 2024
CODIS: Benchmarking Context-dependent Visual Comprehension for Multimodal Large Language Models
ACL 2024
Graph-Structured Speculative Decoding
ACL 2024
Boundary-aware Contrastive Learning for Semi-supervised Nuclei Instance Segmentation
MIDL 2024
Model Composition for Multimodal Large Language Models
ACL 2024
Browse and Concentrate: Comprehending Multimodal Content via Prior-LLM Context Fusion
ACL 2024
Dynamic Pseudo Label Optimization in Point-Supervised Nuclei Segmentation
MICCAI 2024
Tractable and Near-Optimal Adversarial Algorithms for Robust Estimation in Contaminated Gaussian Models
JMLR 2023
Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask Questions
EMNLP 2023
Imperceptible Adversarial Attack via Invertible Neural Networks
AAAI 2023
Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video Retrieval
ECCV 2022
IFlyLegal: A Chinese Legal System for Consultation, Law Searching, and Document Analysis
IJCNLP 2019
IFlyLegal: A Chinese Legal System for Consultation, Law Searching, and Document Analysis
EMNLP 2019