Zhiyuan Zhao
17 papers · 2020–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
π Conference Polyglot (8) π§ Keyword Pioneer π Renaissance Researcher (5) π Interdisciplinary Bridge π Academic Marathon (5)
π
Academic Marathon
(5)
π
Cross-Pollinator
(13)
πΊοΈ
Taxonomy Completionist
(38)
π§¬
Topic Evolution
π₯
Mega-Team
(20)
π
Grand Slam
π
Triple Crown
π
Century Club
(15)
ποΈ
Keyword Collector
(70)
π₯
Unstoppable
(6)
β‘
Prolific Year
(6)
π
Conference Pioneer
Conferences
ACL (4)
INTERSPEECH (4)
CVPR (2)
ICLR (2)
AAAI (1)
ICML (1)
IJCAI (1)
MICCAI (1)
NIPS (1)
Top co-authors
Keywords
vision-language model
(3)
benchmark evaluation
(2)
time series forecasting
(2)
domain adaptation
(2)
speech enhancement
(2)
layout analysis
(2)
transfer learning
(2)
document parsing
(2)
large language model
(2)
image generation
(1)
speech recognition
(1)
voice conversion
(1)
prompt engineering
(1)
data integration
(1)
adversarial robustness
(1)
zero-shot learning
(1)
in-context learning
(1)
adversarial training
(1)
object detection
(1)
code generation
(1)
Papers
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing
ACL 2026
Exploring Efficient Open-Vocabulary Segmentation in the Remote Sensing
AAAI 2026
WebUIBench: A Comprehensive Benchmark for Evaluating Multimodal Large Language Models in WebUI-to-Code
ACL 2025
Towards Multiple Character Image Animation Through Enhancing Implicit Decoupling
ICLR 2025
LLMs Caught in the Crossfire: Malware Requests and Jailbreak Challenges
ACL 2025
OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations
CVPR 2025
Time-Series Forecasting for Out-of-Distribution Generalization Using Invariant Learning
ICML 2024
LSTPrompt: Large Language Models as Zero-Shot Time Series Forecasters by Long-Short-Term Prompting
ACL 2024
MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation
CVPR 2024
PINNsFormer: A Transformer-Based Framework For Physics-Informed Neural Networks
ICLR 2024
Time-MMD: Multi-Domain Multimodal Dataset for Time Series Analysis
NIPS 2024
DPMNet: Dual-Path MLP-based Network for Aneurysm Image Segmentation
MICCAI 2024
TridentSE: Guiding Speech Enhancement with 32 Global Tokens
INTERSPEECH 2023
An Anchor-Free Detector for Continuous Speech Keyword Spotting
INTERSPEECH 2022
RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion
INTERSPEECH 2022
Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
INTERSPEECH 2021
Joint Time-Frequency and Time Domain Learning for Speech Enhancement
IJCAI 2020