Shiyu Zhao
16 papers · 2020–2025 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
π Interdisciplinary Bridge π Conference Polyglot (7) π Academic Marathon (5) π Renaissance Researcher (6) πΊοΈ Taxonomy Completionist (42)
π£
Hot Topic Early Bird
π
Conference Polyglot
(7)
π
Academic Marathon
(5)
π§¬
Topic Evolution
π
Century Club
(16)
β‘
Prolific Year
(6)
π₯
Unstoppable
(6)
ποΈ
Keyword Collector
(77)
Conferences
CVPR (7)
NIPS (3)
EMNLP (2)
ACL (1)
ECCV (1)
ICCV (1)
ICLR (1)
Top co-authors
Keywords
large language model
(4)
object detection
(3)
neural machine translation
(2)
transformer architecture
(2)
open-vocabulary detection
(2)
chain-of-thought reasoning
(2)
model ensemble
(2)
open-vocabulary object detection
(2)
multimodal large language model
(2)
motion estimation
(2)
question answering
(1)
knowledge distillation
(1)
computer vision
(1)
feature learning
(1)
attention mechanism
(1)
frame interpolation
(1)
ensemble learning
(1)
referring expression
(1)
pseudo labeling
(1)
autonomous driving
(1)
Papers
Token-Budget-Aware LLM Reasoning
ACL 2025
LoR-VP: Low-Rank Visual Prompting for Efficient Vision Model Adaptation
ICLR 2025
Accelerating Multimodal Large Language Models by Searching Optimal Vision Token Reduction
CVPR 2025
MLLM-as-a-Judge for Image Safety without Human Labeling
CVPR 2025
Taming Self-Training for Open-Vocabulary Object Detection
CVPR 2024
AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning
NIPS 2024
AIDE: An Automatic Data Engine for Object Detection in Autonomous Driving
CVPR 2024
Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making
NIPS 2024
STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases
NIPS 2024
Generating Enhanced Negatives for Training Language-Based Object Detectors
CVPR 2024
OmniLabel: A Challenging Benchmark for Language-Based Object Detection
ICCV 2023
Global Matching With Overlapping Attention for Optical Flow Estimation
CVPR 2022
Exploiting Unlabeled Data with Vision and Language Models for Object Detection
ECCV 2022
The Mininglamp Machine Translation System for WMT21
EMNLP 2021
Deep Animation Video Interpolation in the Wild
CVPR 2021
OPPOβs Machine Translation Systems for WMT20
EMNLP 2020