Yupeng Hu
16 papers · 2024–2026 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+5 more ↓ Show less ↑
π Cross-Pollinator (15) π Conference Polyglot (5) π Renaissance Researcher (6) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (36)
π§
Keyword Pioneer
π
Century Club
(14)
β
The Questioner
ποΈ
Keyword Collector
(85)
β‘
Prolific Year
(5)
Conferences
AAAI (10)
ACL (3)
CVPR (1)
ICCV (1)
IJCAI (1)
Top co-authors
Keywords
composed image retrieval
(4)
multimodal learning
(3)
feature homogenization
(2)
knowledge distillation
(2)
reinforcement learning
(2)
multimodal large language model
(2)
multi-task learning
(1)
zero-shot learning
(1)
transformer architecture
(1)
attention mechanism
(1)
question answering
(1)
stochastic gradient descent
(1)
trajectory prediction
(1)
multi-modal learning
(1)
neural network optimization
(1)
visual question answering
(1)
semantic alignment
(1)
progressive learning
(1)
time series classification
(1)
visual reasoning
(1)
Papers
Resonating with RoPE: Spectral Quantization for High-Fidelity Key Cache Compression
ACL 2026
HABIT: Chrono-Synergia Robust Progressive Learning Framework for Composed Image Retrieval
AAAI 2026
Decompose and Conquer: Compositional Reasoning for Zero-Shot Temporal Action Localization
AAAI 2026
When Eyes and Ears Disagree: Can MLLMs Discern Audio-Visual Confusion?
AAAI 2026
INTENT: Invariance and Discrimination-aware Noise Mitigation for Robust Composed Image Retrieval
AAAI 2026
ReTrack: Evidence-Driven Dual-Stream Directional Anchor Calibration Network for Composed Video Retrieval
AAAI 2026
D2MoRA: Diversity-Regulated Asymmetric MoE-LoRA Decomposition for Efficient Multi-Task Adaptation
AAAI 2026
TEMA: Anchor the Image, Follow the Text for Multi-Modification Composed Image Retrieval
ACL 2026
Circuit-Think: A Multimodal Reasoning Framework for Automated Circuit-to-Netlist Translation with Trajectory-Guided Reinforcement Learning
AAAI 2026
ENCODER: Entity Mining and Modification Relation Binding for Composed Image Retrieval
AAAI 2025
Content-aware Balanced Spectrum Encoding in Masked Modeling for Time Series Classification
AAAI 2025
CoRe-MMRAG: Cross-Source Knowledge Reconciliation for Multimodal RAG
ACL 2025
Towards Stable and Storage-efficient Dataset Distillation: Matching Convexified Trajectory
CVPR 2025
Bilateral Collaboration with Large Vision-Language Models for Open Vocabulary Human-Object Interaction Detection
ICCV 2025
Breaking Barriers of System Heterogeneity: Straggler-Tolerant Multimodal Federated Learning via Knowledge Distillation
IJCAI 2024
Exploiting the Social-Like Prior in Transformer for Visual Reasoning
AAAI 2024