Yuxin Guo
13 papers · 2023–2026 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+5 more ↓ Show less ↑
π Renaissance Researcher (6) πΊοΈ Taxonomy Completionist (29) π§ Keyword Pioneer π Interdisciplinary Bridge π Conference Polyglot (8)
π
Cross-Pollinator
(13)
π
Grand Slam
ποΈ
Keyword Collector
(54)
π
Century Club
(11)
β‘
Prolific Year
(5)
Conferences
CVPR (2)
ICCV (2)
NIPS (2)
AAAI (1)
ACL (1)
CORL (1)
ECCV (1)
EMNLP (1)
ICLR (1)
ICML (1)
Top co-authors
Keywords
representation learning
(3)
diffusion transformer
(2)
multimodal learning
(2)
reinforcement learning
(1)
sentiment analysis
(1)
text classification
(1)
semi-supervised learning
(1)
curriculum learning
(1)
video prediction
(1)
image reconstruction
(1)
toxicity detection
(1)
content moderation
(1)
autonomous driving
(1)
audio-visual learning
(1)
cross-modal learning
(1)
image retrieval
(1)
object localization
(1)
responsible ai
(1)
clinical prediction
(1)
benchmark evaluation
(1)
Papers
Toward Better EHR Reasoning in LLMs: Reinforcement Learning with Expert Attention Guidance
AAAI 2026
UniSonate: A Unified Model for Speech, Music, and Sound Effect Generation with Text Instructions
ACL 2026
Aligned Better, Listen Better for Audio-Visual Large Language Models
ICLR 2025
GenHancer: Imperfect Generative Models are Secretly Strong Vision-Centric Enhancers
ICCV 2025
MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction
CVPR 2025
ReasonPlan: Unified Scene Prediction and Decision Reasoning for Closed-loop Autonomous Driving
CORL 2025
UniMLVG: Unified Framework for Multi-view Long Video Generation with Comprehensive Control Capabilities for Autonomous Driving
ICCV 2025
On the Nonlinearity of Layer Normalization
ICML 2024
CrossMAE: Cross-Modality Masked Autoencoders for Region-Aware Audio-Visual Pre-Training
CVPR 2024
CoReS: Orchestrating the Dance of Reasoning and Segmentation
ECCV 2024
LoTLIP: Improving Language-Image Pre-training for Long Text Understanding
NIPS 2024
ToxicChat: Unveiling Hidden Challenges of Toxicity Detection in Real-World User-AI Conversation
EMNLP 2023
Dual Mean-Teacher: An Unbiased Semi-Supervised Framework for Audio-Visual Source Localization
NIPS 2023