Sangmin Lee
28 papers · 2013–2025 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
π Renaissance Researcher (11) π Interdisciplinary Bridge π Conference Polyglot (9) π Academic Marathon (12) πΊοΈ Taxonomy Completionist (65)
πΊοΈ
Taxonomy Completionist
(65)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π§¬
Topic Evolution
π
Century Club
(28)
π
Trend Setter
β‘
Prolific Year
(12)
π₯
Unstoppable
(6)
ποΈ
Keyword Collector
(154)
β
The Questioner
(2)
Conferences
CVPR (14)
AAAI (5)
ECCV (2)
ICCV (2)
AACL (1)
EMNLP (1)
ICML (1)
IJCNLP (1)
NSDI (1)
Top co-authors
Research topics
Keywords
multimodal learning
(5)
representation learning
(3)
contrastive learning
(3)
diffusion model
(3)
score aggregation
(2)
sound source localization
(2)
large language model
(2)
explainable ai
(2)
spurious correlation
(2)
natural language generation
(2)
scene understanding
(2)
self-supervised learning
(2)
video understanding
(2)
in-context learning
(2)
prompting strategy
(2)
feature extraction
(1)
transfer learning
(1)
semantic segmentation
(1)
manifold learning
(1)
domain generalization
(1)
Papers
MemoryTalker: Personalized Speech-Driven 3D Facial Animation via Audio-Guided Stylization
ICCV 2025
DynScene: Scalable Generation of Dynamic Robotic Manipulation Scenes for Embodied AI
CVPR 2025
Object-aware Sound Source Localization via Audio-Visual Scene Understanding
CVPR 2025
Derivative-Free Diffusion Manifold-Constrained Gradient for Unified XAI
CVPR 2025
LAMA-UT: Language Agnostic Multilingual ASR Through Orthography Unification and Language-Specific Transliteration
AAAI 2025
Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation
CVPR 2025
Question-Aware Gaussian Experts for Audio-Visual Question Answering
CVPR 2025
Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection
AAAI 2025
SocialGesture: Delving into Multi-person Gesture Understanding
CVPR 2025
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders
CVPR 2025
UniCoM: A Universal Code-Switching Speech Generator
EMNLP 2025
IM-LUT: Interpolation Mixing Look-Up Tables for Image Super-Resolution
ICCV 2025
Defining Neural Network Architecture through Polytope Structures of Datasets
ICML 2024
Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge
CVPR 2024
Modeling Multimodal Social Interactions: New Challenges and Baselines with Densely Aligned Representations
CVPR 2024
Self-supervised Debiasing Using Low Rank Regularization
CVPR 2024
Training Debiased Subnetworks With Contrastive Weight Pruning
CVPR 2023
Which is better? Exploring Prompting Strategy For LLM-based Metrics
IJCNLP 2023
Which is better? Exploring Prompting Strategy For LLM-based Metrics
AACL 2023
Weakly Paired Associative Learning for Sound and Image Representations via Bimodal Associative Memory
CVPR 2022
Audio-Visual Mismatch-Aware Video Retrieval via Association and Adjustment
ECCV 2022
Towards a Better Understanding of VR Sickness: Physical Symptom Prediction for VR Contents
AAAI 2021
Visual Comfort Aware-Reinforcement Learning for Depth Adjustment of Stereoscopic 3D Images
AAAI 2021
Explaining Convolutional Neural Networks through Attribution-Based Input Sampling and Block-Wise Feature Aggregation
AAAI 2021
Video Prediction Recalling Long-Term Motion Context via Memory Alignment Learning
CVPR 2021
SACA Net: Cybersickness Assessment of Individual Viewers for VR Content via Graph-based Symptom Relation Embedding
ECCV 2020
Structure Boundary Preserving Segmentation for Medical Image With Ambiguous Boundary
CVPR 2020
ΟBox: A Platform for Privacy-Preserving Apps
NSDI 2013