Sangmin Lee

28 papers · 2013–2025 · 9 conferences · across top CS/AI conferences

Achievements

+10 more ↓

🌈 Renaissance Researcher (11) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (9) 🏃 Academic Marathon (12) 🗺️ Taxonomy Completionist (65)

🗺️ Taxonomy Completionist (65) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🧬 Topic Evolution 💎 Century Club (28) 📈 Trend Setter ⚡ Prolific Year (12) 🔥 Unstoppable (6) 🗃️ Keyword Collector (154) ❓ The Questioner (2)

Conferences

CVPR (14) AAAI (5) ECCV (2) ICCV (2) AACL (1) EMNLP (1) ICML (1) IJCNLP (1) NSDI (1)

Top co-authors

Yong Man Ro (7) Hak Gu Kim (6) Jung Uk Kim (5) James M. Rehg (4) Jong Chul Ye (4) Bolin Lai (3) Sung Jin Um (3) Fiona Ryan (3) Seongyeop Kim (3) Dongjin Kim (3)

Research topics

Applications (1)

Keywords

multimodal learning (5) representation learning (3) contrastive learning (3) diffusion model (3) score aggregation (2) sound source localization (2) large language model (2) explainable ai (2) spurious correlation (2) natural language generation (2) scene understanding (2) self-supervised learning (2) video understanding (2) in-context learning (2) prompting strategy (2) feature extraction (1) transfer learning (1) semantic segmentation (1) manifold learning (1) domain generalization (1)

Papers

MemoryTalker: Personalized Speech-Driven 3D Facial Animation via Audio-Guided Stylization ICCV 2025 DynScene: Scalable Generation of Dynamic Robotic Manipulation Scenes for Embodied AI CVPR 2025 Object-aware Sound Source Localization via Audio-Visual Scene Understanding CVPR 2025 Derivative-Free Diffusion Manifold-Constrained Gradient for Unified XAI CVPR 2025 LAMA-UT: Language Agnostic Multilingual ASR Through Orthography Unification and Language-Specific Transliteration AAAI 2025 Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation CVPR 2025 Question-Aware Gaussian Experts for Audio-Visual Question Answering CVPR 2025 Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection AAAI 2025 SocialGesture: Delving into Multi-person Gesture Understanding CVPR 2025 Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders CVPR 2025 UniCoM: A Universal Code-Switching Speech Generator EMNLP 2025 IM-LUT: Interpolation Mixing Look-Up Tables for Image Super-Resolution ICCV 2025 Defining Neural Network Architecture through Polytope Structures of Datasets ICML 2024 Learning to Visually Localize Sound Sources from Mixtures without Prior Source Knowledge CVPR 2024 Modeling Multimodal Social Interactions: New Challenges and Baselines with Densely Aligned Representations CVPR 2024 Self-supervised Debiasing Using Low Rank Regularization CVPR 2024 Training Debiased Subnetworks With Contrastive Weight Pruning CVPR 2023 Which is better? Exploring Prompting Strategy For LLM-based Metrics IJCNLP 2023 Which is better? Exploring Prompting Strategy For LLM-based Metrics AACL 2023 Weakly Paired Associative Learning for Sound and Image Representations via Bimodal Associative Memory CVPR 2022 Audio-Visual Mismatch-Aware Video Retrieval via Association and Adjustment ECCV 2022 Towards a Better Understanding of VR Sickness: Physical Symptom Prediction for VR Contents AAAI 2021 Visual Comfort Aware-Reinforcement Learning for Depth Adjustment of Stereoscopic 3D Images AAAI 2021 Explaining Convolutional Neural Networks through Attribution-Based Input Sampling and Block-Wise Feature Aggregation AAAI 2021 Video Prediction Recalling Long-Term Motion Context via Memory Alignment Learning CVPR 2021 SACA Net: Cybersickness Assessment of Individual Viewers for VR Content via Graph-based Symptom Relation Embedding ECCV 2020 Structure Boundary Preserving Segmentation for Medical Image With Ambiguous Boundary CVPR 2020 πBox: A Platform for Privacy-Preserving Apps NSDI 2013