Ser-Nam Lim

63 papers · 2019–2026 · 9 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🌍 Conference Polyglot (9) 🏃 Academic Marathon (6) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (6)

🐝 Cross-Pollinator (6) 🌈 Renaissance Researcher (8) 🗺️ Taxonomy Completionist (92) 🏠 Conference Loyalist (20) 🤝 Dynamic Duo (12) 🏆 Grand Slam 🗃️ Keyword Collector (222) 🔥 Unstoppable (7) 💎 Century Club (62) ❓ The Questioner (3) ⚡ Prolific Year (11)

Conferences

CVPR (20) ECCV (13) ICCV (12) ICLR (5) ICML (4) AAAI (3) EMNLP (3) NIPS (2) WACV (1)

Top co-authors

Abhinav Shrivastava (12) Serge Belongie (8) Zuxuan Wu (7) Bor-Chun Chen (6) Philip H.S. Torr (6) Harry Yang (5) Menglin Jia (5) Antonio Torralba (5) Hengshuang Zhao (5) Young Kyun Jang (5)

Keywords

multimodal learning (6) large language model (5) representation learning (4) vision-language model (4) zero-shot learning (3) image generation (3) domain adaptation (3) graph neural network (3) generative model (3) contrastive learning (3) image classification (3) continual learning (2) data augmentation (2) vision-language alignment (2) metric learning (2) model robustness (2) self-supervised learning (2) object detection (2) image segmentation (2) embedding learning (2)

Papers

Next Patch Prediction for AutoRegressive Visual Generation AAAI 2026 Towards Cross-modal Backward-compatible Representation Learning for Vision-Language Models ICCV 2025 DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses ICCV 2025 Scaling Up Temporal Domain Generalization via Temporal Experts Averaging EMNLP 2025 Model Reveals What to Cache: Profiling-Based Feature Reuse for Video Diffusion Models ICCV 2025 Intervening Anchor Token: Decoding Strategy in Alleviating Hallucinations for MLLMs ICLR 2025 LASER: A Neuro-Symbolic Framework for Learning Spatio-Temporal Scene Graphs with Weak Supervision ICLR 2025 Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning ICLR 2025 Metric Compatible Training for Online Backfilling in Large-Scale Retrieval WACV 2025 LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence ICML 2025 Improving Soft Unification with Knowledge Graph Embedding Methods ICML 2025 Generative Zero-Shot Composed Image Retrieval CVPR 2025 MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding CVPR 2024 Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval CVPR 2024 Label Delay in Online Continual Learning NIPS 2024 Video Decomposition Prior: Editing Videos Layer by Layer ICLR 2024 Few-Shot Object Detection with Foundation Models CVPR 2024 Object Recognition as Next Token Prediction CVPR 2024 Fast Encoding and Decoding for Implicit Video Representation ECCV 2024 Spherical Linear Interpolation and Text-Anchoring for Zero-shot Composed Image Retrieval ECCV 2024 Composing Object Relations and Attributes for Image-Text Matching CVPR 2024 Jack of All Tasks Master of Many: Designing General-Purpose Coarse-to-Fine Vision-Language Model CVPR 2024 On the Robustness of Large Multimodal Models Against Image Adversarial Attacks CVPR 2024 uCAP: An Unsupervised Prompting Method for Vision-Language Models ECCV 2024 AirSketch: Generative Motion to Sketch NIPS 2024 Computationally Budgeted Continual Learning: What Does Matter? CVPR 2023 Sample-Dependent Adaptive Temperature Scaling for Improved Calibration AAAI 2023 Towards Scalable Neural Representation for Diverse Videos CVPR 2023 Open Vocabulary Semantic Segmentation With Patch Aligned Contrastive Learning CVPR 2023 TIPI: Test Time Adaptation With Transformation Invariance CVPR 2023 Detecting Everything in the Open World: Towards Universal Object Detection CVPR 2023 HNeRV: A Hybrid Neural Representation for Videos CVPR 2023 Open-vocabulary Panoptic Segmentation with Embedding Modulation ICCV 2023 BT^2: Backward-compatible Training with Basis Transformation ICCV 2023 Rapid Adaptation in Online Continual Learning: Are We Evaluating It Right? ICCV 2023 Graph Inductive Biases in Transformers without Message Passing ICML 2023 Teaching with Soft Label Smoothing for Mitigating Noisy Labels in Facial Expressions ECCV 2022 AdaViT: Adaptive Vision Transformers for Efficient Image Recognition CVPR 2022 ObjectFormer for Image Manipulation Detection and Localization CVPR 2022 Visual Prompt Tuning ECCV 2022 Object-Centric Unsupervised Image Captioning ECCV 2022 MTFormer: Multi-task Learning via Transformer and Cross-Task Reasoning ECCV 2022 Totems: Physical Objects for Verifying Visual Integrity ECCV 2022 Joint Audio-Visual Deepfake Detection ICCV 2021 Deep Co-Training With Task Decomposition for Semi-Supervised Domain Adaptation ICCV 2021 Robustness and Generalization via Generative Adversarial Training ICCV 2021 Efficient Object Embedding for Spliced Image Retrieval CVPR 2021 Intentonomy: A Dataset and Study Towards Human Intent Understanding CVPR 2021 Combining Label Propagation and Simple Models out-performs Graph Neural Networks ICLR 2021 On Feature Normalization and Data Augmentation CVPR 2021 Cross-Modal Retrieval Augmentation for Multi-Modal Classification EMNLP 2021 When in Doubt: Improving Classification Performance with Alternating Normalization EMNLP 2021 Exploring Visual Engagement Signals for Representation Learning ICCV 2021 Making an Invisibility Cloak: Real World Adversarial Attacks on Object Detectors ECCV 2020 Differentiating through the Fréchet Mean ICML 2020 Generate, Segment, and Refine: Towards Generic Manipulation Segmentation AAAI 2020 What makes fake images detectable? Understanding properties that generalize ECCV 2020 Curriculum Manager for Source Selection in Multi-Source Domain Adaptation ECCV 2020 Quantization Guided JPEG Artifact Correction ECCV 2020 One-Shot Domain Adaptation for Face Generation CVPR 2020 A Metric Learning Reality Check ECCV 2020 Enhancing Adversarial Example Transferability With an Intermediate Level Attack ICCV 2019 Cross-X Learning for Fine-Grained Visual Categorization ICCV 2019