Ser-Nam Lim
63 papers · 2019–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
π Conference Polyglot (9) π Academic Marathon (6) π Interdisciplinary Bridge π§ Keyword Pioneer π Cross-Pollinator (6)
π
Cross-Pollinator
(6)
π
Renaissance Researcher
(8)
πΊοΈ
Taxonomy Completionist
(92)
π
Conference Loyalist
(20)
π€
Dynamic Duo
(12)
π
Grand Slam
ποΈ
Keyword Collector
(222)
π₯
Unstoppable
(7)
π
Century Club
(62)
β
The Questioner
(3)
β‘
Prolific Year
(11)
Conferences
CVPR (20)
ECCV (13)
ICCV (12)
ICLR (5)
ICML (4)
AAAI (3)
EMNLP (3)
NIPS (2)
WACV (1)
Top co-authors
Keywords
multimodal learning
(6)
large language model
(5)
representation learning
(4)
vision-language model
(4)
zero-shot learning
(3)
image generation
(3)
domain adaptation
(3)
graph neural network
(3)
generative model
(3)
contrastive learning
(3)
image classification
(3)
continual learning
(2)
data augmentation
(2)
vision-language alignment
(2)
metric learning
(2)
model robustness
(2)
self-supervised learning
(2)
object detection
(2)
image segmentation
(2)
embedding learning
(2)
Papers
Next Patch Prediction for AutoRegressive Visual Generation
AAAI 2026
Towards Cross-modal Backward-compatible Representation Learning for Vision-Language Models
ICCV 2025
DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses
ICCV 2025
Scaling Up Temporal Domain Generalization via Temporal Experts Averaging
EMNLP 2025
Model Reveals What to Cache: Profiling-Based Feature Reuse for Video Diffusion Models
ICCV 2025
Intervening Anchor Token: Decoding Strategy in Alleviating Hallucinations for MLLMs
ICLR 2025
LASER: A Neuro-Symbolic Framework for Learning Spatio-Temporal Scene Graphs with Weak Supervision
ICLR 2025
Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning
ICLR 2025
Metric Compatible Training for Online Backfilling in Large-Scale Retrieval
WACV 2025
LARM: Large Auto-Regressive Model for Long-Horizon Embodied Intelligence
ICML 2025
Improving Soft Unification with Knowledge Graph Embedding Methods
ICML 2025
Generative Zero-Shot Composed Image Retrieval
CVPR 2025
MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding
CVPR 2024
Visual Delta Generator with Large Multi-modal Models for Semi-supervised Composed Image Retrieval
CVPR 2024
Label Delay in Online Continual Learning
NIPS 2024
Video Decomposition Prior: Editing Videos Layer by Layer
ICLR 2024
Few-Shot Object Detection with Foundation Models
CVPR 2024
Object Recognition as Next Token Prediction
CVPR 2024
Fast Encoding and Decoding for Implicit Video Representation
ECCV 2024
Spherical Linear Interpolation and Text-Anchoring for Zero-shot Composed Image Retrieval
ECCV 2024
Composing Object Relations and Attributes for Image-Text Matching
CVPR 2024
Jack of All Tasks Master of Many: Designing General-Purpose Coarse-to-Fine Vision-Language Model
CVPR 2024
On the Robustness of Large Multimodal Models Against Image Adversarial Attacks
CVPR 2024
uCAP: An Unsupervised Prompting Method for Vision-Language Models
ECCV 2024
AirSketch: Generative Motion to Sketch
NIPS 2024
Computationally Budgeted Continual Learning: What Does Matter?
CVPR 2023
Sample-Dependent Adaptive Temperature Scaling for Improved Calibration
AAAI 2023
Towards Scalable Neural Representation for Diverse Videos
CVPR 2023
Open Vocabulary Semantic Segmentation With Patch Aligned Contrastive Learning
CVPR 2023
TIPI: Test Time Adaptation With Transformation Invariance
CVPR 2023
Detecting Everything in the Open World: Towards Universal Object Detection
CVPR 2023
HNeRV: A Hybrid Neural Representation for Videos
CVPR 2023
Open-vocabulary Panoptic Segmentation with Embedding Modulation
ICCV 2023
BT^2: Backward-compatible Training with Basis Transformation
ICCV 2023
Rapid Adaptation in Online Continual Learning: Are We Evaluating It Right?
ICCV 2023
Graph Inductive Biases in Transformers without Message Passing
ICML 2023
Teaching with Soft Label Smoothing for Mitigating Noisy Labels in Facial Expressions
ECCV 2022
AdaViT: Adaptive Vision Transformers for Efficient Image Recognition
CVPR 2022
ObjectFormer for Image Manipulation Detection and Localization
CVPR 2022
Visual Prompt Tuning
ECCV 2022
Object-Centric Unsupervised Image Captioning
ECCV 2022
MTFormer: Multi-task Learning via Transformer and Cross-Task Reasoning
ECCV 2022
Totems: Physical Objects for Verifying Visual Integrity
ECCV 2022
Joint Audio-Visual Deepfake Detection
ICCV 2021
Deep Co-Training With Task Decomposition for Semi-Supervised Domain Adaptation
ICCV 2021
Robustness and Generalization via Generative Adversarial Training
ICCV 2021
Efficient Object Embedding for Spliced Image Retrieval
CVPR 2021
Intentonomy: A Dataset and Study Towards Human Intent Understanding
CVPR 2021
Combining Label Propagation and Simple Models out-performs Graph Neural Networks
ICLR 2021
On Feature Normalization and Data Augmentation
CVPR 2021
Cross-Modal Retrieval Augmentation for Multi-Modal Classification
EMNLP 2021
When in Doubt: Improving Classification Performance with Alternating Normalization
EMNLP 2021
Exploring Visual Engagement Signals for Representation Learning
ICCV 2021
Making an Invisibility Cloak: Real World Adversarial Attacks on Object Detectors
ECCV 2020
Differentiating through the FrΓ©chet Mean
ICML 2020
Generate, Segment, and Refine: Towards Generic Manipulation Segmentation
AAAI 2020
What makes fake images detectable? Understanding properties that generalize
ECCV 2020
Curriculum Manager for Source Selection in Multi-Source Domain Adaptation
ECCV 2020
Quantization Guided JPEG Artifact Correction
ECCV 2020
One-Shot Domain Adaptation for Face Generation
CVPR 2020
A Metric Learning Reality Check
ECCV 2020
Enhancing Adversarial Example Transferability With an Intermediate Level Attack
ICCV 2019
Cross-X Learning for Fine-Grained Visual Categorization
ICCV 2019