Paul Hongsuck Seo
28 papers · 2016–2025 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
π Academic Marathon (9) π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (10) π Cross-Pollinator (13)
π
Cross-Pollinator
(13)
π
Renaissance Researcher
(7)
πΊοΈ
Taxonomy Completionist
(71)
π§¬
Topic Evolution
π±
Topic Pioneer
ποΈ
Keyword Collector
(139)
π
Century Club
(28)
π
Trend Setter
β‘
Prolific Year
(6)
π₯
Unstoppable
(10)
π
Conference Pioneer
Conferences
CVPR (11)
ECCV (4)
NIPS (4)
AAAI (2)
ICCV (2)
ACL (1)
ACML (1)
EMNLP (1)
INTERSPEECH (1)
NAACL (1)
Top co-authors
Research topics
Keywords
multimodal learning
(4)
video understanding
(4)
zero-shot learning
(3)
vision-language model
(3)
semantic segmentation
(3)
audiovisual speech recognition
(2)
attention mechanism
(2)
object tracking
(2)
deep neural network
(2)
instance segmentation
(2)
visual grounding
(2)
question answering
(2)
automatic speech recognition
(2)
motion estimation
(1)
visual question answering
(1)
vision transformer
(1)
uncertainty quantification
(1)
policy gradient
(1)
image segmentation
(1)
reinforcement learning
(1)
Papers
ReSCORE: Label-free Iterative Retriever Training for Multi-hop Question Answering with Relevance-Consistency Supervision
ACL 2025
ReTAG: Retrieval-Enhanced, Topic-Augmented Graph-Based Global Sensemaking
EMNLP 2025
LCIRC: A Recurrent Compression Approach for Efficient Long-form Context and Query Dependent Modeling in LLMs
NAACL 2025
Random Conditioning for Diffusion Model Compression with Distillation
CVPR 2025
DialNav: Multi-turn Dialog Navigation with a Remote Guide
ICCV 2025
Multi-Granularity Video Object Segmentation
AAAI 2025
CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation
CVPR 2024
Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels
NIPS 2024
TrackIME: Enhanced Video Point Tracking via Instance Motion Estimation
NIPS 2024
Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation
ECCV 2024
Learning Correlation Structures for Vision Transformers
CVPR 2024
AVFormer: Injecting Vision Into Frozen Speech Models for Zero-Shot AV-ASR
CVPR 2023
Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning
CVPR 2023
IFSeg: Image-Free Semantic Segmentation via Vision-Language Model
CVPR 2023
Zero-Shot Referring Image Segmentation With Global-Local Context Features
CVPR 2023
Learning Audio-Video Modalities from Image Captions
ECCV 2022
End-to-End Generative Pretraining for Multimodal Video Captioning
CVPR 2022
AVATAR: Unconstrained Audiovisual Speech Recognition
INTERSPEECH 2022
Look Before You Speak: Visually Contextualized Utterances
CVPR 2021
Reinforcing an Image Caption Generator Using Off-Line Human Feedback
AAAI 2020
Combinatorial Inference against Label Noise
NIPS 2019
Learning for Single-Shot Confidence Calibration in Deep Neural Networks Through Stochastic Inferences
CVPR 2019
Regularizing Neural Networks via Stochastic Branch Layers
ACML 2019
CPlaNet: Enhancing Image Geolocalization by Combinatorial Partitioning of Maps
ECCV 2018
Attentive Semantic Alignment with Offset-Aware Correlation Kernels
ECCV 2018
Visual Reference Resolution using Attention Memory for Visual Dialog
NIPS 2017
MarioQA: Answering Questions by Watching Gameplay Videos
ICCV 2017
Image Question Answering Using Convolutional Neural Network With Dynamic Parameter Prediction
CVPR 2016