Jiannan Wu
9 papers · 2021–2024 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
π Conference Polyglot (4) π Cross-Pollinator (12) π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (24)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
Conferences
CVPR (3)
ICCV (3)
NIPS (2)
ECCV (1)
Top co-authors
Keywords
object detection
(3)
vision-language model
(3)
referring video object segmentation
(2)
object tracking
(2)
instance segmentation
(2)
multi-modal learning
(2)
pose estimation
(1)
transfer learning
(1)
zero-shot learning
(1)
vision-language alignment
(1)
image editing
(1)
referring expression
(1)
object localization
(1)
image generation
(1)
action classification
(1)
visual question answering
(1)
natural language understanding
(1)
zero-shot image classification
(1)
foundation model
(1)
semantic segmentation
(1)
Papers
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks
NIPS 2024
InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks
CVPR 2024
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models
ECCV 2024
Exploring Transformers for Open-world Instance Segmentation
ICCV 2023
Universal Instance Perception As Object Discovery and Retrieval
CVPR 2023
Segment Every Reference Object in Spatial and Temporal Spaces
ICCV 2023
VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks
NIPS 2023
Language As Queries for Referring Video Object Segmentation
CVPR 2022
Watch Only Once: An End-to-End Video Action Detection Framework
ICCV 2021