conftrace_

Jiannan Wu

9 papers · 2021–2024 · 4 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+2 more ↓

🌍 Conference Polyglot (4) 🐝 Cross-Pollinator (12) 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (24)

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird

Conferences

CVPR (3) ICCV (3) NIPS (2) ECCV (1)

Top co-authors

Ping Luo (8) Zehuan Yuan (5) Yi Jiang (5) Bin Yan (3) Zhe Chen (3) Huchuan Lu (3) Xizhou Zhu (3) Jifeng Dai (3) Yu Qiao (3) Wenhai Wang (3)

Keywords

object detection (3) vision-language model (3) referring video object segmentation (2) object tracking (2) instance segmentation (2) multi-modal learning (2) pose estimation (1) transfer learning (1) zero-shot learning (1) vision-language alignment (1) image editing (1) referring expression (1) object localization (1) image generation (1) action classification (1) visual question answering (1) natural language understanding (1) zero-shot image classification (1) foundation model (1) semantic segmentation (1)

Papers

VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks NIPS 2024 InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks CVPR 2024 Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models ECCV 2024 Exploring Transformers for Open-world Instance Segmentation ICCV 2023 Universal Instance Perception As Object Discovery and Retrieval CVPR 2023 Segment Every Reference Object in Spatial and Temporal Spaces ICCV 2023 VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks NIPS 2023 Language As Queries for Referring Video Object Segmentation CVPR 2022 Watch Only Once: An End-to-End Video Action Detection Framework ICCV 2021