Shijia Huang
9 papers · 2022–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
π Cross-Pollinator (15) π Renaissance Researcher (6) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (21) π§ Keyword Pioneer
π
Conference Polyglot
(4)
π£
Hot Topic Early Bird
ποΈ
Keyword Collector
(50)
Conferences
CVPR (4)
EMNLP (3)
AAAI (1)
ECCV (1)
Top co-authors
Keywords
point cloud
(2)
multimodal learning
(2)
visual grounding
(2)
image segmentation
(1)
temporal modeling
(1)
scene understanding
(1)
multi-task learning
(1)
vision language model
(1)
3d vision
(1)
multi-modal learning
(1)
video understanding
(1)
3d scene understanding
(1)
instruction tuning
(1)
instance segmentation
(1)
instruction following
(1)
benchmark evaluation
(1)
semantic segmentation
(1)
vision-language model
(1)
multimodal large language model
(1)
zero-shot learning
(1)
Papers
Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding
CVPR 2025
Enhancing Temporal Modeling of Video LLMs via Time Gating
EMNLP 2024
Towards Learning a Generalist Model for Embodied Navigation
CVPR 2024
LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models
ECCV 2024
DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding
AAAI 2023
Learning Preference Model for LLMs via Automatic Preference Data Generation
EMNLP 2023
MP-Former: Mask-Piloted Transformer for Image Segmentation
CVPR 2023
CLEVA: Chinese Language Models EVAluation Platform
EMNLP 2023
Multi-View Transformer for 3D Visual Grounding
CVPR 2022