Xiaohu Qie
19 papers · 2019–2023 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (42) π Interdisciplinary Bridge π Renaissance Researcher (7) π Conference Polyglot (6) π§ Keyword Pioneer
π£
Hot Topic Early Bird
π
Conference Polyglot
(6)
π€
Dynamic Duo
(17)
β‘
Prolific Year
(7)
π
Conference Pioneer
π
Century Club
(19)
ποΈ
Keyword Collector
(90)
Conferences
CVPR (9)
ICCV (5)
NIPS (2)
AAAI (1)
ECCV (1)
ICLR (1)
Top co-authors
Keywords
neural radiance field
(3)
diffusion model
(3)
multimodal learning
(2)
benchmark dataset
(2)
semantic alignment
(2)
video-text retrieval
(2)
multi-modal learning
(2)
transfer learning
(2)
video-language pre-training
(2)
contrastive learning
(2)
image segmentation
(1)
knowledge distillation
(1)
information retrieval
(1)
collaborative filtering
(1)
zero-shot learning
(1)
representation learning
(1)
object detection
(1)
human-object interaction
(1)
image restoration
(1)
video retrieval
(1)
Papers
Masked Image Modeling with Denoising Contrast
ICLR 2023
ViLEM: Visual-Language Error Modeling for Image-Text Retrieval
CVPR 2023
RILS: Masked Visual Reconstruction in Language Semantic Space
CVPR 2023
Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and Text-to-Image Diffusion Models
CVPR 2023
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
ICCV 2023
Order-Prompted Tag Sequence Generation for Video Tagging
ICCV 2023
MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing
ICCV 2023
HOSNeRF: Dynamic Human-Object-Scene Neural Radiance Fields from a Single Video
ICCV 2023
OmniZoomer: Learning to Move and Zoom in on Sphere at High-Resolution
ICCV 2023
Accelerating Vision-Language Pretraining With Free Language Modeling
CVPR 2023
All in One: Exploring Unified Video-Language Pre-Training
CVPR 2023
MILES: Visual BERT Pre-training with Injected Language Semantics for Video-Text Retrieval
ECCV 2022
DeVRF: Fast Deformable Voxel Radiance Fields for Dynamic Scenes
NIPS 2022
Bridging Video-Text Retrieval With Multiple Choice Questions
CVPR 2022
Object-Aware Video-Language Pre-Training for Retrieval
CVPR 2022
BTS: A Bi-Lingual Benchmark for Text Segmentation in the Wild
CVPR 2022
UMT: Unified Multi-Modal Transformers for Joint Video Moment Retrieval and Highlight Detection
CVPR 2022
Tenrec: A Large-scale Multipurpose Benchmark Dataset for Recommender Systems
NIPS 2022
Incorporating Semantic Similarity with Geographic Correlation for Query-POI Relevance Learning
AAAI 2019