Xiaohu Qie

19 papers · 2019–2023 · 6 conferences · across top CS/AI conferences

Achievements

+7 more ↓

🗺️ Taxonomy Completionist (42) 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (7) 🌍 Conference Polyglot (6) 🧭 Keyword Pioneer

🐣 Hot Topic Early Bird 🌍 Conference Polyglot (6) 🤝 Dynamic Duo (17) ⚡ Prolific Year (7) 🚀 Conference Pioneer 💎 Century Club (19) 🗃️ Keyword Collector (90)

Conferences

CVPR (9) ICCV (5) NIPS (2) AAAI (1) ECCV (1) ICLR (1)

Top co-authors

Ying Shan (17) Yixiao Ge (8) Mike Zheng Shou (5) Zhongang Qi (4) Jianping Wu (4) Yan-Pei Cao (4) Dian Li (3) Jinpeng Wang (3) Yuying Ge (3) Ping Luo (3)

Keywords

neural radiance field (3) diffusion model (3) multimodal learning (2) benchmark dataset (2) semantic alignment (2) video-text retrieval (2) multi-modal learning (2) transfer learning (2) video-language pre-training (2) contrastive learning (2) image segmentation (1) knowledge distillation (1) information retrieval (1) collaborative filtering (1) zero-shot learning (1) representation learning (1) object detection (1) human-object interaction (1) image restoration (1) video retrieval (1)

Papers

Masked Image Modeling with Denoising Contrast ICLR 2023 ViLEM: Visual-Language Error Modeling for Image-Text Retrieval CVPR 2023 RILS: Masked Visual Reconstruction in Language Semantic Space CVPR 2023 Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and Text-to-Image Diffusion Models CVPR 2023 Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation ICCV 2023 Order-Prompted Tag Sequence Generation for Video Tagging ICCV 2023 MasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and Editing ICCV 2023 HOSNeRF: Dynamic Human-Object-Scene Neural Radiance Fields from a Single Video ICCV 2023 OmniZoomer: Learning to Move and Zoom in on Sphere at High-Resolution ICCV 2023 Accelerating Vision-Language Pretraining With Free Language Modeling CVPR 2023 All in One: Exploring Unified Video-Language Pre-Training CVPR 2023 MILES: Visual BERT Pre-training with Injected Language Semantics for Video-Text Retrieval ECCV 2022 DeVRF: Fast Deformable Voxel Radiance Fields for Dynamic Scenes NIPS 2022 Bridging Video-Text Retrieval With Multiple Choice Questions CVPR 2022 Object-Aware Video-Language Pre-Training for Retrieval CVPR 2022 BTS: A Bi-Lingual Benchmark for Text Segmentation in the Wild CVPR 2022 UMT: Unified Multi-Modal Transformers for Joint Video Moment Retrieval and Highlight Detection CVPR 2022 Tenrec: A Large-scale Multipurpose Benchmark Dataset for Recommender Systems NIPS 2022 Incorporating Semantic Similarity with Geographic Correlation for Query-POI Relevance Learning AAAI 2019