Chenfei Wu

22 papers · 2018–2024 · 12 conferences · across top CS/AI conferences

Achievements

+10 more ↓

🌉 Interdisciplinary Bridge 🏃 Academic Marathon (6) 🌍 Conference Polyglot (12) 🌈 Renaissance Researcher (6) 🗺️ Taxonomy Completionist (38)

🗺️ Taxonomy Completionist (38) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏆 Grand Slam 🤝 Dynamic Duo (20) 💎 Century Club (22) ⚡ Prolific Year (5) 🗃️ Keyword Collector (78) 📈 Trend Setter 🚀 Conference Pioneer

Conferences

AAAI (4) ACL (3) CVPR (2) ECCV (2) ICML (2) NAACL (2) NIPS (2) EMNLP (1) ICLR (1) IJCAI (1) IJCNLP (1) WACV (1)

Top co-authors

Nan Duan (20) Lijuan Wang (6) Zicheng Liu (6) Lei Ji (6) Shengming Yin (5) Jianfeng Wang (4) Yongfei Liu (4) Minheng Ni (4) Zhengyuan Yang (4) VASUDEV LAL (4)

Keywords

multimodal learning (4) diffusion model (3) large language model (3) video generation (3) image generation (3) visual question answering (2) task planning (2) transformer architecture (2) vision-language representation learning (2) vision-language pretraining (2) multimodal fusion (2) prompt engineering (2) self-supervised learning (2) vision-language model (2) visual synthesis (2) cross-modal alignment (2) representation learning (2) depth estimation (1) autoregressive generation (1) in-context learning (1)

Papers

LayoutNUWA: Revealing the Hidden Layout Expertise of Large Language Models ICLR 2024 Using Left and Right Brains Together: Towards Vision and Language Planning ICML 2024 Low-code LLM: Graphical User Interface over Large Language Models NAACL 2024 StrokeNUWA—Tokenizing Strokes for Vector Graphic Synthesis ICML 2024 HORIZON: High-Resolution Semantically Controlled Panorama Synthesis AAAI 2024 ORES: Open-Vocabulary Responsible Visual Synthesis AAAI 2024 Learning to Plan by Updating Natural Language EMNLP 2024 BridgeTower: Building Bridges between Encoders in Vision-Language Representation Learning AAAI 2023 Learning 3D Photography Videos via Self-supervised Diffusion on Single Images IJCAI 2023 NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation ACL 2023 ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning ACL 2023 ReCo: Region-Controlled Text-to-Image Generation CVPR 2023 Learning Temporal Video Procedure Segmentation From an Automatically Collected Large Dataset WACV 2022 NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis NIPS 2022 VL-InterpreT: An Interactive Visualization Tool for Interpreting Vision-Language Transformers CVPR 2022 NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion ECCV 2022 Trace Controlled Text to Image Generation ECCV 2022 KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation NAACL 2022 GEM: A General Evaluation Benchmark for Multimodal Tasks IJCNLP 2021 GEM: A General Evaluation Benchmark for Multimodal Tasks ACL 2021 Differential Networks for Visual Question Answering AAAI 2019 Chain of Reasoning for Visual Question Answering NIPS 2018