Chenfei Wu
22 papers · 2018–2024 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
๐ Interdisciplinary Bridge ๐ Academic Marathon (6) ๐ Conference Polyglot (12) ๐ Renaissance Researcher (6) ๐บ๏ธ Taxonomy Completionist (38)
๐บ๏ธ
Taxonomy Completionist
(38)
๐งญ
Keyword Pioneer
๐ฃ
Hot Topic Early Bird
๐
Grand Slam
๐ค
Dynamic Duo
(20)
๐
Century Club
(22)
โก
Prolific Year
(5)
๐๏ธ
Keyword Collector
(78)
๐
Trend Setter
๐
Conference Pioneer
Conferences
AAAI (4)
ACL (3)
CVPR (2)
ECCV (2)
ICML (2)
NAACL (2)
NIPS (2)
EMNLP (1)
ICLR (1)
IJCAI (1)
IJCNLP (1)
WACV (1)
Top co-authors
Keywords
multimodal learning
(4)
diffusion model
(3)
large language model
(3)
video generation
(3)
image generation
(3)
visual question answering
(2)
task planning
(2)
transformer architecture
(2)
vision-language representation learning
(2)
vision-language pretraining
(2)
multimodal fusion
(2)
prompt engineering
(2)
self-supervised learning
(2)
vision-language model
(2)
visual synthesis
(2)
cross-modal alignment
(2)
representation learning
(2)
depth estimation
(1)
autoregressive generation
(1)
in-context learning
(1)
Papers
LayoutNUWA: Revealing the Hidden Layout Expertise of Large Language Models
ICLR 2024
Using Left and Right Brains Together: Towards Vision and Language Planning
ICML 2024
Low-code LLM: Graphical User Interface over Large Language Models
NAACL 2024
StrokeNUWAโTokenizing Strokes for Vector Graphic Synthesis
ICML 2024
HORIZON: High-Resolution Semantically Controlled Panorama Synthesis
AAAI 2024
ORES: Open-Vocabulary Responsible Visual Synthesis
AAAI 2024
Learning to Plan by Updating Natural Language
EMNLP 2024
BridgeTower: Building Bridges between Encoders in Vision-Language Representation Learning
AAAI 2023
Learning 3D Photography Videos via Self-supervised Diffusion on Single Images
IJCAI 2023
NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation
ACL 2023
ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning
ACL 2023
ReCo: Region-Controlled Text-to-Image Generation
CVPR 2023
Learning Temporal Video Procedure Segmentation From an Automatically Collected Large Dataset
WACV 2022
NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis
NIPS 2022
VL-InterpreT: An Interactive Visualization Tool for Interpreting Vision-Language Transformers
CVPR 2022
NรWA: Visual Synthesis Pre-training for Neural visUal World creAtion
ECCV 2022
Trace Controlled Text to Image Generation
ECCV 2022
KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation
NAACL 2022
GEM: A General Evaluation Benchmark for Multimodal Tasks
IJCNLP 2021
GEM: A General Evaluation Benchmark for Multimodal Tasks
ACL 2021
Differential Networks for Visual Question Answering
AAAI 2019
Chain of Reasoning for Visual Question Answering
NIPS 2018