Papers
190 papers found
DreamBlend: Advancing Personalized Fine-Tuning of Text-to-Image Diffusion Models
Shwetha Ram, Tal Neiman, Qianli Feng et al.
Disentangling Subject-Irrelevant Elements in Personalized Text-to-Image Diffusion via Filtered Self-Distillation
Seunghwan Choi, Jooyeol Yun, Jeonghoon Park et al.
ViSTA: Visual Storytelling using Multi-modal Adapters for Text-to-Image Diffusion Models
Sibo Dong, Ismail Shaheen, Maggie Shen et al.
Detection-Driven Object Count Optimization for Text-to-Image Diffusion Models
Oz Zafar, Yuval Cohen, Lior Wolf et al.
Fully Unsupervised Self-debiasing of Text-to-Image Diffusion Models
Korada Sri Vardhana, Shrikrishna Lolla, Soma Biswas
Plot’n Polish: Zero-Shot Story Visualization and Disentangled Editing with Text-to-Image Diffusion Models
Kiymet Akdemir, Jing Shi, Kushal Kafle et al.
Copyright Infringement Detection in Text-to-Image Diffusion Models via Differential Privacy
Xiafeng Man, Zhipeng Wei, Jingjing Chen
Mechanistic Dissection of Cross-Attention Subspaces in Text-to-Image Diffusion Models
Jun-Hyun Bae, Wonyong Jo, Jaehyup Lee et al.
Shifted Diffusion for Text-to-Image Generation
Yufan Zhou, Bingchen Liu, Yizhe Zhu et al.
Self-Play Fine-tuning of Diffusion Models for Text-to-image Generation
Huizhuo Yuan, Zixiang Chen, Kaixuan Ji et al.
Vector Quantized Diffusion Model for Text-to-Image Synthesis
Shuyang Gu, Dong Chen, Jianmin Bao et al.
On the Scalability of Diffusion-based Text-to-Image Generation
Hao Li, Yang Zou, Ying Wang et al.
Self-Cross Diffusion Guidance for Text-to-Image Synthesis of Similar Subjects
Weimin Qiu, Jieke Wang, Meng Tang
MCCD: Multi-Agent Collaboration-based Compositional Diffusion for Complex Text-to-Image Generation
Mingcheng Li, Xiaolu Hou, Ziyang Liu et al.
Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis
Bingda Tang, Boyang Zheng, Sayak Paul et al.
Decoding Correlation-Induced Misalignment in the Stable Diffusion Workflow for Text-to-Image Generation
Yunze Tong, Fengda Zhang, Didi Zhu et al.
RAGD: Regional-Aware Diffusion Model for Text-to-Image Generation
Zhennan Chen, Yajie Li, Haofan Wang et al.
InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation
Xingchao Liu, Xiwen Zhang, Jianzhu Ma et al.
Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image Generation
Zhihong Pan, Xin Zhou, Hao Tian
UFOGen: You Forward Once Large Scale Text-to-Image Generation via Diffusion GANs
Yanwu Xu, Yang Zhao, Zhisheng Xiao et al.
SimAC: A Simple Anti-Customization Method for Protecting Face Privacy against Text-to-Image Synthesis of Diffusion Models
Feifei Wang, Zhentao Tan, Tianyi Wei et al.
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis
Weixi Feng, Xuehai He, Tsu-Jui Fu et al.
PixArt-$\alpha$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Junsong Chen, Jincheng YU, Chongjian GE et al.
SANA: Efficient High-Resolution Text-to-Image Synthesis with Linear Diffusion Transformers
Enze Xie, Junsong Chen, Junyu Chen et al.
Rapid Diffusion: Building Domain-Specific Text-to-Image Synthesizers with Fast Inference Speed
Bingyan Liu, Weifeng Lin, Zhongjie Duan et al.