From 1d1b4d291325939dd98f6b91687af95c98c34d5c Mon Sep 17 00:00:00 2001 From: wang shuai Date: Fri, 22 Aug 2025 10:49:18 +0800 Subject: [PATCH] Update README.md --- README.md | 4 +--- 1 file changed, 1 insertion(+), 3 deletions(-) diff --git a/README.md b/README.md index dff3748..745a8e7 100644 --- a/README.md +++ b/README.md @@ -13,9 +13,7 @@ We decouple diffusion transformer into encoder-decoder design, and surprisingly * We achieves **1.26 FID** on ImageNet256x256 Benchmark with DDT-XL/2(22en6de). * We achieves **1.28 FID** on ImageNet512x512 Benchmark with DDT-XL/2(22en6de). * As a byproduct, our DDT can reuse encoder among adjacent steps to accelerate inference. -## Update 5/6/2025 -* PixelDDT-XXL/16-R1024-T2I achieves **66.7** without prompt rewriting and **71.2** with prompt rewriting on GenEval benchmark -* Pixel space Text-to-image models(PixelDDT-XXL/16) will be released soon. + ## Visualizations ![](./figs/teaser.png) ## Checkpoints