From b6792002e61512f977fd61139ec75c1c7596c2a0 Mon Sep 17 00:00:00 2001 From: wangshuai6 Date: Wed, 9 Apr 2025 11:49:03 +0800 Subject: [PATCH] update README --- README.md | 5 +++++ 1 file changed, 5 insertions(+) diff --git a/README.md b/README.md index fae08b4..1bd77a5 100644 --- a/README.md +++ b/README.md @@ -1,4 +1,9 @@ # DDT: Decoupled Diffusion Transformer +arxiv link: [https://arxiv.org/abs/2504.05741](https://arxiv.org/abs/2504.05741) + +ImagNet256 leaderboard: [https://paperswithcode.com/sota/image-generation-on-imagenet-256x256](https://paperswithcode.com/sota/image-generation-on-imagenet-256x256) + +ImagNet512 leaderboard: [https://paperswithcode.com/sota/image-generation-on-imagenet-512x512](https://paperswithcode.com/sota/image-generation-on-imagenet-512x512) ## Introduction We decouple diffusion transformer into encoder-decoder design, and surpresingly that a **more substantial encoder yields performance improvements as model size increases**. ![](./figs/main.png)