ML Research Wiki / Benchmarks / Image Generation / WISE

WISE

Image Generation Benchmark

Performance Over Time

📊 Showing 13 results | 📏 Metric: Overall

Top Performing Models

Rank Model Paper Overall Date Code
1 Janus Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling 0.26 2025-01-29 📦 deepseek-ai/janus
2 Janus-pro Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling 0.37 2025-01-29 📦 deepseek-ai/janus
3 MindOmni (w/o cot) MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO 0.38 2025-05-19 📦 easonxiao-888/mindomni
4 Show-o Show-o: One Single Transformer to Unify Multimodal Understanding and Generation 0.40 2024-08-22 📦 showlab/show-o
5 Emu3-gen Emu3: Next-Token Prediction is All You Need 0.45 2024-09-27 📦 baaivision/emu3 📦 flagopen/flagscale
6 stable-diffusion-xl-base-0.9 SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis 0.48 2023-07-04 📦 stability-ai/generative-models 📦 compvis/fm-boosting 📦 yuchen413/text2image_safety
7 PixArt-XL-2-1024-MS PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis 0.50 2023-09-30 📦 PixArt-alpha/PixArt-alpha 📦 Karine-Huang/T2I-CompBench 📦 swookey-thinky/image_diffusion
8 stable-diffusion-3.5-large Scaling Rectified Flow Transformers for High-Resolution Image Synthesis 0.50 2024-03-05 📦 Karine-Huang/T2I-CompBench 📦 hxixixh/adaflow
9 UniWorld-V1 UniWorld-V1: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation 0.55 2025-06-03 📦 PKU-YuanGroup/UniWorld-V1 📦 pku-yuangroup/imgedit
10 Bagel Emerging Properties in Unified Multimodal Pretraining 0.55 2025-05-20 📦 ByteDance-Seed/Bagel 📦 neverbiasu/ComfyUI-BAGEL

All Papers (13)