Janus
|
Janus-Pro: Unified Multimodal Understanding and G…
|
0.26
|
2025-01-29
|
|
Janus-pro
|
Janus-Pro: Unified Multimodal Understanding and G…
|
0.37
|
2025-01-29
|
|
MindOmni
(w/o cot)
|
MindOmni: Unleashing Reasoning Generation in Visi…
|
0.38
|
2025-05-19
|
|
Show-o
|
Show-o: One Single Transformer to Unify Multimoda…
|
0.40
|
2024-08-22
|
|
Emu3-gen
|
Emu3: Next-Token Prediction is All You Need
|
0.45
|
2024-09-27
|
|
stable-diffusion-xl-base-0.9
|
SDXL: Improving Latent Diffusion Models for High-…
|
0.48
|
2023-07-04
|
|
PixArt-XL-2-1024-MS
|
PixArt-$α$: Fast Training of Diffusion Transforme…
|
0.50
|
2023-09-30
|
|
stable-diffusion-3.5-large
|
Scaling Rectified Flow Transformers for High-Reso…
|
0.50
|
2024-03-05
|
|
UniWorld-V1
|
UniWorld-V1: High-Resolution Semantic Encoders fo…
|
0.55
|
2025-06-03
|
|
Bagel
|
Emerging Properties in Unified Multimodal Pretrai…
|
0.55
|
2025-05-20
|
|
Playground-v2.5-1024px-aesthetic
|
Playground v2.5: Three Insights towards Enhancing…
|
0.58
|
2024-02-27
|
|
Bagel (w/ cot)
|
Emerging Properties in Unified Multimodal Pretrai…
|
0.69
|
2025-05-20
|
|
MindOmni
(w/ cot)
|
MindOmni: Unleashing Reasoning Generation in Visi…
|
0.70
|
2025-05-19
|
|