CN-CLIP (ViT-H/14)
|
Chinese CLIP: Contrastive Vision-Language Pretrai…
|
81.50
|
2022-11-02
|
|
CN-CLIP (ViT-L/14@336px)
|
Chinese CLIP: Contrastive Vision-Language Pretrai…
|
80.10
|
2022-11-02
|
|
R2D2 (ViT-L/14)
|
CCMB: A Large-scale Chinese Cross-modal Benchmark
|
79.10
|
2022-05-08
|
|
CN-CLIP (ViT-L/14)
|
Chinese CLIP: Contrastive Vision-Language Pretrai…
|
78.90
|
2022-11-02
|
|
CN-CLIP (ViT-B/16)
|
Chinese CLIP: Contrastive Vision-Language Pretrai…
|
77.00
|
2022-11-02
|
|
R2D2 (ViT-B)
|
CCMB: A Large-scale Chinese Cross-modal Benchmark
|
75.10
|
2022-05-08
|
|
Wukong (ViT-L/14)
|
Wukong: A 100 Million Large-scale Chinese Cross-m…
|
74.00
|
2022-02-14
|
|
Wukong (ViT-B/32)
|
Wukong: A 100 Million Large-scale Chinese Cross-m…
|
67.00
|
2022-02-14
|
|
CN-CLIP (RN50)
|
Chinese CLIP: Contrastive Vision-Language Pretrai…
|
66.80
|
2022-11-02
|
|