FlickrStyle10K is collected and built on Flickr30K image caption dataset. The original FlickrStyle10K dataset has 10,000 pairs of images and stylized captions including humorous and romantic styles. However, only 7,000 pairs from the official training set are now publicly accessible. The dataset can be downloaded via https://zhegan27.github.io/Papers/FlickrStyle_v0.9.zip
Variants: FlickrStyle10K
This dataset is used in 2 benchmarks:
Task | Model | Paper | Date |
---|---|---|---|
Image Captioning | CapDec | Text-Only Training for Image Captioning … | 2022-11-01 |
Semi Supervised Learning for Image Captioning | CapDec | Text-Only Training for Image Captioning … | 2022-11-01 |
Recent papers with results on this dataset: