Crisscrossed Captions
Crisscrossed Captions (CxC) contains 247,315 human-labeled annotations including positive and negative associations between image pairs, caption pairs and image-caption pairs.
Image source: Crisscrossed Captions: Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCO
Variants: CxC
This dataset is used in 1 benchmark:
Task | Model | Paper | Date |
---|---|---|---|
Semantic Textual Similarity | PromCSE-RoBERTa-large (0.355B) | Improved Universal Sentence Embeddings with … | 2022-03-14 |
Semantic Textual Similarity | ALIGN-L2 | MURAL: Multimodal, Multitask Retrieval Across … | 2021-09-10 |
Semantic Textual Similarity | DE-T2T+I2T | MURAL: Multimodal, Multitask Retrieval Across … | 2021-09-10 |
Semantic Textual Similarity | MURAL-large | MURAL: Multimodal, Multitask Retrieval Across … | 2021-09-10 |
Recent papers with results on this dataset: