📊 Showing 2 results | 📏 Metric: Image-to-text R@1
Rank | Model | Paper | Image-to-text R@1 | Date | Code |
---|---|---|---|---|---|
1 | VLPCook | Vision and Structured-Language Pretraining for Cross-Modal Food Retrieval | 45.20 | 2022-12-08 | 📦 mshukor/vlpcook |
2 | Marin et al. | Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking Recipes and Food Images | 17.00 | 2018-10-14 | - |