AE-110k

AliExpress - 110k

Dataset Information
Modalities
Texts
Languages
English
Introduced
2019
License
Unknown
Homepage

Overview

The dataset contains product information from AliExpress Sports & Entertainment category. Each attribute value in "Item Specific" is matched against the product title using exact string match to generate positive triples . Negative triples are randomly generated. Each triple is stored in a line and separated by \u0001.

Variants: AE-110k

Associated Benchmarks

This dataset is used in 2 benchmarks:

Recent Benchmark Submissions

Task Model Paper Date
Attribute Mining T5 Large - End2End An Empirical Comparison of Generative … 2024-07-01
Attribute Value Extraction GPT-4-json-val-10-dem ExtractGPT: Exploring the Potential of … 2023-10-19
Attribute Value Extraction ft-GPT-3.5-json-val ExtractGPT: Exploring the Potential of … 2023-10-19

Research Papers

Recent papers with results on this dataset: