MAVE

MAVE: : A Product Dataset for Multi-source Attribute Value Extraction

Dataset Information
Modalities
Texts
Languages
English
Introduced
2021
Homepage

Overview

The dataset contains 3 million attribute-value annotations across 1257 unique categories created from 2.2 million cleaned Amazon product profiles.
It is a large, multi-sourced, diverse dataset for product attribute extraction study.

Variants: MAVE

Associated Benchmarks

This dataset is used in 2 benchmarks:

Recent Benchmark Submissions

Task Model Paper Date
Attribute Mining T5 Large - End2End An Empirical Comparison of Generative … 2024-07-01
Attribute Value Extraction MAVEQA MAVE: A Product Dataset for … 2021-12-16
Attribute Value Extraction AVEQA MAVE: A Product Dataset for … 2021-12-16
Attribute Value Extraction AD-Opentag MAVE: A Product Dataset for … 2021-12-16

Research Papers

Recent papers with results on this dataset: