PGDP5K

Plane Geometry Diagram Parsing Dataset

Dataset Information
Modalities
Images
Languages
English
Introduced
2022
License
Unknown
Homepage

Overview

PGDP5K is a dataset consisting of 5000 diagram samples composed of 16 shapes, covering 5 positional relations, 22 symbol types and 6 text types, labeled with more fine-grained annotations at primitive level, including primitive classes, locations and relationships, where 1,813 non-duplicated images are selected from the Geometry3K dataset and other 3,187 images are collected from three popular textbooks across grades 6-12 on mathematics curriculum websites by taking screenshots from PDF books.

Variants: PGDP5K

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Scene Parsing PGDPNet Plane Geometry Diagram Parsing 2022-05-19
Scene Parsing Inter-GPS Inter-GPS: Interpretable Geometry Problem Solving … 2021-05-10

Research Papers

Recent papers with results on this dataset: