I2L-140K

Dataset Information
Introduced
2018
License
Unknown
Homepage

Overview

Introduced by Singh, Sumeet S.. “Teaching Machines to Code: Neural Markup Generation with Visual Attention.” ArXiv abs/1802.05415 (2018): n. pag.

A prebuilt dataset for OpenAI's task for image-2-latex system. Includes total of ~140k formulas and images splitted into train, validation and test sets. Superset of im2latex-100K dataset.

Variants: I2L-140K

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Optical Character Recognition (OCR) I2L-NOPOOL Teaching Machines to Code: Neural … 2018-02-15
Optical Character Recognition (OCR) I2L-STRIPS Teaching Machines to Code: Neural … 2018-02-15

Research Papers

Recent papers with results on this dataset: