WTW

Wired Table in the Wild

Dataset Information
Modalities
Images
Introduced
2021
License
Homepage

Overview

WTW (Wired Table in the Wild) is a large-scale dataset which includes well-annotated structure parsing of multiple style tables in several scenes like the photo, scanning files, web pages.

WTW dataset has 10970 training samples and 3611 testing ones. The test images are divided into 7 challenging categories.

Dataset for trains and test contain images and labels. The label is in XML format, which has cell bbox and the structure label, includes start row, end row, start col, end col, and table id. In addition, the test set also contains separate file descripts sub-classification information for each image.

Variants: WTW

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Table Recognition StrucTexTv2 (small) StrucTexTv2: Masked Visual-Textual Prediction for … 2023-03-01

Research Papers

Recent papers with results on this dataset: