SheetCopilot

Dataset Information
Modalities
Tables
Languages
English
Introduced
2023
License
Unknown
Homepage

Overview

The SheetCopilot dataset contains 28 evaluation workbooks and 221 spreadsheet manipulation tasks that are applied to these workbooks. These tasks involve diverse atomic actions related to six task categories (i.e. Entry and manipulation, Formatting, Management, Charts, Pivot Table, and Formula).

Dataset statistics:

  1. Each task possesses one or more ground truth solutions.

  2. The lengths of the task instructions range from 20 to 530 characters, with most tasks between 80 and 110 characters.

  3. The number of atomic actions required by each task ranges from 1 to 9.

Evaluation metrics:

  1. Execution success rate, pass rate, and the number of used actions are evaluated to judge the functional correctness and efficiency of a method.

  2. A submitted solution is considered correct if the properties to be checked match those of any of the GT solutions of the corresponding task.

Please download the full datasets in our Github Repo:

https://github.com/BraveGroup/SheetCopilot

Thanks for using our dataset!

Variants: SheetCopilot

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Robot Task Planning SheetAgent (GPT-3.5) SheetAgent: Towards A Generalist Agent … 2024-03-06
Robot Task Planning SheetCopilot (NIPS2023) SheetCopilot: Bringing Software Productivity to … 2023-05-30

Research Papers

Recent papers with results on this dataset: