ToolBench

Name: ToolBench
Published: 2023-07-31
License: Apache-2.0 license

Dataset Information

Modalities

Texts

Languages

English

Introduced

2023

License

Apache-2.0 license

Homepage

Official Website

Contents

Overview
Associated Benchmarks
Recent Benchmark Submissions
Research Papers

Overview

ToolBench is an instruction-tuning dataset for tool use, which is created automatically using ChatGPT. Specifically, the authors collect 16,464 real-world RESTful APIs spanning 49 categories from RapidAPI Hub, then prompt ChatgPT to generate diverse human instructions involving these APIs, covering both single-tool and multi-tool scenarios.

Variants: ToolBench

Associated Benchmarks

This dataset is used in 1 benchmark:

Trajectory Planning - Metrics: Win rate

Recent Benchmark Submissions

Task	Model	Paper	Date
Trajectory Planning	GPT4-TOPGUN	SwissNYF: Tool Grounded LLM Agents …	2024-02-15
Trajectory Planning	Attention Bucket	Fortify the Shortest Stave in …	2023-12-07
Trajectory Planning	GPT4- DFSDT	ToolLLM: Facilitating Large Language Models …	2023-07-31

Research Papers

Recent papers with results on this dataset:

External Links:

ToolBench

Overview edit

Associated Benchmarks

Recent Benchmark Submissions

Research Papers

Edit Dataset Information

Overview