VulScribeR

VulScriber: 22K+ unfiltered vul samples generated with ChatGPT via Injection

Dataset Information
Modalities
Texts
Languages
English
Introduced
2024
License
Unknown

Overview

Datasets are listed in the repository's readme file. This one is extra and yields 20K+ items after filtering with a fuzzy parser.

Variants: VulScribeR

Associated Benchmarks

This dataset is used in 1 benchmark:

Recent Benchmark Submissions

Task Model Paper Date
Vulnerability Detection Reveal Model - Tested on Reveal (Training on Devign + VulScribeR 20K + Extra Cleans) VulScribeR: Exploring RAG-based Vulnerability Augmentation … 2024-08-07

Research Papers

Recent papers with results on this dataset: