iSarcasm is a dataset of tweets, each labelled as either sarcastic or non_sarcastic. Each sarcastic tweet is further labelled for one of the following types of ironic speech:
For each sarastic tweet, there's also:
iSarcasm contains 4,484 tweets, out of which 777 are labelled as sarcastic and 3,707 as non-sarcastic. You'll find two files, isarcasm_train.csv and isarcasm_test.csv, each containing 80% and 20% of the examples chosen at random, respectively. Each line in a file has the format tweet_id,sarcasm_label,sarcasm_type, where sarcasm_type are only defined for sarcastic tweets, as specified above.
Source: iSarcasm
Variants: iSarcasm
This dataset is used in 1 benchmark:
Task | Model | Paper | Date |
---|---|---|---|
Sarcasm Detection | RoBERTa + Mutation Data Augmentation | UTNLP at SemEval-2022 Task 6: … | 2022-04-18 |
Recent papers with results on this dataset: