Chengwei Qin Nanyang Technological University, ♣ Shanghai Jiao Tong University ♠ Georgia Institute of Technology ♦ Stanford University, Aston Zhang, Zhuosheng Zhang, Jiaao Chen, Michihiro Yasunaga, Diyi Yang, Xinyu Wang, Yong Jiang, Nguyen Bach, Tao Wang, Zhongqiang Huang, Fei Huang, Kewei Tu, Xuezhi Wang, Jason Wei, Dale Schuurmans, Quoc V Le, Ed Chi, Denny 2022 Zhou, Rationale, Maarten Bosma, Vincent Y Zhao, Kelvin Guu, Adams Wei Yu, Brian Lester, Nan Du, An- Drew M Dai, Finetuned Lan-, Xuezhi Wang, Yu Wu, Wei Wu, Shuohang Wang, Yichong Xu, Yuwei Fang, Wenhao Yu, Yang Liu, Hai Zhao, Chen- Guang Zhu, Michael 2022 Zeng, Mu Li, Nathanael Schärli, Le Hou, Xiangyang Zhou, Lu Li, Daxiang Dong, Yi Liu, Ying Chen, Wayne Xin Zhao, Dianhai Yu, Hua Wu, Association for Computational Linguistics SeattleUnited States, Nathan Scales Dale Schuurmans, Olivier Bousquet, Quoc Le, and Ed Chi2022Xuezhi Wang (2023)
This paper investigates whether ChatGPT functions effectively as a general-purpose natural language processing (NLP) task solver. It examines the model's zero-shot learning capabilities across 20 diverse NLP tasks. The results indicate that while ChatGPT performs well on reasoning tasks and dialogue generation, it struggles with specific tasks like sequence tagging and summarization. The study also provides qualitative analyses and comparative evaluations against other models, highlighting both strengths and limitations of ChatGPT in NLP scenarios.
This paper employs the following methods:
The following datasets were used in this research:
The authors identified the following limitations: