A Categorical Archive of ChatGPT Failures

02/06/2023
by   Ali Borji, et al.
0

Large language models have been demonstrated to be valuable in different fields. ChatGPT, developed by OpenAI, has been trained using massive amounts of data and simulates human conversation by comprehending context and generating appropriate responses. It has garnered significant attention due to its ability to effectively answer a broad range of human inquiries, with fluent and comprehensive answers surpassing prior public chatbots in both security and usefulness. However, a comprehensive analysis of ChatGPT's failures is lacking, which is the focus of this study. Eleven categories of failures, including reasoning, factual errors, math, coding, and bias, are presented and discussed. The risks, limitations, and societal implications of ChatGPT are also highlighted. The goal of this study is to assist researchers and developers in enhancing future language models and chatbots.

READ FULL TEXT

page 6

page 7

page 9

page 10

page 14

page 15

page 18

page 19

research
08/09/2023

An Empirical Study on Using Large Language Models to Analyze Software Supply Chain Security Failures

As we increasingly depend on software systems, the consequences of breac...
research
12/18/2022

Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model

The emergence of large pretrained models has enabled language models to ...
research
08/11/2023

Neural Conversation Models and How to Rein Them in: A Survey of Failures and Fixes

Recent conditional language models are able to continue any kind of text...
research
06/22/2023

Visual Adversarial Examples Jailbreak Large Language Models

Recently, there has been a surge of interest in introducing vision into ...
research
03/29/2023

Advances in apparent conceptual physics reasoning in GPT-4

ChatGPT is built on a large language model trained on an enormous corpus...
research
01/18/2023

How Close is ChatGPT to Human Experts? Comparison Corpus, Evaluation, and Detection

The introduction of ChatGPT has garnered widespread attention in both ac...
research
05/17/2019

The Unexpected Unexpected and the Expected Unexpected: How People's Conception of the Unexpected is Not That Unexpected

The answers people give when asked to 'think of the unexpected' for ever...

Please sign up or login with your details

Forgot password? Click here to reset