Putting GPT-3's Creativity to the (Alternative Uses) Test

06/10/2022
by   Claire Stevenson, et al.
0

AI large language models have (co-)produced amazing written works from newspaper articles to novels and poetry. These works meet the standards of the standard definition of creativity: being original and useful, and sometimes even the additional element of surprise. But can a large language model designed to predict the next text fragment provide creative, out-of-the-box, responses that still solve the problem at hand? We put Open AI's generative natural language model, GPT-3, to the test. Can it provide creative solutions to one of the most commonly used tests in creativity research? We assessed GPT-3's creativity on Guilford's Alternative Uses Test and compared its performance to previously collected human responses on expert ratings of originality, usefulness and surprise of responses, flexibility of each set of ideas as well as an automated method to measure creativity based on the semantic distance between a response and the AUT object in question. Our results show that – on the whole – humans currently outperform GPT-3 when it comes to creative output. But, we believe it is only a matter of time before GPT-3 catches up on this particular task. We discuss what this work reveals about human and AI creativity, creativity testing and our definition of creativity.

READ FULL TEXT
research
06/26/2023

Automatic Assessment of Divergent Thinking in Chinese Language with TransDis: A Transformer-Based Language Model Approach

Language models have been increasingly popular for automatic creativity ...
research
07/17/2023

AI for the Generation and Testing of Ideas Towards an AI Supported Knowledge Development Environment

New systems employ Machine Learning to sift through large knowledge sour...
research
01/03/2023

Large Language Models as Corporate Lobbyists

We demonstrate a proof-of-concept of a large language model conducting c...
research
10/10/2022

The Minimum Wage as an Anchor: Effects on Determinations of Fairness by Humans and AI

I study the role of minimum wage as an anchor for judgements of the fair...
research
08/18/2022

Using Large Language Models to Simulate Multiple Humans

We propose a method for using a large language model, such as GPT-3, to ...
research
08/03/2023

Large Language Model Displays Emergent Ability to Interpret Novel Literary Metaphors

Recent advances in the performance of large language models (LLMs) have ...
research
08/30/2023

Quantifying Uncertainty in Answers from any Language Model via Intrinsic and Extrinsic Confidence Assessment

We introduce BSDetector, a method for detecting bad and speculative answ...

Please sign up or login with your details

Forgot password? Click here to reset