Causal Parrots: Large Language Models May Talk Causality But Are Not Causal

08/24/2023
by   Matej Zečević, et al.
0

Some argue scale is all what is needed to achieve AI, covering even causal models. We make it clear that large language models (LLMs) cannot be causal and give reason onto why sometimes we might feel otherwise. To this end, we define and exemplify a new subgroup of Structural Causal Model (SCM) that we call meta SCM which encode causal facts about other SCM within their variables. We conjecture that in the cases where LLM succeed in doing causal inference, underlying was a respective meta SCM that exposed correlations between causal facts in natural language on whose data the LLM was ultimately trained. If our hypothesis holds true, then this would imply that LLMs are like parrots in that they simply recite the causal knowledge embedded in the data. Our empirical analysis provides favoring evidence that current LLMs are even weak `causal parrots.'

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/08/2023

Axiomatization of Interventional Probability Distributions

Causal intervention is an essential tool in causal inference. It is axio...
research
06/09/2023

Can Large Language Models Infer Causation from Correlation?

Causal inference is one of the hallmarks of human intelligence. While th...
research
03/07/2023

Can large language models build causal graphs?

Building causal graphs can be a laborious process. To ensure all relevan...
research
02/18/2023

Improving the Out-Of-Distribution Generalization Capability of Language Models: Counterfactually-Augmented Data is not Enough

Counterfactually-Augmented Data (CAD) has the potential to improve langu...
research
06/12/2020

Learning Causal Models Online

Predictive models – learned from observational data not covering the com...
research
06/14/2022

Can Foundation Models Talk Causality?

Foundation models are subject to an ongoing heated debate, leaving open ...
research
07/28/2022

Measuring Causal Effects of Data Statistics on Language Model's `Factual' Predictions

Large amounts of training data are one of the major reasons for the high...

Please sign up or login with your details

Forgot password? Click here to reset