LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models

08/20/2023
by   Neel Guha, et al.
0

The advent of large language models (LLMs) and their adoption by the legal community has given rise to the question: what types of legal reasoning can LLMs perform? To enable greater study of this question, we present LegalBench: a collaboratively constructed legal reasoning benchmark consisting of 162 tasks covering six different types of legal reasoning. LegalBench was built through an interdisciplinary process, in which we collected tasks designed and hand-crafted by legal professionals. Because these subject matter experts took a leading role in construction, tasks either measure legal reasoning capabilities that are practically useful, or measure reasoning skills that lawyers find interesting. To enable cross-disciplinary conversations about LLMs in the law, we additionally show how popular legal frameworks for describing legal reasoning – which distinguish between its many forms – correspond to LegalBench tasks, thus giving lawyers and LLM developers a common vocabulary. This paper describes LegalBench, presents an empirical evaluation of 20 open-source and commercial LLMs, and illustrates the types of research explorations LegalBench enables.

READ FULL TEXT

page 19

page 30

page 31

research
08/11/2023

Large Language Models in Cryptocurrency Securities Cases: Can ChatGPT Replace Lawyers?

Large Language Models (LLMs) could enhance access to the legal system. H...
research
09/13/2022

LegalBench: Prototyping a Collaborative Benchmark for Legal Reasoning

Can foundation models be guided to execute tasks involving legal reasoni...
research
07/17/2023

Legal Syllogism Prompting: Teaching Large Language Models for Legal Judgment Prediction

Legal syllogism is a form of deductive reasoning commonly used by legal ...
research
12/14/2017

Passing the Brazilian OAB Exam: data preparation and some experiments

In Brazil, all legal professionals must demonstrate their knowledge of t...
research
06/12/2023

Large Language Models as Tax Attorneys: A Case Study in Legal Capabilities Emergence

Better understanding of Large Language Models' (LLMs) legal analysis abi...
research
09/21/2020

The Next Era of American Law Amid the Advent of Autonomous AI Legal Reasoning

Legal scholars have postulated that there have been three eras of Americ...
research
07/01/2022

Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset

One concern with the rise of large language models lies with their poten...

Please sign up or login with your details

Forgot password? Click here to reset