Simplify Your Law: Using Information Theory to Deduplicate Legal Documents

10/02/2021
by   Corinna Coupette, et al.
0

Textual redundancy is one of the main challenges to ensuring that legal texts remain comprehensible and maintainable. Drawing inspiration from the refactoring literature in software engineering, which has developed methods to expose and eliminate duplicated code, we introduce the duplicated phrase detection problem for legal texts and propose the Dupex algorithm to solve it. Leveraging the Minimum Description Length principle from information theory, Dupex identifies a set of duplicated phrases, called patterns, that together best compress a given input text. Through an extensive set of experiments on the Titles of the United States Code, we confirm that our algorithm works well in practice: Dupex will help you simplify your law.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/15/2021

Law Smells: Defining and Detecting Problematic Patterns in Legal Drafting

Building on the computer science concept of code smells, we initiate the...
research
12/29/2021

LeSICiN: A Heterogeneous Graph-based Approach for Automatic Legal Statute Identification from Indian Legal Documents

The task of Legal Statute Identification (LSI) aims to identify the lega...
research
11/01/2020

Lawmaps: Enabling Legal AI development through Visualisation of the Implicit Structure of Legislation and Lawyerly Process

Modelling that exploits visual elements and information visualisation ar...
research
05/10/2022

Metamorphic Testing and Debugging of Tax Preparation Software

This paper presents a data-driven debugging framework to improve the tru...
research
10/15/2019

The NAI Suite – Drafting and Reasoning over Legal Texts

A prototype for automated reasoning over legal texts, called NAI, is pre...
research
01/27/2021

Measuring Law Over Time: A Network Analytical Framework with an Application to Statutes and Regulations in the United States and Germany

How do complex social systems evolve in the modern world? This question ...
research
11/24/2011

The Network of French Legal Codes

We propose an analysis of the codified Law of France as a structured sys...

Please sign up or login with your details

Forgot password? Click here to reset