Extractive summarisation of legal texts

Ben Hachey*, Claire Grover

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

73 Citations (Scopus)

Abstract

We describe research carried out as part of a text summarisation project for the legal domain for which we use a new XML corpus of judgments of the UK House of Lords. These judgments represent a particularly important part of public discourse due to the role that precedents play in English law. We present experimental results using a range of features and machine learning techniques for the task of predicting the rhetorical status of sentences and for the task of selecting the most summary-worthy sentences from a document. Results for these components are encouraging as they achieve state-of-the-art accuracy using robust, automatically generated cue phrase information. Sample output from the system illustrates the potential of summarisation technology for legal information management systems and highlights the utility of our rhetorical annotation scheme as a model of legal discourse, which provides a clear means for structuring summaries and tailoring them to different types of users.

Original languageEnglish
Pages (from-to)305-345
Number of pages41
JournalArtificial Intelligence and Law
Volume14
Issue number4
DOIs
Publication statusPublished - Dec 2006

Keywords

  • Automatic text summarisation
  • Knowledge management
  • Legal discourse
  • Machine learning
  • Natural language processing
  • XML

Fingerprint

Dive into the research topics of 'Extractive summarisation of legal texts'. Together they form a unique fingerprint.

Cite this