The Limits of Annotation in Machine Learning a Documents Hohfeldian Legal Entities

Ahmed Izzidien

doi:10.33774/coe-2021-dqwvg

Language and Linguistics

Search within Language and Linguistics

The Limits of Annotation in Machine Learning a Documents Hohfeldian Legal Entities

15 November 2021, Version 1

Poster

Ahmed Izzidien

Show author details

This content is an early or alternative research output and has not been peer-reviewed by Cambridge University Press at the time of posting.

Abstract

Natural language processing (NLP) summarisers aim to capture the essential elements of a document. Yet, the ontological character of a summary can be domain specific. In legal analysis, the Hohfeldian matrix is used to summarise principle legal relations between agents, such as individuals and organisations. We test a limit of using machine learning (ML) to detect such agents. Based on training with our 2400 hand labelled annotations, an F1= 80.1 is found. Extrapolating this suggests that over one million annotations are required to capture all the agents mentioned in a document. This questions the feasibility of such an approach, one that is unable to be inclusive of all agents who are party to a legal relation. Such complete capture is an essential criteria of fair ML and accurate legal summaries. An alternative approach based on hypernymy is suggested.

Keywords

Hohfeld

Fair Machine Learning

Ontology

Contract Analysis

Legal Artificial Intelligence

Comments

Comments are not moderated before they are posted, but they can be removed by the site moderators if they are found to be in contravention of our Commenting and Discussion Policy - please read this policy before you post. Comments should be used for scholarly discussion of the content in question. You can find more information about how to use the commenting feature here .

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Version History

Nov 15, 2021 Version 1

Metrics

234

Views

Downloads

Citations

License

The content is available under CC BY 4.0

DOI

10.33774/coe-2021-dqwvg

Funding

NGI Trust

825618

Isaac Newton Trust

20.40 (I)

The Psychometrics Centre

CJBS Small Research Grant Scheme

Author’s competing interest statement

The author(s) have declared they have no conflict of interest with regard to this content

Ethics

The author(s) have declared ethics committee/IRB approval is not relevant to this content

Conference

Cambridge Language Sciences Annual Symposium 2021

The Limits of Annotation in Machine Learning a Documents Hohfeldian Legal Entities

Authors

Abstract

Keywords

Comments

Version History

Metrics

License

DOI

Funding

Author’s competing interest statement

Ethics

Conference

Share