This page provides background information for a proposed extension of schema.org dedicated to Terms of Services and Privacy Notices (ie Legal Documents). This extension is part of the overall LegiCrowd project and, more specifically, of the part of the project (LegiCrowd Onto) dedicated to the building of an ontology for the description of such legal documents.
This extension was built by taking both a top down approach from various sources of information such as the P3P, some previous work towards to extend the P3P with GDPR specific input for data and policies, cases from ToS;DR, as well as a bottom up approach cornerstoned by the LegiCrowd annotation environment.
See also our partners and acknowledgements page.
Modeling Online Legal documents requires to handle types of objects:
- Legal documents themselves, which need to be put into the hierarchy
- Users data, which encompass a wide range of data types including personal data (such as a user’s first name), sensitive data (for example about his beliefs or health)
- Actions performed by the platform itself or the users (e.g. collect or share data, input content or cancels a contract).
- (Note: introductory section to be completed).
New types : LegalDocument, PrivacyNotice
The current version of schema.org types Terms of Services as a Property, we propose a more generic approach: a legal document is a type of WebPage and has several subtypes, Privacy Notices and Terms of Services. The model is hence as follows (nodes with a + sign are proposed new types):
In our case, PrivacyNotice and termsofServices are not different, but this model leaves rooms for specific properties for each. It has to be noted that the properties attached to the current termofServices can be inherited from the proposed LegalDocument type.
New type: WebpageSection
We propose to create a type WebpageSection type to describe the content of each section of a Webpage through the hasPart property inherited from the CreativeWork type. Though a section is not a creative work per se, the hasPart property provides some leeway as it is described as “Indicat[ing] an item or CreativeWork that is part of this item, or CreativeWork (in some sense).”
New property: readability, of type ratingvalue, attached to type WebPage
The existing WebPage type receives the property readability, which describes the readability level of the current text. It is a type of ratingvalue from which it inherits values of type text (for example: High, Low) or number (e.g. a Fleisch index).
Alternately, we could use the property JobTitle of the type Person but we think that this type of Responsible person is highly relevant in the context of Legal Documents. See discussion here.
The Person and Organization types of schema.org come with a large set of properties (ie name, email address, postal address…) which are useful for Legal Documents annotation.
The handling (ie protecting, collecting, sharing) of users’ personal data is a core topic of online legal documents. A wide range of personal data can already be described with the existing types and properties, but are not marked as personal data. To achieve this, we propose to create a PersonalData type as a subtype of Intangible and to which existing or new personal data types are attached, rather than to create a specific property which appear difficult to attach to some existing data types.
There are several types of PersonalData necessary to describe the content of Legal Document. The propose the following list of types: