Establishing a Strong Baseline for Privacy Policy Classification

Najmeh Mousavi Nejad; Pablo Jabat; Rostislav Nedelchev; Simon Scerri; Damien Graux

doi:10.1007/978-3-030-58201-2_25

Conference Papers Year : 2020

Establishing a Strong Baseline for Privacy Policy Classification

(1, 2) , (3) , (1) , (2) , (4)

1
2
3
4

Najmeh Mousavi Nejad

Function : Author
PersonId : 1117588

Universität Bonn = University of Bonn

Fraunhofer Institute for Intelligent Analysis and Information Systems

Pablo Jabat

Function : Author
PersonId : 1117589

Company Watch Ltd

Rostislav Nedelchev

Function : Author
PersonId : 1117590

Universität Bonn = University of Bonn

Simon Scerri

Function : Author
PersonId : 1117591

Fraunhofer Institute for Intelligent Analysis and Information Systems

Damien Graux

Function : Author
PersonId : 1117592

Trinity College Dublin

Abstract

Digital service users are routinely exposed to Privacy Policy consent forms, through which they enter contractual agreements consenting to the specifics of how their personal data is managed and used. Nevertheless, despite renewed importance following legislation such as the European GDPR, a majority of people still ignore policies due to their length and complexity. To counteract this potentially dangerous reality, in this paper we present three different models that are able to assign pre-defined categories to privacy policy paragraphs, using supervised machine learning. In order to train our neural networks, we exploit a dataset containing 115 privacy policies defined by US companies. An evaluation shows that our approach outperforms state-of-the-art by 5% over comparable and previously-reported F1 values. In addition, our method is completely reproducible since we provide open access to all resources. Given these two contributions, our approach can be considered as a strong baseline for privacy policy classification.

Keywords

Domains

Computer Science [cs]

Fichier principal

497034_1_En_25_Chapter.pdf (358 Ko)

Origin	Files produced by the author(s)

Hal Ifip : Connect in order to contact the contributor

https://inria.hal.science/hal-03440825

Submitted on : Monday, November 22, 2021-3:32:23 PM

Last modification on : Friday, June 23, 2023-4:24:04 PM

Long-term archiving on : Wednesday, February 23, 2022-7:57:14 PM

Dates and versions

hal-03440825 , version 1 (22-11-2021)

Licence

Attribution

Identifiers

HAL Id : hal-03440825 , version 1
DOI : 10.1007/978-3-030-58201-2_25

Cite

Najmeh Mousavi Nejad, Pablo Jabat, Rostislav Nedelchev, Simon Scerri, Damien Graux. Establishing a Strong Baseline for Privacy Policy Classification. 35th IFIP International Conference on ICT Systems Security and Privacy Protection (SEC), Sep 2020, Maribor, Slovenia. pp.370-383, ⟨10.1007/978-3-030-58201-2_25⟩. ⟨hal-03440825⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

IFIP IFIP-AICT IFIP-TC IFIP-TC11 IFIP-SEC

114 View

201 Download

Establishing a Strong Baseline for Privacy Policy Classification

Abstract

Keywords

Domains

Dates and versions

Licence

Identifiers

Cite

Export

Collections

Altmetric

Share