Data Anonymization as a Vector Quantization Problem: Control Over Privacy for Health Data

Yoan Miche; Ian Oliver; Silke Holtmanns; Aapo Kalliola; Anton Akusok; Amaury Lendasse; Kaj-Mikael Björk

doi:10.1007/978-3-319-45507-5_13

Conference Papers Year : 2016

Data Anonymization as a Vector Quantization Problem: Control Over Privacy for Health Data

(1) , (1) , (1) , (2) , (3) , (4) , (3)

1
2
3
4

Yoan Miche

Function : Author
PersonId : 1022792

Nokia Bell Labs [Espoo]

Ian Oliver

Function : Author

Nokia Bell Labs [Espoo]

Silke Holtmanns

Function : Author

Nokia Bell Labs [Espoo]

Aapo Kalliola

Function : Author

Aalto University

Anton Akusok

Function : Author

Arcada University of Applied Sciences

Amaury Lendasse

Function : Author

University of Iowa [Iowa City]

Kaj-Mikael Björk

Function : Author

Arcada University of Applied Sciences

Abstract

This paper tackles the topic of data anonymization from a vector quantization point of view. The admitted goal in this work is to provide means of performing data anonymization to avoid single individual or group re-identification from a data set, while maintaining as much as possible (and in a very specific sense) data integrity and structure. The structure of the data is first captured by clustering (with a vector quantization approach), and we propose to use the properties of this vector quantization to anonymize the data. Under some assumptions over possible computations to be performed on the data, we give a framework for identifying and “pushing back outliers in the crowd”, in this clustering sense, as well as anonymizing cluster members while preserving cluster-level statistics and structure as defined by the assumptions (density, pairwise distances, cluster shape and members...).

Domains

Computer Science [cs] Library and information sciences

Fichier principal

430962_1_En_13_Chapter.pdf (87)

Origin	Files produced by the author(s)

Hal Ifip : Connect in order to contact the contributor

https://inria.hal.science/hal-01635008

Submitted on : Tuesday, November 14, 2017-4:06:34 PM

Last modification on : Wednesday, November 3, 2021-6:55:32 AM

Long-term archiving on : Thursday, February 15, 2018-1:58:07 PM

Dates and versions

hal-01635008 , version 1 (14-11-2017)

Licence

Attribution

Identifiers

HAL Id : hal-01635008 , version 1
DOI : 10.1007/978-3-319-45507-5_13

Cite

Yoan Miche, Ian Oliver, Silke Holtmanns, Aapo Kalliola, Anton Akusok, et al.. Data Anonymization as a Vector Quantization Problem: Control Over Privacy for Health Data. International Conference on Availability, Reliability, and Security (CD-ARES), Aug 2016, Salzburg, Austria. pp.193-203, ⟨10.1007/978-3-319-45507-5_13⟩. ⟨hal-01635008⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

IFIP-LNCS IFIP IFIP-TC IFIP-TC5 IFIP-WG IFIP-TC8 IFIP-CD-ARES IFIP-WG8-4 IFIP-WG8-9 IFIP-LNCS-9817

163 View

185 Download

Data Anonymization as a Vector Quantization Problem: Control Over Privacy for Health Data

Abstract

Domains

Dates and versions

Licence

Identifiers

Cite

Export

Collections

Altmetric

Share