Identifying Risks in Datasets for Automated Decision–Making

Mariachiara Mecati; Flavio Emanuele Cannavò; Antonio Vetrò; Marco Torchiano

doi:10.1007/978-3-030-57599-1_25

Conference Papers Year : 2020

Identifying Risks in Datasets for Automated Decision–Making

(1) , (1) , (1) , (1)

Mariachiara Mecati

Function : Author
PersonId : 1104823

Politecnico di Torino = Polytechnic of Turin

Flavio Emanuele Cannavò

Function : Author

Politecnico di Torino = Polytechnic of Turin

Antonio Vetrò

Function : Author

Politecnico di Torino = Polytechnic of Turin

Marco Torchiano

Function : Author

Politecnico di Torino = Polytechnic of Turin

Abstract

Our daily life is profoundly affected by the adoption of automated decision making (ADM) systems due to the ongoing tendency of humans to delegate machines to take decisions. The unleashed usage of ADM systems was facilitated by the availability of large-scale data, alongside with the deployment of devices and equipment. This trend resulted in an increasing influence of ADM systems’ output over several aspects of our life, with possible discriminatory consequences towards certain individuals or groups. In this context, we focus on input data by investigating measurable characteristics which can lead to discriminating automated decisions. In particular, we identified two indexes of heterogeneity and diversity, and tested them on two datasets. A limitation we found is the index sensitivity to a large number of categories, but on the whole results show that the indexes reflect well imbalances in the input data. Future work is required to further assess the reliability of these indexes as indicators of discrimination risks in the context of ADM, in order to foster a more conscious and responsible use of ADM systems through an immediate investigation on input data.

Keywords

Bias Data quality Data ethics Imbalance measures Algorithm fairness

Domains

Computer Science [cs] Library and information sciences

Fichier principal

499995_1_En_25_Chapter.pdf (295)

Origin	Files produced by the author(s)

Hal Ifip : Connect in order to contact the contributor

https://inria.hal.science/hal-03282772

Submitted on : Friday, July 9, 2021-2:01:42 PM

Last modification on : Tuesday, November 30, 2021-2:32:02 PM

Long-term archiving on : Sunday, October 10, 2021-7:24:00 PM

Dates and versions

hal-03282772 , version 1 (09-07-2021)

Licence

Attribution

Identifiers

HAL Id : hal-03282772 , version 1
DOI : 10.1007/978-3-030-57599-1_25

Cite

Mariachiara Mecati, Flavio Emanuele Cannavò, Antonio Vetrò, Marco Torchiano. Identifying Risks in Datasets for Automated Decision–Making. 19th International Conference on Electronic Government (EGOV), Aug 2020, Linköping, Sweden. pp.332-344, ⟨10.1007/978-3-030-57599-1_25⟩. ⟨hal-03282772⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

IFIP-LNCS IFIP IFIP-TC IFIP-TC8 IFIP-EGOV IFIP-WG8-5 IFIP-LNCS-12219

56 View

96 Download

Identifying Risks in Datasets for Automated Decision–Making

Abstract

Keywords

Domains

Dates and versions

Licence

Identifiers

Cite

Export

Collections

Altmetric

Share