Identifying Risks in Datasets for Automated Decision–Making - Electronic Government Access content directly
Conference Papers Year : 2020

Identifying Risks in Datasets for Automated Decision–Making


Our daily life is profoundly affected by the adoption of automated decision making (ADM) systems due to the ongoing tendency of humans to delegate machines to take decisions. The unleashed usage of ADM systems was facilitated by the availability of large-scale data, alongside with the deployment of devices and equipment. This trend resulted in an increasing influence of ADM systems’ output over several aspects of our life, with possible discriminatory consequences towards certain individuals or groups. In this context, we focus on input data by investigating measurable characteristics which can lead to discriminating automated decisions. In particular, we identified two indexes of heterogeneity and diversity, and tested them on two datasets. A limitation we found is the index sensitivity to a large number of categories, but on the whole results show that the indexes reflect well imbalances in the input data. Future work is required to further assess the reliability of these indexes as indicators of discrimination risks in the context of ADM, in order to foster a more conscious and responsible use of ADM systems through an immediate investigation on input data.
Fichier principal
Vignette du fichier
499995_1_En_25_Chapter.pdf (295.45 Ko) Télécharger le fichier
Origin : Files produced by the author(s)

Dates and versions

hal-03282772 , version 1 (09-07-2021)





Mariachiara Mecati, Flavio Emanuele Cannavò, Antonio Vetrò, Marco Torchiano. Identifying Risks in Datasets for Automated Decision–Making. 19th International Conference on Electronic Government (EGOV), Aug 2020, Linköping, Sweden. pp.332-344, ⟨10.1007/978-3-030-57599-1_25⟩. ⟨hal-03282772⟩
26 View
32 Download



Gmail Facebook X LinkedIn More