Data Governance as Success Factor for Data Science - Responsible Design, Implementation and Use of Information and Communication Technology Access content directly
Conference Papers Year : 2020

Data Governance as Success Factor for Data Science

Paul Brous
  • Function : Author
  • PersonId : 995645
Marijn Janssen
  • Function : Author
  • PersonId : 985668
Rutger Krans
  • Function : Author
  • PersonId : 1098390

Abstract

More and more, asset management organizations are introducing data science initiatives to support predictive maintenance and anomaly detection. Asset management organizations are by nature data intensive to manage their assets like bridges, dykes, railways and roads. For this, they often implement data lakes using a variety of architectures and technologies to store big data and facilitate data science initiatives. However, the decision-outcomes of data science models are often highly reliant on the quality of the data. The data in the data lake therefore has to be of sufficient quality to develop trust by decision-makers. Not surprisingly, organizations are increasingly adopting data governance as a means to ensure that the quality of data entering the data lake is and remains of sufficient quality, and to ensure the organization remains legally compliant. The objective of the case study is to understand the role of data governance as success factor for data science. For this, a case study regarding the governance of data in a data lake in the asset management domain is analyzed to test three propositions contributing to the success of using data science. The results show that unambiguous ownership of the data, monitoring the quality of the data entering the data lake, and a controlled overview of standard and specific compliance requirements are important factors for maintaining data quality and compliance and building trust in data science products.
Fichier principal
Vignette du fichier
492453_1_En_36_Chapter.pdf (532.15 Ko) Télécharger le fichier
Origin : Files produced by the author(s)

Dates and versions

hal-03222837 , version 1 (10-05-2021)

Licence

Attribution

Identifiers

Cite

Paul Brous, Marijn Janssen, Rutger Krans. Data Governance as Success Factor for Data Science. 19th Conference on e-Business, e-Services and e-Society (I3E), Apr 2020, Skukuza, South Africa. pp.431-442, ⟨10.1007/978-3-030-44999-5_36⟩. ⟨hal-03222837⟩
52 View
78 Download

Altmetric

Share

Gmail Facebook X LinkedIn More