Enabling Non-expert Users to Apply Data Mining for Bridging the Big Data Divide - Data-Driven Process Discovery and Analysis
Conference Papers Year : 2015

Enabling Non-expert Users to Apply Data Mining for Bridging the Big Data Divide

Abstract

Non-expert users find complex to gain richer insights into the increasingly amount of available heterogeneous data, the so called big data. Advanced data analysis techniques, such as data mining, are difficult to apply due to the fact that (i) a great number of data mining algorithms can be applied to solve the same problem, and (ii) correctly applying data mining techniques always requires dealing with the inherent features of the data source. Therefore, we are attending a novel scenario in which non-experts are unable to take advantage of big data, while data mining experts do: the big data divide. In order to bridge this gap, we propose an approach to offer non-expert miners a tool that just by uploading their data sets, return them the more accurate mining pattern without dealing with algorithms or settings, thanks to the use of a data mining algorithm recommender. We also incorporate a previous task to help non-expert users to specify data mining requirements and a later task in which users are guided in interpreting data mining results. Furthermore, we experimentally test the feasibility of our approach, in particular, the method to build recommenders in an educational context, where instructors of e-learning courses are non-expert data miners who need to discover how their courses are used in order to make informed decisions to improve them.
Fichier principal
Vignette du fichier
335156_1_En_4_Chapter.pdf (638.62 Ko) Télécharger le fichier
Origin Files produced by the author(s)
Loading...

Dates and versions

hal-01746410 , version 1 (29-03-2018)

Licence

Identifiers

  • HAL Id : hal-01746410 , version 1

Cite

Roberto Espinosa, Diego García-Saiz, Marta Zorrilla, Jose Jacobo Zubcoff, Jose-Norberto Mazón. Enabling Non-expert Users to Apply Data Mining for Bridging the Big Data Divide. 3rd International Symposium on Data-Driven Process Discovery and Analysis (SIMPDA), Aug 2013, Riva del Garda, Italy. pp.65-86. ⟨hal-01746410⟩
132 View
217 Download

Share

More