Arguments Against Using the 1998 DARPA Dataset for Cloud IDS Design and Evaluation and Some Alternative

Paulo Faria Quinan; Issa Traore; Isaac Woungang; Abdulaziz Aldribi; Onyekachi Nwamuo

doi:10.1007/978-3-030-45778-5_21

Conference Papers Year : 2020

Arguments Against Using the 1998 DARPA Dataset for Cloud IDS Design and Evaluation and Some Alternative

(1, 2) , (1) , (3) , (4) , (1, 2)

1
2
3
4

Paulo Faria Quinan

Function : Author
PersonId : 1102796

University of Victoria [Canada]

Department of Electrical & Computer Engineering [Victoria]

Issa Traore

Function : Author
PersonId : 1007476

University of Victoria [Canada]

Isaac Woungang

Function : Author
PersonId : 1102797

Ryerson University [Toronto]

Abdulaziz Aldribi

Function : Author

Qassim University [Kingdom of Saudi Arabia]

Onyekachi Nwamuo

Function : Author
PersonId : 1102798

University of Victoria [Canada]

Department of Electrical & Computer Engineering [Victoria]

Abstract

Due to the lack of adequate public datasets, the proponents of many existing cloud intrusion detection systems (IDS) have relied on the DARPA dataset to design and evaluate their models. In the current paper, we show empirically that the DARPA dataset by failing to meet important statistical characteristics of real world cloud traffic data center is inadequate for evaluating cloud IDS. We present, as alternative, a new public dataset collected through a cooperation between our lab and a non-profit cloud service provider, which contains benign data and a wide variety of attack data. We present a new hypervisor-based cloud IDS using instance-oriented feature model and supervised machine learning techniques. We investigate 3 different classifiers: Logistic Regression (LR), Random Forest (RF), and Support Vector Machine (SVM) algorithms. Experimental evaluation on a diversified dataset yields a detection rate of 92.08% and a false positive rate of 1.49% for random forest, the best performing of the three classifiers.

Keywords

Cloud IDS Cloud security Machine learning IDS evaluation Hypervisor-based IDS

Domains

Computer Science [cs] Networking and Internet Architecture [cs.NI]

Fichier principal

487577_1_En_21_Chapter.pdf (1.02 Mo)

Origin	Files produced by the author(s)

Hal Ifip : Connect in order to contact the contributor

https://inria.hal.science/hal-03266464

Submitted on : Monday, June 21, 2021-5:31:55 PM

Last modification on : Wednesday, October 25, 2023-3:22:03 PM

Long-term archiving on : Wednesday, September 22, 2021-7:02:47 PM

Dates and versions

hal-03266464 , version 1 (21-06-2021)

Licence

Attribution

Identifiers

HAL Id : hal-03266464 , version 1
DOI : 10.1007/978-3-030-45778-5_21

Cite

Paulo Faria Quinan, Issa Traore, Isaac Woungang, Abdulaziz Aldribi, Onyekachi Nwamuo. Arguments Against Using the 1998 DARPA Dataset for Cloud IDS Design and Evaluation and Some Alternative. 2nd International Conference on Machine Learning for Networking (MLN), Dec 2019, Paris, France. pp.315-332, ⟨10.1007/978-3-030-45778-5_21⟩. ⟨hal-03266464⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

IFIP-LNCS IFIP IFIP-TC IFIP-TC6 IFIP-LNCS-12081 IFIP-MLN

114 View

155 Download

Arguments Against Using the 1998 DARPA Dataset for Cloud IDS Design and Evaluation and Some Alternative

Abstract

Keywords

Domains

Dates and versions

Licence

Identifiers

Cite

Export

Collections

Altmetric

Share