2023, Volume 7 Issue 2
A Novel Intrusion Detection System Using Multiple Linear Regression
AUTHOR(S)
Koushik Paul, Sayandeep Paik, Siddhartha Kuri, Soumyadip Majumder, Dr. Avijit Kumar Chaudhuri
DOI: https://doi.org/10.46647/ijetms.2023.v07i02.010
ABSTRACT
The internet is no doubt the biggest and the most important tool of modern civilisation. But along with its numerous benefits, it also comes with its own set of risks, the most important of them being breaches in security and privacy.
An anomaly-based Intrusion Detection System (IDS) is a type of security system that is used to detect and alert on unusual or abnormal behaviour that may indicate an attack or intrusion. Unlike signature-based IDS, which rely on known patterns of attack, anomaly-based IDS is designed to detect previously unseen or unknown attacks by identifying deviations from normal patterns of behaviour.
Multiple linear regression is a statistical technique used to analyse the relationship between a dependent variable and multiple independent variables. In this technique, a linear equation is established between the dependent variable and multiple independent variables, with the aim of predicting the value of the dependent attribute for a given set of values of the independent attribute.
In this paper, we collected a data set of 125974 entries and 42 attributes from Kaggle, pre-processed the data and used logistic regression to predict the dependent variable (called xAttack) using 25 independent variables, as we found a high correlation between the aforementioned variables
The results are simulated using 10-fold cross validation, using various train test splits of the data set. The data has been split into 80-20,50-50, and 66-34. After testing the given data set in different train test splits, an accuracy of 92.73 was achieved.
Page No: 75 - 86
References:
[1] Ali H. Mirza, “Computer Network Intrusion Detection using various Classifiers and Ensemble Learning”. 978-1-5386-1501-0/18/$31.00 c 2018 IEEE.
[2] T.Saranya, S.Sridevi, C. Deisy, Tran Duc Chung and M. K. A. Ahamed Khan, “Performance Analysis of Machine Learning Algorithms in Intrusion Detection System: A Review”. Third International Conference on Computing and Network Communications (CoCoNet’19), Procedia Computer Science 171 (2020) 1251–1260
[3] Partha Ghosh and Rajarshee Mitra, “Proposed GA-BFSS and Logistic Regression based Intrusion Detection System”. 978-1-4799-4445-3/15/$31.00 ©2015 IEEE
[4] Christiana Ioannou, Vasos Vassiliou and Charalampos Sergiou. “An Intrusion Detection System for Wireless Sensor Networks”. 978-1-5386-0643-8/17/$31.00 ©2017 IEEE
[5] Anil Lamba, Satinderjeet Singh, Sachin Bhardwaj, Natasha Dutta and Sivakumar Sai Rela Muni, “USES OF ARTIFICIAL INTELLIGENT TECHNIQUES TO BUILD ACCURATE MODELS FOR INTRUSION DETECTION SYSTEM”. International Journal For Technological Research In Engineering, Volume 2, Issue 12, August-2015, ISSN (Online): 2347 – 4718
[6] Intrusion Detection System (IDS), Wikipedia https://en.wikipedia.org/wiki/Intrusion_detection_system
[7] Gilles Vandewiele, Isabelle Dehaene, Gyorgy Kovacs, Lucas Sterckx, Olivier Janssens, Femke Ongenae, Femke De Backere, Filip De Turck, Kristien Roelens, Johan Decruyenaere, Sofie Van Hoecke and Thomas Demeester, “Overly optimistic prediction results on imbalanced data: a case study of flaws and benefits when applying over-sampling”. Artificial Intelligence In Medicine 111 (2021) 101987. https://doi.org/10.1016/j.artmed.2020.101987
[8] Koushik Paul, Saheb Karan, Siddhartha Kuri, Sulekha Das and Avijit Kumar Chaudhuri, “Placement Prediction Using Multiple Logistic Regression Method”. International Journal of Advanced Research in Computer and Communication Engineering, Impact Factor 7.39ïðVol. 11, Issue 3, March 2022. ISSN (O) 2278-1021, ISSN (P) 2319-5940. DOI: 10.17148/IJARCCE.2022.11337
[9] “Handbook of Biological Statistics ̴ John H. McDonald”, Multiple Logistic Regression, https://www.biostathandbook.com/multiplelogistic.html
[10] Mary L. McHugh, “Interrater reliability: the kappa statistic”. Biochem Med (Zagreb)
. 2012;22(3):276-82. Published online 2012 Oct 15.
[11] Random Accuracy formula to find the Kappa value, tutorialspoint, https://www.tutorialspoint.com/statistics/cohen_kappa_coefficient.htm
[12] Lynn E. Eberly, “Multiple Linear Regression”. Methods in Molecular Biology, vol. 404: Topics in Biostatistics. DOI:10.1007/978-1-59745-530-5_9
How to Cite This Article:
Koushik Paul, Sayandeep Paik, Siddhartha Kuri, Soumyadip Majumder, Dr. Avijit Kumar Chaudhuri
. A Novel Intrusion Detection System Using Multiple Linear Regression
. ijetms;7(2):75-86. DOI: 10.46647/ijetms.2023.v07i02.010