Enhancing network intrusion detection classifiers using supervised adversarial training

Yin, Chuanlong; Zhu, Yuefei; Liu, Shengli; Fei, Jinlong; Zhang, Hetong

doi:10.1007/s11227-019-03092-1

Enhancing network intrusion detection classifiers using supervised adversarial training

Published: 11 December 2019

Volume 76, pages 6690–6719, (2020)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

The Journal of Supercomputing Aims and scope Submit manuscript

Enhancing network intrusion detection classifiers using supervised adversarial training

Download PDF

Chuanlong Yin ORCID: orcid.org/0000-0003-0735-9019¹,
Yuefei Zhu¹,
Shengli Liu¹,
Jinlong Fei¹ &
…
Hetong Zhang¹

898 Accesses
20 Citations
Explore all metrics

Abstract

The performance of classifiers has a direct impact on the effectiveness of intrusion detection system. Thus, most researchers aim to improve the detection performance of classifiers. However, classifiers can only get limited useful information from the limited number of labeled training samples, which usually affects the generalization of classifiers. In order to enhance the network intrusion detection classifiers, we resort to adversarial training, and a novel supervised learning framework using generative adversarial network for improving the performance of the classifier is proposed in this paper. The generative model in our framework is utilized to continuously generate other complementary labeled samples for adversarial training and assist the classifier for classification, while the classifier in our framework is used to identify different categories. Meanwhile, the loss function is deduced again, and several empirical training strategies are proposed to improve the stabilization of the supervised learning framework. Experimental results prove that the classifier via adversarial training improves the performance indicators of intrusion detection. The proposed framework provides a feasible method to enhance the performance and generalization of the classifier.

A Self-supervised Adversarial Learning Approach for Network Intrusion Detection System

Toward Learning Robust Detectors from Imbalanced Datasets Leveraging Weighted Adversarial Training

An Adversarial Learning Model for Intrusion Detection in Real Complex Network Environments

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Intrusion detection, as a kind of multilevel and multilayer network protection measure, aims to detect various intrusion behaviors by collecting and analyzing all kinds of information on the network. In fact, intrusion detection is usually equivalent to a classification problem. It identifies whether network traffic behaviors are normal or any one of the other four attack types: Denial of Service (DoS), Probe, Root to Local (R2L) and User to Root (U2R) [1, 2]. Then, it sets the alarm and takes appropriate measures. Therefore, there is no doubt that constructing a suitable classifier and training it to improve its generalization is the key task of intrusion detection.

Several machine learning methods, including support vector machine (SVM) [3, 4], artificial neural network (ANN) [5, 6], K-nearest neighbors (KNN) [7, 8], random forest (RF) [9] and others [10, 11], have been implemented as classifiers to improve the performance of intrusion detection and have made good progress. However, previous works based on traditional machine learning methodologies, which belong to shallow learning algorithms, have a limited ability to represent complex functions for complex classification problems [12].

More recently, with the development of deep learning, more and more researchers have explored deep learning methods to enhance the performance of classifiers for intrusion detection, and have also achieved remarkable results. Compared with traditional machine learning algorithms, deep learning methods are adept in representing high-dimensional spatial features and can automatically learn the intrinsic features without feature engineering [13]. The experiments demonstrated deep networks significantly outperformed the shallow network in detection of attacks [12].

However, in practice, whether classifiers for intrusion detection are based on traditional machine learning or based on deep learning methods, the detection effectiveness is highly dependent on the number of samples for training. After all, in the supervised learning, classifiers can only get limited useful information from the limited number of labeled samples, which usually affects the generalization of classifiers. The best way to make a machine learning model generalize better is to train it on more data. Data augmentation allows more data to be generated from limited data, increasing the number and diversity of training samples. Training samples supplements can reduce the model’s dependence on certain attributes, thereby improving the generalization ability of the model. Unfortunately, it is difficult to generate new fake data for a density estimation task unless we have already solved the density estimation problem [14]. In addition, it is expensive and time-consuming for labeling large training datasets. It is sometimes impossible due to emerging and fast evolving intrusion attacks, which makes those problems particularly severe. Generative adversarial networks (GANs) can learn the probability distribution of dataset, and try to generate new ‘fake’ samples similar to data samples. Since GANs introduce interaction in the training stage (which is equivalent to adding a kind of ‘fake’ labeled samples relative to original data samples), GANs can expand the labeled data and give more useful information on the basis of the training set. In fact, as a semi-supervised model, GANs enhance the effectiveness of image recognition [15], anomaly detection [16], imaging markers [17], etc. Since GANs are suited to model the high-dimensional complex distributions of real-world data [18], it is reasonably straightforward to utilize them to offer more useful information to improve the generalization of classifiers. Several works have begun to explore the applications of GANs for anomaly detection. As far as we know, there are three types of application methods to apply GAN for intrusion detection.

With the purpose of generating adversarial attacks to evade the intrusion detection system, authors applied GAN to generate adversarial malicious examples to perform the black-box attacks. The experiments showed many intrusion detection systems were vulnerable to adversarial perturbations using GAN [19, 20]. A framework based on GAN generated DoS attack traffic similar to the normal traffic to evade network traffic classifiers [21]. Obviously, our research goal in this paper is to enhance the performance of the classifier for intrusion detection rather than to obtain the generated attacks to evade the intrusion detection system.

With the purpose of balancing previously unbalanced datasets, a framework based on GAN was proposed to generate data that captured the data distribution of selected attack types from the dataset. As a result, the framework was feasible for improving the performance of intrusion detection systems [22]. With the same purpose to address the challenges of both data scarcity and data imbalance, a framework was developed to incorporate deep adversarial learning with statistical learning. Experiments indicated that the proposed framework outperformed other models [23]. Indeed, the problem of unbalanced classification in the training set leads to a decline in the detection performance for intrusion detection, which results in a large bias of the classifier, and the prediction of the classifier tends to be the majority in the dataset. The above methods are essential to train the GAN model to learn the distribution of minority status, and oversampling samples to balance the training set. At last, they change the inter-class distribution in the training set. In this paper, we leverage GAN to generate new label data. It is equivalent to adding other new category of data to the training set rather than changing the number of other categories, and fundamentally different from those oversampling methods. It does not need to train each class separately, the training process is more simplified, and the training time and overhead are certainly decreased.

With the purpose of identifying anomaly attacks, in [16], authors developed GAN-based models for anomaly detection, and achieved good results on image and network intrusion datasets for binary classification. Compared with those methods, the application scenario in this paper is different. The above methods mainly use GANs for binary classification. The discriminant model of GAN itself is a binary classifier to judge the real or fake for input data, which is very similar to the anomaly detection. Therefore, GANs can easily be extended to the application scenario of binary classification. However, in the multiclass classification scenario, the multiclass classifier not only needs to judge the anomaly, but also needs to further judge the category of anomaly.

Inspired by the above reasoning, in this paper, we restrict our focus to complement the labeled samples via adversarial training, and augment the training set. Specifically, in the training phase, because the generative model G is continually generating ‘fake’ samples to offer the classifier C with useful information (which enhances the classification performance of the classifier C), the ID-GAN framework improves the generalization of the classifier.

As far as we know, we first propose the supervised learning framework based on GAN for intrusion detection (ID-GAN) under the multiclass classification task. Experimental results show that the ID-GAN framework improves the performance of classifiers by using complementary and helpful information by adversarial training. The framework can effectively enhance the generalization of the classifier and can improve the effectiveness of intrusion detection in a series of adversarial rounds, which achieves state-of-the-art results on the benchmark NSL-KDD dataset.

The main contributions in this paper are embodied in overcoming the following challenges.

1.
The discriminative model D in the original GAN is a binary classifier, so it only can be used to judge whether the sample is from the real dataset or not. It does not have ability to further predict the classification of real samples. Hence, a novel supervised learning framework based on GAN for the multiclass classification is proposed in the paper.
2.
Because the structure of the proposed framework is different from the original GAN, it is necessary to deduce the loss function again for the supervised learning according to the needs of the multiclass classification for intrusion detection. Therefore, we show the theoretical derivation of the loss function.
3.
Unlike the original GAN and its variants, the purpose of the proposed framework is to train a multiclass model with enhanced performance rather than a generative model. Therefore, the training method of the framework is different from that of the original GAN and its variants, and how to train the proposed framework needs to be studied. Several empirical strategies are proposed to improve the stabilization of the framework.
4.
Aiming at the problem of experimental verification, we compare the performance of the original classifier with that of the enhanced classifier via adversarial training on the benchmark NSL-KDD dataset for the multiclass classification, and show the graphical depiction in detail on how to enhance the classifier with the help of the proposed framework.

The remainder of the paper is organized as follows. Section 2 introduces the latest research progress in the field of intrusion detection, especially application research using deep learning in this area. We detail the original GANs and its variants in Sect. 3. Section 4 introduces our method and describes how to construct the supervised learning framework. The experimental configuration and evaluation criteria are shown in Sect. 5. Section 6 reveals how to train the proposed framework, and the experimental results and discussion are presented in Sect. 7. Finally, conclusion and future work are drawn.

2 Related work

Significant progress has been made in improving the performance of classifiers in the field of intrusion detection.

To improve the accuracy of decision tree (DT) and naïve Bayes (NB) classifiers for multiclass classification, two independent hybrid mining algorithms were presented [24]. Hybrid DT algorithm utilized a NB classifier to avoid overfitting, while Hybrid NB algorithm employed a DT classifier to select important features to improve efficiency. Kanakarajan, Muniasamy [25] introduced greedy randomized adaptive search procedure with annealed randomness—Forest (GAR-forest), to improve the performance of multiclass classification with feature selection. Experimental results showed that GAR-forest performed better for multiclass classification problem compared with random forest, C4.5, naive Bayes and multilayer perceptron.

Some researchers reduce the high dimensions and feature space by removing redundant or unimportant features to further improve the performance of the classifier. Kuang et al. [26] reduced the high-dimensional data using hybrid kernel principal component analysis (KPCA) and then utilized the SVM for intrusion detection. Ikram, Cherukuri [27] studied the integrating of principal component analysis (PCA) and SVM for abnormal recognition. The above research reduces the dimensions of the input feature space in the intrusion detection system, which effectively improves the overall performance of the classification. Nevertheless, PCA suffers from the fact that it is a linear combination of all the original variables. Thus, it often cannot obtain deterministic mappings from high-dimensional spaces to low-dimensional spaces [28]. Furthermore, its nonlinear extension KPCA suffers from two major disadvantages. First, the underlying manifold structure of data is not considered in process modeling. Second, the selection of the kernel function and kernel parameters is always problematic [29].

More recently, deep learning is one of the most effective machine learning techniques which is getting popular, and has gained a wide range of applications in the intrusion detection community. The researchers take advantage of generative models such as deep autoencoder (DAE), deep Boltzmann machine (DBMs) and deep belief networks (DBNs) in a pre-training stage (unsupervised learning) to improve the detection performance. During this process, each of the lower layers is separately trained from other layers, which allows other layers to be greedily trained layer by layer from the bottom up [12, 30]. Furthermore, the final prediction and classification are carried out by traditional machine learning algorithms such as SVM or SoftMax, which avoids manual intervention to select features and can effectively represent high-dimensional features.

Abolhasanzadeh [31] proposed an approach to detect attacks in big data using DAE based on dimensionality reduction and the neural network bottleneck feature extraction. The results in terms of accuracy rate outperformed PCA, factor analysis and KPCA.

Gao et al. [32] successfully exploited a classifier based on DBN for intrusion detection, and concluded that the classifier achieved a high accuracy when the greedy layer-by-layer learning algorithm was used for pre-train.

For optimizing the basic network structure of the DBN classification model in intrusion detection system, Wei et al. [33] designed an artificial fish swarm algorithm optimization particle swarm optimization joint genetic algorithm optimization particle swarm optimization algorithm (AFSA-GA-PSO). In order to optimize DBN model, the framework based on the above algorithm (AFSA-GA-PSO-DBN) was proposed and tested for multiclass classification. Compared with the machine learning model with superior performance such as SVM, random forest and naive Bayes, the framework improved the average classification accuracy.

Potluri et al. [34] applied the convolutional neural networks (CNNs) for intrusion detection with the purpose of identifying the multiple attack classes. Different performance metrics such as precision, recall and F-measure were calculated and compared with the existing deep learning approaches.

Javaid et al. [35] proposed a deep learning method based on self-taught learning (STL), and improved the performance of network intrusion detection for multiclass classification.

However, the DAE, DBNs and DBMs algorithms have the difficulties of an intractable partition function or an intractable posterior distribution. Therefore, they are typically only used for pre-training a classification network [36].

Generative adversarial networks are another type of deep generative model. Different from other generative models, GANs incorporate the adversarial idea and allow for interaction during training. It is a great potential model to be applied and popularized for many actual scenarios. In the past 2 years, there have been hundreds of GAN variants. In terms of GANs as a semi-supervised model [37, 38], the authors extended GAN to semi-supervised model in image classification to enhance the robustness of unsupervised learning models.

Several works have begun to explore the applications of GANs for the anomaly detection task. A semi-supervised model based on GAN, consisting of two generators, three discriminators and one classifier, was proposed for detection anomalies in communication packet streams [39]. The approach was effective for packet flow binary classification.

In [16], authors developed GAN-based models for anomaly detection, and achieved good results on image and network intrusion datasets for binary classification. However, the variant of GANs belongs to unsupervised learning, which is only leveraged for anomaly detection, and is incompetent for multiclass classification.

Different from the above variants, in this paper, we develop a novel framework based on GAN, propose the supervised learning approach for multiclass classification, and suggest several empirical techniques for the framework training.

3 Generative adversarial networks

Generative adversarial networks [40] belong to one of the deep learning frameworks first proposed by Ian J. Goodfellow in 2014. This idea is sought after by academics in various fields of study, and it shows broad application prospects in imaging, visual computing and other fields.

Generally, a standard framework of GANs consists of a generative model G and a discriminative model D. During the training of GANs, the samples (which are called as ‘fake’ samples or generated samples) generated by the generative model G and the real data samples are mixed, and then randomly transmitted to the discriminative model D. The goal of the discriminative model D (which is equivalent to a binary classifier) is to identify the real data samples and the generated samples as accurately as possible. Meanwhile, the goal of the generative model G is the opposite of the discriminant model, which is to deceive the discriminative model D as much as possible and minimize the probability that the discriminative model D identifies the generated sample. Both sides are constantly optimizing themselves during training until they reach the equilibrium where neither side can improve and the generated sample is completely indistinguishable from the real data sample.

In summary, the original GAN contains a generative model G and a discriminative model D. G is used to capture the distribution of the dataset and generate similar samples, while D is a discriminator that determines whether the input is a data sample or a generated sample. The basic framework of the original GAN is shown in Fig. 1.

The generative model G takes a noise distribution p(z) (usually a Gaussian distribution or uniform distribution) as an input and produces fake samples G(z). Meanwhile, the discriminative model D identifies whether a sample comes from the data distribution p(x) or the generated samples G(z). The loss function of GANs can be defined as the following optimization problem [40]:

$$\mathop {\hbox{min} }\limits_{G} \mathop {\hbox{max} }\limits_{D} V\left( {D,G} \right) = E_{{x \sim P_{data} }} \left[ {\log D\left( x \right)} \right] + E_{{z \sim P_{z} }} \left( z \right)\left[ {\log \left( {1 - D\left( {G\left( z \right)} \right)} \right)} \right]$$

(1)

Equation (1) shows that in the training process of GANs, the discriminative model needs to be constantly revised to maximize the value of V, that is, to maximize D(x) and minimize D(G(z)). Meanwhile, it is necessary to revise model G to minimize the value of V. In other words, by maximizing D(G(z)), the generative model tries to generate samples that are very similar to the data samples. Finally, both G and D reach the Nash equilibrium. The generative model G can estimate the probability distribution of the real data samples. Meanwhile, the detection accuracy of the determinative model D is equal to 50%, which makes it difficult to identify whether the data sample is real or fake.

Additionally, GANs can be applied in semi-supervised learning. In an original GAN, the discriminator is a binary classifier that identifies the authenticity of samples. Considering a K-class task, the output of the generator can be classified as K+1, and the corresponding discriminator becomes a (K+1)-category classification problem [15]. The advantage of this kind of processing is that it can make full use of unlabeled data to learn the probability distribution of real data samples and thus aid the training process of supervised learning.

$$\begin{aligned} L & = - E_{{x,y \sim P_{data} \left( {x,y} \right)}} \left[ {\log P_{model} \left( {y |x} \right)} \right] - E_{x \sim G} \left[ {\log P_{model} (y = K + 1|x)} \right] \\ & = L_{supervised} + L_{unsupervised} ,\;{\text{where}} \\ L_{superviesed} & = - E_{{x,y \sim p_{data} }} \left( {x,y} \right)\log P_{model} (y|x,{\text{y}} < K + 1),\;{\text{and}} \\ L_{unsuperviesed} & = - \{ E_{{x \sim p_{data} }} \left( x \right)\log \left[ {1 - P_{model} \left( {y = K + 1 |x} \right)} \right] \\ & \quad + E_{x \sim G} log[P_{model} (y = K + 1|x)]\} \\ \end{aligned}$$

(2)

Let P_model(y = K+1|x) denote the probability that x is a generated sample, which corresponds to 1 − D(x) in the original GAN framework. Assuming that the dataset consists of real data and some generated samples, the loss function for training the classifier then becomes Eq. (2), which can be divided into two parts for different data sources. For labeled data samples, L_supervised stands for the negative log probability of the labeled sample, given that the sample is from the real data. The goal is to expect the discriminative model to output the correct label on the real data distribution P_data(x, y). For an unsupervised loss L_unsupervised, the loss function is defined by GANs. This is in fact the standard GAN game-value that becomes evident when we substitute D(x) = 1 − P_model(y = K+1|x) in Eq. (3) [15].

$$L_{unsuperviesed} = - E_{{x \sim P_{data} \left( x \right)}} \log D\left( x \right) - E_{z \sim noise} \log (1 - D\left( {G\left( Z \right)} \right))$$

(3)

4 Proposed methodologies

4.1 The supervised learning framework using adversarial training for intrusion detection

A standard multiclass classifier for intrusion detection usually takes a sample x as input, and outputs a 5-dimensional vector (l_normal, l_probe, l_dos, l_r2l, l_u2r) that can be turned into one of the five possible class probabilities by applying the softmax function. In the supervised learning, such a model is then trained by minimizing the cross-entropy between the real labels and the predictive distribution P_model(y|x) to obtain the optimal parameters.

As stated before, the two core models in a GAN are the generative model G and the discriminative model D. D as a binary classifier only has ability to judge whether the sample is from the real dataset or not. It does not have ability to further predict the classification of real data samples. Additionally, most of classifiers for intrusion detection usually belong to the supervised learning, so we need to reconstruct a supervised learning framework based on GAN.

In order to supply more information for the multiclass classifier C, we take the output of the generative model C as the input of the classifier C together with the original training set.

To improve the efficiency of the framework and further simplify the framework, we replace the discriminative model D with a multiclass classifier C. In this way, the classifier C not only undertakes the task of classification for the training set, but also serves as the role of the discriminative model D to determine whether the sample is from the generative model G or the real dataset. We regard the output of the generative model G as the ‘fake’ category, and the corresponding multiclass classifier becomes a 6-category classifier. Therefore, the original GAN is transformed into a supervised learning framework for intrusion detection, as shown in Fig. 2. First, the framework needs to train the classifier through adversarial training, as indicated by the green arrow. Second, the blue arrow represents the framework inputs the test samples into the trained classifier for multiclass classification.

After the introduction of adversarial training for intrusion detection, the generative model can continually generate ‘fake’ samples from a random distribution p(z). In the adversarial training, the multiclass classifier identifies whether the sample is Normal, or fake, or any one of the other four attack types: DoS, Probe, R2L and U2R, while the generative model dynamically adjusts the strategy for generating more similar fake samples according to the feedback (fake or real) from the multiclass classifier. Thus, the framework can train the classifier together with new augmented training set, which includes original five-category labeled samples and constantly generated new ‘fake’ samples.

For example, originally, only a professor (similar to the classifier) trained students to recognize and classify five languages (Russian, English, Arabic, French and German). An assistant professor (similar to the generative model) is added to train the students to recognize whether it belongs to five languages or not. Although the supplementary assistant professor does not directly teach students how to identify and classify the five types of languages, the practice of distinguishing ‘whether it belongs to five languages’ is also helpful for classification and recognition of languages. There are rough feedbacks, which are better than no feedback.

In summary, the main idea of the ID-GAN framework is to train a multiclass classifier that plays both the roles of a classifier performing the classification task and a classifier to distinguish generated samples from the real data samples. To be more specific, the classifier takes a sample as the input and classifies it into six classes. Real data samples are classified into the first five classes, and generated samples are classified into ’fake’ class, as shown in Fig. 2.

4.2 The derivation of the loss function

It is assumed that (x_l, y_l) is a sample from the training set that contains a 5-category classification label, where $y_{l} \in \left\{ {normal,dos,probe,r2l,u2r} \right\}$. The generative model generates ‘fake’ samples (x_f, y_f) from the random noise distribution, where y_f = ‘fake.’ The samples (x, y) are synthetic data samples and generated samples, where the label y contains six classes (${\text{y}} \in \left\{ {normal,dos,probe,r2l,u2r,fake} \right\}$). For the multiclass classification problem, the classifier inputs a sample x and outputs the classification probabilities for the six classes p_i (i = 1, 2, 3, 4, 5, 6). The first five categories correspond to the original classification, and the last classification corresponds to ‘fake’ category by applying the softmax function.

Assuming that p is the real probability distribution of the sample and q is the predicted probability distribution of the classifier, the cross-entropy for a given dataset X is defined as:

$${\text{CE}}\left( {p,q} \right) = - \mathop \sum \limits_{x \in X} p\left( x \right)\log q\left( x \right)$$

(4)

The value of Eq. (4) indicates the error between the real classification and the predicted classification. The smaller the value is, the closer the predicted probability distribution is to the real probability distribution, and the more accurate the predicted result will be.

Under the multiclass classification task, the loss function is usually defined as cross-entropy loss. Let $y_{{x_{i} }}^{j}$ represent the real probability distribution of the sample x_i, and let $P_{model} \left( {y = j |x_{i} } \right)$ represent the predicted probability distribution of the sample x_i, then the corresponding loss function can be defined as:

$$L_{x} = - \mathop \sum \limits_{j} y_{{x_{i} }}^{j} \log P_{model} \left( {y = j |x_{i} } \right)$$

$$\forall j \in \left\{ {normal,dos,probe,r2l,u2r,fake} \right\}$$

(5)

For dataset X, which is synthetic data samples and generated samples, the corresponding loss function is defined as:

$$Lc = - \frac{1}{N}\mathop \sum \limits_{i = 1}^{N} \mathop \sum \limits_{j} y_{{x_{i} }}^{j} \log P_{model} \left( {y = j |x_{i} } \right)$$

$$\forall j \in \left\{ {normal,dos,probe,r2l,u2r,fake} \right\}$$

(6)

After one-hot coding, the real category of the sample $y_{{x_{i} }}^{j}$ is mapped into a K-dimension vector. For example, [1, 0, 0, 0, 0, 0] indicates that the sample belongs to the ‘normal’ category, [0, 1, 0, 0, 0, 0] indicates that the sample belongs to the ‘dos’ category, and [0, 0, 1, 0, 0, 0] indicates that the sample belongs to the ‘probe’ category. [0, 0, 0, 1, 0, 0] means that the sample belongs to ‘r2l’ category, [0, 0, 0, 0, 1, 0] indicates that the sample belongs to ‘u2r’ category, and [0, 0, 0, 0, 0, 1] indicates that the sample belongs to the ‘fake’ category. Similarly, if the sample x_i belongs to category c, then $y_{{x_{i} }}^{c}$ = 1. Besides, all the values of the remaining columns are 0, that is, $y_{{x_{i} }}^{j \ne c}$ = 0.

Therefore, the loss function of the multiclass classifier in the proposed framework can be further expressed as follows.

$$\begin{aligned} Lc & = - \frac{1}{N}\mathop \sum \limits_{i = 1}^{N} \mathop \sum \limits_{j} y_{{x_{i} }}^{j} \log P_{model} \left( {y = j |x_{i} } \right) \\ & = - \frac{1}{N}\mathop \sum \limits_{i = 1}^{N} \left[ {y_{{x_{i} }}^{j = c} \log P_{model} \left( {y = c |x_{i} } \right) + \mathop \sum \limits_{j \ne c} y_{{x_{i} }}^{j} \log P_{model} \left( {y = j |x_{i} } \right) } \right] \\ & = - \frac{1}{N}\mathop \sum \limits_{i = 1}^{N} \left[ {y_{{x_{i} }}^{j = c} \log P_{model} \left( {y = c |x_{i} } \right) } \right] \\ & = - \frac{1}{N}\mathop \sum \limits_{i = 1}^{N} \left[ {\log P_{model} \left( {y = c |x_{i} } \right) } \right] \\ \end{aligned}$$

5 Experiments

5.1 Dataset

The NSL-KDD (Knowledge Discovery and Data Mining) [41, 42] is a benchmark dataset for network intrusion detection. It removes a large amount of redundant data in the original dataset and adjusts the normal and abnormal data in proper proportions to make the testing and training set sizes more reasonable. This is still ideal and the most trustful public benchmark dataset [43,44,45] for an effective and accurate assessment of different machine learning algorithms for intrusion detection.

The NSL-KDD dataset consists of a training set and a test set. The training set KDDTrain⁺ contains 125,973 instances, and the test set KDDTest⁺ contains 22,544 instances, as shown in Table 1.

Table 1 Different classifications in the NSL-KDD dataset

Enhancing network intrusion detection classifiers using supervised adversarial training

Abstract

Similar content being viewed by others

A Self-supervised Adversarial Learning Approach for Network Intrusion Detection System

Toward Learning Robust Detectors from Imbalanced Datasets Leveraging Weighted Adversarial Training

An Adversarial Learning Model for Intrusion Detection in Real Complex Network Environments

Explore related subjects

1 Introduction

2 Related work

3 Generative adversarial networks

4 Proposed methodologies

4.1 The supervised learning framework using adversarial training for intrusion detection

4.2 The derivation of the loss function

5 Experiments

5.1 Dataset

5.2 Selection of generative model and classifier for ID-GAN

5.3 Classification metrics

5.4 Controlled experiments

5.5 Experimental configuration

6 Training the framework

7 Results and discussion

7.1 Controlled group

7.2 Overall comparison

7.3 Individual comparison

7.4 Training time

7.5 Discussion

8 Conclusion and future work

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation