A Novel Framework for Generating Personalized Network Datasets for NIDS Based on Traffic Aggregation.
Pablo VelardeHugo GonzalezRafael Martínez-PeláezLuis J MenaAlberto Ochoa-BrustEfraín Moreno-GarcíaVanessa G FélixRodolfo OstosPublished in: Sensors (Basel, Switzerland) (2022)
In this paper, we addressed the problem of dataset scarcity for the task of network intrusion detection. Our main contribution was to develop a framework that provides a complete process for generating network traffic datasets based on the aggregation of real network traces. In addition, we proposed a set of tools for attribute extraction and labeling of traffic sessions. A new dataset with botnet network traffic was generated by the framework to assess our proposed method with machine learning algorithms suitable for unbalanced data. The performance of the classifiers was evaluated in terms of macro-averages of F 1-score (0.97) and the Matthews Correlation Coefficient (0.94), showing a good overall performance average.