题名

Hybrid Big Data Architecture for High-Speed Log Anomaly Detection

DOI

10.6138/JIT.2017.18.7.20170419d

作者

Natawut Nupairoj;Pittayut Tangsatjatham

关键词

Big data ; Real-time ; Log processing ; Hybrid processing ; Lambda architecture

期刊名称

網際網路技術學刊

卷期/出版年月

18卷7期(2017 / 12 / 01)

页次

1681 - 1688

内容语文

英文

中文摘要

Anomaly detection in network traffic can be very challenging, especially for environments with high-speed networks and lots of servers. In these environments, log data of network traffic is usually large, coming at high-speed, and have various formats, the classic case of big data problem. This makes anomaly detection very difficult due to the fact that to get good accuracy, large amount of data must be processed in real-time. To solve this problem, this paper proposes a hybrid architecture for network traffic anomaly detection using popular big data framework including Apache Spark and Apache Flume. To demonstrate the capabilities of our proposed solution, we implement a SARIMA-based anomaly detection as a case study. The experimental results clearly indicated that our proposed architecture allows anomaly detection with good accuracy in large-scale environment effectively.

主题分类 基礎與應用科學 > 資訊科學