题名 |
Hybrid Big Data Architecture for High-Speed Log Anomaly Detection |
DOI |
10.6138/JIT.2017.18.7.20170419d |
作者 |
Natawut Nupairoj;Pittayut Tangsatjatham |
关键词 |
Big data ; Real-time ; Log processing ; Hybrid processing ; Lambda architecture |
期刊名称 |
網際網路技術學刊 |
卷期/出版年月 |
18卷7期(2017 / 12 / 01) |
页次 |
1681 - 1688 |
内容语文 |
英文 |
中文摘要 |
Anomaly detection in network traffic can be very challenging, especially for environments with high-speed networks and lots of servers. In these environments, log data of network traffic is usually large, coming at high-speed, and have various formats, the classic case of big data problem. This makes anomaly detection very difficult due to the fact that to get good accuracy, large amount of data must be processed in real-time. To solve this problem, this paper proposes a hybrid architecture for network traffic anomaly detection using popular big data framework including Apache Spark and Apache Flume. To demonstrate the capabilities of our proposed solution, we implement a SARIMA-based anomaly detection as a case study. The experimental results clearly indicated that our proposed architecture allows anomaly detection with good accuracy in large-scale environment effectively. |
主题分类 |
基礎與應用科學 >
資訊科學 |