Density-Based LLE Algorithm for Network Forensics Data

Full Text (PDF, 349KB), PP.52-59

Views: 0 Downloads: 0

Author(s)

Peng Tao 1 Chen Xiaosu 1,* Liu Huiyu 1 Chen Kai 1

1. School of Computer Science and Technology of Huazhong University of Science and Technology

* Corresponding author.

DOI: https://doi.org/10.5815/ijmecs.2011.01.08

Received: 16 Nov. 2010 / Revised: 20 Dec. 2010 / Accepted: 14 Jan. 2011 / Published: 8 Feb. 2011

Index Terms

Data Reduction, Network Forensics, Manifold Learning, LLE

Abstract

In a network forensic system, there are huge amounts of data that should be processed, and the data contains redundant and noisy features causing slow training and testing processes, high resource consumption as well as poor detection rate. In this paper, a schema is proposed to reduce the data of the forensics using manifold learning. Manifold learning is a popular recent approach to nonlinear dimensionality reduction. Algorithms for this task are based on the idea that the dimensionality of many data sets is only artificially high. In this paper, we reduce the forensic data with manifold learning, and test the result of the reduced data.

Cite This Paper

Peng Tao, Chen Xiaosu, Liu Huiyu, Chen Kai, "Density-Based LLE Algorithm for Network Forensics Data", International Journal of Modern Education and Computer Science(IJMECS), vol.3, no.1, pp.52-59, 2011. DOI:10.5815/ijmecs.2011.01.08

Reference

[1]Marcus Ranum, Network Flight Recorder. http://www.ranum.com/
[2]Simson Garfinkel, Web Security, Privacy & Commerce, 2nd Edition. http://www.oreillynet.com/pub/a/network /2002/04/26 /nettap.html
[3]DARPA 1998 data set, http://www.ll.mit. edu /IST/ideval/data/1998/1998_data_index. html, cited August 2003.
[4]W. Lee, S. J. Stolfo, and K. W. Mok, A Data Mining Framework for Building Intrusion Detection Models, IEEE Symposium on Security and Privacy, Oakland, California (1999), 120-132.
[5]KDD 1999 data set, http://kdd.ics.uci.edu/databases/ kddcup99/ kddcup99.html, cited August 2003.
[6]I. Levin, KDD-99 Classifier Learning Contest LLSoft’s Results Overview, ACM SIGKDD Explorations 1(2) (2000), 67-75.
[7]Tenenbaum, J. B., de Silva, V., & Langford, J. C. (2000) A global geometric framework for nonlinear dimensionality reduction, Science, 290, pp. 2319–2323.
[8]Roweis, Sam T. & Saul, Lawrence K. (2000) Nonlinear dimensionality reduction by locally linear embedding, Science, 290, pp. 2323–2326.
[9]Roweis, S., Saul, L.: Nonlinear dimensionality reduction by locally linear embedding. Science 290 (2000) 2323–2326
[10]Belkin, M., Niyogi, P.: Laplacian eigenmaps for dimensionality reduction and data representation. Neural Computation 15 (2003) 1373–1396
[11]Zhang, Z., Zha, H.: Principal manifolds and nonlinear dimensionality reduction via tangent space alignment. SIAM Journal on Scientific Computing 26 (2004) 313–338
[12]CHEN You SHEN Hua-Wei LI Yang CHENG Xue-Qi , An Efficient Feature Selection Algorithm Toward Building Lightweight Intrusion Detection System. CHINESE JOURNAL OF COMPUTERS, 2007 30(8), 1398-1407